YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Arabic End-of-Utterance (EOU) Classifier

Overview

This repository contains a custom PyTorch model for End-of-Utterance (EOU) detection in Arabic conversational text.
The model predicts whether a given text segment represents the end of a speaker’s turn.

This is a custom architecture (not a Hugging Face AutoModel) and is intended for research and development use.


Task

Given an input text segment, the model outputs a binary prediction:

  • 0 → The speaker is expected to continue speaking
  • 1 → The speaker has finished their turn

Model Details

  • Framework: PyTorch
  • Architecture: Custom EOUClassifier
  • Task: Binary classification (EOU detection)
  • Language: Arabic

Tokenizer

This model uses the tokenizer from:

Omartificial-Intelligence-Space/SA-BERT-V1

The tokenizer is not included in this repository and must be loaded separately.


Files

  • model.py — Model architecture (EOUClassifier)
  • model.pt — Trained model weights
  • config.json — Model configuration
  • README.md — This file

Loading the Model

import torch
from transformers import AutoTokenizer
from model import EOUClassifier

tokenizer = AutoTokenizer.from_pretrained(
    "Omartificial-Intelligence-Space/SA-BERT-V1"
)

model = EOUClassifier()
model.load_state_dict(
    torch.load("model.pt", map_location="cpu")
)
model.eval()

examples = ["مقصدي من الموضوع انه", "اتمنى تقدر تساعدني"]


batch = tokenizer(examples, padding=True, truncation=True, return_tensors="pt")
batch.to(device)

out = model(batch["input_ids"], batch["attention_mask"])

license

MIT

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support