CreoleEval: Martinican Creole POS Tagger

Model type: AutoModelForTokenClassification
Task: Universal Part-Of-Speech Tagging (UPOS)
Language: Martinican Creole (Kréyol Matnik)
License: CC BY 4.0

Model Description

This model is an mBert bert-base-multilingual-cased checkpoint fine-tuned on the Martinican Creole treebank for UPOS tagging. It takes tokenized text and predicts one of the 17 universal part-of-speech tags for each token.

Accuracy: 0.88

First Version: 4/22/2025 - Last Updated on 4/25/2025


Intended Use

  • Primary use: Automatic UPOS tagging of Martinican Creole text.
  • Not for: Named-entity recognition, dependency parsing, or languages other than Martinican Creole without further fine-tuning.
from transformers import pipeline

tagger = pipeline(
    "token-classification",
    model="lmompelat/mbert-martinican-pos-tagger",
    tokenizer="lmompelat/mbert-martinican-pos-tagger"
)

example = "Mwen té ka alé an lékol."
predictions = tagger(example)
print(predictions)

Citation

Author: Ludovic Mompelat

Email: [email protected]

Downloads last month
13
Safetensors
Model size
177M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for lmompelat/mbert-martinican-pos-tagger

Finetuned
(709)
this model