This is a MicroBERT model for Maltese.

  • Its suffix is -mxp, which means that it was pretrained using supervision from masked language modeling, XPOS tagging, and UD dependency parsing.
  • The unlabeled Maltese data was taken from a February 2022 dump of Maltese Wikipedia, totaling 2,113,223 tokens.
  • The UD treebank UD_Maltese-GSD, v2.9, totaling 44,162 tokens, was used for labeled data.

Please see the repository and the paper for more details.

Downloads last month
10
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support