TweebankNLP
/

bertweet-tb2_ewt-pos-tagging

Token Classification

Inference Endpoints

Model card Files Files and versions Community

hjian42 commited on May 4, 2022

Commit

6cda070

·

1 Parent(s): 482c3ee

Update README.md

Files changed (1) hide show

README.md +28 -0

README.md CHANGED Viewed

@@ -1,3 +1,31 @@
 ---
 license: cc-by-nc-4.0
 ---

 ---
 license: cc-by-nc-4.0
 ---
+## Model Specification
+- This is the **state-of-the-art Twitter POS tagging model (with 95.38\% Accuracy)** on Tweebank V2's NER benchmark (also called `Tweebank-NER`), trained on the corpus combining both Tweebank-NER and English-EWT training data.
+- For more details about the `TweebankNLP` project, please refer to this [our paper](https://arxiv.org/pdf/2201.07281.pdf) and [github](https://github.com/social-machines/TweebankNLP) page.
+- In the paper, it is referred as `HuggingFace-BERTweet (TB2+EWT)` in the POS table.
+## How to use the model
+```python
+from transformers import AutoTokenizer, AutoModelForTokenClassification
+tokenizer = AutoTokenizer.from_pretrained("TweebankNLP/bertweet-tb2_ewt-pos-tagging")
+model = AutoModelForTokenClassification.from_pretrained("TweebankNLP/bertweet-tb2_ewt-pos-tagging")
+```
+## References
+If you use this repository in your research, please kindly cite [our paper](https://arxiv.org/pdf/2201.07281.pdf):
+```bibtex
+@article{jiang2022tweetnlp,
+    title={Annotating the Tweebank Corpus on Named Entity Recognition and Building NLP Models for Social Media Analysis},
+    author={Jiang, Hang and Hua, Yining and Beeferman, Doug and Roy, Deb},
+    journal={In Proceedings of the 13th Language Resources and Evaluation Conference (LREC)},
+    year={2022}
+}
+```