samirmsallem
/

distilbert-base-multilingual-cased-definitions_ner

Token Classification

Model card Files Files and versions Community

distilbert-base-multilingual-cased-definitions_ner / README.md

samirmsallem's picture

Update README.md

0f41209 verified 9 days ago

|

history blame contribute delete

2.92 kB

	---
	datasets:
	- samirmsallem/wiki_def_de_multitask
	language:
	- de
	base_model:
	- distilbert/distilbert-base-multilingual-cased
	library_name: transformers
	tags:
	- science
	- ner
	- def_extraction
	- definitions
	metrics:
	- precision
	- recall
	- f1
	- accuracy
	model-index:
	- name: checkpoints
	results:
	- task:
	name: Token Classification
	type: token-classification
	dataset:
	name: samirmsallem/wiki_def_de_multitask
	type: samirmsallem/wiki_def_de_multitask
	metrics:
	- name: F1
	type: f1
	value: 0.812455003599712
	- name: Precision
	type: precision
	value: 0.8076097328244275
	- name: Recall
	type: recall
	value: 0.8173587638821825
	- name: Loss
	type: loss
	value: 0.329479843378067
	---

	## NER model for definition component recognition in German scientific texts

	distilbert-base-multilingual-cased-definitions_ner is a NER model (token classification) in the scientific domain in German, finetuned from the model [distilbert-base-multilingual-cased](https://huggingface.co/distilbert/distilbert-base-multilingual-cased).
	It was trained using a custom annotated dataset of around 10,000 training and 2,000 test examples containing definition- and non-definition-related sentences from wikipedia articles in german.

	The model is specifically designed to recognize and classify components of definitions, using the following entity labels:
	- DF: Definiendum (the term being defined)
	- VF: Definitor (the verb or phrase introducing the definition)
	- GF: Definiens (the explanation or meaning)

	Training was conducted using a standard NER objective. The model achieves an F1 score of approximately 81% on the evaluation set.

	Here are the overall final metrics on the test dataset after 5 epochs of training:
	- f1: 0.812455003599712
	- precision: 0.8076097328244275
	- recall: 0.8173587638821825
	- loss: 0.329479843378067


	## Model Performance Comparision on wiki_definitions_de_multitask:

	\| Model \| Precision \| Recall \| F1 Score \| Eval Samples per Second \| Epoch \|
	\| --- \| --- \| --- \| --- \| --- \| --- \|
	\| [distilbert-base-multilingual-cased-definitions_ner](https://huggingface.co/samirmsallem/distilbert-base-multilingual-cased-definitions_ner/) \| 80.76 \| 81.74 \| 81.25 \| 457.53 \| 5.0 \|
	\| [scibert_scivocab_cased-definitions_ner](https://huggingface.co/samirmsallem/scibert_scivocab_cased-definitions_ner) \| 80.54 \| 82.11 \| 81.32 \| 236.61 \| 4.0 \|
	\| [GottBERT_base_best-definitions_ner](https://huggingface.co/samirmsallem/GottBERT_base_best-definitions_ner) \| 82.98 \| 82.81 \| 82.90 \| 272.26 \| 5.0 \|
	\| [xlm-roberta-base-definitions_ner](https://huggingface.co/samirmsallem/xlm-roberta-base-definitions_ner) \| 81.90 \| 83.35 \| 82.62 \| 241.21 \| 5.0 \|
	\| [gbert-base-definitions_ner](https://huggingface.co/samirmsallem/gbert-base-definitions_ner) \| 82.73 \| 83.56 \| 83.14 \| 278.87 \| 5.0 \|