teoyidu
/

nomic-moe-turkish-v1

Zero-Shot Classification

sentence-transformers

Model card Files Files and versions Community

teoyidu commited on Mar 14

Commit

f976017

·

verified ·

1 Parent(s): 936a387

Update README.md

Files changed (1) hide show

README.md +41 -5

README.md CHANGED Viewed

@@ -1,13 +1,49 @@
-# This is an embedding model trained on triplet turkish corpus data. It performs well compared to other preexisting turkish embedding models. Thats all
 ---
 license: apache-2.0
 datasets:
-- emrecan/stsb-mt-turkish
 language:
 - tr
 base_model:
 - nomic-ai/nomic-embed-text-v2-moe
 library_name: sentence-transformers
-tags:
-- embedding
----

 ---
 license: apache-2.0
 datasets:
+- emrecan/all-nli-tr
 language:
 - tr
+- en
+metrics:
+- spearmanr
+- accuracy
+- bertscore
 base_model:
 - nomic-ai/nomic-embed-text-v2-moe
+pipeline_tag: zero-shot-classification
 library_name: sentence-transformers
+---
+# Model Card: Turkish Triplet Embedding Model (Nomic MoE)
+## Model Description
+This is an embedding model trained on a Turkish triplet corpus, utilizing the dataset [`emrecan/all-nli-tr`](https://huggingface.co/datasets/emrecan/all-nli-tr). The model is based on **Nomic Mixture of Experts (MoE)** and achieves strong performance compared to other existing Turkish embedding models.
+### **Intended Use**
+- Semantic similarity tasks
+- Text clustering
+- Information retrieval
+- Sentence and document-level embedding generation
+### **Training Details**
+- **Architecture:** Nomic Mixture of Experts (MoE)
+- **Dataset:** `emrecan/all-nli-tr`
+- **Training Objective:** Triplet loss for contrastive learning
+### **Evaluation & Performance**
+Compared to other Turkish embedding models, this model demonstrates strong performance in capturing semantic relationships within the language. Further evaluations and benchmarks will be shared as they become available.
+### **How to Use**
+You can use this model with Hugging Face's `transformers` or `sentence-transformers` library:
+```python
+from sentence_transformers import SentenceTransformer
+model = SentenceTransformer("your-huggingface-model-name")
+embeddings = model.encode(["Merhaba dünya!", "Bugün hava çok güzel."])
+print(embeddings)
+```
+### **License & Citation**
+Please refer to the repository for licensing details and citation instructions.