LocalDoc
/

azerbaijani_spell_corrector

Text2Text Generation

Model card Files Files and versions Community

vrashad commited on Nov 30, 2024

Commit

20f6ae7

·

verified ·

1 Parent(s): dae7fa6

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -91,6 +91,26 @@ These substitutions represent phonetic similarities and common mistakes made by
 To use the model for spell correction:
 ## License

 To use the model for spell correction:
+```python
+import torch
+from transformers import MT5ForConditionalGeneration, MT5Tokenizer
+# Load the tokenizer and model
+tokenizer = MT5Tokenizer.from_pretrained('LocalDoc/azerbaijani_spell_corrector')
+model = MT5ForConditionalGeneration.from_pretrained('LocalDoc/azerbaijani_spell_corrector')
+# Function to correct sentences
+def correct_sentence(sentence):
+    input_text = "correct: " + sentence
+    input_ids = tokenizer.encode(input_text, return_tensors='pt', max_length=128, truncation=True)
+    outputs = model.generate(input_ids=input_ids, max_length=128, num_beams=5, early_stopping=True)
+    corrected_sentence = tokenizer.decode(outputs[0], skip_special_tokens=True)
+    return corrected_sentence
+# Example usage
+incorrect_sentence = "Pul dogru adamlarda deyil"
+print(correct_sentence(incorrect_sentence))
+```
 ## License