SIGIR2025: Lost in Transliteration
Collection
Model Checkpoints and Data for the SIGIR Short Paper: Lost in Transliteration: Bridging the Script Gap in Neural IR
•
19 items
•
Updated
This is a BGE-M3 model post-trained on the Chinese dataset from MMARCO/v2.
The model was used for the SIGIR 2025 Short paper: Lost in Transliteration: Bridging the Script Gap in Neural IR.
Base model
BAAI/bge-m3