4
JQL: Judging Quality Across Languages
🦊
Filter multilingual data to improve LLM training
Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models
Filter multilingual data to improve LLM training