Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
76
22
David Dale
cointegrated
Follow
leolloyd's profile picture
d0rj's profile picture
xmzhao's profile picture
77 followers
·
8 following
https://daviddale.ru/en
cointegrated
avidale
AI & ML interests
Research engineer at FAIR, Meta. Some pet projects on NLP for under-resourced languages. Interests: Machine translation, Chatbots, applied NLU, controllable text generation (in particular, text style transfer), miniature models.
Recent Activity
liked
a dataset
3 days ago
rombodawg/Everything_Instruct_Multilingual
new
activity
11 days ago
openlanguagedata/flores_plus:
[DRAFT] Fix orthography in the Russian dev set
new
activity
11 days ago
openlanguagedata/flores_plus:
Fix encoding at chv devtest
View all activity
Organizations
cointegrated
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
3 days ago
rombodawg/Everything_Instruct_Multilingual
Viewer
•
Updated
Oct 8, 2024
•
5.81M
•
242
•
23
New activity in
openlanguagedata/flores_plus
11 days ago
[DRAFT] Fix orthography in the Russian dev set
4
#4 opened 3 months ago by
cointegrated
Fix encoding at chv devtest
4
#9 opened about 2 months ago by
alexantonov
liked
a dataset
23 days ago
google/wmt24pp
Viewer
•
Updated
3 days ago
•
54.9k
•
5.06k
•
30
New activity in
slone/nllb-rus-tyv-v1
24 days ago
Adding `safetensors` variant of this model
#1 opened 24 days ago by
SFconvertbot
New activity in
cointegrated/LaBSE-en-ru
26 days ago
Warn Some weights of the model checkpoint at cointegrated/LaBSE-en-ru were not used when initializing BertModel:
1
#4 opened 5 months ago by
alashkov83
New activity in
slone/LaBSE-shallow-distilled-bak
about 1 month ago
Adding `safetensors` variant of this model
#1 opened about 2 months ago by
SFconvertbot
New activity in
cointegrated/SONAR_200_text_encoder
about 2 months ago
can you please do the same for decoder
1
#2 opened 3 months ago by
damerajee
New activity in
slone/finugorbib
about 2 months ago
[bot] Conversion to Parquet
#1 opened about 2 months ago by
parquet-converter
liked
a dataset
about 2 months ago
udmurtNLP/udmurt-russian-parallel-corpora
Viewer
•
Updated
Feb 1
•
102k
•
80
•
3
New activity in
openlanguagedata/flores_plus
about 2 months ago
Added Dargwa dev set to flores_plus
2
#3 opened 3 months ago by
Murtazali
published
a dataset
about 2 months ago
slone/finugorbib
Viewer
•
Updated
Jan 27
•
849k
•
170
•
1
updated
a dataset
about 2 months ago
slone/finugorbib
Viewer
•
Updated
Jan 27
•
849k
•
170
•
1
liked
a dataset
about 2 months ago
alexantonov/chukot_russian_flores_sample
Viewer
•
Updated
Jan 31
•
100
•
114
•
4
liked
a model
2 months ago
Helsinki-NLP/opus-mt-tc-bible-big-mul-mul
Translation
•
Updated
Oct 12, 2024
•
707
•
•
4
New activity in
openlanguagedata/flores_plus
3 months ago
Add data integrity tests
1
#7 opened 3 months ago by
cointegrated
updated
a dataset
3 months ago
openlanguagedata/flores_plus
Viewer
•
Updated
20 days ago
•
434k
•
2.09k
•
28
New activity in
openlanguagedata/flores_plus
3 months ago
Two sentences in the dev set (one Lombard and one Tamasheq-Tifinagh) seem to be missing
#6 opened 3 months ago by
cointegrated
liked
2 datasets
3 months ago
aronlp/aromanian-romanian-MT-corpus
Viewer
•
Updated
Jan 15
•
105k
•
14
•
1
ontocord/fineweb-permissive-multilingual-2m
Viewer
•
Updated
Oct 9, 2024
•
2.23M
•
171
•
2
Load more