Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
96.8
TFLOPS
9
3
34
Gordon
GordonM
Follow
ntse-usyd's profile picture
hlydecker's profile picture
2 followers
ยท
25 following
https://gordonmcd.com
DrGordonMcD
gdmcdonald
AI & ML interests
Data Science for good
Recent Activity
reacted
to
MoritzLaurer
's
post
with ๐
5 days ago
Quite excited by the ModernBERT release! 0.15/0.4B small, 2T modern pre-training data and tokenizer with code, 8k context window, great efficient model for embeddings & classification! This will probably be the basis for many future SOTA encoders! And I can finally stop using DeBERTav3 from 2021 :D Congrats @answerdotai, @LightOnIO and collaborators like @tomaarsen ! Paper and models here ๐https://huggingface.co/collections/answerdotai/modernbert-67627ad707a4acbf33c41deb
reacted
to
MoritzLaurer
's
post
with ๐ฅ
5 days ago
๐ Releasing a new zeroshot-classifier based on ModernBERT! Some key takeaways: - โก Speed & efficiency: It's multiple times faster and uses significantly less memory than DeBERTav3. You can use larger batch sizes and enabling bf16 (instead of fp16) gave me a ~2x speed boost as well - ๐ Performance tradeoff: It performs slightly worse than DeBERTav3 on average across my zeroshot classification task collection - ๐ง Use cases: I recommend using it for scenarios requiring speed and a larger context window (8k). - ๐ก Whatโs next? Iโm preparing a newer version trained on better + longer synthetic data to fully leverage the 8k context window and improve upon the training mix of my older zeroshot-v2.0 models. I also hope that there will be a multilingual variant in the future. Great work by https://huggingface.co/answerdotai ! If youโre looking for a high-speed zeroshot classifier, give it a try! ๐ Resources below: ๐ Base model: https://huggingface.co/MoritzLaurer/ModernBERT-base-zeroshot-v2.0 Large model: https://huggingface.co/MoritzLaurer/ModernBERT-large-zeroshot-v2.0 Updated zeroshot collection: https://huggingface.co/collections/MoritzLaurer/zeroshot-classifiers-6548b4ff407bb19ff5c3ad6f ModernBERT collection with paper: https://huggingface.co/collections/answerdotai/modernbert-67627ad707a4acbf33c41deb
liked
a model
about 1 month ago
Varosa/SeamlessExpressive
View all activity
Organizations
spaces
2
Sort:ย Recently updated
No application file
Vfhgkbbjl
๐จ
Runtime error
BandiCount
๐จ
models
0
None public yet
datasets
0
None public yet