nlpguy
nlpguy
AI & ML interests
large language models
------------------
profile pic
-----------------------------------
picmix.com/pic/98-tan-12224111
picmix.com/profile/Lily1027
Recent Activity
liked
a model
8 days ago
kalomaze/Qwen3-16B-A3B
new activity
8 days ago
Qwen/Qwen3-30B-A3B:Qwen3 is great, but could be better.
new activity
27 days ago
meta-llama/Llama-4-Scout-17B-16E-Instruct:Less Knowledge Than Llama 3.3 70b?
Organizations
None yet
nlpguy's activity
Qwen3 is great, but could be better.
7
19
#18 opened 8 days ago
by
phil111
Less Knowledge Than Llama 3.3 70b?
2
5
#60 opened 28 days ago
by
phil111
Uhh... does voting still work?
4
#1099 opened 3 months ago
by
nlpguy

This Mistral Small has FAR less knowledge than the last.
5
20
#5 opened 3 months ago
by
phil111
🚩 Report: Not working
2
11
#1082 opened 3 months ago
by
ehristoforu

Mergekit config
1
2
#2 opened 3 months ago
by
ehartford

We are working on creating a single 22b from this model
16
21
#5 opened about 1 year ago
by
rombodawg

Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.
3
2
#27 opened 4 months ago
by
phil111
🚩 Report: Legal issue(s)
3
7
#9 opened 5 months ago
by
nightflightdk
Notably better than Phi3.5 in many ways, but something is wrong.
1
8
#5 opened 5 months ago
by
phil111
How do you quantitize that so quickly?
1
#1 opened 6 months ago
by
nlpguy

Adding Evaluation Results
#1 opened 8 months ago
by
leaderboard-pr-bot

Was this dataset created with Claude Sonnet 3 or 3.5?
2
#2 opened 8 months ago
by
nlpguy

leaderboard should be more curated
7
#908 opened 8 months ago
by
ehartford

Licence issue
2
#55 opened 9 months ago
by
Ayaz550
Model Failed: StableProse
3
#894 opened 9 months ago
by
nlpguy

would you consider publishing the intermediate models from step 1 and 2
2
#1 opened 9 months ago
by
nlpguy

Voting System: You can vote for your own model.
2
3
#851 opened 10 months ago
by
nlpguy

Submitted models aren't showing up
4
#835 opened 10 months ago
by
Stark2008
