nlpguy

AI & ML interests

large language models ------------------ profile pic ----------------------------------- picmix.com/pic/98-tan-12224111 picmix.com/profile/Lily1027

Recent Activity

liked a model 8 days ago

kalomaze/Qwen3-16B-A3B

new activity 8 days ago

Qwen/Qwen3-30B-A3B:Qwen3 is great, but could be better.

new activity 27 days ago

meta-llama/Llama-4-Scout-17B-16E-Instruct:Less Knowledge Than Llama 3.3 70b?

View all activity

Organizations

None yet

nlpguy's activity

New activity in Qwen/Qwen3-30B-A3B 8 days ago

Qwen3 is great, but could be better.

#18 opened 8 days ago by

phil111

New activity in meta-llama/Llama-4-Scout-17B-16E-Instruct 27 days ago

Less Knowledge Than Llama 3.3 70b?

#60 opened 28 days ago by

phil111

New activity in open-llm-leaderboard/open_llm_leaderboard 3 months ago

Uhh... does voting still work?

#1099 opened 3 months ago by

nlpguy

New activity in mistralai/Mistral-Small-24B-Instruct-2501 3 months ago

This Mistral Small has FAR less knowledge than the last.

#5 opened 3 months ago by

phil111

New activity in mistralai/Mistral-Small-24B-Base-2501 3 months ago

First

#2 opened 3 months ago by

teknium

New activity in open-llm-leaderboard/open_llm_leaderboard 3 months ago

🚩 Report: Not working

#1082 opened 3 months ago by

ehristoforu

New activity in mkurman/Qwen2.5-14B-DeepSeek-R1-1M 3 months ago

Mergekit config

#2 opened 3 months ago by

ehartford

New activity in mistral-community/Mixtral-8x22B-v0.1 4 months ago

We are working on creating a single 22b from this model

#5 opened about 1 year ago by

rombodawg

New activity in deepseek-ai/DeepSeek-V3-Base 4 months ago

Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.

#27 opened 4 months ago by

phil111

New activity in matteogeniaccio/phi-4 5 months ago

🚩 Report: Legal issue(s)

#9 opened 5 months ago by

nightflightdk

Notably better than Phi3.5 in many ways, but something is wrong.

#5 opened 5 months ago by

phil111

New activity in mradermacher/smolchess-v2-GGUF 6 months ago

How do you quantitize that so quickly?

#1 opened 6 months ago by

nlpguy

New activity in nlpguy/StableProse 8 months ago

Adding Evaluation Results

#1 opened 8 months ago by

leaderboard-pr-bot

New activity in PocketDoc/Dans-MemoryCore-CoreCurriculum-Small 8 months ago

Was this dataset created with Claude Sonnet 3 or 3.5?

#2 opened 8 months ago by

nlpguy

New activity in open-llm-leaderboard/open_llm_leaderboard 8 months ago

leaderboard should be more curated

#908 opened 8 months ago by

ehartford

New activity in black-forest-labs/FLUX.1-schnell 9 months ago

Licence issue

#55 opened 9 months ago by

Ayaz550

New activity in open-llm-leaderboard/open_llm_leaderboard 9 months ago

Model Failed: StableProse

#894 opened 9 months ago by

nlpguy

New activity in v000000/MN-12B-Estrella-v1 9 months ago

would you consider publishing the intermediate models from step 1 and 2

#1 opened 9 months ago by

nlpguy

New activity in open-llm-leaderboard/open_llm_leaderboard 10 months ago

Voting System: You can vote for your own model.

#851 opened 10 months ago by

nlpguy

Submitted models aren't showing up

#835 opened 10 months ago by

Stark2008