3 1 106

Olamedia

olamedia

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

Lightricks/LTX-Video

liked a model 7 days ago

RiverZ/normal-lora

liked a model 8 days ago

nvidia/parakeet-tdt-0.6b-v2

View all activity

Organizations

None yet

olamedia's activity

liked 2 models 7 days ago

Lightricks/LTX-Video

Text-to-Video • Updated 2 days ago • 291k • • 1.49k

RiverZ/normal-lora

Image-to-Image • Updated 13 days ago • 25.5k • 33

liked a model 8 days ago

nvidia/parakeet-tdt-0.6b-v2

Automatic Speech Recognition • Updated 1 day ago • 167k • 874

liked a model 10 days ago

mradermacher/Kevin-32B-GGUF

Updated 10 days ago • 530 • 3

liked 4 models 12 days ago

liked a model 15 days ago

moonshotai/Kimi-Audio-7B-Instruct

Text-to-Speech • Updated 4 days ago • 4.41k • 306

liked a model 18 days ago

mradermacher/The-Omega-Directive-Qwen3-14B-v1.1-GGUF

Updated 18 days ago • 4.46k • 3

reacted to merterbak's post with 🔥 18 days ago

Post

4837

Qwen 3 models released🔥
It offers 2 MoE and 6 dense models with following parameter sizes: 0.6B, 1.7B, 4B, 8B, 14B, 30B(MoE), 32B, and 235B(MoE).
Models: Qwen/qwen3-67dd247413f0e2e4f653967f
Blog: https://qwenlm.github.io/blog/qwen3/
Demo: Qwen/Qwen3-Demo
GitHub: https://github.com/QwenLM/Qwen3

✅ Pre-trained 119 languages(36 trillion tokens) and dialects with strong translation and instruction following abilities. (Qwen2.5 was pre-trained on 18 trillion tokens.)
✅Qwen3 dense models match the performance of larger Qwen2.5 models. For example, Qwen3-1.7B/4B/8B/14B/32B perform like Qwen2.5-3B/7B/14B/32B/72B.
✅ Three stage done while pretraining:
• Stage 1: General language learning and knowledge building.
• Stage 2: Reasoning boost with STEM, coding, and logic skills.
• Stage 3: Long context training
✅ It supports MCP in the model
✅ Strong agent skills
✅ Supports seamless between thinking mode (for hard tasks like math and coding) and non-thinking mode (for fast chatting) inside chat template.
✅ Better human alignment for creative writing, roleplay, multi-turn conversations, and following detailed instructions.