sometimesanotion's picture

sometimesanotion PRO

sometimesanotion

·

https://ko-fi.com/sometimesanotion

AI & ML interests

Agentic LLM services, model merging, finetunes, distillation

Recent Activity

new activity about 19 hours ago

sometimesanotion/Lamarck-14B-v0.7:Excellent model!

liked a model 7 days ago

kalomaze/Qwen3-16B-A3B

posted an update 8 days ago

The capabilities of the new Qwen 3 models are fascinating, and I am watching that space! My experience, however, is that context management is vastly more important with them. If you use a client with a typical session log with rolling compression, a Qwen 3 model will start to generate the same messages over and over. I don't think that detracts from them. They're optimized for a more advanced MCP environment. I honestly think the 8B is optimal for home use, given proper RAG/CAG. In typical session chats, Lamarck and Chocolatine are still my daily drives. I worked hard to give Lamarck v0.7 a sprinkling of CoT from both DRT and Deepseek R1. While those models got surpassed on the leaderboards, in practice, I still really enjoy their output. My projects are focusing on application and context management, because that's where the payoff in improved quality is right now. But should there be a mix of finetunes to make just the right mix of - my recipes are standing by.

View all activity

Organizations

sometimesanotion's activity

liked a model 7 days ago

kalomaze/Qwen3-16B-A3B

Updated 8 days ago • 957 • 69

liked a model 8 days ago

huihui-ai/Qwen3-14B-abliterated

Text Generation • Updated 7 days ago • 382 • 16

liked a model 10 days ago

huihui-ai/Qwen3-8B-abliterated

Text Generation • Updated 7 days ago • 204 • 8

liked 2 models about 1 month ago

allura-org/Gemma-3-Glitter-12B

Image-Text-to-Text • Updated Mar 28 • 239 • • 16

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated 10 days ago • 187k • 1.58k

liked a model about 2 months ago

google/gemma-3-12b-it

Image-Text-to-Text • Updated Mar 21 • 342k • • 349

liked 14 models 2 months ago

OpenLLM-France/Lucie-7B-Instruct-v1.1

Text Generation • Updated Mar 21 • 13.1k • 8

Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8

Text Generation • Updated Mar 9 • 14 • 2

TimeLordRaps/DS-R1-Lamarckvergence-14B-1M-test3

Text Generation • Updated Mar 1 • 6 • 1

microsoft/Phi-4-mini-instruct

Text Generation • Updated 8 days ago • 444k • 466

YOYO-AI/Qwen2.5-14B-YOYO-V4-p2

Text Generation • Updated Mar 3 • 9 • 2

Lunzima/NQLSG-Qwen2.5-14B-OriginalFusion

Text Generation • Updated Mar 1 • 8 • 2

Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8.7

Text Generation • Updated Mar 9 • 10 • 3

wanlige/li-14b-v0.4-slerp0.1

Text Generation • Updated Mar 5 • 73 • 6

CultriX/Qwen2.5-14B-GeneralReasoning

Text Generation • Updated Feb 18 • 14 • 2

wanlige/li-14b-v0.4

Text Generation • Updated 10 days ago • 100 • • 17

CultriX/Qwen2.5-14B-ReasoningMerge

Text Generation • Updated Feb 18 • 20 • 3

YOYO-AI/Qwen2.5-14B-YOYO-V3

Text Generation • Updated Mar 22 • 18 • 4

mlx-community/Lamarck-14B-v0.7-6bit

Text Generation • Updated Feb 19 • 7 • 1

mlx-community/Lamarck-14B-v0.7-4bit

Text Generation • Updated Feb 19 • 7 • 1