Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
48
4
160
sometimesanotion
PRO
sometimesanotion
Follow
21world's profile picture
jrruethe's profile picture
francoj's profile picture
101 followers
·
120 following
https://ko-fi.com/sometimesanotion
AI & ML interests
Agentic LLM services, model merging, finetunes, distillation
Recent Activity
new
activity
about 19 hours ago
sometimesanotion/Lamarck-14B-v0.7:
Excellent model!
liked
a model
7 days ago
kalomaze/Qwen3-16B-A3B
posted
an
update
8 days ago
The capabilities of the new Qwen 3 models are fascinating, and I am watching that space! My experience, however, is that context management is vastly more important with them. If you use a client with a typical session log with rolling compression, a Qwen 3 model will start to generate the same messages over and over. I don't think that detracts from them. They're optimized for a more advanced MCP environment. I honestly think the 8B is optimal for home use, given proper RAG/CAG. In typical session chats, Lamarck and Chocolatine are still my daily drives. I worked hard to give Lamarck v0.7 a sprinkling of CoT from both DRT and Deepseek R1. While those models got surpassed on the leaderboards, in practice, I still really enjoy their output. My projects are focusing on application and context management, because that's where the payoff in improved quality is right now. But should there be a mix of finetunes to make just the right mix of - my recipes are standing by.
View all activity
Organizations
sometimesanotion
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
7 days ago
kalomaze/Qwen3-16B-A3B
Updated
8 days ago
•
957
•
69
liked
a model
8 days ago
huihui-ai/Qwen3-14B-abliterated
Text Generation
•
Updated
7 days ago
•
382
•
16
liked
a model
10 days ago
huihui-ai/Qwen3-8B-abliterated
Text Generation
•
Updated
7 days ago
•
204
•
8
liked
2 models
about 1 month ago
allura-org/Gemma-3-Glitter-12B
Image-Text-to-Text
•
Updated
Mar 28
•
239
•
•
16
Qwen/Qwen2.5-Omni-7B
Any-to-Any
•
Updated
10 days ago
•
187k
•
1.58k
liked
a model
about 2 months ago
google/gemma-3-12b-it
Image-Text-to-Text
•
Updated
Mar 21
•
342k
•
•
349
liked
14 models
2 months ago
OpenLLM-France/Lucie-7B-Instruct-v1.1
Text Generation
•
Updated
Mar 21
•
13.1k
•
8
Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8
Text Generation
•
Updated
Mar 9
•
14
•
2
TimeLordRaps/DS-R1-Lamarckvergence-14B-1M-test3
Text Generation
•
Updated
Mar 1
•
6
•
1
microsoft/Phi-4-mini-instruct
Text Generation
•
Updated
8 days ago
•
444k
•
466
YOYO-AI/Qwen2.5-14B-YOYO-V4-p2
Text Generation
•
Updated
Mar 3
•
9
•
2
Lunzima/NQLSG-Qwen2.5-14B-OriginalFusion
Text Generation
•
Updated
Mar 1
•
8
•
2
Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8.7
Text Generation
•
Updated
Mar 9
•
10
•
3
wanlige/li-14b-v0.4-slerp0.1
Text Generation
•
Updated
Mar 5
•
73
•
6
CultriX/Qwen2.5-14B-GeneralReasoning
Text Generation
•
Updated
Feb 18
•
14
•
2
wanlige/li-14b-v0.4
Text Generation
•
Updated
10 days ago
•
100
•
•
17
CultriX/Qwen2.5-14B-ReasoningMerge
Text Generation
•
Updated
Feb 18
•
20
•
3
YOYO-AI/Qwen2.5-14B-YOYO-V3
Text Generation
•
Updated
Mar 22
•
18
•
4
mlx-community/Lamarck-14B-v0.7-6bit
Text Generation
•
Updated
Feb 19
•
7
•
1
mlx-community/Lamarck-14B-v0.7-4bit
Text Generation
•
Updated
Feb 19
•
7
•
1
Load more