Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
sequelbox 
posted an update 5 days ago
Post
268
EARLY RELEASE PREVIEW of Esper 3 for Qwen 3 8b!

- Reasoning finetune focused on coding, architecture, DevOps, and general reasoning
- Trained using DeepSeek-R1 685b synthetic data
- Official Apache 2.0 release coming soon on Valiant Labs: try out the preview for now and see what you think!

Try it out: sequelbox/Qwen3-8B-Esper3-PREVIEW

with my love,
allegra
In this post