Krishna Kaasyap

KrishnaKaasyap

AI & ML interests

Test Time Training Multimodal & Inter-Modality Transfer Learning Mechanistic Interpretability Evolutionary Model Merging Swarm Intelligence of multiple models with different architectures and different algorithms MuZero approach to general tasks

Recent Activity

upvoted a collection 9 days ago
Qwen2.5-Omni
upvoted a collection 9 days ago
Qwen3
liked a model 15 days ago
nari-labs/Dia-1.6B
View all activity

Organizations

Blog-explorers's profile picture

KrishnaKaasyap's activity

upvoted an article 9 months ago
view article
Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

• 232