Diwank Tomer's picture

Diwank Tomer PRO

diwank

·

https://diwank.name

AI & ML interests

None yet

Recent Activity

updated a collection about 11 hours ago

liked a model about 11 hours ago

declare-lab/nora-long

upvoted a collection about 11 hours ago

View all activity

Organizations

diwank's activity

upvoted a collection about 11 hours ago

NextCoder

NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 4 items • Updated about 12 hours ago • 28

upvoted a paper 6 days ago

From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

Paper • 2504.16080 • Published 11 days ago • 15

upvoted an article 15 days ago

Article

Introducing HELMET

18 days ago

• 23

upvoted a collection 30 days ago

MegaPairs

6 items • Updated 18 days ago • 8

upvoted 4 collections about 1 month ago

OneSQL-v0.1-Qwen

Text-to-SQL model • 15 items • Updated about 1 month ago • 5

LipSync and Face Operations

19 items • Updated 4 days ago • 49

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 5 items • Updated 4 days ago • 105

Granite Speech

3 items • Updated 1 day ago • 10

upvoted a paper about 1 month ago

Vision-R1: Evolving Human-Free Alignment in Large Vision-Language Models via Vision-Guided Reinforcement Learning

Paper • 2503.18013 • Published Mar 23 • 19

upvoted a collection about 1 month ago

Utilities

No crazy stuff, but useful ones for in-between steps • 16 items • Updated Mar 19 • 6

upvoted a paper about 1 month ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 123

upvoted 4 papers about 2 months ago

Automated Movie Generation via Multi-Agent CoT Planning

Paper • 2503.07314 • Published Mar 10 • 45

Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling

Paper • 2503.08605 • Published Mar 11 • 26

VACE: All-in-One Video Creation and Editing

Paper • 2503.07598 • Published Mar 10 • 47

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published Feb 27 • 28

upvoted a collection about 2 months ago

Wan2.1 14B 480p I2V LoRAs

A collection of Remade's Wan2.1 14B 480p I2V LoRAs • 39 items • Updated Apr 1 • 109

upvoted 3 papers about 2 months ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published Mar 6 • 15

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published Mar 7 • 58

R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning

Paper • 2503.05379 • Published Mar 7 • 37