Bur's picture

21 2

Bur

KSa

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

PrimitiveAnything: Human-Crafted 3D Primitive Assembly Generation with Auto-Regressive Transformer

liked a Space 6 days ago

hyz317/PrimitiveAnything

upvoted a collection 6 days ago

View all activity

Organizations

None yet

KSa's activity

upvoted a paper 6 days ago

PrimitiveAnything: Human-Crafted 3D Primitive Assembly Generation with Auto-Regressive Transformer

Paper • 2505.04622 • Published 7 days ago • 25

liked a Space 6 days ago

PrimitiveAnything

Convert 3D models into primitive assemblies

upvoted a collection 6 days ago

OpenVision

27 items • Updated 6 days ago • 24

upvoted a paper 6 days ago

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Paper • 2505.04601 • Published 7 days ago • 20

upvoted 2 papers 14 days ago

TesserAct: Learning 4D Embodied World Models

Paper • 2504.20995 • Published 15 days ago • 20

The Leaderboard Illusion

Paper • 2504.20879 • Published 16 days ago • 68

upvoted 2 papers about 1 month ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10 • 124

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 84

upvoted 6 papers 4 months ago

SEAL: Entangled White-box Watermarks on Low-Rank Adaptation

Paper • 2501.09284 • Published Jan 16 • 10

GameFactory: Creating New Games with Generative Interactive Videos

Paper • 2501.08325 • Published Jan 14 • 66

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published Jan 5 • 27

GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking

Paper • 2501.02690 • Published Jan 5 • 17

VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control

Paper • 2501.01427 • Published Jan 2 • 55

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 107

upvoted 2 papers 6 months ago

Patience Is The Key to Large Language Model Reasoning

Paper • 2411.13082 • Published Nov 20, 2024 • 7

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory

Paper • 2411.11922 • Published Nov 18, 2024 • 19

upvoted 4 papers 8 months ago

StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation

Paper • 2409.12576 • Published Sep 19, 2024 • 16

Vista3D: Unravel the 3D Darkside of a Single Image

Paper • 2409.12193 • Published Sep 18, 2024 • 10

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 147

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 78