Yi-Hao's picture

31 21

Yi-Hao

yihaopeng

·

https://www.yihaopeng.tw/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in Action

updated a collection 1 day ago

upvoted a collection 12 days ago

View all activity

Organizations

yihaopeng's activity

upvoted a paper about 7 hours ago

TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in Action

Paper • 2505.01583 • Published 4 days ago • 7

upvoted a collection 12 days ago

Perception LM

7 items • Updated 20 days ago • 39

upvoted a collection 25 days ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 25 days ago • 66

upvoted a paper about 1 month ago

Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published Mar 21 • 36

upvoted a collection about 1 month ago

MambaVision

MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 1 day ago • 31

upvoted 3 collections about 2 months ago

💫StarVector Models

StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated Mar 20 • 93

SigLIP2

36 items • Updated Apr 3 • 69

Ovis2

Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated Mar 25 • 60

upvoted a collection 3 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 8 days ago • 461

upvoted a paper 4 months ago

AutoPresent: Designing Structured Visuals from Scratch

Paper • 2501.00912 • Published Jan 1 • 8

upvoted 2 papers 5 months ago

SketchAgent: Language-Driven Sequential Sketch Generation

Paper • 2411.17673 • Published Nov 26, 2024 • 19

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published Dec 18, 2024 • 52

upvoted a collection 5 months ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 8 days ago • 310

upvoted 2 collections 6 months ago

LLM2Vec

16 items • Updated Oct 8, 2024 • 46

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 1 day ago • 257

upvoted 3 collections 7 months ago

Pangea

A Fully Open Multilingual Multimodal LLM for 39 Languages • 26 items • Updated Feb 1 • 18

MulitUI

MultiUI: 7M multimodal UI instructions • 5 items • Updated Oct 19, 2024 • 7

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 6 days ago • 303

upvoted a paper 9 months ago

Generative Photomontage

Paper • 2408.07116 • Published Aug 13, 2024 • 21

upvoted a paper 10 months ago

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23, 2024 • 72