Dongwon Jo's picture

2 3

Dongwon Jo

dongwonjo

·

AI & ML interests

Efficient AI, Model Compression, Quantization, Pruning, Generative Model, Large Language Model, Diffusion

Recent Activity

authored a paper 13 days ago

Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning

upvoted a paper 13 days ago

Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning

upvoted a paper 4 months ago

Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models

View all activity

Organizations

None yet

dongwonjo's activity

authored a paper 13 days ago

Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning

Paper • 2505.13866 • Published 15 days ago • 16

upvoted a paper 13 days ago

Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning

Paper • 2505.13866 • Published 15 days ago • 16

upvoted a paper 4 months ago

Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models

Paper • 2406.12311 • Published Jun 18, 2024 • 7

authored a paper 4 months ago

FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation

Paper • 2502.01068 • Published Feb 3 • 17

upvoted a paper 4 months ago

FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation

Paper • 2502.01068 • Published Feb 3 • 17

commented a paper 4 months ago

FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation

Paper • 2502.01068 • Published Feb 3 • 17 •

updated 5 models 9 months ago

dongwonjo/Llama-1-7B-BinaryMoS-E4

Updated Sep 9, 2024 • 2

dongwonjo/Llama-1-13B-BinaryMoS-E4

Updated Sep 9, 2024 • 3

dongwonjo/Llama-2-13B-BinaryMoS-E4

Updated Sep 9, 2024

dongwonjo/Llama-1-30B-BinaryMoS-E4

Updated Sep 9, 2024 • 1

dongwonjo/Llama-2-7B-BinaryMoS-E4

Updated Sep 9, 2024 • 83

commented a paper 12 months ago

Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models

Paper • 2406.12311 • Published Jun 18, 2024 • 7 •

authored a paper 12 months ago

Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models

Paper • 2406.12311 • Published Jun 18, 2024 • 7