gn00029914's picture

90 278

gn00029914

gn00029914

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Qwen2.5-Omni Technical Report

liked a Space 4 days ago

Qwen/Qwen2.5-Omni-7B-Demo

liked a model 4 days ago

Qwen/Qwen2.5-Omni-7B

View all activity

Organizations

gn00029914's activity

upvoted a paper 4 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 150

upvoted 2 papers 5 days ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published 8 days ago • 37

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published 8 days ago • 34

upvoted a collection 5 days ago

Phi-4

Phi-4 family of small language, multi-modal and reasoning models. • 13 items • Updated 6 days ago • 141

upvoted a paper 9 days ago

YaRN: Efficient Context Window Extension of Large Language Models

Paper • 2309.00071 • Published Aug 31, 2023 • 71

upvoted a collection 9 days ago

Qwen3

27 items • Updated about 3 hours ago • 541

upvoted a paper 14 days ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published 16 days ago • 42

upvoted a paper 16 days ago

Hyper-RAG: Combating LLM Hallucinations using Hypergraph-Driven Retrieval-Augmented Generation

Paper • 2504.08758 • Published Mar 30 • 3

upvoted a collection 18 days ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 20 days ago • 187

upvoted 2 papers 21 days ago

T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge

Paper • 2407.00088 • Published Jun 25, 2024 • 12

1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs

Paper • 2410.16144 • Published Oct 21, 2024 • 5

upvoted a collection 21 days ago

BitNet

🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated 7 days ago • 36

upvoted a collection 23 days ago

Cogito v1 Preview

5 items • Updated about 1 month ago • 108

upvoted an article about 1 month ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

Apr 5

• 142

upvoted a collection about 1 month ago

Cognition

Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend. • 200 items • Updated 23 days ago • 5

upvoted 2 papers about 1 month ago

Training Software Engineering Agents and Verifiers with SWE-Gym

Paper • 2412.21139 • Published Dec 30, 2024 • 23

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 50

upvoted a paper about 2 months ago

Mirostat: A Neural Text Decoding Algorithm that Directly Controls Perplexity

Paper • 2007.14966 • Published Jul 29, 2020 • 1