Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked
a model
9 days ago
sand-ai/MAGI-1
liked
a model
about 1 month ago
Qwen/Qwen2.5-VL-32B-Instruct
liked
a dataset
about 1 month ago
starvector/text2svg-stack
Organizations
Collections
3
models
14

peakji/qwen2.5-coder-7b-awq
Updated
•
91

peakji/steiner-32b-preview-gguf
Updated
•
76
•
22

peakji/steiner-32b-preview-awq
Updated
•
4
•
4

peakji/steiner-32b-preview
Updated
•
14
•
92

peakji/peak-reasoning-7b-gguf
Updated
•
164
•
4

peakji/peak-reasoning-7b-awq
Updated

peakji/peak-reasoning-7b
Updated

peakji/qwen2.5-72b-instruct-trim
Updated

peakji/qwen2.5-32b-instruct-trim
Updated

peakji/qwen2.5-14b-instruct-trim
Updated
datasets
8
peakji/peak-text-with-context-2m
Viewer
•
Updated
•
2.07M
•
146
peakji/peak-anchor-content-plain-20k
Viewer
•
Updated
•
20.1k
•
115
peakji/peak-search-content-plain-40k
Viewer
•
Updated
•
40.4k
•
40
peakji/peak-anchor-content-35k
Viewer
•
Updated
•
35.6k
•
82
peakji/peak-search-content-70k
Viewer
•
Updated
•
70.2k
•
118
peakji/peak-anchor-40k
Viewer
•
Updated
•
42.7k
•
152
peakji/peak-search-300k
Viewer
•
Updated
•
312k
•
104
peakji/peak-intent-50
Viewer
•
Updated
•
265k
•
60