Models and Datasets
Junxiong Wang PRO
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
updated
a model
7 days ago
togethercomputer/M1-3B
updated
a collection
7 days ago
M1
updated
a model
24 days ago
JunxiongWang/M1-3B
Organizations
Collections
9
models
52
JunxiongWang/M1-3B
Text Generation
•
Updated
•
1.27k
JunxiongWang/M1-3B-SFT
Text Generation
•
Updated
•
10
•
1
JunxiongWang/MambaInLlama1B_SFT_MATH
Updated
JunxiongWang/MambaInLlama3B_SFT_MATH
Updated
•
143
JunxiongWang/MambaInLlama3B_DPO2
Updated
JunxiongWang/MambaInLlama3B_DPO1
Updated
JunxiongWang/open_instruct_dev
Updated
JunxiongWang/MambaInLlama3B_Distill_MATH
Updated
•
2
JunxiongWang/MambaInLlama3B_v3
Updated
JunxiongWang/MambaInLlama1B_Distill_MATH
Updated
•
28
datasets
17
JunxiongWang/R1_GR_SFT
Viewer
•
Updated
•
44k
•
76
JunxiongWang/R1_SFT
Updated
•
50
JunxiongWang/R1_Sythetic_SFT
Viewer
•
Updated
•
1M
•
61
JunxiongWang/MATH_SFT
Viewer
•
Updated
•
19.1M
•
163
JunxiongWang/qwen1b_it_math
Viewer
•
Updated
•
19.1M
•
314
JunxiongWang/test_math
Viewer
•
Updated
•
89.1k
•
155
JunxiongWang/FineMathV4
Viewer
•
Updated
•
6.7M
•
135
JunxiongWang/MATH-V3
Viewer
•
Updated
•
14M
•
5
JunxiongWang/model_revision_max_4_closest_and_random
Viewer
•
Updated
•
530k
•
28
JunxiongWang/sftdatasetv4
Viewer
•
Updated
•
4.96M
•
60