-
Stabilizing RLHF through Advantage Model and Selective Rehearsal
Paper • 2309.10202 • Published • 11 -
Multimodal Foundation Models: From Specialists to General-Purpose Assistants
Paper • 2309.10020 • Published • 41 -
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Paper • 2402.14658 • Published • 84
M1n9X
m1n9x
AI & ML interests
None yet
Recent Activity
updated
a dataset
19 days ago
Percena/lme-mc10
liked
a model
19 days ago
nari-labs/Dia-1.6B
liked
a Space
3 months ago
nanotron/ultrascale-playbook
Organizations
Collections
1
models
0
None public yet
datasets
0
None public yet