7b tulu 2.5 - a hamishivi Collection

hamishivi 's Collections

Large-Scale Data Selection for Instruction Tuning

TESS 2

Tulu 2 Llama 3 Update

LM Preference Datasets

7b tulu 2.5

updated Mar 4

a small run at 7b scale with ppo, following the unpacking dpo and ppo paper.

hamishivi/tulu-v2.5-7b-uf-mean-7b-uf-rm

Text Generation • Updated Jun 25, 2024 • 7
hamishivi/tulu-v2.5-7b-uf-mean-7b-uf-rm-value

Token Classification • Updated Jun 25, 2024 • 1
hamishivi/tulu-v2.5-7b-uf-rm

Text Classification • Updated Jun 25, 2024 • 4