tarsur909/pythia1b-oai-summary-ppo-1ep-translated-seperated Text Generation • Updated 2 days ago • 40
tarsur909/pythia1b-oai-summary-rm-10ep-seperated-translated Text Classification • Updated 3 days ago • 5
tarsur909/pythia1b-oai-summary-ppo-1ep-translated-seperated_new Text Generation • Updated 13 days ago • 3
tarsur909/pythia1b-oai-summary-rm-1ep-translated-seperated Text Classification • Updated 17 days ago • 10
tarsur909/summarize_sft-test_lm-pythia1b-oai-summary-ppo-1ep-translated-seperated_42_250_64 Viewer • Updated 2 days ago • 250 • 7
tarsur909/summarize_sft-test_lm-pythia1b-oai-summary-ppo-1ep-translated-seperated_42_250old Viewer • Updated 2 days ago • 250 • 5
tarsur909/rewards_negative_log-train-with-reward-stats-10ep-seperated-translated Viewer • Updated 3 days ago • 1k • 7