ajagota71/pythia-70m-detox-irl-rlhf-test-facebook-filter Reinforcement Learning • Updated about 18 hours ago
ajagota71/ajagota71_pythia-1b-detox-epoch-100_2000_samples_detoxified Viewer • Updated 2 days ago • 2k • 20
ajagota71/ajagota71_pythia-160m-detox-epoch-100_2000_samples_detoxified Viewer • Updated 3 days ago • 2k • 45
ajagota71/ajagota71_pythia-70m-detox-epoch-100_2000_samples_detoxified Viewer • Updated 3 days ago • 2k • 43
ajagota71/ajagota71_pythia-70m-detox-epoch-100_500_samples_detoxified Viewer • Updated 6 days ago • 500 • 38
ajagota71/ajagota71_pythia-160m-detox-epoch-60_2000_samples_detoxified Viewer • Updated 6 days ago • 2k • 28
ajagota71/ajagota71_pythia-160m-detox-epoch-20_2000_samples_detoxified Viewer • Updated 6 days ago • 2k • 27