NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos Reinforcement Learning • Updated about 24 hours ago • 2 • 2