Nellyw888/VeriReason-codeLlama-7b-RTLCoder-Verilog-GRPO-reasoning-tb Reinforcement Learning • Updated 3 days ago • 1.51k • 1
Nellyw888/VeriReason-Qwen2.5-7b-RTLCoder-Verilog-GRPO-reasoning-tb Reinforcement Learning • Updated 3 days ago • 1.54k • 3