Nellyw888/VeriReason-codeLlama-7b-RTLCoder-Verilog-GRPO-reasoning-tb Reinforcement Learning • Updated 3 days ago • 1.51k • 1
Nellyw888/VeriReason-Qwen2.5-7b-RTLCoder-Verilog-GRPO-reasoning-tb Reinforcement Learning • Updated 3 days ago • 1.54k • 3
CoIn: Counting the Invisible Reasoning Tokens in Commercial Opaque LLM APIs Paper • 2505.13778 • Published 15 days ago • 4