BOUKOUFFALLAH Abdallah

iBado

Abdellahbado

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Large Language Models are Locally Linear Mappings

upvoted a paper 10 days ago

Text Generation Beyond Discrete Token Sampling

upvoted a paper 13 days ago

Reward Reasoning Model

View all activity

Organizations

None yet

iBado's activity

upvoted a paper 1 day ago

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published 4 days ago • 11

upvoted a paper 10 days ago

Text Generation Beyond Discrete Token Sampling

Paper • 2505.14827 • Published 14 days ago • 10

upvoted a paper 13 days ago

Reward Reasoning Model

Paper • 2505.14674 • Published 14 days ago • 34

upvoted 2 papers 14 days ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published 17 days ago • 114

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published 15 days ago • 78

upvoted a paper 16 days ago

Transformer Interpretability Beyond Attention Visualization

Paper • 2012.09838 • Published Dec 17, 2020 • 1

upvoted a paper 17 days ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published 19 days ago • 117

upvoted a paper 25 days ago

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published 27 days ago • 173

upvoted 2 papers 27 days ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 28 days ago • 168

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published 28 days ago • 92

upvoted a collection about 1 month ago

Qwen3

Collection

40 items • Updated 13 days ago • 722

upvoted a paper about 1 month ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 93

upvoted a collection about 1 month ago

Reasoning, Thinking, RL and Test-Time Scaling

Collection

147 items • Updated Apr 24 • 11

upvoted a paper about 1 month ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 99

upvoted a collection about 1 month ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 11 items • Updated Apr 28 • 81

upvoted 2 papers about 1 month ago

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published Apr 16 • 44

Kuwain 1.5B: An Arabic SLM via Language Injection

Paper • 2504.15120 • Published Apr 21 • 120

upvoted 2 papers about 2 months ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 284

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 141

upvoted a paper 3 months ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10 • 46