7 1191 640

Kye Gomez

kye

https://discord.gg/qUtxnK2NMf

kyegomezb

AI & ML interests

Neuroscience, Behavior Science, Anti-Matter, Anti-Gravity propulsion,

Recent Activity

liked a dataset about 15 hours ago

rajpurkarlab/ReXGradient-160K

upvoted a paper about 19 hours ago

Softpick: No Attention Sink, No Massive Activations with Rectified Softmax

upvoted a paper about 23 hours ago

MediAug: Exploring Visual Augmentation in Medical Imaging

View all activity

Organizations

kye's activity

upvoted a paper about 19 hours ago

Softpick: No Attention Sink, No Massive Activations with Rectified Softmax

Paper • 2504.20966 • Published 4 days ago • 19

upvoted 4 papers about 23 hours ago

upvoted 4 papers 2 days ago

Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think

Paper • 2504.20708 • Published 4 days ago • 19

Taming the Titans: A Survey of Efficient LLM Inference Serving

Paper • 2504.19720 • Published 5 days ago • 9

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published 4 days ago • 24

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published 4 days ago • 32

upvoted 4 papers 3 days ago

X-Fusion: Introducing New Modality to Frozen Large Language Models

Paper • 2504.20996 • Published 4 days ago • 10

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published 4 days ago • 43

RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning

Paper • 2504.20073 • Published 9 days ago • 8

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published 4 days ago • 74

upvoted a paper 4 days ago

SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning

Paper • 2504.19162 • Published 6 days ago • 14

upvoted a paper 5 days ago

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Paper • 2504.18415 • Published 8 days ago • 40

upvoted 4 papers 6 days ago

Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark

Paper • 2504.16427 • Published 11 days ago • 16

DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs

Paper • 2504.17040 • Published 10 days ago • 13

Process Reward Models That Think

Paper • 2504.16828 • Published 10 days ago • 16

Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation

Paper • 2504.17207 • Published 10 days ago • 28

upvoted a paper 8 days ago

MixVPR: Feature Mixing for Visual Place Recognition

Paper • 2303.02190 • Published Mar 3, 2023 • 1