Jens Roland

FreeHugsForRobots

AI & ML interests

None yet

Recent Activity

liked a model 16 days ago
agentica-org/DeepCoder-14B-Preview
liked a model 6 months ago
Qwen/Qwen2.5-Coder-32B-Instruct
liked a model 7 months ago
black-forest-labs/FLUX.1-schnell
View all activity

Organizations

None yet

FreeHugsForRobots's activity

upvoted an article 10 months ago
view article
Article

Our Transformers Code Agent beats the GAIA benchmark!

โ€ข 84
replied to DmitryRyumin's post 12 months ago
reacted to DmitryRyumin's post with ๐Ÿ”ฅ 12 months ago
view post
Post
2222
๐Ÿ”ฅ๐Ÿš€๐ŸŒŸ New Research Alert - xLSTM! ๐ŸŒŸ๐Ÿš€๐Ÿ”ฅ
๐Ÿ“„ Title: xLSTM: Extended Long Short-Term Memory ๐Ÿ”

๐Ÿ“ Description: xLSTM is a scaled-up LSTM architecture with exponential gating and modified memory structures to mitigate known limitations. xLSTM blocks outperform SOTA transformers and state-space models in performance and scaling.

๐Ÿ‘ฅ Authors: Maximilian Beck et al.

๐Ÿ“„ Paper: xLSTM: Extended Long Short-Term Memory (2405.04517)

๐Ÿ“ Repository: https://github.com/NX-AI/xlstm

๐Ÿ“š More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

๐Ÿ” Keywords: #xLSTM #DeepLearning #Innovation #AI
  • 1 reply
ยท