Foundational Deep Learning - Architecture - a YedsonUQ Collection

YedsonUQ 's Collections

Foundational Deep Learning - Architecture

AI-Automated Scientific Research

Benchmark and Evaluation

Distributed Training and Federated Learning

Explainable AI - Interpretable AI

Learning Paradigm/Scheme

Models

Reinforcement Learning (RL)

Retrieval Augmented Generation (RAG)

Uncertainty Quantification

Survey

Foundational Deep Learning - Architecture

updated 4 days ago

Forgetting Transformer: Softmax Attention with a Forget Gate

Paper • 2503.02130 • Published 17 days ago • 27
L^2M: Mutual Information Scaling Law for Long-Context Language Modeling

Paper • 2503.04725 • Published 14 days ago • 19
Transformers without Normalization

Paper • 2503.10622 • Published 7 days ago • 128