FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness Paper • 2205.14135 • Published May 27, 2022 • 13
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods Jan 18, 2024 • 56