vignesh yaadav

viai957

viai957

AI & ML interests

None yet

Recent Activity

liked a dataset 12 days ago

argilla/distilabel-math-preference-dpo

liked a Space 2 months ago

nanotron/ultrascale-playbook

upvoted a paper 3 months ago

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

View all activity

Organizations

None yet

viai957's activity

liked a dataset 12 days ago

argilla/distilabel-math-preference-dpo

Viewer • Updated Jul 16, 2024 • 2.42k • 325 • 86

liked a Space 2 months ago

2.55k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 3 months ago

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Paper • 2205.14135 • Published May 27, 2022 • 13

updated a model 6 months ago

viai957/LexiGenesis

Updated Oct 24, 2024

liked 2 models 7 months ago

unsloth/Qwen2.5-Math-72B-Instruct

Text Generation • Updated Sep 23, 2024 • 41 • 3

unsloth/Qwen2.5-Math-7B

Text Generation • Updated Nov 12, 2024 • 42 • 3

liked a dataset 7 months ago

open-llm-leaderboard/Qwen__Qwen2.5-Math-7B-Instruct-details

Viewer • Updated Feb 13 • 43.2k • 111 • 1

liked 2 models 7 months ago

unsloth/Qwen2.5-Math-72B-bnb-4bit

Text Generation • Updated Sep 23, 2024 • 10 • 1

Qwen/Qwen2.5-Math-RM-72B

Text Classification • Updated Oct 31, 2024 • 27.2k • 78

liked a model 9 months ago

m-a-p/OpenCodeInterpreter-DS-6.7B

Text Generation • Updated Mar 3, 2024 • 1.31k • 135

upvoted an article 9 months ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

Jan 18, 2024

• 56

liked a model 10 months ago

Salesforce/blip-image-captioning-large

Image-to-Text • Updated Feb 3 • 2.48M • 1.32k

updated a model 12 months ago

viai957/CodeLlama_34b-SQL

Text Generation • Updated May 4, 2024 • 58 • 1

updated 2 models about 1 year ago

viai957/llama3-scratch

Updated May 1, 2024

viai957/vanillaTransformers

Updated Mar 20, 2024

liked a dataset about 1 year ago

togethercomputer/RedPajama-Data-V2

Updated Nov 21, 2024 • 3.12k • 358

updated a model about 1 year ago

viai957/code_contester-gemma-7b

Updated Feb 22, 2024

liked a dataset about 1 year ago

CohereLabs/aya_collection

Viewer • Updated 18 days ago • 514M • 20.7k • 221

liked 2 models about 1 year ago

stabilityai/stable-code-3b

Text Generation • Updated Jul 10, 2024 • 5.11k • 642

mistralai/Mixtral-8x7B-Instruct-v0.1

Text Generation • Updated Aug 19, 2024 • 495k • • 4.4k