ZeroGPU Explorers

community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

zero-gpu-explorers's activity

AtAndDev 
posted an update about 14 hours ago
view post
Post
417
There seems to multiple paid apps shared here that are based on models on hf, but some ppl sell their wrappers as "products" and promote them here. For a long time, hf was the best and only platform to do oss model stuff but with the recent AI website builders anyone can create a product (really crappy ones btw) and try to sell it with no contribution to oss stuff. Please dont do this, or try finetuning the models you use...
Sorry for filling yall feed with this bs but yk...
AtAndDev 
posted an update 4 days ago
view post
Post
1451
Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.
not-lain 
posted an update 4 days ago
dreamerdeo 
posted an update 26 days ago
view post
Post
2788
🚀 Excited to share our technical report on the Southeast Asian multilingual model Sailor2 and its latest updates!

Our 49-page report details Sailor2's development journey, including multilingual data cleaning, small model data mixture simulations, multi-stage continual pre-training, multi-stage post-training, and multi-cultural multi-lingual evaluations. Sailor2 aims to streamline the multilingual model pre-training process efficiently for the community.

🧭 We highlight Sailor2's impressive performance in low-resource language translation scenarios and its cultural understanding advantages in Southeast Asia, promoting practical applications for regional languages.

Model updates include: 
💡 More precise outputs: Reduced redundancy in model outputs through refined post-training data and optimization techniques. 
🌈 Handling longer texts: Expanded to handle up to 128K context length in Southeast Asian languages through long-text training. 
⚡️ Faster inference: Achieved 2.5x faster inference speed with speculative decoding. 
🌪️ More model sizes: Introduced new sizes of 3B and 14B through model pruning.

🌟 All models are Apache-licensed for commercial use; development tools (code, resources) are open-source.

📚 Technical report: Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs (2502.12982) 
🤖️ Models: sail/sailor2-language-models-674d7c9e6b4dbbd9a869906b 
💬 Demo: sail/Sailor2-20B-Chat 
📣 Sailor2 community: https://huggingface.co/sailor2
AtAndDev 
posted an update 29 days ago
lukmanaj 
posted an update about 1 month ago
view post
Post
2891
I am excited to share that I’ve successfully completed Unit 1: Foundations of Agents in the Hugging Face Agents Course.
Exploring the fundamentals of AI agents has been an insightful journey, and I’m looking forward to applying these concepts in real-world applications.
Big thanks to the Hugging Face team for this amazing learning opportunity! 🤗
Check out the course here: https://huggingface.co/learn/agents-course/
  • 2 replies
·
julien-c 
in zero-gpu-explorers/README about 1 month ago

Update README.md

1
#152 opened about 1 month ago by
fdaudens
fdaudens 
updated a Space about 1 month ago
fdaudens 
in zero-gpu-explorers/README about 1 month ago

Update README.md

1
#152 opened about 1 month ago by
fdaudens
not-lain 
posted an update about 2 months ago
AtAndDev 
posted an update about 2 months ago
view post
Post
1892
everywhere i go i see his face
AtAndDev 
posted an update about 2 months ago
view post
Post
533
Deepseek gang on fire fr fr
AtAndDev 
posted an update about 2 months ago
view post
Post
1614
R1 is out! And with a lot of other R1 releated models...
not-lain 
posted an update about 2 months ago
view post
Post
1660
we now have more than 2000 public AI models using ModelHubMixin🤗
mlabonne 
posted an update about 2 months ago
view post
Post
6182
🆕 LLM Course 2025 edition!

I updated the LLM Scientist roadmap and added a ton of new information and references. It covers training, datasets, evaluation, quantization, and new trends like test-time compute scaling.

The LLM Course has been incredibly popular (41.3k stars!) and I've been touched to receive many, many messages about how it helped people in their careers.

I know how difficult this stuff can be, so I'm super proud of the impact it had. I want to keep updating it in 2025, especially with the LLM Engineer roadmap.

Thanks everyone, hope you'll enjoy it!

💻 LLM Course: https://huggingface.co/blog/mlabonne/llm-course