Hugging Face – Posts

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

All HF Hub posts

clem

posted an update 2 days ago

Post

3130

We just crossed 1,500,000 public models on Hugging Face (and 500k spaces, 330k datasets, 50k papers). One new repository is created every 15 seconds. Congratulations all!

3 replies

openfree

posted an update about 18 hours ago

Post

1413

🚀 Idea Transformer:

Idea Transformer: Infinity is an innovative tool that unlocks infinite creativity by generating unique transformation ideas and design images from up to three keywords and a chosen category. Leveraging a state-of-the-art diffusion pipeline, real-time translation, and a powerful LLM, it delivers fresh ideas every time. 🎨✨

openfree/Idea-Transformer

Key Features

Diverse Ideas:
Randomly selects creative variations from your keywords and category — the possibilities are nearly endless! 🎲
Unique Design Images:
Your text prompt produces striking, varied design images via the diffusion model. 🖼️
Real-Time Translation & Expansion:
Korean inputs are automatically translated and enriched using an advanced LLM for high-quality output. 🔄
Dual-Language Support:
Enjoy an intuitive Gradio interface with separate English and Korean tabs for a global audience. 🌍
Explore a Wide Range of Categories:

Sensor Functions 📡: Creative changes in sensor technologies.
Size & Shape Change 📏: Ideas altering physical dimensions and forms.
Surface & Appearance Change 🎨: Transformations in color, texture, and visual effects.
Material State Change 🔥: Transitions between different material states.
Movement Characteristics Change 🏃‍♂️💨: Innovations in motion, speed, and vibration.
Structural Change 🛠️: Reconfigurations via assembly/disassembly and design modifications.
Spatial Movement 🚀: Ideas on repositioning and directional shifts.
Time-Related Change ⏳: Concepts influenced by aging, wear, and lifecycle.
Light & Visual Effects 💡: Alterations in illumination, transparency, and holographic effects.
Sound & Vibration Effects 🔊: Innovations in auditory and vibrational dynamics.
Business Ideas 💼: Strategies for market redefinition, business model innovation, and more.
Why Choose Idea Transformer?

Infinite Creativity & Cutting-Edge Technology : Your keywords and randomized transformations produce an endless stream of unique ideas!

KaiChen1998

posted an update about 15 hours ago

Post

746

📢 Our EMOVA paper has been accepted by CVPR 2025, and we are glad to release all resources, including code (training & inference), datasets (training & evaluation), and checkpoints (EMOVA-3B/7B/72B)!

🤗 EMOVA is a novel end-to-end omni-modal LLM that can see, hear and speak. Given omni-modal (i.e., textual, visual and speech) inputs, EMOVA can generate both textual and speech responses with vivid emotional controls by utilizing the speech decoder and a style controller.

✨ EMOVA Highlights
✅ State-of-the-art omni-modality: EMOVA achieves SoTA comparable results on both vision-language and speech benchmarks simultaneously.
✅ Device adaptation: our codebase supports training/inference on both NVIDIA GPUs (e.g., A800 & H20) and Ascend NPUs (e.g., 910B3)!
✅ Modular design: we integrate multiple implementations of vision encoder, vision projector, and language model.

🔥 You are all welcome to try and star!
- Project page: https://emova-ollm.github.io/
- Github: https://github.com/emova-ollm/EMOVA
- Demo: Emova-ollm/EMOVA-demo

nroggendorff

posted an update 2 days ago

Post

2098

to the nvidia employee that won't respond to my emails: hear me now.

you have made a semi-powerful to irrelevant enemy. you have been warned

4 replies

Jaward

posted an update 3 days ago

Post

1498

This is the most exciting of this week’s release for me: Gemini Robotics - A SOTA generalist Vision-Language-Action model that brings intelligence to the physical world. It comes with a verifiable real-world knowledge Embodied Reasoning QA benchmark. Cool part is that the model can be specialized with fast adaptation to new tasks and have such adaptations transferred to new robot embodiment like humanoids. Looking forward to the model and data on hf, it’s about time I go full physical:)
Technical Report: https://storage.googleapis.com/deepmind-media/gemini-robotics/gemini_robotics_report.pdf

hanzla

posted an update about 10 hours ago

Post

551

Hello community,

I want to share my work of creating a reasoning mamba model

I used GRPO over Falcon3 Mamba Instruct to make this model. It generates blazing fast response while building good logic to answer challenging questions.

Give it a try:

Model repo: hanzla/Falcon3-Mamba-R1-v0

Space: hanzla/Falcon3MambaReasoner

Looking forward to community feedback.

DualityAI-RebekahBogdanoff

posted an update 1 day ago

Post

1035

Think building custom digital twins for AI training is hard? Let us show you how to make it easy!

Next week, Duality AI is offering a free "Creating Your Own 4-Wheeled Vehicle Digital Twins for AI Training with Falcon Editor" live class.
Sign up here: https://forms.gle/2U5xugMjvSkZdeaR8

What we'll cover:
🚗 Import & Configure a rigged 4-wheeled vehicle and transform it into a controllable system twin using Blueprints.
🚗 Enable Dynamic Control by exposing Python variables for real-time adjustments.
🚗 Attach Sensors to capture valuable simulation data.
🚗 Assemble & Run a Simulation Scenario to generate training data for AI & robotics applications.

See how Falcon creates synthetic data for faster, easier, and more targeted AI training by creating a FREE account here: https://www.duality.ai/edu

1 reply

MJannik

posted an update 1 day ago

Post

1179

I've published an article showing five ways to use 🪢 Langfuse with 🤗 Hugging Face.

My personal favorite is Method #4: Using Hugging Face Datasets for Langfuse Dataset Experiments. This lets you benchmark your LLM app or AI agent with a dataset hosted on Hugging Face. In this example, I chose the GSM8K dataset ( openai/gsm8k) to test the mathematical reasoning capabilities of my smolagent :)

Link to the Article here on HF: https://huggingface.co/blog/MJannik/hugging-face-and-langfuse

burtenshaw

posted an update 2 days ago

Post

1548

Still speed running Gemma 3 to think. Today I focused on setting up gpu poor hardware to run GRPO.

This is a plain TRL and PEFT notebook which works on mac silicone or colab T4. This uses the 1b variant of Gemma 3 and a reasoning version of GSM8K dataset.

🧑‍🍳 There’s more still in the oven like releasing models, an Unsloth version, and deeper tutorials, but hopefully this should bootstrap your projects.

Here’s a link to the 1b notebook: https://colab.research.google.com/drive/1mwCy5GQb9xJFSuwt2L_We3eKkVbx2qSt?usp=sharing

1 reply

nicolay-r

posted an update 2 days ago

Post

1427

📢 With the recent release of Gemma-3, If you interested to play with textual chain-of-though, the notebook below is a wrapper over the the model (native transformers inference API) for passing the predefined schema of promps in batching mode.
https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_gemma_3.ipynb

Limitation: schema supports texts only (for now), while gemma-3 is a text+image to text.

Model: google/gemma-3-1b-it
Provider: https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/transformers_gemma3.py

1 reply

Recently active users