John Smith's picture

John Smith PRO

John6666

AI & ML interests

None yet

Recent Activity

published a model about 5 hours ago
John6666/femix-hassakuxl-v212-sdxl
published a model about 5 hours ago
John6666/samlust-nsfw-v12-sdxl
View all activity

Organizations

open/ acc's profile picture Solving Real World Problems's profile picture FashionStash Group meeting's profile picture No More Copyright's profile picture

John6666's activity

reacted to as-cle-bert's post with ๐Ÿ‘ about 7 hours ago
view post
Post
284
Hey there, ๐—ถ๐—ป๐—ด๐—ฒ๐˜€๐˜-๐—ฎ๐—ป๐˜†๐˜๐—ต๐—ถ๐—ป๐—ด ๐˜ƒ๐Ÿญ.๐Ÿฌ.๐Ÿฌ just dropped with major changes:

โœ… Embeddings: now works with Sentence Transformers, Jina AI, Cohere, OpenAI, and Model2Vec
All powered via ๐—–๐—ต๐—ผ๐—ป๐—ธ๐—ถ๐—ฒโ€™๐˜€ ๐—”๐˜‚๐˜๐—ผ๐—˜๐—บ๐—ฏ๐—ฒ๐—ฑ๐—ฑ๐—ถ๐—ป๐—ด๐˜€.
No more local-only limitations ๐Ÿ™Œ
โœ… Vector DBs: now supports ๐—ฎ๐—น๐—น ๐—Ÿ๐—น๐—ฎ๐—บ๐—ฎ๐—œ๐—ป๐—ฑ๐—ฒ๐˜…-๐—ฐ๐—ผ๐—บ๐—ฝ๐—ฎ๐˜๐—ถ๐—ฏ๐—น๐—ฒ ๐—ฏ๐—ฎ๐—ฐ๐—ธ๐—ฒ๐—ป๐—ฑ๐˜€
Think: Qdrant, Pinecone, Weaviate, Milvus, etc.
No more bottlenecks๐Ÿ”“
โœ… File parsing: now plugs into any ๐—Ÿ๐—น๐—ฎ๐—บ๐—ฎ๐—œ๐—ป๐—ฑ๐—ฒ๐˜…-๐—ฐ๐—ผ๐—บ๐—ฝ๐—ฎ๐˜๐—ถ๐—ฏ๐—น๐—ฒ ๐—ฑ๐—ฎ๐˜๐—ฎ ๐—น๐—ผ๐—ฎ๐—ฑ๐—ฒ๐—ฟ
Using LlamaParse, Docling or your own setup? Youโ€™re covered.
Curious of knowing more? Try it out! ๐Ÿ‘‰ https://github.com/AstraBert/ingest-anything
reacted to ProCreations's post with ๐Ÿ‘€ about 7 hours ago
view post
Post
372
๐Ÿง  Post of the Day: Quantum AI โ€“ Your Thoughts + Our Take

Yesterday we asked: โ€œWhat will quantum computing do to AI?โ€
Big thanks to solongeran for this poetic insight:

โ€œQuantum computers are hard to run error-free. But once theyโ€™re reliable, AI will be there. Safer than the daily sunset. Shure โ€“ no more queues ;)โ€

๐Ÿš€ Our Take โ€“ What Quantum Computing Will Do to AI (by 2035)

By the time scalable, fault-tolerant quantum computers arrive, AI wonโ€™t just run faster โ€” itโ€™ll evolve in ways weโ€™ve never seen:

โธป

๐Ÿ”น 1. Huge Speedups in Optimization & Search
Why: Quantum algorithms like Groverโ€™s can cut down search and optimization times exponentially in some cases.
How: Theyโ€™ll power up tasks like hyperparameter tuning, decision-making in RL, and neural architecture search โ€” crunching what now takes hours into seconds.

โธป

๐Ÿ”น 2. Quantum Neural Networks (QNNs)
Why: QNNs can represent complex relationships more efficiently than classical nets.
How: They use entanglement and superposition to model rich feature spaces, especially useful for messy or high-dimensional data โ€” think drug discovery, finance, or even language structure.

โธป

๐Ÿ”น 3. Autonomous Scientific Discovery
Why: Quantum AI could simulate molecular systems that are impossible for classical computers.
How: By combining quantum simulation with AI exploration, we may unlock ultra-fast pathways to new drugs, materials, and technologies โ€” replacing years of lab work with minutes of computation.

โธป

๐Ÿ”น 4. Self-Evolving AI Architectures
Why: Future AI systems will design themselves.
How: Quantum processors will explore massive spaces of model variants in parallel, enabling AI to simulate, compare, and evolve new architectures โ€” fast, efficient, and with little trial-and-error.

โธป

โš›๏ธ The Takeaway:
Quantum computing wonโ€™t just speed up AI. Itโ€™ll open doors to new types of intelligence โ€” ones that learn, discover, and evolve far beyond todayโ€™s limits.
reacted to vincentg64's post with ๐Ÿ‘€ about 7 hours ago
view post
Post
356
How to Design LLMs that Donโ€™t Need Prompt Engineering https://mltblog.com/3GAbAQu



Standard LLMs rely on prompt engineering to fix problems (hallucinations, poor response, missing information) that come from issues in the backend architecture. If the backend (corpus processing) is properly built from the ground up, it is possible to offer a full, comprehensive answer to a meaningful prompt, without the need for multiple prompts, rewording your query, having to go through a chat session, or prompt engineering. In this article, I explain how to do it, focusing on enterprise corpuses. The strategy relies on four principles:

โžก๏ธ Exact and augmented retrieval
โžก๏ธ Showing full context in the response
โžก๏ธ Enhanced UI with option menu
โžก๏ธ Structured response as opposed to long text

I now explain these principles.

Read full article at https://mltblog.com/3GAbAQu

#xLLM #BondingAI #PromptEngineering
reacted to onekq's post with ๐Ÿ‘ about 21 hours ago
view post
Post
668
I didn't noticed that Gemini 2.5 (pro and flash) has been silently launched for API preview. Their performance is solid, but below QwQ 32B and the latest DeepSeek v3.

onekq-ai/WebApp1K-models-leaderboard
  • 2 replies
ยท
reacted to nyuuzyou's post with ๐Ÿ‘ about 21 hours ago
view post
Post
872
๐Ÿ–ผ๏ธ OpenClipart SVG Dataset - nyuuzyou/openclipart

Collection of 178,604 Public Domain Scalable Vector Graphics (SVG) clipart images featuring:
- Comprehensive metadata: title, description, artist name, tags, original page URL, and more.
- Contains complete SVG XML content (minified) for direct use or processing.
- All images explicitly released into the public domain under the CC0 license.
- Organized in a single train split with 178,604 entries.
reacted to mrfakename's post with ๐Ÿค—๐Ÿ‘ about 21 hours ago
view post
Post
723
Hi everyone,

I just launched TTS Arena V2 - a platform for benchmarking TTS models by blind A/B testing. The goal is to make it easy to compare quality between open-source and commercial models, including conversational ones.

What's new in V2:

- **Conversational Arena**: Evaluate models like CSM-1B, Dia 1.6B, and PlayDialog in multi-turn settings
- **Personal Leaderboard**: Optional login to see which models you tend to prefer
- **Multi-speaker TTS**: Random voices per generation to reduce speaker bias
- **Performance Upgrade**: Rebuilt from Gradio โ†’ Flask. Much faster with fewer failed generations.
- **Keyboard Shortcuts**: Vote entirely via keyboard

Also added models like MegaTTS 3, Cartesia Sonic, and ElevenLabs' full lineup.

I'd love any feedback, feature suggestions, or ideas for models to include.

TTS-AGI/TTS-Arena-V2
  • 1 reply
ยท
reacted to merve's post with ๐Ÿš€ about 21 hours ago
reacted to RiverZ's post with ๐Ÿค— about 21 hours ago
view post
Post
671
๐Ÿš€ Excited to Share Our Latest Work: In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer๏ฝž

๐ŸŽจ Daily Paper:
In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer (2504.20690)


๐Ÿ”“ Code is now open source!
๐Ÿ”ฅ Huggingface DEMO:
RiverZ/ICEdit

๐ŸŒ Project Website: https://river-zhang.github.io/ICEdit-gh-pages/
๐Ÿ  GitHub Repository: https://github.com/River-Zhang/ICEdit/blob/main/scripts/gradio_demo.py
๐Ÿค— Huggingface:
sanaka87/ICEdit-MoE-LoRA

๐Ÿ“„ arxiv Paper:
In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer (2504.20690)


๐Ÿ”ฅ Why itโ€™s cool:
- Achieves high-quality, multi-task image editing.
- Uses only 1% of the training parameters and 0.1% of the training data compared to existing methods โ€” extremely efficient
- Beats several commercial models on background preservation, ID control, and consistency
- Open-source, low-cost, faster, and stronger โ€” think of it as the โ€œDeepSeek of image editingโ€ ๐Ÿ‘€

We also implemented a Gradio demo app, available directly in our GitHub repo! And we made a flashy demo video โ€” happy to send it your way!
reacted to fdaudens's post with ๐Ÿ”ฅ about 21 hours ago
view post
Post
931
Forget everything you know about transcription models - NVIDIA's parakeet-tdt-0.6b-v2 changed the game for me!

Just tested it with Steve Jobs' Stanford speech and was speechless (pun intended). The video isnโ€™t sped up.

3 things that floored me:
- Transcription took just 10 seconds for a 15-min file
- Got a CSV with perfect timestamps, punctuation & capitalization
- Stunning accuracy (correctly captured "Reed College" and other specifics)

NVIDIA also released a demo where you can click any transcribed segment to play it instantly.

The improvement is significant: number 1 on the ASR Leaderboard, 6% error rate (best in class) with complete commercial freedom (cc-by-4.0 license).

Time to update those Whisper pipelines! H/t @Steveeeeeeen for the finding!

Model: nvidia/parakeet-tdt-0.6b-v2
Demo: nvidia/parakeet-tdt-0.6b-v2
ASR Leaderboard: hf-audio/open_asr_leaderboard
reacted to daavoo's post with ๐Ÿš€ about 21 hours ago
view post
Post
652
We've just released a new version of https://github.com/mozilla-ai/any-agent , including a Python implementation of https://huggingface.co/blog/tiny-agents!

Give it a โญ!

from any_agent import AnyAgent, AgentConfig
from any_agent.config import MCPStdioParams

agent = AnyAgent.create(
    "tinyagent",
    AgentConfig(
        model_id="gpt-4.1-nano",
        instructions="You must use the available tools to find an answer",
        tools=[
            MCPStdioParams(
                command="uvx",
                args=["duckduckgo-mcp-server"]
            )
        ]
    )
)

result = agent.run(
    "Which Agent Framework is the best??"
)
print(result.final_output)

reacted to samihalawa's post with ๐Ÿ‘€ about 21 hours ago
view post
Post
375
CURSOR IS OVER ๐Ÿ“ข Big announcement, folks! ๐Ÿ”ฅ I'm making a clean break from Cursor, Copilot, Cline, and all those other AI-focused IDEs. ๐Ÿ‘‹

If you're already shelling out for Cursor, Copilot, or similar, honestly, you'd be much better off putting those bucks towards Claude MAX and just taking off!
๐Ÿ’ฐโžก๏ธ Claude MAX
WHY?๐Ÿค” Claude Coder is now practically UNLIMITED if you're a Claude MAX subscriber. Seriously, it's a game-changer. And of course supports MCP out of the box (they invented it!)
๐Ÿš€And the absolute best bit? You've got a 99% (make that 100% in my experience, LOL) chance that the code it spits out will be PERFECT: zero bugs. โœจ No more head-scratching debugging sessions! ๐Ÿฅณ
It just works so much better, has this massive memory (token limit), and is so autonomous โ€“ you can literally let it grind away on a project for hours. โณ


  • 1 reply
ยท
reacted to as-cle-bert's post with โค๏ธ about 21 hours ago
view post
Post
1099
One of the biggest challenges I've been facing since I started developing [๐๐๐Ÿ๐ˆ๐ญ๐ƒ๐จ๐ฐ๐ง](https://github.com/AstraBert/PdfItDown) was handling correctly the conversion of files like Excel sheets and CSVs: table conversion was bad and messy, almost unusable for downstream tasks๐Ÿซฃ

That's why today I'm excited to introduce ๐ซ๐ž๐š๐๐ž๐ซ๐ฌ, the new feature of PdfItDown v1.4.0!๐ŸŽ‰

With ๐˜ณ๐˜ฆ๐˜ข๐˜ฅ๐˜ฆ๐˜ณ๐˜ด, you can choose among three (for now๐Ÿ‘€) flavors of text extraction and conversion to PDF:

- ๐——๐—ผ๐—ฐ๐—น๐—ถ๐—ป๐—ด, which does a fantastic work with presentations, spreadsheets and word documents๐Ÿฆ†

- ๐—Ÿ๐—น๐—ฎ๐—บ๐—ฎ๐—ฃ๐—ฎ๐—ฟ๐˜€๐—ฒ by LlamaIndex, suitable for more complex and articulated documents, with mixture of texts, images and tables๐Ÿฆ™

- ๐— ๐—ฎ๐—ฟ๐—ธ๐—œ๐˜๐——๐—ผ๐˜„๐—ป by Microsoft, not the best at handling highly structured documents, by extremly flexible in terms of input file format (it can even convert XML, JSON and ZIP files!)โœ’๏ธ

You can use this new feature in your python scripts (check the attached code snippet!๐Ÿ˜‰) and in the command line interface as well!๐Ÿ

Have fun and don't forget to star the repo on GitHub โžก๏ธ https://github.com/AstraBert/PdfItDown
reacted to clem's post with ๐Ÿค—๐Ÿ”ฅ about 21 hours ago
reacted to DevinGrey's post with ๐Ÿ‘€ about 21 hours ago
view post
Post
675
hello All. I am new to all of this and just beginning to learn how to use hugging face and AI in general. How can I access an ai code developer for help in setting up a website?
  • 2 replies
ยท
replied to DevinGrey's post about 21 hours ago
reacted to sometimesanotion's post with ๐Ÿ‘€ 1 day ago
view post
Post
1163
The capabilities of the new Qwen 3 models are fascinating, and I am watching that space!

My experience, however, is that context management is vastly more important with them. If you use a client with a typical session log with rolling compression, a Qwen 3 model will start to generate the same messages over and over. I don't think that detracts from them. They're optimized for a more advanced MCP environment. I honestly think the 8B is optimal for home use, given proper RAG/CAG.

In typical session chats, Lamarck and Chocolatine are still my daily drives. I worked hard to give Lamarck v0.7 a sprinkling of CoT from both DRT and Deepseek R1. While those models got surpassed on the leaderboards, in practice, I still really enjoy their output.

My projects are focusing on application and context management, because that's where the payoff in improved quality is right now. But should there be a mix of finetunes to make just the right mix of - my recipes are standing by.
reacted to ProCreations's post with ๐Ÿ‘€ 1 day ago
view post
Post
1027
Quantum Computing + AI = ๐Ÿคฏ?
What do you think quantum computing will do to AI?
Will it revolutionize training speed? Unlock whole new algorithms? Or maybeโ€ฆ just complicate things?

๐Ÿ’ฌ Drop your thoughts below โ€” weโ€™ll share our take and highlight some of your replies in tomorrowโ€™s post!
  • 1 reply
ยท
reacted to ginipick's post with ๐Ÿ”ฅ 1 day ago
view post
Post
2303
๐ŸŽจ Renoir Studio: Impressionist Masterpieces Reborn Through AI โœจ

๐ŸŒŸ Experience Renoir's Magical Brushstrokes with AI!

๐Ÿ”— Try it now: ginigen/flux-lora-renoir
๐Ÿ”— Model page: openfree/pierre-auguste-renoir
๐Ÿ”— Collection: openfree/painting-art-ai-681453484ec15ef5978bbeb1

Hello, AI art enthusiasts! ๐Ÿ’–
Today I'm introducing a special model - Pierre-Auguste Renoir Studio. Create your own beautiful artwork in the style of the 19th century French Impressionist master! ๐Ÿ–ผ๏ธ
โœจ Why Renoir's Style?
Renoir is famous for his luminous colors and soft brushstrokes. His works feature:

๐ŸŒž Warm sunshine and dancing light
๐Ÿ‘จโ€๐Ÿ‘ฉโ€๐Ÿ‘งโ€๐Ÿ‘ฆ The beauty of everyday life and joyful moments
๐ŸŒธ Vibrant nature and portraits of beautiful women
๐ŸŽญ Lively Parisian social gatherings and outdoor scenes

๐Ÿ”ฌ Technical Features
This model was developed as a flux-based learning model trained on a curated collection of high-resolution masterpieces from renowned global artists. The LoRA fine-tuning process leveraged exceptional quality open-access imagery released by prestigious institutions including the Art Institute of Chicago. The resulting model demonstrates remarkable capability in capturing the nuanced artistic techniques and stylistic elements across diverse historical art movements! ๐Ÿง ๐Ÿ’ซ
๐Ÿš€ How to Use

Describe your desired scene in the prompt box
Add the "renoir" keyword at the end (this is the trigger keyword!)
Click the 'Generate' button
Enjoy your ideas reborn in Renoir's style!

๐Ÿ’ก Recommended Prompt Examples

"Elegant ladies enjoying a picnic in a sunlit garden, wearing pastel dresses and hats renoir"
"People boating by a riverbank, light reflecting on water, warmth of summer renoir"
"Paris cafe terrace, people chatting over coffee, evening sunset renoir"

๐ŸŒˆ Now It's Your Turn!
#AI#Renoir #ArtificialIntelligence#HuggingFace #FLUX #LoRA