redaelkate's picture

10 8

redaelkate

Pussinsilicon

·

redaelkate

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation

upvoted a paper 27 days ago

Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback

upvoted a paper 29 days ago

Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics

View all activity

Organizations

None yet

Pussinsilicon's activity

upvoted a paper 7 days ago

BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation

Paper • 2504.14538 • Published 10 days ago • 26

upvoted a paper 27 days ago

Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback

Paper • 2405.20216 • Published May 30, 2024 • 22

upvoted a paper 29 days ago

Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics

Paper • 2503.20308 • Published Mar 26 • 22

upvoted 2 papers about 1 month ago

Exploring the Evolution of Physics Cognition in Video Generation: A Survey

Paper • 2503.21765 • Published Mar 27 • 11

Towards Scientific Discovery with Generative AI: Progress, Opportunities, and Challenges

Paper • 2412.11427 • Published Dec 16, 2024 • 3

upvoted an article about 2 months ago

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25

• 159

upvoted an article 2 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.23k

liked a Space 2 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

reacted to sanaka87's post with 🔥 3 months ago

Post

1762

🚀 Excited to Share Our Latest Work: 3DIS & 3DIS-FLUX for Multi-Instance Layout-to-Image Generation! ❤️❤️❤️

🎨 Daily Paper: 3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering (2501.05131)
🔓 Code is now open source!
🌐 Project Website: https://limuloo.github.io/3DIS/
🏠 GitHub Repository: https://github.com/limuloo/3DIS
📄 3DIS Paper: https://arxiv.org/abs/2410.12669
📄 3DIS-FLUX Tech Report: https://arxiv.org/abs/2501.05131

🔥 Why 3DIS & 3DIS-FLUX?
Current SOTA multi-instance generation methods are typically adapter-based, requiring additional control modules trained on pre-trained models for layout and instance attribute control. However, with the emergence of more powerful models like FLUX and SD3.5, these methods demand constant retraining and extensive resources.

✨ Our Solution: 3DIS
We introduce a decoupled approach that only requires training a low-resolution Layout-to-Depth model to convert layouts into coarse-grained scene depth maps. Leveraging community and company pre-trained models like ControlNet + SAM2, we enable training-free controllable image generation on high-resolution models such as SDXL and FLUX.

🌟 Benefits of Our Decoupled Multi-Instance Generation:
1. Enhanced Control: By constructing scenes using depth maps in the first stage, the model focuses on coarse-grained scene layout, improving control over instance placement.
2. Flexibility & Preservation: The second stage employs training-free rendering methods, allowing seamless integration with various models (e.g., fine-tuned weights, LoRA) while maintaining the generative capabilities of pre-trained models.

Join us in advancing Layout-to-Image Generation! Follow and star our repository to stay updated! ⭐

liked a model 4 months ago

Datou1111/shou_xin

Text-to-Image • Updated Mar 16 • 195 • • 871

updated a collection 7 months ago

roleplay

3 items • Updated Sep 20, 2024

liked a model 7 months ago

aifeifei798/DarkIdol-Llama-3.1-8B-Instruct-1.2-Uncensored

Text Generation • Updated Jul 29, 2024 • 4.74k • 193

updated a collection 7 months ago

roleplay

3 items • Updated Sep 20, 2024

liked a model 7 months ago

sophosympatheia/New-Dawn-Llama-3.1-70B-v1.1

Text Generation • Updated Aug 16, 2024 • 29 • 32

updated a collection 7 months ago

roleplay

3 items • Updated Sep 20, 2024

liked 3 models 7 months ago

sophosympatheia/Midnight-Miqu-70B-v1.5

Text Generation • Updated Dec 10, 2024 • 727 • 199

ArliAI/Llama-3.1-70B-ArliAI-RPMax-v1.1

Updated Oct 18, 2024 • 11 • 31

TheDrummer/Gemmasutra-Mini-2B-v1

Updated Sep 27, 2024 • 122 • 60

liked a Space 8 months ago

Kolors Virtual Try-On

Try on virtual garments on person images