ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
Abstract
ComfyMind, a collaborative AI system built on ComfyUI, enhances generative workflows with a Semantic Workflow Interface and Search Tree Planning mechanism, outperforming existing open-source systems across generation, editing, and reasoning tasks.
With the rapid advancement of generative models, general-purpose generation has gained increasing attention as a promising approach to unify diverse tasks across modalities within a single system. Despite this progress, existing open-source frameworks often remain fragile and struggle to support complex real-world applications due to the lack of structured workflow planning and execution-level feedback. To address these limitations, we present ComfyMind, a collaborative AI system designed to enable robust and scalable general-purpose generation, built on the ComfyUI platform. ComfyMind introduces two core innovations: Semantic Workflow Interface (SWI) that abstracts low-level node graphs into callable functional modules described in natural language, enabling high-level composition and reducing structural errors; Search Tree Planning mechanism with localized feedback execution, which models generation as a hierarchical decision process and allows adaptive correction at each stage. Together, these components improve the stability and flexibility of complex generative workflows. We evaluate ComfyMind on three public benchmarks: ComfyBench, GenEval, and Reason-Edit, which span generation, editing, and reasoning tasks. Results show that ComfyMind consistently outperforms existing open-source baselines and achieves performance comparable to GPT-Image-1. ComfyMind paves a promising path for the development of open-source general-purpose generative AI systems. Project page: https://github.com/LitaoGuo/ComfyMind
Community
๐ Github Code: https://github.com/LitaoGuo/ComfyMind
๐ Project Page: https://litaoguo.github.io/ComfyMind.github.io/
๐งช Online demo: Will be released in a few days. Stay tuned! ๐
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- AgentSGEN: Multi-Agent LLM in the Loop for Semantic Collaboration and GENeration of Synthetic Data (2025)
- HALO: Hierarchical Autonomous Logic-Oriented Orchestration for Multi-Agent LLM Systems (2025)
- Enhancing LLM-Based Agents via Global Planning and Hierarchical Execution (2025)
- HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented Generation (2025)
- HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking (2025)
- Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning (2025)
- Large Language Models for Planning: A Comprehensive and Systematic Survey (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper