CHIMERA: A Knowledge Base of Idea Recombination in Scientific Literature
Abstract
A large-scale knowledge base of recombination examples is built from scientific paper abstracts using an LLM-based extraction model to analyze and inspire new creative directions in AI.
A hallmark of human innovation is the process of recombination -- creating original ideas by integrating elements of existing mechanisms and concepts. In this work, we automatically mine the scientific literature and build CHIMERA: a large-scale knowledge base (KB) of recombination examples. CHIMERA can be used to empirically explore at scale how scientists recombine concepts and take inspiration from different areas, or to train supervised machine learning models that learn to predict new creative cross-domain directions. To build this KB, we present a novel information extraction task of extracting recombination from scientific paper abstracts, collect a high-quality corpus of hundreds of manually annotated abstracts, and use it to train an LLM-based extraction model. The model is applied to a large corpus of papers in the AI domain, yielding a KB of over 28K recombination examples. We analyze CHIMERA to explore the properties of recombination in different subareas of AI. Finally, we train a scientific hypothesis generation model using the KB, which predicts new recombination directions that real-world researchers find inspiring. Our data and code are available at https://github.cs.huji.ac.il/tomhope-lab/CHIMERA
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Sparks of Science: Hypothesis Generation Using Structured Paper Data (2025)
- BAGELS: Benchmarking the Automated Generation and Extraction of Limitations from Scholarly Text (2025)
- Structured Extraction of Process Structure Properties Relationships in Materials Science (2025)
- Enhancing Document Retrieval for Curating N-ary Relations in Knowledge Bases (2025)
- AutoRev: Automatic Peer Review System for Academic Research Papers (2025)
- Spark: A System for Scientifically Creative Idea Generation (2025)
- Symbol-based entity marker highlighting for enhanced text mining in materials science with generative AI (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 9
Browse 9 models citing this paperDatasets citing this paper 2
Spaces citing this paper 0
No Space linking this paper