Update README.md
Browse files
README.md
CHANGED
@@ -16,13 +16,13 @@ tags:
|
|
16 |
- formal-mathematics
|
17 |
---
|
18 |
|
19 |
-
# BFS-Prover Tactic Generator
|
20 |
|
21 |
This repository contains the latest tactic generator model checkpoint from BFS-Prover, a state-of-the-art theorem proving system in Lean4. While the full BFS-Prover system integrates multiple components for scalable theorem proving, we are releasing the core tactic generation model that achieved state-of-the-art performance on formal mathematics tasks. Given a proof state in Lean4, the model generates a tactic that transforms the current proof state into a new state, progressively working towards completing the proof.
|
22 |
|
23 |
**๐ Paper: [BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving](https://arxiv.org/abs/2502.03438)**
|
24 |
|
25 |
-
## Model Details
|
26 |
|
27 |
- Base Model: Qwen2.5-Math-7B
|
28 |
- Training Approach:
|
@@ -34,7 +34,7 @@ This repository contains the latest tactic generator model checkpoint from BFS-P
|
|
34 |
- Lean-Workbook
|
35 |
- Autoformalized NuminaMath-CoT dataset
|
36 |
|
37 |
-
## Performance
|
38 |
BFS-Prover achieves state-of-the-art performance on the MiniF2F test benchmark. Here's a detailed comparison:
|
39 |
|
40 |
### MiniF2F Test Benchmark Results
|
@@ -48,11 +48,11 @@ BFS-Prover achieves state-of-the-art performance on the MiniF2F test benchmark.
|
|
48 |
| DeepSeek-Prover-V1.5 | MCTS | No | 32ร16ร400 | 63.5% |
|
49 |
|
50 |
|
51 |
-
### Key Advantages
|
52 |
-
- Achieves better performance without requiring a critic model (value function)
|
53 |
-
- Combined with simpler search method (BFS) rather than MCTS
|
54 |
|
55 |
-
## Usage
|
56 |
- The model expects Lean4 tactic states in the format `"{state}:::"`
|
57 |
- `:::` serves as a special indicator to signal the model to generate a tactic for the given state.
|
58 |
- The model will echo back the input state followed by the generated tactic.
|
@@ -80,7 +80,7 @@ print(tactic)
|
|
80 |
# Final tactic: "simp [h]"
|
81 |
```
|
82 |
|
83 |
-
## Citation
|
84 |
|
85 |
If you use this model in your research, please cite our paper:
|
86 |
|
@@ -93,11 +93,11 @@ If you use this model in your research, please cite our paper:
|
|
93 |
}
|
94 |
```
|
95 |
|
96 |
-
## License
|
97 |
|
98 |
https://choosealicense.com/licenses/apache-2.0/
|
99 |
|
100 |
-
## Contact
|
101 |
|
102 |
For questions and feedback about the tactic generator model, please contact:
|
103 |
- Ran Xin ([email protected])
|
|
|
16 |
- formal-mathematics
|
17 |
---
|
18 |
|
19 |
+
# ๐งฎ BFS-Prover Tactic Generator
|
20 |
|
21 |
This repository contains the latest tactic generator model checkpoint from BFS-Prover, a state-of-the-art theorem proving system in Lean4. While the full BFS-Prover system integrates multiple components for scalable theorem proving, we are releasing the core tactic generation model that achieved state-of-the-art performance on formal mathematics tasks. Given a proof state in Lean4, the model generates a tactic that transforms the current proof state into a new state, progressively working towards completing the proof.
|
22 |
|
23 |
**๐ Paper: [BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving](https://arxiv.org/abs/2502.03438)**
|
24 |
|
25 |
+
## โจ Model Details
|
26 |
|
27 |
- Base Model: Qwen2.5-Math-7B
|
28 |
- Training Approach:
|
|
|
34 |
- Lean-Workbook
|
35 |
- Autoformalized NuminaMath-CoT dataset
|
36 |
|
37 |
+
## ๐ Performance
|
38 |
BFS-Prover achieves state-of-the-art performance on the MiniF2F test benchmark. Here's a detailed comparison:
|
39 |
|
40 |
### MiniF2F Test Benchmark Results
|
|
|
48 |
| DeepSeek-Prover-V1.5 | MCTS | No | 32ร16ร400 | 63.5% |
|
49 |
|
50 |
|
51 |
+
### ๐ Key Advantages
|
52 |
+
- โ
Achieves better performance without requiring a critic model (value function)
|
53 |
+
- โ
Combined with simpler search method (BFS) rather than MCTS
|
54 |
|
55 |
+
## ๐ Usage
|
56 |
- The model expects Lean4 tactic states in the format `"{state}:::"`
|
57 |
- `:::` serves as a special indicator to signal the model to generate a tactic for the given state.
|
58 |
- The model will echo back the input state followed by the generated tactic.
|
|
|
80 |
# Final tactic: "simp [h]"
|
81 |
```
|
82 |
|
83 |
+
## ๐ Citation
|
84 |
|
85 |
If you use this model in your research, please cite our paper:
|
86 |
|
|
|
93 |
}
|
94 |
```
|
95 |
|
96 |
+
## ๐ License
|
97 |
|
98 |
https://choosealicense.com/licenses/apache-2.0/
|
99 |
|
100 |
+
## ๐ง Contact
|
101 |
|
102 |
For questions and feedback about the tactic generator model, please contact:
|
103 |
- Ran Xin ([email protected])
|