Update README.md
Browse files
README.md
CHANGED
@@ -41,6 +41,8 @@ Designed for multi-step, logic-intensive mathematical problem-solving tasks
|
|
41 |
|
42 |
Additionally trained with Reinforcement Learning for higher accuracy but generates on average 50% more tokens
|
43 |
|
|
|
|
|
44 |
## Special thanks
|
45 |
|
46 |
🙏 Special thanks to [Georgi Gerganov](https://github.com/ggerganov) and the whole team working on [llama.cpp](https://github.com/ggerganov/llama.cpp/) for making all of this possible.
|
|
|
41 |
|
42 |
Additionally trained with Reinforcement Learning for higher accuracy but generates on average 50% more tokens
|
43 |
|
44 |
+
Find more details in their paper here: https://www.microsoft.com/en-us/research/wp-content/uploads/2025/04/phi_4_reasoning.pdf
|
45 |
+
|
46 |
## Special thanks
|
47 |
|
48 |
🙏 Special thanks to [Georgi Gerganov](https://github.com/ggerganov) and the whole team working on [llama.cpp](https://github.com/ggerganov/llama.cpp/) for making all of this possible.
|