|
|
|
|
|
<br>DeepSeek open-sourced DeepSeek-R1, an [LLM fine-tuned](https://sebeke.website) with reinforcement learning (RL) to enhance thinking ability. DeepSeek-R1 attains results on par with [OpenAI's](http://logzhan.ticp.io30000) o1 design on a number of standards, consisting of MATH-500 and SWE-bench.<br> |