|
|
|
|
|
<br>[DeepSeek open-sourced](https://gitlab.kitware.com) DeepSeek-R1, an LLM fine-tuned with reinforcement learning (RL) to [improve reasoning](http://62.234.223.2383000) ability. DeepSeek-R1 attains outcomes on par with OpenAI's o1 model on a number of criteria, including MATH-500 and [SWE-bench](http://118.31.167.22813000).<br> |