DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL (arxiv.org)
1338 points by gradus_ad 6 days ago | 1051 comments
1511338 points by gradus_ad 6 days ago | 1051 comments
15165 points by naggie 3 days ago | 26 comments
152138 points by tsadoq 4 days ago | 50 comments
15314 points by usaphp 8 hours ago | 2 comments
15483 points by thunderbong 5 days ago | 25 comments
1559 points by rbanffy a day ago | 0 comments
156519 points by AnhTho_FR 2 days ago | 287 comments
157140 points by rrampage a day ago | 64 comments
15856 points by gmays 3 days ago | 6 comments
1597 points by tlombardozzi 13 hours ago | 2 comments
160160 points by boriskourt 4 days ago | 27 comments
16192 points by todsacerdoti 5 days ago | 34 comments
16272 points by pseudolus 5 days ago | 32 comments
163177 points by naggie 8 days ago | 47 comments
16430 points by rookie123 20 hours ago | 10 comments
165400 points by ada1981 4 days ago | 350 comments
166876 points by sbarre 5 days ago | 260 comments
16788 points by cwillu 8 days ago | 6 comments
168155 points by arti_chaud 3 days ago | 45 comments
16913 points by Philpax 17 hours ago | 0 comments
170329 points by todsacerdoti 4 days ago | 97 comments
171118 points by wulujia 3 days ago | 35 comments
172169 points by kawera 5 days ago | 41 comments
173604 points by todsacerdoti 5 days ago | 483 comments
17486 points by zetalyrae 7 days ago | 120 comments
1756 points by Fajar_Rahmad a day ago | 3 comments
176359 points by todsacerdoti 4 days ago | 243 comments
177471 points by Einenlum 3 days ago | 137 comments
178175 points by FinnLobsien 4 days ago | 77 comments
17956 points by brakmic 5 days ago | 22 comments
180