New best story on Hacker News: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL 1079 by gradus_ad | 899 comments on Hacker News. Andrew S Gomes January 26, 2025 Share to: Twitter Facebook URL Print Email Tags Hacker News
Post a Comment