New best story on Hacker News: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL Andrew S Gomes January 26, 2025 Share to: Twitter Facebook URL Print Email DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL 1079 by gradus_ad | 899 comments on Hacker News. Tags Hacker News
Post a Comment
Click to see the code!
To insert emoticon you must added at least one space before the code.