Paper 1: Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? https://arxiv.org/pdf/2504.13837
Paper 2: Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification https://arxiv.org/pdf/2502.01839
Tweet: https://x.com/YangYue_THU/status/1914690345964855566
Noam Brown Interview: https://www.youtube.com/watch?v=c675KAlmo8k
[Download Link]: https://drive.google.com/file/d/1ierwwx3KiKuTl1X7Lt8oDQ1iHiAXyczQ/view?usp=sharing
Bob Rein
2025-05-04 21:56:19 +0000 UTCGrant Singleton
2025-05-03 16:28:13 +0000 UTCEagleshadow
2025-04-26 09:47:24 +0000 UTCTristan Reid
2025-04-25 23:52:44 +0000 UTC