DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Adam Suma, Sam Dauncey

2025 arXiv.org Cited 2,135 times

Cited in this thesis

Frequently Cited Together

Identification of the Species of Origin for Meat Products by Rapid Evaporative IBalog 20161 chapter
Fishers' preference for mobile traceability platform: challenges in achieving a Untal 20251 chapter
Automatic design of convolutional neural network architectures under resource coLi 20211 chapter
Unlocking the combined impact of microplastics and emerging contaminants on fishWu 20251 chapter
Microplastic contamination in wild freshwater fish: global trends, challenges ande Araujo 20251 chapter
Adaptive mixtures of local expertsJacobs 19911 chapter

BibTeX

@article{Guo2025,
  author = {Guo, Daya and Yang, Damai and Zhang, Hanze and Song, Jiangtao and Zhang, Rujie and Xu, Rundong and He, Yujia},
  journal = {arXiv preprint arXiv:2501.12948},
  title = {DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning},
  year = {2025},
}