DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Cited in this thesis
Frequently Cited Together
- Minimally Invasive Evaluation of Venous Leg Ulcers in an Outpatient Setting Usin1 chapter
- Fish mislabelling in France: substitution rates and retail types1 chapter
- Application of rapid evaporative ionization mass spectrometry in preclinical and1 chapter
- Qualitative and quantitative analysis of adulterated Antarctic Krill Oil (AKO) b1 chapter
- DNA barcoding reveals mislabeling of endangered sharks sold as swordfish in New 1 chapter
- Masked siamese convnets1 chapter
BibTeX
@article{Guo2025,
author = {Guo, Daya and Yang, Damai and Zhang, Hanze and Song, Jiangtao and Zhang, Rujie and Xu, Rundong and He, Yujia},
journal = {arXiv preprint arXiv:2501.12948},
title = {DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning},
year = {2025},
}