PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan et al.

2019 Neural Information Processing Systems Cited 50,820 times

Abstract

Deep learning frameworks have often focused on either usability or speed, but not both. PyTorch is a machine learning library that shows that these two goals are in fact compatible: it was designed from first principles to support an imperative and Pythonic programming style that supports code as a model, makes debugging easy and is consistent with other popular scientific computing libraries, while remaining efficient and supporting hardware accelerators such as GPUs. In this paper, we detail the principles that drove the implementation of PyTorch and how they are reflected in its architecture. We emphasize that every aspect of PyTorch is a regular Python program under the full control of its user. We also explain how the careful and pragmatic implementation of the key components of its runtime enables them to work together to achieve compelling performance. We demonstrate the efficiency of individual subsystems, as well as the overall speed of PyTorch on several commonly used benchmarks.

Cited in this thesis

Fish Species and Part Identification

Frequently Cited Together

Generalization and parameter estimation in feedforward nets: Some experimentsMorgan 19891 chapter
Bert: Pre-training of deep bidirectional transformers for language understandingDevlin 20181 chapter
Idiot's Bayes—not so stupid after all?Hand 20011 chapter
Adaptive mixtures of local expertsJacobs 19911 chapter
Gaussian error linear units (gelus)Hendrycks 20161 chapter
Identification of biological tissues by rapid evaporative ionization mass spectrBalog 20101 chapter

BibTeX

@article{Paszke2019,
  title = {Pytorch: An imperative style, high-performance deep learning library},
  author = {Paszke, A},
  journal = {arXiv preprint arXiv:1912.01703},
  year = {2019},
}