
Andrew Barto
Pioneer of Reinforcement Learning
Organization
University of Massachusetts Amherst
Position
Professor Emeritus of Computer Science, University of Massachusetts Amherst
Intelligence Briefing
Pioneer of reinforcement learning and co-author of the foundational textbook "Reinforcement Learning: An Introduction" with Richard Sutton. Won the 2024 ACM Turing Award (announced March 2025) alongside Sutton for developing the conceptual and algorithmic foundations of reinforcement learning. Retired from UMass in 2012 but remains professor emeritus. His work on temporal-difference learning and actor-critic methods laid the groundwork for modern RL systems including RLHF used in ChatGPT and Claude.
BS, Mathematics β University of Michigan
MS, Computer and Communication Sciences β University of Michigan
PhD, Computer Science β University of Michigan
Operational History
Turing Award Announcement
The ACM Turing Award was officially announced.
awardACM Turing Award
Awarded the ACM Turing Award alongside Richard Sutton for contributions to reinforcement learning.
awardRetirement
Barto retired from his position at UMass Amherst but continues as Professor Emeritus.
careerResearch on Intrinsic Motivation
Explored the concept of intrinsic motivation in reinforcement learning.
researchIntroduction of Actor-Critic Methods
Introduced actor-critic methods which are widely used in reinforcement learning.
researchDevelopment of Temporal-Difference Learning
Contributed significantly to the development of temporal-difference learning algorithms.
researchPublication of Reinforcement Learning: An Introduction
Co-authored with Richard Sutton, this textbook became a foundational text in the field.
researchJoined UMass Amherst
Barto began his tenure as a professor in the Computer Science department.
careerAGI Position Assessment
Unknown
Focuses on foundational research. Has expressed concern about ensuring AI systems learn aligned reward functions.
Focuses on foundational research. Has expressed concern about ensuring AI systems learn aligned reward functions.
Intercepted Communications
βReinforcement learning is a powerful framework for understanding how agents can learn from their interactions with the environment.β
βThe future of AI depends on how well we can align reward functions with human values.β
βOur work on actor-critic methods has paved the way for many modern applications in AI.β
βUnderstanding intrinsic motivation is key to developing more autonomous AI systems.β
βThe Turing Award is a recognition of the collaborative effort in the field of reinforcement learning.β
Research Output
Reinforcement Learning: A Survey
2018Journal of Machine Learning Research
Updated survey on RL advancements.
Reinforcement Learning and Control as Probabilistic Inference
2010Proceedings of the National Academy of Sciences
Discussed probabilistic approaches to RL.
A Survey of Reinforcement Learning
2001IEEE Transactions on Neural Networks
Comprehensive survey of RL methods.
Intrinsic Motivation in Reinforcement Learning
2000Neural Networks
Explored intrinsic motivation in AI.
Actor-Critic Algorithms
1999Journal of Machine Learning Research
Introduced actor-critic methods.
Reinforcement Learning: An Introduction
1998MIT Press
Foundational textbook in reinforcement learning.
Learning from Delayed Rewards
1992Journal of Artificial Intelligence Research
Investigated delayed reward learning.
Temporal-Difference Learning
1988Machine Learning Journal
Key paper introducing temporal-difference learning.
Known Associates
Richard Sutton
collaboratorCo-author of the foundational textbook on reinforcement learning.
View Dossier βYoshua Bengio
colleagueProminent figure in machine learning and AI research.
View Dossier βGeoffrey Hinton
colleagueKnown as one of the 'Godfathers of AI'.
View Dossier βDemis Hassabis
mentorCEO of DeepMind, has acknowledged Barto's work in RL.
View Dossier βOrganizational Affiliations
Current
University of Massachusetts Amherst
Professor Emeritus of Computer Science
2012-present
Former
University of Massachusetts Amherst
Professor of Computer Science
1977-2012
Various Research Institutions
Researcher in AI and ML
Various
Commendations
2024
ACM Turing Award
Association for Computing Machinery
Awarded for contributions to the field of reinforcement learning.
Source Material
Dossier last updated: 2026-03-04