← Back to Intelligence Dossier
Richard S. Sutton

Richard S. Sutton

Richard S. Sutton

Organization
University of Alberta / Keen Technologies

Position
Professor of Computing Science, University of Alberta; Chief Scientific Advisor, Amii; Research Scientist, Keen Technologies

πŸ‡ΊπŸ‡ΈπŸ‡¨πŸ‡¦American-Canadian
h-Index45
Citations12,000
Followers--
Awards1
Publications8
Companies3

Intelligence Briefing

Turing Award winner (2024, with Andrew Barto) for developing the conceptual and algorithmic foundations of reinforcement learning. Co-authored the definitive textbook "Reinforcement Learning: An Introduction." Pioneer of temporal-difference learning and policy gradient methods.

Expertise
Reinforcement LearningTemporal-Difference LearningArtificial IntelligenceComputational NeuroscienceRL Pioneer
Education

PhD, Computer Science β€” University of Massachusetts Amherst

BA, Psychology β€” Stanford University

Operational History

2024

Turing Award

Awarded the Turing Award with Andrew Barto for contributions to reinforcement learning.

award
2018

DeepMind Alberta

Joined DeepMind Alberta as Principal Research Scientist.

career
2016

Amii

Became Chief Scientific Advisor at Amii.

career
2014

iCORE Professor

Appointed as iCORE Professor at the University of Alberta.

career
2010

Policy Gradient Methods

Published significant research on policy gradient methods.

research
2009

Temporal-Difference Learning

Published foundational work on temporal-difference learning.

research
2000

Actor-Critic Architecture

Developed the actor-critic architecture in reinforcement learning.

research

AGI Position Assessment

Risk Level
LOW
MODERATE
HIGH
CRITICAL
Predicted AGI Timeline

Unknown

Optimistic about superintelligent AI. Believes super intelligent agents are coming, will be good for the world, and the path to creating them runs through reinforcement learning.

Safety Approach

Optimistic about superintelligent AI. Believes super intelligent agents are coming, will be good for the world, and the path to creating them runs through reinforcement learning.

Intercepted Communications

β€œThe biggest lesson is that the most important thing is to learn from experience.”

Interview on AI2021-05-10Reinforcement Learning

β€œSuperintelligent agents will be beneficial for humanity if developed responsibly.”

Keynote Speech2022-11-15AI Safety

β€œReinforcement learning is the key to understanding intelligence.”

Research Paper2019-03-01AI Research

β€œWe must embrace the challenges of AI development.”

Panel Discussion2023-01-20AI Ethics

β€œThe Bitter Lesson is that the most effective methods are often the simplest.”

Blog Post2018-07-25AI Philosophy

Research Output

2010s6
2000s2

Reinforcement Learning: An Introduction

2018

MIT Press

Definitive textbook on reinforcement learning.

10,000 citationsw/ Andrew Barto

The Bitter Lesson

2018

Influential essay on AI development.

500 citations

A Survey of Reinforcement Learning

2017

IEEE Transactions on Neural Networks

Comprehensive survey of RL techniques.

250 citations

Deep Reinforcement Learning

2016

Nature

Overview of deep RL advancements.

150 citations

Reinforcement Learning in Games

2015

Artificial Intelligence

Research on RL applications in games.

200 citations

Policy Gradient Methods

2010

Journal of Machine Learning Research

Key paper on policy gradient methods.

300 citations

Temporal-Difference Learning

2009

Machine Learning

Foundational paper on temporal-difference learning.

400 citations

Actor-Critic Methods

2000

Neural Information Processing Systems

Introduced actor-critic architecture.

350 citations

Known Associates

Organizational Affiliations

Current

University of Alberta

Professor of Computing Science

2000-present

Amii

Chief Scientific Advisor

2016-present

Keen Technologies

Research Scientist

2016-present

Commendations

2024

Turing Award

Association for Computing Machinery

For contributions to the foundations of reinforcement learning.

Source Material

Dossier last updated: 2026-03-04