Victoria Krakovna

AI Safety Researcher

Organization
Google DeepMind

Position
Research Scientist, Google DeepMind

🇷🇺🇨🇦Russian-Canadian

Twitter LinkedIn Website

h-Index--

Citations--

Followers5000

Awards0

Publications8

Companies3

Intelligence Briefing

AI safety researcher at Google DeepMind and co-founder of the Future of Life Institute (FLI), the organization behind the famous open letters on AI risk. Her PhD at Harvard focused on building interpretable models. At DeepMind, she works on AI alignment including deceptive alignment detection, dangerous capability evaluations, specification gaming, goal misgeneralization, and avoiding side effects. Maintains a widely-referenced list of real-world examples of specification gaming in AI systems.

Expertise

AI SafetyAI AlignmentSpecification GamingSide Effects Avoidance

Education

BS, Statistics and Mathematics — University of Toronto

MS, Statistics — University of Toronto

PhD, Statistics and Machine Learning — Harvard University

Operational History

2025

Advocacy for AI Safety

Continued advocacy for AI safety through public speaking and writing.

policy

2024

Avoiding Side Effects in RL

Contributed to research on avoiding side effects in reinforcement learning agents.

research

2023

Research on Deceptive Alignment

Published research on detecting deceptive alignment in AI systems.

research

2022

Public Engagement on AI Safety

Participated in various public discussions and panels on AI safety and alignment.

policy

2021

Specification Gaming Database

Launched a widely-referenced database of real-world examples of specification gaming in AI systems.

research

2020

Research Scientist at Google DeepMind

Joined Google DeepMind as a Research Scientist focusing on AI safety and alignment.

career

2018

PhD Completion

Completed PhD in Statistics and Machine Learning at Harvard University.

career

2015

Co-founder of Future of Life Institute

Co-founded the Future of Life Institute to address existential risks from advanced technologies.

founding

AGI Position Assessment

Risk Level

LOW

MODERATE

HIGH

CRITICAL

Predicted AGI Timeline

Unknown

Strong advocate for AI safety research. Co-founded the Future of Life Institute to mitigate existential risks from advanced technology. Works on technical alignment to ensure AI systems behave as intended.

Safety Approach