Julian Michael

Hi, I’m Julian. I currently work on AI safety, evaluation, and alignment at Meta. Until recently, I was head of the Safety, Evaluations, and Alignment Lab (SEAL) at Scale AI, which does research related to safeguarding AI system behavior and ensuring it preserves and amplifies human agency. Prior to SEAL, I did research on a variety of topics, mostly focused on AI alignment or formal semantics of natural language.

In alignment, I focus on scalable oversight and agent alignment, from the lens of task formulation, data collection, and evaluation methodology. I’m especially interested in using debate as a training and evaluation paradigm, which I helped test with humans. I’m also interested in pushing the boundaries of difficult evaluations for scalable oversight, as in our release of GPQA.

In language, I work on ways to use data and machine learning to help us do a better science of language, particularly when it comes to syntax and semantics. I lay out this scientific paradigm in my PhD thesis, described best in my 2023 talk at the Big Picture Workshop. In constructing the building blocks for this, I have developed approaches to crowdsourcing annotation for syntactic parsing, semantic role labeling, and predicate-argument structure.

More broadly, I am interested in the Science of AI and NLP, using empirical methods to improve our understanding of intelligent behavior and language use. Along these lines, I have worked on broad-coverage and fine-grained evaluation of models, unsupervised discovery of linguistic structure, and explicitly incorporating ambiguity into task design. See my publications for a full list.

Selected publications (see all)

GPQA: A Graduate-Level Google-Proof Q&A Benchmark
David Rein, Betty Li Hou, Asa Cooper Stickland, Jackson Petty, Richard Yuanzhe Pang, Julien Dirani, Julian Michael† and Samuel R. Bowman†
COLM 2024 (Spotlight)
pdf arxiv data talk twitter reviews bib
Media: AI Index (2024; Ch. 2), Nature
Debate Helps Supervise Unreliable Experts
Julian Michael,* Salsabila Mahdi,* David Rein,* Jackson Petty, Julien Dirani, Vishakh Padmakumar and Samuel R. Bowman
arXiv preprint
website pdf arxiv code data twitter bib
The Case for Scalable, Data-Driven Theory: A Paradigm for Scientific Progress in NLP
Julian Michael
The Big Picture Workshop (Best Paper)
pdf arxiv slides twitter bib
Inducing Semantic Roles Without Syntax
Julian Michael and Luke Zettlemoyer
Findings of ACL 2021
website s2 pdf code bib

Selected talks (see all)

The Case for Scalable, Data-Driven Theory: A Paradigm for Scientific Progress in NLP
This is the best entry point to my thesis work, laying out my proposal for how to use machine learning to do better science, specifically in the case of syntax and semantics.
video slides paper

Dec 7, 2023 • The Big Picture Workshop

An Introduction to NLP, for Scientists
A summary of the contemporary state of NLP and a (novel at the time, as far as I'm aware) proposal for how to use language models for scientific data analysis. Includes spicy takes on the relationship between deep learning and psychiatric drugs and more.
video slides colab

Apr 5, 2023 • Graduate Regression @ NYU

Philosophical Foundations of AI Ethics
An introduction to foundational issues in AI ethics from a philosophical perspective. I try to connect contemporary views and disagreements to their philosophical roots in consequentialism, deontology, social contract theory, and critical theory.
video slides

Apr 3, 2024 • Ethics in Artificial Intelligence @ UW

Apr 7, 2023 • Ethics in Artificial Intelligence @ UW

Jan 25, 2022 • Computational Ethics in NLP @ UW

Representing Meaning with Question-Answer Pairs
A 10-minute, accessible colloquium talk summarizing some of my early PhD work on crowdsourcing representations of language structure and meaning. My experience with the projects described in this talk led me to invest further in QA-SRL.
video slides

Nov 7, 2019 • UW Allen School Colloquium

Other writings (see all)

An in-depth review of a mid-2021 version of the OpenPhil Biological Anchors report on transformative AI timelines.
To Dissect an Octopus, a blog post taking a deep dive into the form/meaning debate around language models.
A long comment thread on the Alignment Forum discussing limits on the extrapolations we can make about automation potential based on ML benchmarks.
The GLUE diagnostic set guide, which doubles as a quick tour of fun phenomena in semantics.
Fulfilling Imperatives, an essay investigating of the semantics of imperative sentences.
Modern Cosmology: Explaining the Universe, an essay investigating whether inflation theory qualifies as science.