Distinguished Lecture Series: Been Kim (Google DeepMind)- Alignment and interpretability: how we might get it right

Date & Time:

November 19, 2024 2:00 pm – 3:00 pm

Location:

Crerar 390, 5730 S. Ellis Ave., Chicago, IL,

11/19/2024 02:00 PM 11/19/2024 03:00 PM America/Chicago Distinguished Lecture Series: Been Kim (Google DeepMind)- Alignment and interpretability: how we might get it right Crerar 390, 5730 S. Ellis Ave., Chicago, IL,

Part of the 2024-25 DSI Distinguished Speaker Series and the Computer Science Distinguished Lecture Series.

Abstract: The main goal of interpretability is to enable communication between humans and machines, whether it’s a value, knowledge, or an objective. In this talk, I argue that a better way to enable this communication is for humans to expand what they know and learn new things. Doing so enables us to also expand what machines know—by building better-aligned machines. I share why considering the representational gap is crucial in solving the alignment problem, and I provide an example of bridging the knowledge gap.

Speakers

Been Kim

Senior Staff Research Scientist, Google DeepMind

Been Kim is a senior staff research scientist at Google DeepMind. Her research focuses on helping humans to communicate with complex machine learning models: 1) building tools to aid human’s collaboration with machines (and detect when those tools fail) 2) study machines’ general nature and 3) leveraging machines’ knowledge to benefit humans. She gave a talk at the G20 meeting in Argentina in 2019 and a keynote at ICLR 2022 and ECML 2020. Her work TCAV received UNESCO Netexplo award, was featured at Google I/O 19′. Her work is in a chapter of Brian Christian’s book on “The Alignment Problem”. She is the General chair at ICLR2024, was a Senior Program Chair at ICLR 2023 and advisory board at TRAILS. She has been a senior area chair at NeurIPS, ICML, ICLR, AISTATS and others for the past few years. She is a steering committee member of FAccT conference and SATML. She received her PhD. from MIT.

Resources

Community

Globus Receives Multiple Honors in 2024 HPCwire Readers’ and Editors’ Choice Awards

Argonne Team Breaks New Ground in AI-Driven Protein Design

DOE Awards Fred Chong and his National Research Team $7.5M to Develop a SMART Software Stack to Control Quantum Computer Noise

Midwest PL Summit

Thomas Kleine Buening (Oxford)- Strategic Interactive Decision-Making

“Machine Learning Foundations Accelerate Innovation and Promote Trustworthiness” by Rebecca Willett

Nightshade: Data Poisoning to Fight Generative AI with Ben Zhao

Ian Foster – Better Information Faster: Programming the Continuum

Speakers

Been Kim

Unveiling Attention Receipts: Tangible Reflections on Digital Consumption

“Machine Learning Foundations Accelerate Innovation and Promote Trustworthiness” by Rebecca Willett

Nightshade: Data Poisoning to Fight Generative AI with Ben Zhao

Five UChicago CS students named to Siebel Scholars Class of 2024

UChicago Computer Scientists Design Small Backpack That Mimics Big Sensations

In The News: U.N. Officials Urge Regulation of Artificial Intelligence

UChicago Computer Scientists Bring in Generative Neural Networks to Stop Real-Time Video From Lagging

Computer Science Class Shows Students How To Successfully Create Circuit Boards Without Engineering Experience

UChicago CS Researchers Shine at CHI 2023 with 12 Papers and Multiple Awards

New Prototypes AeroRigUI and ThrowIO Take Spatial Interaction to New Heights – Literally

Computer Science Displays Catch Attention at MSI’s Annual Robot Block Party

UChicago, Stanford Researchers Explore How Robots and Computers Can Help Strangers Have Meaningful In-Person Conversations