Computer Science/Discrete Mathematics Seminar I

How to Sketch a Learning Algorithm

How does the choice of training data influence an AI model? This question is of central importance to interpretability, privacy, and basic science.  At its core is the data deletion problem: after a reasonable amount of precomputation, quickly predict how the model would behave if a given subset of training data had been deleted.

I will present a data deletion scheme capable of predicting model outputs with arbitrarily small error ε and failure probability δ in the deep learning setting.  The precomputation and prediction algorithms are only Õ(log(1/δ)/ε^2) factors slower than regular training and inference, respectively. The storage requirements are those of Õ(log(1/δ)/ε^2) models. 

The proof is based on a simple assumption about the stability of training. In contrast to the assumptions made by prior work, stability appears to be fully compatible with learning powerful AI models.  At a technical level, our work is based on a new method for locally sketching an arithmetic circuit by computing higher-order derivatives in random complex directions.

Based on my recent work: https://arxiv.org/pdf/2604.07328.

Date & Time

May 28, 2026 | 11:00am – 12:00pm
Add to calendar 05/28/2026 11:00 05/28/2026 12:00 Computer Science/Discrete Mathematics Seminar I use-title Topic: How to Sketch a Learning Algorithm Speakers: Sam Gunn, University of California, Berkeley More: https://www.ias.edu/math/events/computer-sciencediscrete-mathematics-seminar-i-629 How does the choice of training data influence an AI model? This question is of central importance to interpretability, privacy, and basic science.  At its core is the data deletion problem: after a reasonable amount of precomputation, quickly predict how the model would behave if a given subset of training data had been deleted. I will present a data deletion scheme capable of predicting model outputs with arbitrarily small error ε and failure probability δ in the deep learning setting.  The precomputation and prediction algorithms are only Õ(log(1/δ)/ε^2) factors slower than regular training and inference, respectively. The storage requirements are those of Õ(log(1/δ)/ε^2) models.  The proof is based on a simple assumption about the stability of training. In contrast to the assumptions made by prior work, stability appears to be fully compatible with learning powerful AI models.  At a technical level, our work is based on a new method for locally sketching an arithmetic circuit by computing higher-order derivatives in random complex directions. Based on my recent work: https://arxiv.org/pdf/2604.07328 [https://urldefense.com/v3/__https:/arxiv.org/pdf/2604.07328__;!!NubF!PFXw2mJ-yoFQuwsOIvHLyT4MtxL2svba0K0bgDnTiwpMEG4GME2rRAYksqPzGp9ii8DUhKjcU6jK$]. Simonyi Hall 101 and Remote Access a7a99c3d46944b65a08073518d638c23

Location

Simonyi Hall 101 and Remote Access

Speakers

Sam Gunn, University of California, Berkeley