Main content start

Efficient AI-assisted data annotation

Date
Wed April 22nd 2026, 4:30pm
Location
UC Berkeley Evans Hall Room 1015
Speaker
Tijana Zrnic, Stanford Statistics

Obtaining high-quality labeled datasets is often costly, requiring either extensive human annotation or expensive experiments. In this talk I will discuss methods that supplement such "expert" labels with AI predictions from pre-trained models to construct labeled datasets more cost-effectively. Our approach results in labels with provable error guarantees, enabling rigorous yet efficient dataset curation using modern AI models.