Addressing the Unexpected – Anomaly Detection and AI Safety – Job Talk
Sun 25.01 11:15 - 12:15
- Faculty Seminar
-
Bloomfield 527
Abstract: While AI models are becoming an ever-increasing part of our lives, our understanding of their behavior in unexpected situations is drifting even further out of reach. This gap poses significant risks to users, model owners, and society at large.
In the first part of the talk, I will overview my research on detecting unexpected phenomena with and within deep learning models. Specifically, detecting (i) anomalous samples, (ii) unexpected model behavior, and (iii) unexpected security threats. In the second part of the talk, I will dive into my recent research on a specific type of unexpected security threat: attacks on image watermarks. I will review such attacks and present my recent work toward addressing them. I will conclude with a discussion of future research directions.

