MPI-INF Logo
Campus Event Calendar

Event Entry

What and Who

Privacy Auditing and Protection in Large Language Models

Fatemehsadat Mireshghallah
University of Washington
SWS Colloquium

Fatemehsadat Mireshghallah is a post-doctoral scholar at the Paul G. Allen Center for Computer Science & Engineering at University of Washington. She received her Ph.D. from the CSE department of UC San Diego in 2023. Her research interests are Trustworthy Machine Learning and Natural Language Processing. She is a recipient of the National Center for Women & IT (NCWIT) Collegiate award in 2020 for her work on privacy-preserving inference, a finalist of the Qualcomm Innovation Fellowship in 2021 and a recipient of the 2022 Rising star in Adversarial ML award.

MMCI  
AG Audience
English

Date, Time and Location

Monday, 18 September 2023
10:00
90 Minutes
E1 5
029
Saarbrücken

Abstract

Large language Models (LLMs, e.g., GPT-3, OPT, TNLG,…) are shown to have a remarkably high performance on standard benchmarks, due to their high parameter count, extremely large training datasets, and significant compute. Although the high parameter count in these models leads to more expressiveness, it can also lead to higher memorization, which, coupled with large unvetted, web-scraped datasets can cause different negative societal and ethical impacts such as leakage of private, sensitive information and generation of harmful text. In this talk, we will go over how these issues affect the trustworthiness of LLMs, and zoom in on how we can measure the leakage and memorization of these models, and mitigate it through differentially private training. Finally we will discuss what it would actually mean for LLMs to be privacy preserving, and what are the future research directions on making large models trustworthy.

Contact

Gretchen Gravelle
+49 681 9303 9102
--email hidden

Video Broadcast

Yes
Kaiserslautern
G26
112

Virtual Meeting Details

Zoom
966 8141 4048
passcode not visible
logged in users only

Gretchen Gravelle, 09/13/2023 12:46
Gretchen Gravelle, 09/13/2023 12:43 -- Created document.