Donnerstag, 09. Januar 2025, 16:45 - 18:15 iCal

Invited Talk: Paul Röttger

A Brief Introduction to AI Safety

Hörsaal 33 (HS) im Hauptgebäude der Universität Wien
Universitätsring 1, 1010 Wien

Vortrag


Abstract: AI systems such as ChatGPT, are now being used by millions of people across the world. In this lecture, I will give a brief introduction to the field of AI safety, which works to ensure that AI is safe to use today and will continue to be safe as it grows more capable. I will outline what it means for an AI system to be safe, how we can test for safety and how we can improve it. I will discuss open challenges in AI safety today as well as risks that may materialise in the future. The lecture will include a small participatory component, so I encourage the audience to bring their laptops.

 

Bio: Paul is a postdoctoral researcher in Dirk Hovy's MilaNLP Lab at Bocconi University, working on evaluating and improving the alignment and safety of large language models (LLMs), as well as measuring their societal impacts. Before coming to Milan in June 2023, Paul completed his PhD at the University of Oxford, where he worked on LLMs for hate speech detection. During his PhD, Paul also co-founded Rewire, a start-up building AI for content moderation, which in March 2023 was acquired by another large online safety company.


Veranstalter

Lecture series: Machines that understand? Large Language Models and Artificial Intelligence


Kontakt

Lukas Thoma
Universität Wien
Forschungsgruppe Data Mining and Machine Learning
+43-1-4277-79501
lukas.thoma@univie.ac.at