MPI-INF Logo
Campus Event Calendar

Event Entry

What and Who

Advancing Image and Video Recognition with Less Supervision

Anna Kukleva
Max-Planck-Institut für Informatik - D2
Promotionskolloquium
AG 1, INET, AG 5, RG1, SWS, AG 2, AG 4, D6, AG 3  
Public Audience
English

Date, Time and Location

Thursday, 1 August 2024
14:00
60 Minutes
E 1.4
024
Saarbrücken

Abstract

Deep learning has become an essential component of modern life,transforming various tasks across multiple domains such as entertainment, education, and autonomous driving. However, the increasing demand for data to train models for emerging tasks poses significant challenges. Deep learning models heavily rely on high-quality labeled datasets, yet obtaining comprehensive supervision is resource-intensive and can introduce biases.Therefore, we explore strategies to mitigate the need for full supervision and reduce data acquisition costs. The first part of the discussion focuses on self-supervised and unsupervised learning methods, which enable learning without explicit labels by leveraging inherent data structures and injecting prior knowledge for robust data representations. The second part of the presentation discusses strategies such as minimizing precise annotations in multimodal learning, allowing for effective utilization of correlated information across different modalities. Moreover, we discuss open-world scenarios, proposing novel setup and method to adapt vision-language models to the new domains. Overall, this research contributes to understanding learning dynamics and biases present in data, advancing training methods that require less supervision.

Contact

Connie Balzert
+49 681 9325 2000
--email hidden

Virtual Meeting Details

Zoom
665 3344 4339
passcode not visible
logged in users only

Connie Balzert, 07/16/2024 10:25 -- Created document.