Pour vous authentifier, privilégiez eduGAIN / To authenticate, prefer eduGAINeu

2–13 juin 2025
Institut Pascal
Fuseau horaire Europe/Paris

Datasets dictionary learning and applications to frugal AI

6 juin 2025, 15:30
20m
Institut Pascal

Institut Pascal

Rue André Rivière 91400 Orsay France

Orateur

Fred Ngolè (CEA Paris-Saclay)

Description

Empirical risk minimization learning paradigm works under the assumption that training and test data are identically distributed. However, this hypothesis is seldom met in practice, due to several factors including changes in the underlying physical process generating the data, data acquisition conditions or sensors drifts. This problem is known as distributional shift between the reference and target data. A straightforward generalization of this problem is when training with multiple datasets exhibiting distributional shifts.

In this context, optimal transport provides useful tools for comparing and manipulating probability measures. In particular, a Wasserstein barycenter is a meaninful way of averaging datasets in the space of probability measures equipped with the optimal transport based distance. Leveraging this concept, we introduce a novel dictionary learning problem over empirical probability measures and present its applications to different learning setting where no labels are available on the target dataset.

Documents de présentation