Empirical optimal transport under estimated costs: Distributional limits and 
statistical applications

Hundrieser, S.; Mordant, G.; Weitkamp, C.A.; Munk, A.

doi:10.1016/j.spa.2024.104462

Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Released

Journal Article

Empirical optimal transport under estimated costs: Distributional limits and statistical applications

MPS-Authors

/persons/resource/persons32719

Munk, A.
Research Group of Statistical Inverse Problems in Biophysics, Max Planck Institute for Multidisciplinary Sciences, Max Planck Society;

External Resource

No external resources are shared

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

Publisher Version
(Publisher version), 2MB

Supplementary Material (public)

There is no public supplementary material available

Citation

Hundrieser, S., Mordant, G., Weitkamp, C., & Munk, A. (2024). Empirical optimal transport under estimated costs: Distributional limits and statistical applications. Stochastic Processes and their Applications, 178: 104462. doi:10.1016/j.spa.2024.104462.

Cite as: https://hdl.handle.net/21.11116/0000-000F-CB1E-C

Abstract

Optimal transport (OT) based data analysis is often faced with the issue that the underlying cost function is (partially) unknown. This is addressed in this paper with the derivation of distributional limits for the empirical OT value when the cost function and the measures are estimated from data. For statistical inference purposes, but also from the viewpoint of a stability analysis, understanding the fluctuation of such quantities is paramount. Our results find direct application in the problem of goodness-of-fit testing for group families, in machine learning applications where invariant transport costs arise, in the problem of estimating the distance between mixtures of distributions, and for the analysis of empirical sliced OT quantities.

The established distributional limits assume either weak convergence of the cost process in uniform norm or that the cost is determined by an optimization problem of the OT value over a fixed parameter space. For the first setting we rely on careful lower and upper bounds for the OT value in terms of the measures and the cost in conjunction with a Skorokhod representation. The second setting is based on a functional delta method for the OT value process over the parameter space. The proof techniques might be of independent interest.