RT info:eu-repo/semantics/doctoralThesis T1 Statistical analysis of the optimal transport problem A1 González Sanz, Alberto A2 Universidad de Valladolid. Escuela de Doctorado K1 Estadística matemática - Investigación operativa K1 Optimal transport K1 Transporte óptimo K1 Statistics K1 Estadística K1 Empirical processes K1 procesos empíricos K1 1209 Estadística AB Optimal transportation is a resource allocation problem present in fields such as economics, finance, physics or artificial intelligence. From a probabilistic point of view, the optimal transport cost endows the space of probability measures with a metric topology. In particular, this topology is equivalent to the weak topology of probability measures together with the convergence of moments. This makes the transport cost an appropriate tool for measuringdiscrepancies between distributions. On the other hand, the solution of the transport problem is known as optimal plan. That is, an unambiguous way to relate two distributions following an optimality criterion. This optimal plan, when deterministic, is called a transport map.However, in many cases the probability distribution is a theoretical, unattainable entity. It is only visible to the practitioner through its empirical version, i.e. a finite data set of size n. This work examines the asymptotic behaviour of the transport cost in its empirical version. In other words, we study the limits of the empirical cost and plans when the data grows to infinity. It is well-known that the empirical transport cost converges to the population one. Moreover, for continuous measures it does so at a rate that decreases with dimension. In this thesis we prove the consistency of the transport map using topology of set-valued maps. This leads, indirectly, to being able to state that the rate at which the fluctuations–difference between the expected empirical cost and the empirical cost itself–approximate zero is theparametric one, irrespective of the dimension. Moreover, these fluctuations multiplied by the parametric rate tend toward a Gaussian random variable. In economics the transportation problem appears in numerous occasions in its semi-discrete version, i.e. one of the probability distributions isdiscrete. In this case, we show that the rate at which the empirical transport cost converges to the population one does not depend on the dimension.We also show that the well-known entropy regularization (or Sinkhorn regularization), apart from simplifying the computation of the transport problem by giving it a differentiable structure, has highly satisfactory statistical properties. In particular, its bias and the divergence–that the regularization defines–converge with speed greater than the parametric one; the empirical regularized plans converge to the population ones with paramtetric ratemoreover, tending to a Gaussian process. The transport map endows a probability measure P with an order with respect to a given reference. This property leads to the successful definition of M.Hallin’s multivariatedistribution function by choosing as a reference measure the spherical uniform. This thesis provides sufficient conditions under which this function defines a homeomorphism between the support of the probability measure P and the unitary ball–i.e. to support of the spherical uniform. Finally, we provide a conditional version of the multivariate distribution function, with applications to quantile regression. YR 2023 FD 2023 LK https://uvadoc.uva.es/handle/10324/62639 UL https://uvadoc.uva.es/handle/10324/62639 LA eng NO Escuela de Doctorado DS UVaDOC RD 28-nov-2024