Home

A user's guide to optimal transport

1. Q lt Blom Ee Ble F gr 4 tn Tm Therefore 1 A7 1 d an 2 d Em x xe Kun LI E d Ln pm e oS 2 s or E am 2T I 0 and thus the sequence xn is a Cauchy sequence as soon as 0 lt 7 lt 1 A This shows uniqueness existence follows by the l s c of E One step estimates We claim that the following discrete version of the EVI 3 30 holds for any TEX d x y d CU C09 Mor y lt E Eu VyeX 39 where x is the minimizer of 3 12 Indeed pick a curve y satisfying 3 29 for zo z x y and y x and use the minimality of x to get d z ye 2T SEM lt 0 0E 7 HEU 240 0d y F 1 t d x x7 td z y t 1 t a a y l 2T E x lt E t Rearranging the terms dropping the positive addend td x x and dividing by t gt 0 we get 2 VT 2 C DE TEN LA ary lt Bly El 2T 2T 2 so that letting t 0 we get 3 33 Now we pass to the discrete version of the error estimate which will also give the full conver gence of the discrete solutions to the limit curve Given T y D E and the associate discrete solutions x7 yj we are going to bound the distance d at y7 in terms of the distance d T 7 Write two times the discrete EVI 3 33 for 7 7 2 and y y first with x 7 then with T 2 pig 7 2 tO get we use the assumption A gt 0 d Cae y E
2. 33 Plugging v 1 i 1 and recalling that W2 Toei en Tk olein gu Oas m m 00 for every n N we get that Wu Wi un lt zs fondu f P6ssodw 62 m m oo PARE 1 sacs W2 uo un Letting n oo we get that i C 4 Geod X is a Cauchy sequence and the conclusion Lemma 2 11 The multivalued map from G X Geod X which associates to each pair x y the set G x y of constant speed geodesics connecting x to y has closed graph Proof Straightforward Lemma 2 12 A variant of gluing Let Y Z be Polish spaces v v P Y and f g Y Z be two Borel maps Let y Adm f v gy0 Then there exists a plan B P Y such that zB v TA B D fon gon 4B v Proof Let vz vz be the disintegrations of v v w r t f g respectively Then define B x V X V dy z Z2 Remark 2 13 The Hilbert case If X is an Hilbert space then for every x y X there exists only one constant speed geodesic connecting them the curve t gt 1 t a ty Thus Theorem 2 10 reads as t gt Ju is a constant speed geodesic if and only if there exists an optimal plan y Opt j10 1 such that pi 1 t n tr 4 If y is induced by a map T the formula further simplifies to pi 1 t Id tT uo 2 11 Remark 2 14 A slight modification of the arguments presented in the second part of the proof of Theorem 2 10 shows that if X
3. lt J Pait Bajaj dv Bn xo BRn yo 26 valid for any y Adm p v and the analogous one with the integral of d y yo in place of d3 x xo show that if K C X and K2 C P Y are 2 uniformly integrable so is the set u Z X x Y THY Ki TY K2 We say that a function f X R has quadratic growth provided f x lt a d x xo 4 1 2 2 for some a Rand zo X Itis immediate to check that if f has quadratic growth and u P2 X then f L1 X p The concept of 2 uniform integrability in conjunction with tightness in relation with conver gence of integral of functions with quadratic growth plays a role similar to the one played by tight ness in relation with convergence of integral of bounded functions as shown in the next proposition Proposition 2 4 Let un C Ao X be a sequence narrowly converging to some u Then the fol lowing 3 properties are equivalent i un is 2 uniformly integrable ii fdun gt f fdu for any continuous f with quadratic growth iii f d xo du gt f d vo du for some xo X Proof i ii It is not restrictive to assume f gt 0 Since any such f can be written as supremum of a family of continuous and bounded functions it clearly holds J faves imin f ran n gt o0 Thus we only have to prove the limsup inequality Fix gt 0 xo X and find Re gt 1 such that J X Br z0 d xo dju lt for every n Now
4. M i e the second derivative along a unit speed geodesic is bounded above by a universal constant depending only on M see e g the third appendix of Chapter 10 of 80 for the simple proof Proposition 1 30 Let M be a smooth compact Riemannian manifold without boundary Let M RU oc be a c concave function not identically equal to oo Then is Lipschitz semiconcave and real valued Also assume that y 0 p x Then exp y C 0 y x Conversely if p is differentiable at x then exp WVq x 0 p x 18 Proof The fact that is real valued follows from the fact that the cost function d z y 2 is uni formly bounded in x y M Smoothness and compactness ensure that the functions d y 2 are uniformly Lipschitz and uniformly semiconcave in y M this gives that i is Lipschitz and semiconcave Now pick y 0 y x and v exp y Recall that v belongs to the superdifferential of d y 2 at x i e d z y x y 2 2 Thus from y 0 p x we have T e a tp _d cn v exp z o d a z lt v exp z o d z z that is v OT y x To prove the converse implication it is enough to show that the c superdifferential of p at x is non empty To prove this use the c concavity of y to find a sequence yn C M such that d a Yn z _ ex p z lim 7777 v yn d zy plz Cow q y VzeM neN
5. d2 a Yt lt s x y X ES a y g Observe that fort lt r lt s eater ua gt e Veyiex and equality holds if and only if there is a constant speed geodesic y t s X such that z Vt Y Yr Z Ys The notions of ae and c transforms convexity concavity and sub super differential are defined as in Section 1 2 Definitions 1 8 1 9 and 1 10 The basic properties of the Hopf Lax formula are collected in the following proposition 36 Proposition 2 17 Basic properties of the Hopf Lax formula We have the following three prop erties i For any t s 0 1 the map Hj is order preserving that is p lt v gt H HE ii For any t lt s 0 1 it holds iii For any t s 0 1 it holds H o Ht o H Hg Proof The order preserving property is a straightforward consequence of the definition To prove property ii observe that H H 9 6 sup inf o 2 h a y s y s which gives the equality H Hf p cU in particular choosing xz x we get the claim the proof of the other equation is similar For the last property assume t lt s the other case is similar and observe that by 7 we have Hj o HoH 2H 2Id and H oHtoH lt Hg Id The fact that Kantorovich potentials evolve according to the Hopf Lax formula is expressed in the following theorem We remark that in the statement below one must deal at the
6. y is uniformly semiconcave the sit uation is the following We have a semiconcave function f R R and a semiconvex function g R R such that f gt g on R f g ona certain closed set K and we have to prove that the vector field u K R4 defined by u x V f x Vg x is Lipschitz Up to rescaling we may assume that f and g are such that f is concave and g is convex Then for every x K and y IR we have u x y 2 Iz y o y g x f y f x u x y 2 ly al and thus for every z K y IR it holds If y f x u x y 2 Iz yl Picking z1 2 K and y R we have f 22 f z1 ue 22 21 za f za y f z2 ule y lu f wa y f 21 ue 2 y 21 z2 t y mi Adding up we get u z1 u zz y lxx zal yl ler y mi P lt 3i m2 yl Eventually choosing y u z1 u z2 6 we obtain lu z1 u z2 36 x1 za It is worth stressing the fact that the regularity property ensured by the previous corollary holds without any assumption on the measures 10 H1 Remark 2 25 A much simpler proof in the Euclidean case The fact that intermediate trans port maps are Lipschitz can be proved in the Euclidean case via the theory of monotone op erators Indeed if G R R is a possibly multivalued monotone
7. Id o 1 t Id tT for every t in the sense of distributions Indeed for o CX IR it holds foam 5 f older dquo f v omaem T a duo ive ve due Now the continuity equation describes the link between the motion of the continuum ji and the instantaneous velocity v R IR of every atom of ji It is therefore natural to think at the vector field v as the infinitesimal variation of the continuum pz From this perspective one might expect that the set of smooth curves on Z5 IR and more generally on Z M is somehow linked to the set of solutions of the continuity equation This is actually the case as we are going to discuss now In order to state the rigorous result we need to recall the definition of absolutely continuous curve on a metric space Definition 2 28 Absolutely continuous curve Let Y d be a metric space and let 0 1 5 t gt yt Y bea curve Then yi is said absolutely continuous if there exists a function f L1 0 1 such that d gi ys a f r dr Vt lt s 0 1 2 18 t We recall that if y is absolutely continuous then for a e t the metric derivative y exists given by 7 d j lim ten Ye 241 poo C Sed and that j L 0 1 and is the smallest L function up to negligible sets for which inequality 2 18 is satisfied see e g Theorem 1 1 2 of 6 for the simple proof The link between absolutely continuous curves in 4 2 M an
8. Plen zadrg alens yf s ri as ni lt J ater 22 d x x3 dy x1 22 23 lt 8mm f f Gs nra d 25 23 dey1 z1 22 8d Wa ua u2 W2 ua u3 Finally we need to prove that W is real valued Here we use the fact that we restricted the analysis to the space Z5 X from the triangle inequality we have Wa u v Wa p 054 Wa v 054 Pza e ovo oo A trivial yet very useful inequality is W2 Fast gu lt J d2 f 2 9 a du x 21 valid for any couple of metric spaces X Y any u Y X and any couple of Borel maps f g X Y This inequality follows from the fact that f g is an admissible plan for the measures Fn gap and its cost is given by the right hand side of 2 1 Observe that there is a natural isometric immersion of X d into 2 5 X W2 namely the map qo Op Now we want to study the topological properties of A X W2 To this aim we introduce the notion of 2 uniform integrability K C A2 X is 2 uniformly integrable provided for any gt 0 and xo X there exists Re gt 0 such that sup f d x xo dp lt LEK J X Br zo Remark 2 3 Let X dx Y dy be Polish and endow X x Y with the product distance d 1 31 2 y2 d 21 2 d yi y2 Then the inequality 2 20 dy 0 y i Bisnis J di 2 20 dy 0 9 Bn xo x Bn yo Br ao xY Bn xo x Bn vo J d amp a xo dp x J Rdy x y Br z0 Xx Bn vo
9. d ee gi V GERD 0 2 28 which is the same equation solved by yf It is possible to show that this fact together with the smoothness of the v s and the equality uj jg gives that jj ji for every t see Proposition 8 1 7 and Theorem 8 3 1 of 6 for a proof of this fact Conclude observing that 2 dig x S v T s cits ff v CE ran a loz T r gt eee a dr lt t f le IBaqedr lt sL W2 uf u lt fita T s x dug z and that by the characterization of convergence 2 4 W2 u y 0 as 0 for every t 0 1 48 2 4 Bibliographical notes To call the distance W2 the Wasserstein distance is quite not fair a much more appropriate would be Kantorovich distance Also the spelling Wasserstein is questionable as the original one was Vasershtein Yet this terminology is nowadays so common that it would be impossible to change it The equivalence 2 4 has been proven by the authors and G Savar in 6 In the same reference Remark 2 8 has been first made The fact that Z 5 X W2 is complete and separable as soon as X d is belongs to the folklore of the theory a proof can be found in 6 Proposition 2 4 was proved by C Villani in 79 Theorem 7 12 The terminology displacement interpolation was introduced by McCann 63 for probability mea sures in R4 Theorem 2 10 appears in this form here for the first time in 58 the th
10. gt 0 such that d z y d z O d O y Ve B A UB A y B B UB B 18 Now let f resp g be a smooth probability density everywhere positive and symmetric w r t the x y axes such that 5 uB A f dvol gt 4 resp Je ByuB B g dvol gt 3 and let T resp T be the optimal transport map from fvol to gvol resp from gvol to f vol We claim that either T or T is discontinuous and argue by contradiction Suppose that both are continuous and observe that by the symmetry of the optimal transport problem it must hold T x T l x forany x M Again by the symmetry of M f g the point T O must be invariant under the symmetries around the x and y axes Thus it is either T O O or T O O and similarly T O O O We claim that it must hold T O O Indeed otherwise either T O O and T O O or T O O and T O O In the first case the two couples O O and O O belong to the support of the optimal plan and thus by cyclical monotonicity it holds d O 0 d O O lt d 0 O d O O 0 which is absurdum In the second case we have T x 4 O for all x M which by continuity and compactness implies d T M O gt 0 This contradicts the fact that f is positive everywhere and T gvol f vol Thus it holds T O O Now observe that by construction there must be some mass transfer from B A U B A to B B U B B i e we can find x B
11. has a global maximum at x zo Use the smoothness and compactness of M to find r gt 0 such that d 2 x y d x y lt r R is C and satisfies V d y 2 gt cId for every y M with c gt 0 independent on y Now observe that since is semiconcave and real valued it is Lipschitz Thus for 9 gt 0 sufficiently small it holds o v lt r 3 for any v O x and any x M Also since y is bounded possibly decreasing the value of o we can assume that 2 T lt edle 5 Fix zo M v O y xo and let yo exp eov We claim that for o chosen as above the maximum of coy d yo 2 cannot lie outside B xo Indeed if d x xo gt r we have d x yo gt 2r 3 and thus d z y r 2r p y d 19 yo sues ug c cm m rr 2 Thus the maximum must lie in B a9 Recall that in this ball the function d yo is C and satisfies V d yo 2 gt cld thus it holds V sue lt 9A c Id where A R is such that V lt AId on the whole of M Thus decreasing if necessary the value of o We can assume that d Ve sue T lt 0 on B x9 which implies that eoy d yo 2 admits a unique point x B xo such that 0 9 i d yo 2 x which therefore is the unique maximum Since V d yo to amp ov O eop xo we conclude that xo is the unique global maximum as claimed 20 Now define the function v M RU oo
12. 0 h h 0 h Fu o T lo t o T t to shows that the total derivative is well defined for a e t and that is an L vector field in the sense that it holds l Uut s llus 9 T t S ullu lt i t dt oo Ht Notice also the inequality EEE T dr Ur Ur ar t An important property of the total derivative is the Leibnitz rule for any couple of absolutely contin uous vector fields uj u along the same regular curve p the map t uj Up is absolutely continuous and it holds d d d d ur up z Sulu ut Su F a e t 6 6 Ht Ht Indeed from the identity dt Ht ut uj ul o T to t u2 9 T to t dus it follows the absolute continuity and the same expression gives d d a uj up xr uj o T to t u o T to t Fut o T to t Jw 9 T to t d 1 1 d j ccu uj Uy UE m dt us Example 6 7 The smooth case Let x t amp x be a CX vector field on R4 u a regular curve and v its velocity vector field Then the inequality l amp s o T t 5 Sellue lt Mes amp l Hto d ut 9 T to t 2 a ue 9 T to t 2 Pto Pto Hs Il o T t s lu lt ga t SPOT TOS s m Id 91 with C sup z Or x C SUD z amp ac is absolutely continuous gives that is absolutely continuous along p Then a direct appli
13. 1 exists for every t A Also by 2 25 we know that for to A the linear functional Lj Vp o D gt R given by d Veo Ls Vo yl 1 pdp 47 satisfies Lig V9 Velle Hto and thus it can be uniquely extended to a linear and bounded functional on Tan Y2 R By the Riesz representation theorem there exists a vector field v Tan 2 IR such that d aliis J edi Li V9 Vee Vto Min Ve D 2 26 and whose norm in L u is bounded above by the metric derivative at t to It remains to prove that the continuity equation is satisfied in the sense of distributions This is a consequence of 2 26 see Theorem 8 3 1 of 6 for the technical details Part B Up to a time reparametrization argument we can assume that v r2 lt L for some L R for a e t Fix a Gaussian family of mollifiers p and define My Me p TE C Er E Ht It is clear that d ate V gt ems 0 Moreover from Jensen inequality applied to the map X z gt 2 X z X z X vm it follows that llvellz2que llvellzzqu L 2 27 This bound together with the smoothness of vf implies that there exists a unique locally Lipschitz map T 0 1 x R R4 t 0 1 satisfying d at cw 5z Va R a e t 0 1 Tria eu Va R4 t 0 1 A simple computation shows that the curve t gt jij T t 4416 solves
14. 9 94 2010 pp 107 130 125 38 39 40 41 A FIGALLI F MAGGI AND A PRATELLI A mass transportation approach to quantitative isoperimetric inequalities Invent Math 182 2010 pp 167 211 A FIGALLI AND L RIFFORD Mass transportation on sub Riemannian manifolds Geom Funct Anal 20 2010 pp 124 159 N Fusco F MAGGI AND A PRATELLI The sharp quantitative isoperimetric inequality Ann of Math 2 168 2008 pp 941 980 W GANGBO The Monge mass transfer problem and its applications in Monge Amp re equa tion applications to geometry and optimization Deerfield Beach FL 1997 vol 226 of Con temp Math Amer Math Soc Providence RI 1999 pp 79 104 W GANGBO AND R J MCCANN The geometry of optimal transportation Acta Math 177 1996 pp 113 161 N GIGLI On the geometry of the space of probability measures in R endowed with the quadratic optimal transport distance 2008 Thesis Ph D Scuola Normale Superiore Second order calculus on P2 M W2 Accepted by Memoirs of the AMS 2009 On the heat flow on metric measure spaces existence uniqueness and stability Calc Var Partial Differential Equations 2010 On the inverse implication of Brenier McCann theorems and the structure of M W2 accepted paper Meth Appl Anal 2011 R JORDAN D KINDERLEHRER AND F OTTO 77e variational formulation of the Fokker Planck equation SIAM J Math
15. Pj 0 Proof The first formula follows directly from Theorem 6 22 the second from the fact that O7 is the adjoint of O An important feature of equations 6 27 and 6 29 is that to express the derivatives of Ny Ut We Ov u and O7 u no new operators appear This implies that we can re cursively calculate derivatives of any order of the vector fields P u E u4 Ov ut and OX u provided of course that we make appropriate regularity assumptions on the vector field uz and on the velocity vector field v An example of result which can be proved following this direction is that the operator t P is analytic along the restriction of a geodesic Proposition 6 24 Analyticity of t P Let m be the restriction to 0 1 of a geodesic de fined in some larger interval 1 e Then the operator t P is analytic in the following sense For any to 0 1 there exists a sequence of bounded linear operators An MEL I such that the following equality holds in a neighborhood of to t to P u 5 C A u o T to t o T t to Vu Li 6 30 ncN 103 Proof From the fact that u is the restriction of a geodesic we know that L SUP e 0 1 Lip v lt co and that Zu 0 recall Example 6 9 In particular condition 6 28 is fulfilled Fix to 0 1 u E and define us uo T t to so that Zu 0 From equati
16. T x x dt pz x dx 1 f Vole TG 2 p w de Rem e fag Dr Rem where the remainder term Rem is bounded by Lip Vo Lip Ve men lt EPI furia adt p o de PEE p Since heuristically speaking W po p has the same magnitude of 7 we have Rem o 7 and the proof is complete 68 3 3 1 Elements of subdifferential calculus in R7 W2 Recall that we introduced a weak Riemannian structure on the space Z5 M W 5 in Subsec tion 2 3 2 Among others this weak Riemannian structure of Y2 W2 allows the development of a subdifferential calculus for geodesically convex functionals in the same spirit and with many formal similarities of the usual subdifferential calculus for convex functionals on an Hilbert space To keep the notation and the discussion simpler we are going to define the subdifferential of a geodesically convex functional only for the case Y2 R and for regular measures Definition 1 25 but everything can be done also on manifolds or Hilbert spaces and for general u Ao M Recall that for a convex functional F on an Hilbert space H the subdifferential O F x at a point x is the set of vectors v H such that A F x vy 2 5le yl lt Fu Wed Definition 3 26 Subdifferential in IR W2 Let E 4 R7 RU 00 be a geodesically convex and lower semicontinuous functional and u R4 be a regular measure such that E t lt oo The
17. converges in the Gromov Hausdorff topology to some space X d It is well known that in this situation there exists a compact space Y dy and a family of isometric embeddings fn supp m Y f X Y such that the Hausdorff distance between supp m and f X goes to 0 as n oo The space f supp m dy f m4 is isomorphic to Xn dn Mn by construction for every n N and f X dy is isometric to X d so we identify these spaces with the respective subspaces of Y dy Since Y dy is compact the sequence m admits a subsequence not relabeled which weakly converges to some m Z Y It is immediate to verify that actually m P X Also again by compactness weak convergence is equivalent to convergence w r t W2 which means that there exists plans y Y admissible for the couple rn Mn such that 4 amp 9 n gt 0 Therefore n dy 7y is a sequence of admissible couplings for X d m and Xn dn Mn whose cost tends to zero This concludes the proof Now we prove the HWI which relates the entropy often denoted by H the Wasserstein distance W and the Fisher information 7 and the log Sobolev inequalities To this aim we introduce the Fisher information functional 7 A X 0 oo on a general metric measure space X d m as the squared slope of the entropy vo 000 0 lim 2 if E50 oo I u 4
18. ii d zi y Vy X a e t 0 thus plugging this bound into the EVI we get x4 d axe y 5d ni y E zi Ey Vy X a e t gt 0 which implies E u E VEe tm Meo BW ye deny Fix an interval a b C 0 00 let L be the Lipschitz constant of x in a b and observe that for any y X it holds Xj aet gt 0 3 10 d ZP Cov 2 liides y 2 Ld sy aet a b Plugging this bound in the EVI we get A Ld zi y 3d ni y E x Ely a e t a b 53 and by the lower semicontinuity of t E x the inequality holds for every t a b Taking y x and then exchanging the roles of x s we deduce E zx E zs lt Ld a1 s 5d zi as lt L t s z Nie s1 Vt s a b thus the map t gt Ez is locally Lipschitz It is then obvious that it holds E E E E d 29 Eas qu EGO Ela qu EON 7 Beth deen 2i dt h0 h h0 d xt 4h Lt h E 1 T V E zi di 3l V El 2 gl a e t Thus to conclude we need only to prove the opposite inequality Integrate the EVI from f to t h to get d dl Peng P 6o 7 sos as f ita ds lt EQ Let y x to obtain d X 1 Cen lt E x E z ds Aran nf E x E zisa dr Al rane t 0 Now let A C 0 00 be the set of points of differentiability of t E a and where t exists choose t AN a b divide by the above inequality le
19. lt Q t d 2o y td z1 y t 1 t d zo 21 3 29 for every t 0 1 Observe that there is no compactness assumption of the sublevels of E If X is an Hilbert space and more generally a NPC space Definition 2 19 then the second inequality in 3 29 is satisfied by geodesics Hence A convex functionals are automatically compatible with the metric Following the same lines of the previous section it is possible to show that this assumption im plies both Assumption 3 8 and if the sublevels of E are boundedly compact Assumption 3 13 so that Theorem 3 14 holds Also it can be shown that formula 3 21 is true and thus that Proposition 3 19 holds also in this setting so that Theorem 3 20 can be proved as well However if Assumption 3 24 holds it is better not to follow the general theory as developed before but to restart from scratch indeed in this situation much stronger statements hold also at the level of discrete solutions which can be proved by a direct use of Assumption 3 24 We collect the main results achievable in this setting in the following theorem Theorem 3 25 Gradient Flows for compatible E and d EVI Assume that X E satisfy As sumption 3 24 Then the following hold e For every x D E and 0 lt T lt 1 A there exists a unique discrete solution x7 as in Definition 3 7 Let x D E and x1 any family of discrete solutions starting from it Then x7 converge locally uniform
20. AMBROSIO B KIRCHHEIM AND A PRATELLI Existence of optimal transport maps for crystalline norms Duke Mathematical Journal 125 2004 pp 207 241 L AMBROSIO AND S RIGOT Optimal mass transportation in the Heisenberg group J Funct Anal 208 2004 pp 261 301 K BACHER AND K T STURM Localization and tensorization properties of the curvature dimension condition for metric measure spaces J Funct Anal 259 2010 pp 28 56 J D BENAMOU AND Y BRENIER A numerical method for the optimal time continuous mass transport problem and related problems in Monge Amp re equation applications to geometry and optimization Deerfield Beach FL 1997 vol 226 of Contemp Math Amer Math Soc Providence RI 1999 pp 1 11 P BERNARD AND B BUFFONI Optimal mass transportation and Mather theory J Eur Math Soc JEMS 9 2007 pp 85 127 M BERNOT V CASELLES AND J M MOREL The structure of branched transportation networks Calc Var Partial Differential Equations 32 2008 pp 279 317 S BIANCHINI AND A BRANCOLINI Estimates on path functionals over Wasserstein spaces SIAM J Math Anal 42 2010 pp 1179 1217 A BRANCOLINI G BUTTAZZO AND F SANTAMBROGIO Path functionals over Wasserstein spaces J Eur Math Soc JEMS 8 2006 pp 415 434 124 17 18 19 20 21 22 23 24 31 32 33 L BRASCO G BUTTAZZO AND F SANTAMBROGIO A benamou
21. Anal 29 1998 pp 1 17 electronic N JUILLET On displacement interpolation of measures involved in brenier s theorem ac cepted paper Proc of the AMS 2011 L V KANTOROVICH On an effective method of solving certain classes of extremal problems Dokl Akad Nauk USSR 28 1940 pp 212 215 On the translocation of masses Dokl Akad Nauk USSR 37 1942 pp 199 201 English translation in J Math Sci 133 4 2006 1381D1382 L V KANTOROVICH AND G S RUBINSHTEIN On a space of totally additive functions Vestn Leningrad Univ 13 7 1958 pp 52 59 M KNOTT AND C S SMITH On the optimal mapping of distributions J Optim Theory Appl 43 1984 pp 39 49 K KUWADA N GIGLI AND S I OHTA Heat flow on alexandrov spaces preprint 2010 S LISINI Characterization of absolutely continuous curves in Wasserstein spaces Calc Var Partial Differential Equations 28 2007 pp 85 120 G LOEPER On the regularity of solutions of optimal transportation problems Acta Math 202 2009 pp 241 283 J LOTT Some geometric calculations on Wasserstein space Comm Math Phys 277 2008 pp 423 437 J LOTT AND C VILLANI Weak curvature conditions and functional inequalities J Funct Anal 2007 pp 311 333 J LOTT AND C VILLANI Ricci curvature for metric measure spaces via optimal transport Ann of Math 2 169 2009 pp 903 991 126 59 60 62 63
22. Figalli and L Rifford 39 e cost functions induced by sub Riemannian Lagrangians A Agrachev and P Lee 1 e Wiener spaces E H D Feyel A S st nel 36 Here E is a Banach space y Z E is Gaussian and H is its Cameron Martin space namely H hE E m yy lt V In this case J x uli c z y 2 00 otherwise ifr ycH The issue of regularity of optimal maps would nowadays require a lecture note in its own A rough statement that one should have in mind is that it is rare to have regular even just continuous optimal transport maps The key Theorem 1 27 is due to L Caffarelli 22 21 23 Example 1 36 is due to G Loeper 55 For the general case of cost squared distance on a com pact Riemannian manifold it turns out that continuity of optimal maps between two measures with smooth and strictly positive density is strictly related to the positivity of the so called Ma Trudinger Wang tensor 59 an object defined taking fourth order derivatives of the distance function The 23 understanding of the structure of this tensor has been a very active research area in the last years with contributions coming from X N Ma N Trudinger X J Wang C Villani P Delanoe R McCann A Figalli L Rifford H Y Kim and others A topic which we didn t discuss at all is the original formulation of the transport problem of Monge the case c z y x y on R2 The situation in this c
23. Now pick ys as in the hypothesis and define ju e i The equalities n fun eQo duQ f ey du x du y D pdu eor duly X x valid for any o Cy X show that m f n dula and therefore by 7 21 we have zu m Pt T8m B All these arguments can be repeated symmetrically with 1 t in place of t because the push forward of p via the map which takes y and gives the geodesic t gt y1 is m itself thus we obtain m m j 2Nm tNm B 1 t m B B mB vt 0 1 Ht S min To conclude it is sufficient to prove that u is concentrated on 2B for all t 0 1 But this is obvious as u is concentrated on B B and a geodesic whose endpoints lie on B cannot leave 2B As we said we will use this lemma together with the doubling property which is a consequence of the Bishop Gromov inequality to prove a local Poincar inequality For simplicity we stick to the case of Lipschitz functions and their local Lipschitz constant although everything could be equivalently stated in terms of generic Borel functions and their upper gradients For f X R Lipschitz the local Lipschitz constant V f X R is defined as 2g e fool 121 For any ball B such that m B gt 0 the number f p is the average value of f on B 1 f p zu m Proposition 7 20 Local Poincar inequality Assume that X d m is a non branching C D 0 N space Then for every ball B such t
24. On the bad side the entropy E is not geodesically convex in M2 Q W ba and this implies that it is not clear whether the strong properties of Gradient Flows w r t W as described in Section 3 3 Theorem 3 35 and Proposition 3 38 are satisfied also in this setting In particular it is not clear whether there is contractivity of the distance or not Open Problem 5 7 Let p p two solutions of the Heat equation with Dirichlet boundary condition pi e 1 in OQ for every t gt 0 i 1 2 Prove or disprove that Wbz pl p2 lt Whal pj Vt s The question is open also for convex and smooth open sets 5 4 Bibliographical notes The connection of branched transport and transport problem as discussed in Section 5 1 was first pointed out by Q Xia in 81 An equivalent model was proposed by F Maddalena J M Morel and S Solimini in 61 In 81 60 and 15 the existence of an optimal branched transport Theorem 5 2 was also provided Later this result has been extended in several directions see for instance the works A Brancolini G Buttazzo and F Santambrogio 16 and Bianchini Brancolini 15 The interior regularity result Theorem 5 3 has been proved By Q Xia in 82 and M Bernot V Caselles and J M Morel in 14 Also we remark that L Brasco G Buttazzo and F Santambrogio proved a kind of Benamou Brenier formula for branched transport in 17 The content of Section 5 2 comes from J Dolbeault B Nazaret
25. Then as in the proof of iii i of Theorem 1 13 we have J dandas 1 pl o y dyle y J p z du z n g y dv y and f cdy R Thus y L y and y L v which shows that p y is an admissible couple in the dual problem and gives the thesis Remark 1 18 Notice that a statement stronger than the one of Remark 1 15 holds namely under the assumptions of Theorems 1 13 and 1 17 for any c concave couple of functions y maximizing the dual problem and any optimal plan it holds supp y C Oy Indeed we already know that for some c concave o we have o L u y L v and supp y C 0 o for any optimal y Now pick another maximizing couple 4 Y for the dual problem 1 16 and notice that G x y clx y for any x y implies X g and therefore 6 is a maximizing couple as well The fact that L v follows as in the proof of Theorem 1 17 Conclude noticing that for any optimal plan it holds eau 6m otn f cera o ee Waray 2 f dz dy gt iis f ged so that the inequality must be an equality E 13 Definition 1 19 Kantorovich potential A c concave function q such that p pt is a maximizing pair for the dual problem 1 16 is called a c concave Kantorovich potential or simply Kantorovich potential for the couple u v A c convex function y is called c convex Kantorovich potential if p is a c concave Kantorovich potential Observe that c
26. and Uoo z zlog z Then given a metric measure space X d m we define the functionals n A X gt RU 00 by En Hu u m where amp is given by formula 7 6 with u wy similarly for amp The definitions of weak Ricci curvature bounds are the following Definition 7 6 Curvature gt K and no bound on dimension C D K oo We say that a metric measure space X d m has Ricci curvature bounded from below by K R provided the functional bx P X RU oo is K geodesically convex on 73 X W2 In this case we say that X d m satisfies the curvature dimension condition CD K oo or that X d m is a CD K oo space Definition 7 7 Curvature gt 0 and dimension lt N CD 0 N We say that a metric measure space X d m has nonnegative Ricci curvature and dimension bounded from above by N provided the functionals n P X gt RU 00 are geodesically convex on 2 X W2 for every N gt N In this case we say that X d m satisfies the curvature dimension condition C D 0 N or that X d m is a C D 0 N space Note that N gt 1 is not necessarily an integer Remark 7 8 Notice that geodesic convexity is required on Z supp mx and not on 22 X This makes no difference for what concerns C D K oo spaces as amp is 00 on measures having a singular part w r t m but is important for the case of CD 0 N spaces as the functional y has only real values and r
27. lt c N B ax is compact for any c R r 0 zcX What we want to prove is that for X E satisfying these assumptions there is existence of Gradient Flows in the formulation EDE Definition 3 4 Our first goal is to show that in this setting it is possible to recover the results of the previous section We start claiming that it holds cas EG EG Au yy V E x sup disi 5 4 3 21 59 so that the lim in the definition of the slope can be replaced by a sup Indeed we know that ze EQ ja a 28 E diea VE x lim I y d x y yzc d x y 2 x N To prove the opposite inequality fix y x and a constant speed geodesic y connecting x to y for which 3 20 holds Then observe that IVE 2 gt Tim 9w _ im E x n d x y tLo d x y G29 z E Ely _ E a Ely t 2 p es T Cete Using this representation formula we can show that all the assumptions 3 8 and 3 13 hold Proposition 3 18 Suppose that Assumption 3 17 holds Then Assumptions 3 8 and 3 13 hold as well Sketch of the Proof From the A geodesic convexity and the lower semicontinuity assumption it is possible to deduce we omit the details that E has at most quadratic decay at infinity i e there exists T X a b gt 0 such that E x gt a bd z z A d z z Vr c X Therefore from the lower semicontinuity again and the bounded compactness of the sublevels of E we immediately get that the minimiza
28. lt lim nto h nto In F E a 4 7 lt Tp E60 EO gc oth lt pea hto d zi Lt4 n nto valid for any t 0 1 gives 3 23 Define the functions f g 0 1 R by f t Etat ft fls O sup EO 0 sAt s t Let D be the diameter of the compact set 2 J c o 1 use the fact that x is 1 Lipschitz formula 3 21 and the trivial inequality at lt a b b valid for any a b R to get E a1 E zs 4 A g t sup lt VE zi 3 P szt d zs x Therefore the thesis will be proved if we show that gern gt f s f t i g r dr Vt s 3 24 t Fix M gt 0 and define f min f M Now fix e gt 0 pick a smooth mollifier pe IR R with support in e and define fM g e 1 R by f2 t f pelt M t M gy a2 sup f iz s sAt s t Since f is smooth and gM gt f2 it holds e OIS f rer 3 25 t From the trivial bound f h lt f h we get JG t 7 f s 7 pe r dr JG t 7 f s 7 pe r dr ge t sup lt sup S Ss ls t ls t r f s r sup ET prar lt J st oar g pe t 3 26 Thus the family of functions g is dominated in L 0 1 From 3 25 and 3 26 it follows that the family of functions f7 uniformly converge to some function f on 0 1 as 0 for which it holds f s OL lt iar 61 We
29. the optimal map T can be written as x gt exp V w x for some c concave function y M R 19 Proof ii gt i and the last statement Pick v Y M and observe that since d 2 is uniformly bounded condition 1 4 surely holds Thus from Theorem 1 13 and Remark 1 15 we get that any optimal plan y Opt j1 v must be concentrated on the c superdifferential of a c concave function q By Proposition 1 30 we know that y is semiconcave and thus differentiable jj a e by our as sumption on u Therefore x gt T x exp V q x is well defined ji a e and its graph must be of full measure for any y Opt u v This means that y is unique and induced by T i gt ii Argue by contradiction and assume that there exists a semiconcave function f whose set of points of non differentiability has positive u measure Use Lemma 1 34 below to find gt 0 such that y ef is c concave and satisfies v O x if and only exp v 0 x Then conclude the proof as in Theorem 1 26 Lemma 1 34 Let M be a smooth compact Riemannian manifold without boundary and M gt R semiconcave Then for gt Q sufficiently small the function ep is c concave and it holds v O ew x if and only exp v 8 Ey a Proof We start with the following claim there exists gt 0 such that for every zo M and every v O ro the function d a exp amp v paai ae Penk
30. y Vz c X 1 3 A direct consequence of the definition is that the c superdifferential of a c concave function is always a c cyclically monotone set indeed if x yi 0 q it holds ems yi Dole e y Doles Voi s diel c zi Yo i 4 for any permutation c of the indexes What is important to know is that actually under mild assumptions on c every c cyclically mono tone set can be obtained as the c superdifferential of a c concave function This result is part of the following important theorem Theorem 1 13 Fundamental theorem of optimal transport Assume that c X x Y Ris continuous and bounded from below and let u P X v P Y be such that c z y lt a x b y 1 4 for some a L p b L v Also let y Adm u v Then the following three are equivalent i the plan y is optimal ii the set supp y is c cyclically monotone iii there exists a c concave function q such that max 0 L u and supp y C 9 v Proof Observe that the inequality 1 4 together with J ets lt f atras adul f avt lt 00 vn atu implies that for any admissible plan Adm u v the function max c 0 is integrable This together with the bound from below on c gives that c L for any admissible plan i ii We argue by contradiction assume that the support of y is not c cyclically monotone Thus we can find N N zi yi i i v C supp and some permutation
31. 0 D x gt pele gt pile gt e vo exp 2 0 D x which gives v v and the thesis 41 Corollary 2 24 The intermediate transport maps are locally Lipschitz Let jj C P2 M a constant speed geodesic in Y2 M W2 Then for every t 0 1 and s 0 1 there exists only one optimal transport plan from u to us this transport plan is induced by a map and this map is locally Lipschitz Note clearly in a compact setting being locally Lipschitz means being Lipschitz We wrote locally because this is the regularity of transport maps in the non compact situation Proof Fix t 0 1 and without loss of generality let s 1 The fact that the optimal plan from is unique and induced by a map is known by Proposition 2 16 Now let v be the vector field defined on supp iz by v x Vy Vy we are using part iii of the above corollary with the same notation The fact that 7 is a c concave potential for the couple u 4o tells that the optimal transport map T satisfies T x at ila for j a e x Using Theorem 1 33 the fact that 7 is differentiable in supp u and taking into account the scaling properties of the cost we get that T may be written as T x exp v z Since the exponential map is C the fact that T is Lipschitz will follow if we show that the vector field v on supp j4 is when read in charts Lipschitz Thus passing to local coordinates and recalling that d
32. 02 9 On Pj 0 Proof The absolute continuity of P u follows from the fact that both u and P we are absolutely continuous Similarly the second formula in 6 26 follows immediately from the first one noticing that uy P u P uz yields fu SP u opi uz Thus we have only to prove the first equality in 6 26 To this aim let w be an arbitrary absolutely continuous vector field along u and observe that it holds d d d g Pint Wy Pelda Prater 3 z ut Pa e Ht d D 5 Ppa Gu Pye te EP p t Pye Ht Ht Since the left hand sides of these expression are equal the right hand sides are equal as well thus we get d D d D Ps ui m die ue we P ut dit m dil qn Ht d D Ppa te Pu Fiw BP Ht 9a Pu ut OF we ams Ov Py u Wt Ht pa so that the arbitrariness of w gives d D die ut din ut Ov Pu u4 and the conclusion follows from 6 25 Along the same lines the total derivative of AV uz w for given absolutely continuous vector fields uz w along the same regular curve u can be calculated The only thing the we must take care of is the fact that M is not defined on the whole ES so that we need to make some assump tions on uz w to be sure that A ue w is well defined and absolutely continuous Indeed 102 observe that from a purely formal point of view
33. 43 2007 pp 1 13 S T RACHEV AND L RUSCHENDORF Mass transportation problems Vol I Probability and its Applications Springer Verlag New York 1998 Theory R T ROCKAFELLAR Convex Analysis Princeton University Press Princeton 1970 L R SCHENDORF AND S T RACHEV A characterization of random variables with minimum L distance J Multivariate Anal 32 1990 pp 48 54 G SAVARE Gradient flows and diffusion semigroups in metric spaces under lower curvature bounds C R Math Acad Sci Paris 345 2007 pp 151 154 G SAVARE Gradient flows and evolution variational inequalities in metric spaces In prepa ration 2010 K T STURM On the geometry of metric measure spaces I Acta Math 196 2006 pp 65 131 On the geometry of metric measure spaces IT Acta Math 196 2006 pp 133 177 K T STURM AND M K VON RENESSE Transport inequalities gradient estimates entropy and Ricci curvature Comm Pure Appl Math 58 2005 pp 923 940 V N SUDAKOV Geometric problems in the theory of infinite dimensional probability distri butions Proc Steklov Inst Math 1979 pp i v 1 178 Cover to cover translation of Trudy Mat Inst Steklov 141 1976 N S TRUDINGER AND X J WANG On the Monge mass transfer problem Calc Var Partial Differential Equations 13 2001 pp 19 31 C VILLANI Topics in optimal transportation vol 58 of Graduate Studies in Mathematics American Mat
34. 64 X N MA N S TRUDINGER AND X J WANG Regularity of potential functions of the optimal transportation problem Arch Ration Mech Anal 177 2005 pp 151 183 F MADDALENA AND S SOLIMINI Transport distances and irrigation models J Convex Anal 16 2009 pp 121 152 F MADDALENA S SOLIMINI AND J M MOREL A variational model of irrigation pat terns Interfaces Free Bound 5 2003 pp 391 415 R J MCCANN A convexity theory for interacting gases and equilibrium crystals ProQuest LLC Ann Arbor MI 1994 Thesis Ph D Princeton University R J MCCANN A convexity principle for interacting gases Adv Math 128 1997 pp 153 179 Polar factorization of maps on riemannian manifolds Geometric and Functional Anal ysis 11 2001 pp 589 608 V D MILMAN AND G SCHECHTMAN Asymptotic theory of finite dimensional normed spaces vol 1200 of Lecture Notes in Mathematics Springer Verlag Berlin 1986 With an appendix by M Gromov G MONGE M moire sur la th orie des d eblais et des remblais Histoire de l Acad mie Royale des Sciences de Paris 1781 pp 666 704 F OTTO The geometry of dissipative evolution equations the porous medium equation Comm Partial Differential Equations 26 2001 pp 101 174 A PRATELLI On the equality between Monge s infimum and Kantorovich s minimum in opti mal mass transportation Annales de l Institut Henri Poincare B Probability and Statistics
35. A1 is defined as SUD oco d xo z1 if K lt 0 and as inf z5cao d zo 1 if 1 1 K gt Q eve iin ii If X d m is a CD 0 N space it holds m Ao Ai gt 1 t m Ao tm Ai N 7 14 Proof We start with i Suppose that Ao A are open satisfying rn Ao m A1 gt 0 Define the measures ju m Ai m for i 0 1 and find a constant speed geodesic u C A X such that amp ue lt 1 06 o toli 5 0 t W2 uo m Arguing as in the proof of the previous proposition it is immediate to see that u is concentrated on Ao Ai for any t 0 1 In particular m Ao A1 gt 0 otherwise amp 55 j14 would be oo and the convexity inequality would fail Now let v m Ao 4il m an application of Jensen inequality shows Ao Ait that amp ps gt amp 14 thus we have K amp 4 lt 1 1 Seo Ho t C 1 W3 uo m Notice that for a general u of the form m A m It holds Eo0 Ht log m A log m A and conclude using the trivial inequality inf d zo z1 W3 uo pi lt sup d zo 21 z9 Apo z9 AQ wy EA 1 EA 117 The case of Ao A1 compact now follows by a simple approximation argument by considering the e neighborhood A x d x Aj lt i 0 1 noticing that Ao A1 Neso A Ale for any t 0 1 and that m A gt 0 because A C supp m i 0 1 Part ii follows along the same lines taking int
36. B E Yn By Kolmogorov s theorem we get the existence of a measure 3 A X such that uS ND B for every n N The inequality Yo la ron Ye mM rox awa Hi Mit lt n 1 shows that n gt XN a X is a Cauchy sequence in L 3 X i e the space of maps f XN X such that T d f DEPT lt oo for some and thus every x9 X endowed with the distance d f g um y dB y Since X is complete L 8 X is complete as well and therefore there exists a d map 7 of the Cauchy sequence z Define u T B and notice that by 2 1 we have Wi us lt 48 0 so that ju is the limit of the Cauchy sequence un in 275 X W2 The fact that W X W2 is separable follows from 2 4 by considering the set of finite convex combinations of Dirac masses centered at points in a dense countable set in X with rational coefficients The last claim now follows Remark 2 8 On compactness properties of 2 X An immediate consequence of the above theorem is the fact that if X is compact then Z X W2 is compact as well indeed in this case the equivalence 2 4 tells that convergence in 5 X is equivalent to weak convergence It is also interesting to notice that if X is unbounded then 4 X is not locally compact Actu ally for any measure u 2 X and any r gt 0 the closed ball of radius r around ji is not compact To see this fix 7 X and find a sequence xn C X such that d n z o
37. DRE CEP et dC Dre ia ee PAS C P u N 1 Cul M s sillsi sina C lulls svp 6 15 1 2 With this result we can prove existence of the limit of P u as P varies in 9p 95 Theorem 6 12 For any u T M there exists the limit of P u as P varies in 9p Proof We have to prove that given gt 0 there exists a partition P such that P u Q u lt jule VQ gt P 6 16 In order to do so it is sufficient to find 0 to lt t lt ty 1 such that yj t 41 t lt e C and repeatedly apply equation 6 15 to all partitions induced by Q in the intervals t ti 1 Now for s lt t we can introduce the maps T T4 M Ty M which associate to the vector u T4 M the limit of the process just described taking into account partitions of s t instead of those of 0 1 Theorem 6 13 For any t lt t2 t3 0 1 it holds JT qe 6 17 Moreover for any u Ty M the curve t u T u T4 M is the parallel transport of u along Proof For the group property consider those partitions of t3 which contain t2 and pass to the limit first on t t2 and then on t2 t3 To prove the second part of the statement we prove first that u is absolutely continuous To see this pass to the limit in 6 15 with s to and sy ti U Ut to get Pe uto m Ut lt CO ue ta m to lt C ul ts m to 6 18 so that from 6 12a we ge
38. IR7 The concepts that now enter into play are Covariant Derivative Parallel Transport and Curvature To some extent the situation is similar to the one we discussed in Subsection 2 3 2 concerning the first order structure the metric space Z5 R7 W2 is not a Riemannian manifold but if we are careful in giving definitions and in the regularity requirements of the objects involved we will be able to perform calculations very similar to those valid in a genuine Riemannian context Again we are restricting the analysis to the Euclidean case only for simplicity all of what comes next can be generalized to the analysis over 2 for a generic Riemannian manifold M On a typical course of basic Riemannian geometry one of the first concepts introduced is that of Levi Civita connection which identifies the only natural natural here means compatible with the Riemannian structure way of differentiating vector fields on the manifold It would therefore be natural to set up our discussion on the second order analysis on 2 IR by giving the definition of Levi Civita connection in this setting However this cannot be done The reason is that we don t have a notion of smoothness for vector fields therefore not only we don t know how to covariantly differentiate vector fields but we don t know either which are the vector fields regular enough to be differentiated In a purely Riemannian setting this problem does not appear as a Riemannian ma
39. The first one is the most important open problem on the subject is the property of being a CD K N space a local notion That is suppose we have a metric measure space X d m and a finite open cover 9 such that Q d m Q m is a CD K N space for every i Can we deduce that X d m is a CD K N space as well One would like the answer to be affirmative as any notion of curvature should be local For K 0 or N oo this is actually the case at least under some technical assumptions The general case is still open and up to now we only know that the conjecture 30 34 in 80 is false being disproved by Deng and Sturm in 32 see also 11 The second and final thing we want to mention is the case of Finsler manifolds which are differentiable manifolds endowed with a norm possibly not coming from an inner product on each tangent space which varies smoothly with the base point A simple example of Finsler manifolds is the space IR where is any norm It turns out that for any choice of the norm the space R4 4 is a CD 0 N space Various experts have different opinion about this fact namely there is no agreement on the community concerning whether one really wants or not Finsler geometries to be included in the class of spaces with Ricci curvature bounded below In any case 123 it is interesting to know whether there exists a different more restrictive notion of Ricci curvature boun
40. The question now is which information is carried on v w from the properties of the polar factoriza tion At the level of v from the fact that V x Vqz 0 we deduce V x v 0 which means that v is the gradient of some function p On the other hand the fact that s is measure preserving implies that w satisfies V wxg 0 in the sense of distributions indeed for any smooth f R gt R it holds d d 0 Eeo f dona Elo fos duo J vro Then from the identity Vi o se Id e Vp w o we can conclude that u Vp w We now turn to the case X Y M with M smooth Riemannian manifold and c x y d x y 2 d being the Riemannian distance on M For simplicity we will assume that M is compact and with no boundary but everything holds in more general situations The underlying ideas of the foregoing discussion are very similar to the ones of the case X Y R4 the main difference being that there is no more the correspondence given by Proposition 1 21 between c concave functions and convex functions as in the Euclidean case Recall however that the concepts of semiconvexity i e second derivatives bounded from below and semiconcavity make sense also on manifolds since these properties can be read locally and changes of coordinates are smooth In the next proposition we will use the fact that on a compact and smooth Riemannian manifold the functions x d xz y are uniformly Lipschitz and uniformly semiconcave in y
41. X then y is optimal as well Proof Let un Un C A X be two sequences of measures narrowly converging to u v P X respectively Pick y Opt Hn Vn and use Remark 1 4 and Prokhorov theorem to get that Yn admits a subsequence not relabeled narrowly converging to some y Z X It is clear that TY p and 2 v thus it holds Wis lt f dvi a lt lim f eye Ga lim WIE vn n gt Co n oo Now we pass to the second part of the statement that is we need to prove that with the same notation just used it holds y Opt u v Choose a x b x d x xo for some zo X in the bound 1 4 and observe that since u v P2 X Theorem 1 13 applies and thus optimality is equivalent to c cyclical monotonicity of the support The same for the plans y Fix N N and pick z y supp y i 1 N From the fact that 7 narrowly converges to y it is not hard to infer the existence of z y supp 7 such that lim alai z a gt 30 Yis la N n gt o0 Thus the conclusion follows from the c cyclical monotonicity of supp 7 and the continuity of the cost function Now we are going to prove that 2 X W2 is a Polish space In order to enable some construc tions we will use a version of Kolmogorov s theorem which we recall without proof see e g 31 51 28 Theorem 2 6 Kolmogorov Let X be a Polish space and jy A X n N be a seq
42. Y RU 00 such that yet Observe that X RU oo is c concave if and only if y y This is a consequence of the fact that for any function Y gt RU 00 it holds ij w r indeed C4 C4 C4 infs inf u ec ele V x De na 2 9 2 9 c y ely and choosing x we get gt w while choosing y y we get q lt wt Similarly for functions on Y and for the c convexity Definition 1 10 c superdifferential and c subdifferential Let p X RU oo beac concave function The c superdifferential 0 ip C X x Y is defined as Op oy e XxY ola oe y el2 y The c superdifferential 0 x at x X is the set of y Y such that x y O p A symmetric definition is given for c concave functions v Y RU oo The definition of c subdifferential O of a c convex function y X 00 is analogous 9 ei Gy e X x Y glx y cn y Analogous definitions hold for c concave and c convex functions on Y Remark 1 11 The base case c z y z y Let X Y R and c z y x y Then a direct application of the definitions show that e asetis c cyclically monotone if and only if it is cyclically monotone e a function is c convex resp c concave if and only if it is convex and lower semicontinuous resp concave and upper semicontinuous e the c subdifferential of the c convex resp c superdifferenti
43. and G Savar 33 and 26 of J Carrillo S Lisini G Savar and D Slepcev Section 5 3 is taken from a work of the second author and A Figalli 37 6 More on the structure of 25 M W2 The aim of this Chapter is to give a comprehensive description of the structure of the Riemannian manifold 45 R7 W2 thus the content of this part of the work is the natural continuation of what we discussed in Subsection 2 3 2 For the sake of simplicity we are going to stick to the Wasserstein space on R4 but the reader should keep in mind that the discussions here can be generalized with only little effort to the Wasserstein space built over a Riemannian manifold 6 1 Duality between the Wasserstein and the Arnold Manifolds The content of this section is purely formal and directly comes from the seminal paper of Otto 67 We won t even try to provide a rigorous background for the discussion we will do here as we believe that dealing with the technical problems would lead the reader far from the geometric intuition Also we will not use the results presented here later on we just think that these concepts are worth of mention Thus for the purpose of this section just think that each measure is absolutely continuous with smooth density that each L function is C and so on Let us recall the definition of Riemannian submersion Let M N be Riemannian manifolds and let f M N a smooth map f is a submersion prov
44. as OT F x z because O F x is closed and convex For convex functions a natural generalization of Definition 3 1 of Gradient Flow is possible we say that x is a Gradient Flow for F starting from x H if it is a locally absolutely continuous curve in 0 00 such that z 0 F x4 for a e t gt 0 lima T 3 2 t0 We now summarize without proof the main existence and uniqueness results in this context Theorem 3 1 Gradient Flows in Hilbert spaces Brezis Pazy If F H RU 00 is convex and lower semicontinuous then the following statements hold i Existence and uniqueness for all z D F 3 2 has a unique solution x ii Minimal selection and Regularizing effects It holds lir V F x for every t gt 0 that is the right derivative of x always exists and realizes the element of minimal norm in O F z4 and SEP o a t V F x t for every t gt 0 Also 1 F a lt inf 4F zv z 22 s int Lr ze ai 1 2 2 12 EEDS int Ive le f iii Energy Dissipation Equality z VF z L 0 00 F xz ACioc 0 00 and the following Energy Dissipation Equality holds 1 f 1 ff Fx Flea 5 IVFGL dr f a dr 0 lt t lt s lt o t t iv Evolution Variational Inequality and contraction x is the unique solution of the system of differential inequalities id Aa 3g ul Fle 218 w S FQ Vy H a e t 50 among all loc
45. by d x y dy inf eog x if y exp amp ov for some x M v OT y z and v y oo otherwise By definition we have 2 y d z y eps lt EY Lp vey eM and the claim proved ensures that if yo exp ovo for zo M vo O v xo the inf in the definition of yo is realized at x xo and thus d eds Yo 7 t yo o xo Hence egy w and therefore is c concave Along the same lines one can easily see that for y exp amp 00 q x it holds d s y 2 eop y amp o x ie y coy zo Thus we have 0 9 D exp t ep Since the other inclusion has been proved in Proposition 1 30 the proof is finished Remark 1 35 With the same notation of Theorem 1 33 recall that we know that the c concave func tion y whose c superdifferential contains the graph of any optimal plan from u to v is differentiable p a e for regular jz Fix xo such that Vo zo exists let yo exp Vp Xo 0 xo and observe that from d x yo d Xo yo 2 on XE gt p x e xo we deduce that Vy zo belongs to the subdifferential of d yo 2 at zo Since we know that d yo 2 always have non empty superdifferential we deduce that it must be differentiable at xo In particular there exists only one geodesic connecting xo to yo Therefore if u is regular not only there exists a unique optimal transport map T but also for jj a e x
46. covariant derivative of the vector field u Vy along the vector field jj Vez However in order to give a precise meaning to the above formula we should be sure at least that the derivatives we are taking exist Such an approach is possible but heavy indeed consider that we should define what are Ct and C vector fields and in doing so we cannot just consider derivatives along curves Indeed we would need to be sure that the partial derivatives have the right symmetries otherwise there won t be those cancellations which let the above operator be a tensor Instead we adopt the following strategy e First we calculate the curvature tensor for some very specific kind of vector fields for which we are able to do and justify the calculations Specifically we will consider vector fields of the kind u Vy where the function y C M does not depend on the measure p e Then we prove that the object found is actually a tensor i e that its value depends only on the ps a e value of the considered vector fields and not on the fact that we obtained the formula assuming that the functions y s were independent on the measure 104 e Finally we discuss the minimal regularity requirements for the object found to be well defined Pick o C R and observe that a curve of the kind t Id tVy 4 is a regular geodesic on an interval T T for T sufficiently small Remark 1 22 and Proposition 6 3 It is then immedia
47. d Ur y T d sz g d 27 5 y F lt EQ B Adding up these two inequalities and observing that E x7 2 lt du 27 we obtain d 27 y n d z y T lt 2 E y E z7 65 2 On the other hand equation 3 33 with x y and y x reads as d yt 27 P 17 T lt 2 E a2 E y1 Adding up these last two inequalities we get d ce GET yr 2 d z y lt 2 E y Nu E y 3 34 T Discrete estimates Pick t nt lt mr s write inequality 3 33 for x 2 5 n m 1 and add everything up to get PL dy Xr ox aes BH acc UB x dGLy sE 3 Bei Ga Similarly pick t nt write inequality 3 34 for z atl add everything up to get and y y for i 0 n 1 and Now let y z to get d 27 aT lt 2r E E a1 lt 2rE 3 36 having used the fact that E gt 0 Conclusion of passage to the limit Putting 7 2 instead of 7 in 3 36 we get P 1 a7 lt E g therefore 2 a7 X1 7 lt 7 22 2 jEG Vn lt meN which tells that n gt ofl is a Cauchy sequence for any t gt 0 Also choosing n 0 and letting m oo we get the error estimate 3 32 We pass to the EVI Letting 7 0 in 3 35 it is immediate to verify that we get P zy d zs y sonf d x Ely f E a 2 s t s t iid mdi r which is precisely the EVI 3 30 written in integral for
48. for a 0 this problem reduces to the classical Steiner problem while for a 1 it reduces to the classical optimal transport problem for cost distance It is not hard to show the existence of a minimizer for this problem What is interesting is that a continuous formulation is possible as well which allows to discuss the minimization problem for general initial and final measure in Z IR Definition 5 1 Admissible continuous dynamical transfer Let u v A IR An admissible continuous dynamical transfer from p to v is given by a countably 44 rectifiable set T an orientation on it T T gt S4 anda weight function w T 0 oo such that the R valued measure Jr z w defined by Itc wr 80 satisfies V JD rw V hp which is the natural generalization of the Kirchoff rule Given a 0 1 the cost function associated to T 7 w is defined as Ps des I wd r Theorem 5 2 Existence Let jj v IR with compact support Then for all 0 1 there exists a minimizer of the cost in the set of admissible continuous dynamical transfers connecting p to v If w and v m the minimal cost is finite if and only if a gt 1 1 d The fact that 1 1 d is a limit value to get a finite cost can be heuristically understood by the following calculation Suppose we want to move a Delta mass 6 into the Lebesgue measure on a unit cube whose center is z Then the first thing one wants to
49. geodesic y supp 4 it holds y y Y Since Y is totally convex this implies that Y for any t and any y supp p ie p ei P Y Therefore u is a geodesic connecting 4o to 41 in Y d Conclude noticing that for any jj 4 Y it holds du du v f 3 du a d l d diy g a my log m og m j 8 a du NT 7 ut du EZ f rmn a where we wrote my for mY mj ii Fix a gt 0 and let d ad and W2 be the Wasserstein distance on A X induced by the 116 distance d It is clear that a plan y Adm 1 is optimal for the distance W2 if and only if it is optimal for W5 thus W2 aW Now pick uo p A X and let uj C A X be a constant speed geodesic connecting them such that Be lt 1 1 8 uo t8 u1 5 10 t W2 uo m then it holds Boalt lt 1 8 Ho tE u1 zt OVE i and the proof is complete A similar argument applies for the case CD 0 N For Ao A1 C X we define Ao A1 C X as Ao Aile to yis a constant speed geodesic such that 0 Ao y 1 Ay Observe that if Ao Ai are open resp compact Ao A1 is open resp compact hence Borel Proposition 7 14 Brunn Minkowski Let X d m be a metric measure space and Ao Ay C supp m compact subsets Then i if X d m isa CD K oo space it holds log m Ao A4i e gt 1 log m 4o tlog m 41 1 t D 4o 41 7 13 where D y Ao
50. geodesics Let u be a constant speed geodesic on 0 1 Then its re striction to any interval E 1 with gt 0 is regular In general however the whole curve qu may be not regular on 0 1 Proof To prove that p may be not regular just consider the case of jj dz and ij dy dy it is immediate to verify that for the velocity vector field v it holds Lip v t For the other part recall from Remark 2 25 see also Proposition 2 16 that for t 0 1 and s 0 1 there exists a unique optimal map 7 from y to jus It is immediate to verify from formula 2 11 that these maps satisfy HUM Lu cM deed ae Tod s t s t Thus thanks to Proposition 2 32 we have that v is given by T Id Id T v lim 6 5 sot s t t Now recall that Remark 2 25 gives Lip T lt 1 t to obtain 2 t t 1 1 Lip w zt 1 1 1 2 Thus Lip v is integrable on any interval of the kind e 1 e gt 0 Definition 6 4 Vector fields along a curve A vector field along a curve u is a Borel map t a gt u x such that u L for a e t It will be denoted by ut Observe that we are considering also non tangent vector fields that is we are not requiring uz Tanp Z5 R for a e t To define the time smoothness of a vector field u defined along a regular curve u we will make an essential use of the flow maps n
51. gradient flows npa a 49 3 2 The theory of Gradient Flows in a metric setting o oo llle 51 3 2 4 The framework x uode na we om eR a a a EL Rede 51 3 2 2 Generall s c functionals and EDI llle 55 3 2 3 The geodesically convex case EDE and regularizing effects 59 3 2 4 The compatibility of Energy and distance EVI and error estimates 63 3 3 Applications to the Wasserstein case len 67 3 3 4 Elements of subdifferential calculus in P3 R W2 69 3 3 2 Threeclassical functionals sss s ee 70 34 Bibhographucalnotes 2 24 Roo RR Ro RR RUE RR BRE x ERES 76 l_ambrosio sns it tnicola gigli unice fr 4 Geometric and functional inequalities 77 4 1 Brunn Minkowskiinequality 2 2 es TI 4 2 Isoperimetric inequality ee 78 4 3 Sobolev Inequality 0255 2 85 ces be eG See ee RO eee eee Y 78 4 4 Bibliographical notes ee 79 5 Variants of the Wasserstein distance 80 5 1 Branched optimal transportation eee 80 5 4 Dafferentactionfunctiondl e lt p sepeser 5 be ErkyXY s 81 5 3 An extension to measures with unequal mass 0 82 5 4 Bibliographical notes es 84 6 More on the structure of Z5 M W2 84 6 1 Duality between the Wasserstein and the Arnold Manifolds 84 6 2 Onthe notion of tangent Space eA 87 6 3 Second order calculus si s e ss oo ke SR ee we Ee a 88 6 4 Bibliographi
52. gua Vo Id zy for appropriate convex functions o which therefore satisfy Vy o Vo Id p a e Define s VQ o S Then syfie uo and thus s S Q Also S Vy o s which proves the existence of the polar factorization The identity le gdy 5 9 J j SP dun J IVGoS SPdun V tad min J le ydo y eAdn n v 17 shows inequality lt in 1 7 and the uniqueness of the optimal plan ensures that s is the unique minimizer To conclude we need to show uniqueness of the polar factorization Assume that S V oF is another factorization and notice that VP Ho VS 4a v Thus the map Vg is a transport map from ugo to v and is the gradient of a convex function By Proposition 1 21 and Theorem 1 13 we deduce that V is the optimal map Hence VY V o and the proof is achieved Remark 1 29 Polar factorization vs Helmholtz decomposition The classical Helmoltz decom position of vector fields can be seen as a linearized version of the polar factorization result which therefore can be though as a generalization of the former To see why assume that Q and all the objects considered are smooth the arguments hereafter are just formal Let u Q IR be a vector field and apply the polar factorization to the map S Id eu with e small Then we have S Vy o s and both Vy and s will be perturbation of the identity so that Vg Id ev o e Se Id ew o
53. i m is a geodesic in P2 M W2 ii there exists a plan y 9 T M T M being the tangent bundle of M such that J Pese iG i Exp t P pt 2 17 Exp t TM M being defined by x v gt exp tv Also for any u v Pa M such that p is a regular measure Definition 1 32 the geodesic con necting u to v is unique Notice that we cannot substitute the first equation in 2 17 with exp s y Opt uo p1 be cause this latter condition is strictly weaker it may be that the curve t gt exp tv is not a globally minimizing geodesic from x to exp v for some z v supp 7 40 Proof The implication i ii follows directly from Theorem 2 10 by taking into account the fact that t is a constant speed geodesic on M implies that for some r v TM it holds i exp tv and in this case d o 1 v For the converse implication just observe that from the second equation in 2 17 we have W3 ue us lt d exp tv exp sv d z v lt t s J Ivf dv a v 5 WE uo m having used the first equation in 2 17 in the last step To prove the last claim just recall that by Remark 1 35 we know that for j4 a e x there exists a unique geodesic connecting x to T x T being the optimal transport map Hence the conclusion follows from ii of Theorem 2 10 Now we discuss the regularity properties of Kantorovich potentials which follows from Theorem 2 18 Cor
54. if M has non negative Ricci curvature and dim M N Sketch of the Proof We will give only a formal proof neglecting all the issues which arise due to the potential non regularity of the objects involved We start with i Assume that Ric v v gt K v for any v Pick a geodesic pm C 2 M and assume that p C for any t 0 1 By Theorem 1 33 we know that there exists a function p M R differentiable porm a e such that exp V q is the optimal transport map from pom to pim and p m exp tV pom Assume that y is C Then by Lemma 7 10 with u uso we know that d 2 42 2 qi Oo om Iv e Ric Vy V po dm gt K V po dm 114 Since f Vy podm W2 po pi the claim is proved The converse implication follows by an explicit construction if Ric v v lt K v for some x M and v T M then fore lt 6 1 define uo com p 2 co being the normalizing constant and juz T1 uo where T y exp t6Vy y and p C is such that Vg z v and V o z 0 Using Lemma 7 10 again and the hypothesis Ric v v lt K v it is not hard to prove that amp is not A geodesically convex along u We omit the details Now we turn to ii Let pm and o as in the first part of the argument above Assume that M has non negative Ricci curvature and that dim M lt N Observe that for u uy Lemma 7 10 gives 2 1i 1 Thao o J x pl N Ay pl ap _ V24 sRic Ve
55. in the figure Set also w v the rotation by 7 2 of v in Q and w 0 out of Q Notice that V wp 0 Set ju Id tv Mo0 and observe that for positive t the support Q of jj is made of 4 connected components each one the translation of one of the sets T and that uz xo 2 X Ho Hi V It is immediate to check that u is a geodesic in 0 00 so that from 6 3 we know that the restriction of u to any interval e 1 with gt 0 is regular Fix gt 0 and note that by construction the flow maps of u in e 1 are given by T t s Id sv o Id tv Vt s e 1 Now set ww w o T t 0 and notice that w is tangent at u because w is constant in the connected components of the support of u so we can define a C7 function to be affine on each connected component and with gradient given by w and then use the space between the components themselves to rearrange smoothly the function Since wi o T t t h wi we have Jw 0 and a fortiori D 0 Thus w is a parallel transport in e 1 Furthermore since V wyo 0 we have wo w Tan Z IR Therefore there is no way to extend w to a continuous tangent vector field on the whole 0 1 In particular there is no way to extend the parallel transport up to t 0 a Now we pass to the calculus of total and covariant derivatives Let 4 be a fixed regular curve and let v be its velocity vector field Start
56. is Ls c with respect to narrow convergence This is true because our assumptions on c guarantee that there exists an increasing sequence of functions cn X x Y R continuous an bounded such that c x y sup cs x y so that by monotone convergence it holds om f om Since by construction y f cn dy is narrowly continuous the proof is complete We will denote by Opt u v the set of optimal plans from p to v for the Kantorovich formulation of the transport problem i e the set of minimizers of Problem 1 2 More generally we will say that a plan is optimal if it is optimal between its own marginals Observe that with the notation Opt u v we are losing the reference to the cost function c which of course affects the set itself but the context will always clarify the cost we are referring to Once existence of optimal plans is proved a number of natural questions arise e are optimal plans unique e is there a simple way to check whether a given plan is optimal or not e do optimal plans have any natural regularity property In particular are they induced by maps e how far is the minimum of Problem 1 2 from the infimum of Problem 1 1 This latter question is important to understand whether we can really consider Problem 1 2 the re laxation of Problem 1 1 or not It is possible to prove that if c is continuous and pu is non atomic then inf Monge min Kantorovich 1 2 so that transporting with pla
57. j k 1 5 Wa Mian H i 1 2 Wo tj 2 Hkj27 i j 2 9 Therefore it holds ej 2m k 2 yH e Opt I jon Mk an s Vj k 10 neag 2 Also since the inequalities in 2 9 are equalities it is not hard to see that for jz a e y the points Jijzn 0 2 must lie along a geodesic and satisfy d y j2 y 41 2 d vyo 1 2 i 0 2 1 Hence y a e yis a constant speed geodesic and thus u Z Geod X Now suppose for a moment that u narrowly converges up to pass to a subsequence to some u 4 Geod X Then the continuity of the evaluation maps e yields that for any t 0 1 the 32 sequence n ey 4p narrowly converges to e and this together with the uniform bound 2 9 easily implies that u satisfies 2 7 Thus to conclude it is sufficient to show that some subsequence of 14 has a narrow limit We will prove this by showing that u 2 Geod X for every n N and that some subsequence is a Cauchy sequence in 2 Geod X W2 W2 being the Wasserstein distance built over Geod X endowed with the sup distance so that by Theorem 2 7 we conclude We know by Remark 1 4 Remark 2 3 and Theorem 2 7 that for every n N the set of plans o X 1 such that m a pia for i 0 2 is compact in Z3 X Therefore a diagonal argument tells that possibly passing to a subsequence not relabeled we may assume that for every n N the sequenc
58. jm Wis if amp u lt oo Too otherwise 119 The functional J is called Fisher information because its value on R4 4 is given by V 2 I p J Wel gea p and the object on the right hand side is called Fisher information on R It is possible to prove that a formula like the above one is writable and true on general CD K oo spaces see 7 but we won t discuss this topic Proposition 7 18 HWI inequality Let X d m be a metric measure space satisfying the condi tion C D K oo Then K In particular choosing v m it holds K E u lt Wo u m vI u Wz u m Vu e P X 7 19 Finally if K gt 0 the log Sobolev inequality with constant K holds I lt 2K Proof Clearly to prove 7 18 it is sufficient to deal with the case amp v l u lt oo Let u be a constant speed geodesic from jz to v such that bx 7 20 amp uc lt 1 0 0 tol A WA v Then from J T u gt lim jo amp u Soo ui Wa p ui we get the thesis Equation 7 20 now follows from 7 19 and the trivial inequality 1 ab xo i b 1 2 valid for any a b gt 0 The log Sobolev inequality is a notion of global Sobolev type inequality and it is known that it implies a global Poincar inequality we omit the proof of this fact When working on metric measure spaces however it is often important to have at disposal a local Poincar inequality see e g
59. m dae m d us x t d y T d zz y i 2rd z y 27d z y lt dc 2 dy m 2T Taking the limsup as y x we get the thesis 56 By Theorem 3 9 and Lemma 3 10 it is natural to introduce the following variational interpo lation in the Minimizing Movements scheme as opposed to the classical piecewise constant affine interpolations used in other contexts Definition 3 11 Variational interpolation Let X E be satisfying Assumption 3 8 x D E and 0 r T We define the map 0 00 3 t x in the following way e d 3 T nT e Tin 1 7 is chosen among the minimizers of 3 12 with amp replaced by x e x witht nt n 1 7 is chosen among the minimizers of 3 12 with x and T replaced by Tar and t nr respectively For a7 defined in this way we define the discrete speed Dsp 0 00 0 00 and the Discrete slope Dsl 0 00 0 00 by QF ys Lint tyr Dsp eRe t nr n 1s 3 15 d T T Dsl dlet thr t nt n 1 7 t nt Although the object Dsl does not look like a slope we chose this name because from 3 14 we know that V E z7 lt Dsl and because in the limiting process Dsl will produce the slope term in the EDI see the proof of Theorem 3 14 With this notation we have the following result Corollary 3 12 EDE for the discrete solutions Let X E be satisfying Assumption 3 8 T D E 0 lt T lt F and a7 defined
60. o T p 0 vector fields w vw of uoT dp 0 Vu s t V uo T p o vector fields w wo T7 Vy for some o CY R Now pick w Tanz Pf p let e C R be such that w o T7 V and observe that d Pf T tw di lio Id twoT 4 Typ Id tVy 4p d dt lio _ d T iw 4 lot d a lio which means by definition of Tan 4 IR7 and the action of tangent vectors that the differential dPf T w of Pf calculated at T along the direction w is given by Vy The fact that this map is an isometry follows once again by the change of variable formula J lufdp J wT i d J IV of dp 86 6 2 On the notion of tangent space Aim of this section is to quickly discuss the definition of tangent space of A IR at a certain measure u from a purely geometric perspective We will see how this perspective is related to the discussion made in Subsection 2 3 2 where we defined tangent space as 9 RR Tan Z5 R4 ve pE Cg RO Recall that this definition came from the characterization of absolutely continuous curves on 4 IR Theorem 2 29 and the subsequent discussion Yet there is a completely different and purely geometrical approach which leads to a definition of tangent space at jj The idea is to think the tangent space at u as the space of directions or which is the same as the set of constant speed geodesics emanating from u More precisely let the set Geod be defined by
61. o of 1 N such that N N 5 c zi Yi gt gt c zi Yo i i 1 i 1 By continuity we can find neighborhoods U 3 x V 2 yj with e t Ust clui vi lt 0 V u vi Ui xWM 1xiczN M i l Our goal is to build a variation y y 7 of y in such a way that minimality of y is violated To this aim we need a signed measure r with A n lt y so that is nonnegative B null first and second marginal so that Adm p v C f edn lt 0 so that y is not optimal Let Q II U x V and P A Q be defined as the product of the measures where m y U x V Denote by 74 x the natural projections of Q to U and V moe and define N que Blum Ga ia up aio It is immediate to verify that 77 fulfills A B C above so that the thesis is proven ii gt iii We need to prove that if T C X x Y isa c cyclically monotone set then there exists a c concave function x such that 0 D T and max y 0 L y Fix z y T and observe that since we want to be c concave with the c superdifferential that contains I for any choice of xi yi D 1 N we need to have p z c z y1 e 1 e z y1 m1 9i e1 lt exin i n eG v yo RU cr e 21 y1 lar 2 2 42 e2 lt ete yi ey el s y2 e n2 a eo 9 0 9 9 It is therefore natural to define as
62. observe the quantity at uy flew eto voar i YEM4 X xY 12 If p x v y c x y for any x y then the integrand is non negative and the infimum is 0 achieved when y is the null measure Conversely if x v y gt c z y for some x y X x Y then choose y n6 x with n large to get that the infimum is oo Thus we proved that int f eratis sup f odua viiv y Adm p v ew where the supremum is taken among continuous and bounded functions vy 7 satisfying 1 5 We now give the rigorous statement and a proof independent of the min max principle Theorem 1 17 Duality Let u A X v P Y adc X x Y gt Ra continuous and bounded from below cost function Assume that 1 4 holds Then the minimum of the Kantorovich problem 1 2 is equal to the supremum of the dual problem 1 16 Furthermore the supremum of the dual problem is attained and the maximizing couple qw is of the form p q for some c concave function qp Proof Let y Adm p v and observe that for any couple of functions o L u and Y L v satisfying 1 5 it holds e z y dy s y gt J aa J p du 2 J paida This shows that the minimum of the Kantorovich problem is gt than the supremum of the dual prob lem To prove the converse inequality pick y Opt ju v and use Theorem 1 13 to find a c concave function y such that supp y C 0 o max p 0 L u and max q 0 L v
63. of O and O7 is bounded by Lip v Observe that in writing O u O u we are losing the reference to the base measure u which certainly plays a role in the definition this simplifies the notation and hopefully should create no confusion as the measure we are referring to should always be clear from the context Notice that if v C9 R2 R7 these operators read as Oy u Pi Vo u O u Vv Px u The introduction of the operators and O allows to give a precise meaning to formula 6 23 for general regular curves Theorem 6 20 Covariant derivative of P u Let u be a regular curve vi its velocity vec tor field and let uz be an absolutely continuous vector field along it Then P u is absolutely continuous as well and for a e t it holds D d Pulu Pu Se O w 6 25 101 Proof The fact that P u is absolutely continuous has been proved with inequality 6 21 To get the thesis start from equation 6 22 and conclude noticing that for a e t it holds Lip v lt oo and thus Pj V ve N Vp w N plue Ve Or V E Corollary 6 21 Total derivatives of P u and Ph uz Let u be a regular curve let v4 be its velocity vector field and let uy be an absolutely continuous vector field along it Then P uz is absolutely continuous and it holds d d SP pats Py Sue Pu 5 6 Ove Putu 6 26 d d Iph u Ph Fue Pre
64. one of the integrals exists possibly attaining the value coo if and only if the other one exists and in this case the values are equal Now fix a Borel cost function c X x Y RU co The Monge version of the transport problem is the following Problem 1 1 Monge s optimal transport problem Let u P X v P Y Minimize T f c x T x du x X among all transport maps T from p to v i e all maps T such that Typ v a Regardless of the choice of the cost function c Monge s problem can be ill posed because e no admissible T exists for instance if u is a Dirac delta and v is not e the constraint T4 u v is not weakly sequentially closed w r t any reasonable weak topology As an example of the second phenomenon one can consider the sequence f x f na where f R R is 1 periodic and equal to 1 on 0 1 2 and to 1 on 1 2 1 and the measures f Lio 1 and v 6_ 01 2 It is immediate to check that fn u v for every n N and yet fn weakly converges to the null function f 0 which satisfies fu o Z v A way to overcome these difficulties is due to Kantorovich who proposed the following way to relax the problem Problem 1 2 Kantorovich s formulation of optimal transportation We minimize ve f ean XxY in the set Adm u v of all transport plans y A X xY from u to v i e the set of Borel Probability measures on X x Y such that y AxY p A YAE Z X Xx B 0 B Y
65. set OW E u C Tany Z R2 is the set of vector fields v L u R such that EQ rz 14 du W30 v lt BW vv e A R where here and in the following T will denote the optimal transport map from the regular measure u to v whose existence and uniqueness is guaranteed by Theorem 1 26 Observe that the subdifferential of a A geodesically convex functional E has the following mono tonicity property which closely resembles the analogous valid for A convex functionals on an Hilbert space f e Id du ri w T Id dv lt AW2 p v 3 42 for every couple of regular measures ju v in the domain of E and v OW E u w OW E v To prove 3 42 just observe that from the definition of subdifferential we have EQ rz 14 0 du Wii lt BV BQ f T 14 0 dv ZW lt Et and add up these inequalities The definition of subdifferential leads naturally to the definition of Gradient Flow it is sufficient to transpose the definition given with the system 3 2 Definition 3 27 Subdifferential formulation of Gradient Flow Let E be a A geodesically con vex functional on IR and u A R Then p is a Gradient Flow for E starting from u provided it is a locally absolutely continuous curve uj p as t 0 wrt the distance Wa ju is regular for t gt 0 and it holds v OW E u a e t where v is the vector field uniquely identified by the curve p via d dit V
66. the infimum of the above expression as x yi i 1 y Vary among all N ples in I and N varies in N Also since we are free to add a constant to p we can neglect the addendum Z and define p x inf c x 1 c x n ci y2 c z y2 deeds ce 7 c Z 7 the infimum being taken on N gt 1 integer and z y LT i 1 N Choosing N 1 and zi yi x y we get x lt 0 Conversely from the c cyclical monotonicity of T we have p T gt 0 Thus T 0 Also it is clear from the definition that is c concave Choosing again N 1 and a1 y1 Z 7 using 1 3 we get p z lt x y c r y lt a x bY cv y which together with the fact that a L y yields max y 0 Ll u Thus we need only to prove that 0 y contains I To this aim choose 4 9 I let 71 y1 4 9 and observe that by definition of p x we have p x lt elz j c inf e 2 y2 cf22 92 eo 2 3 c x 9 c 2 9 By the characterization 1 3 this inequality shows that Z y 0 y as desired iii gt i Let y Adm p v be any transport plan We need to prove that f cdy lt f cd y Recall that we have plx v y c z y Y x y supp y p x y y lt cay YxexX yey and therefore f etian e 9 ware emnt g y dv y fe e mese lt emi n Remark 1 14 Condition 1 4 is natura
67. v By Lemma 2 11 be low and classical measurable selection theorems we know that there exists a Borel map GeodSel X Geod X such that for any z y X the curve GeodSel z y is a constant speed geodesic connecting x to y Define the Borel probability measure u Y Geod X by H GeodSels y and the measures ju P X by p er 4M We claim that t p is a constant speed geodesic connecting ji to u Consider indeed the map eo 61 Geod X X and observe that from eo ex GeodSel z y x y we get eo e1 H v 2 8 In particular po eo u TY p and similarly jj ji so that the curve t p connects u to ut The facts that the measures ju have finite second moments and p is a constant speed geodesic follow from 2 7 2 1 W2 ts bs lt J dene ean 2 5 5 t sy f es du C s f a ydy y t s WRH i The fact that ii implies i follows from the same kind of argument just used So we turn to i gt ii For n gt 0 we use iteratively the gluing Lemma 2 1 and the Borel map GeodSel to build a measure u P C 0 1 X such that Cian e 41 2 4 Opt Miso H i 1 2 Vee a E and u a e yis a geodesic in the intervals i 2 i 1 2 i 0 2 1 Fix n and observe that for any 0 j lt k lt 2 it holds k 1 k 1 d ej 2 e l rs quny lt Y eias eia L j lt X lla ei27 eno loque y i j H i
68. veu 0 v Tan Po R a e t 69 recall Theorem 2 29 and Definition 2 31 Thus we have a total of 4 different formulations of Gradient Flows of A geodesically convex func tionals on Z5 IR based respectively on the Energy Dissipation Inequality the Energy Dissipation Equality the Evolution Variational Inequality and the notion of subdifferential The important point is that these 4 formulations are equivalent for geodesically convex func tionals Proposition 3 28 Equivalence of the various formulation of GF in the Wasserstein space Let E be a geodesically convex functional on 5 IR and u a curve made of regular measures Then for u the 4 definitions of Gradient Flow for E EDI EDE EVI and the Subdifferential one are equivalent Sketch of the Proof We prove only that the EVI formulation is equivalent to the Subdifferential one Recall that by Proposition 2 34 we know that ld 3g 2 ne y ve Ty Id dun a e t where T is the optimal transport map from i to v Then we have V E u a e t A E q v T7 Id dua 3 Wa ue v lt E v Vv Z R a e t gt ld E u aga Qus v 5 Wu v lt Ev Vv Pa R a e t 3 3 2 Three classical functionals We now pass to the analysis of 3 by now classical examples of Gradient Flows in the Wasserstein space Recall that in terms of strength the best theory to use is the one
69. via the variational interpolation as in Definition 3 11 above Then it holds 1 S 1 S E zi 4 gt Dsp dr 7 DsI7 dr E 27 3 16 t t for every t nr s mr n m EN Proof It is just a restatement of equation 3 13 in terms of the notation given in 3 15 Thus at the level of discrete solutions it is possible to get a discrete form of the Energy Dissipa tion Equality under the quite general Assumptions 3 8 Now we want to pass to the limit as 7 0 In order to do this we need to add some compactness and regularity assumptions on the functional Assumption 3 13 Coercivity and regularity assumptions Assume that E X RU co satisfies e E is bounded from below and its sublevels are boundedly compact i e E lt c N B x is compact for any c R r gt Qand x X e the slope VE D E gt 0 00 is lower semicontinuous e FE has the following continuity property In gt T sup VE an E 2s oo gt E t gt E x Under these assumptions we can prove the following result Theorem 3 14 Gradient Flows in EDI formulation Let X d be a metric space and let E X RU 4 06 be satisfying the Assumptions 3 8 and 3 13 Also letz D E andfor0 lt T lt T define the discrete solution via the variational interpolation as in Definition 3 11 Then it holds 57 e the set of curves a7 is relatively compact in the set of curves in X w rt local uniform con
70. we get that v lt and for its density it holds i p x AT Set zy gt the assumption gt 0 is necessary to have the last inequality in 3 46 If A lt 0 A convexity of V along interpolating curves is not anymore true so that we cannot apply directly the results of Subsection 3 2 4 Yet adapting the arguments it possible to show that all the results which we will present hereafter are true for general A R 72 where we wrote T for 1 t To tT Thus ev fuaa fu sR acer act Therefore the proof will be complete if we show that A u ss det A is convex on the set of positively defined symmetric matrices for any x supp 1 Observe that this map is the composition of the convex and non increasing map z gt 2 u p 2 z with the map A det A Thus to conclude it is sufficient to show that A det A is concave To this aim pick two symmetric and positive definite matrices Ap and Aj notice that det 1 Ao A1 det Ao det Id B where B Ao A1 Ao V Ao and conclude by d dt 2 L det Id tB 4 au B Id tB ge Id tB lt 0 det Id 4 tB 4 5 det rd tB Veg d tB where in the last step we used the inequality tr C lt dtr C for C B Id tB Important examples of functions u satisfying 3 44 and 3 45 are gtz 1 s 1 ue aa Cear 3 47 u x zlog z Remark 3 34 A d
71. 0 j44 The thesis will be proved if we show that u depends only on Observe that from Theorem 2 10 and its proof we know that eo ei H Opt Uo Ito and thus eo e 444 y By the non branching hypothesis we know that eo e Geod X gt X is injective Thus it it invertible on its image letting F the inverse map we get w Fy and the thesis is proved Theorem 2 10 tells us not only that geodesics exists but provides also a natural way to interpo late optimal plans once we have the measure u A Geod X satisfying 2 7 an optimal plan from ju to us is simply given by c s Now we know that the transport problem has a natural dual problem which is solved by the Kantorovich potential It is then natural to ask how to inter polate potentials In other words if p are c conjugate Kantorovich potentials for 110 141 is there a simple way to find out a couple of Kantorovich potentials associated to the couple Ht Hs The answer is yes and it is given shortly said by the solution of an Hamilton Jacobi equation To see this we first define the Hopf Lax evolution semigroup H which in R produces the viscosity solution of the Hamilton Jacobi equation via the following formula x y l i3 c PEOR Hg y z 4 w z it ts 2 12 2 ip Lau ift s yEex s t To fully appreciate the mechanisms behind the theory it is better to introduce the rescaled costs c defined by
72. 1 f E x 7 at dr 5J VE a dr lt E zi aet lt s 3 5 t t which we call Energy Dissipation Inequality EDI in the following Since the inequality 3 4 shows that 4E y lt 3 V E y 4 y never holds the system 3 2 may be also written in form of Energy Dissipation Equality EDE in the following as EE fa de f E zi 7 la dr 7 VE 2 dr E x VO t s 3 6 t t 51 Notice that the convexity of E does not play any role in this formulation A completely different way to rewrite 3 2 comes from observing that if x solves 3 2 and y H is a generic point it holds ld 2 dt Jae yl n yt y es VE ae lt Ely Eas Flee yl where in the last inequality we used the fact that E is convex Since the inequality 2 y 2 0 lt Ey E Sle vf WEH characterizes the elements v of the subdifferential of E at x we have that an absolutely continuous curve x solves 3 2 if and only if d 1 ggl yl 5b yl En lt Ey a e t gt 0 3 7 holds for every y H We will call this system of inequalities the Evolution Variational Inequality EVI Thus we got three different characterizations of Gradient Flows in Hilbert spaces the EDI the EDE and the EVI We now want to show that it is possible to formulate these equations also for functionals E defined on a metric space X d The object z appearing in EDI and EDE can be naturall
73. 1 446 Opt uo H1 and y is a c convex Kantorovich potential for Ho H1 we have from Theorem 1 13 that py m 0 1 70 thus p z sup z y p y gt e 2 1 e m yEx 2 91 9 1 e 9o Plugging this inequality in the definition of i we get Psl inf c x 76 e z gt inf x Ys c n 1 e Qo m 70 2o e Ys Y1 e 40 11 0 eP 00 Ys 70 Step 3 We know that an optimal transport plan from ji to jus is given by ez s p thus to conclude the proof we need to show that s Ps Ys Ps w c 14 78 Yy supp y ts where y is the c conjugate of the c concave function ys The inequality lt follows from the definition of c conjugate To prove opposite inequality start observing that es y inf x y ply e yo y evo lt c 59 y h Qs y pao and conclude by cos vs t inf e y es y Qo plo c9 Y0 Ys c Yt Ys ve no 2 13 4 hS n ys es ns 38 We conclude the section studying some curvature properties of Y2 X W2 We will focus on spaces positively non positively curved in the sense of Alexandrov which are the non smooth analogous of Riemannian manifolds having sectional curvature bounded from below above by 0 Definition 2 19 PC and NPC spaces A geodesic space X d is said to be positively curved PC in the sense
74. 21 22 d yi y2 We say that a couple d y is an admissible coupling between X dx mx and Y dy my we write d y Adm dx mx dy my if e d is a pseudo distance on suppm x U suppmy i e it may be zero on two different points which coincides with dx resp dy when restricted to supp My x supp mx resp supp my x supp my e a Borel w rt the Polish structure given by Dxy measure y on supp My x supp my such that tyy mx andy my It is not hard to see that the set of admissible couplings is always non empty The cost C d y of a coupling is given by Giusy mM Bay dy ery supp Uscriptsizem xsupp Uscriptsizem The distance D X dx mx Y dy my is then defined as D X dx mx Y dy my inf J C d v 7 4 the infimum being taken among all couplings d y of X dx mx and Y dy my A trivial consequence of the definition is that if X dx mx and X d g mg resp Y dy my and Y d m are isomorphic then X dx mx Y dy my D dg mg d my 109 so that ID is actually defined on isomorphism classes of metric measure spaces In the next proposition we collect without proof the main properties of Proposition 7 3 Properties of D The inf in 7 4 is realized and a coupling realizing it will be called optimal Also let X be the set of isomorphism classes of metric measure spaces satisfying Ass
75. A U Bz A and y B B U B B such that x y is in the support of the optimal plan Since O O is the support of the optimal plan as well by cyclical monotonicity it must hold d x y d 0 O lt d z O 4 d O y which contradicts 1 8 E 1 5 Bibliographical notes G Monge s original formulation of the transport problem 66 was concerned with the case X Y R and c z y x y and L V Kantorovich s formulation appeared first in 49 The equality 1 2 saying that the infimum of the Monge problem equals the minimum of Kan torovich one has been proved by W Gangbo Appendix A of 41 and the first author Theorem 2 1 in 4 in particular cases and then generalized by A Pratelli 68 22 In 50 L V Kantorovich introduced the dual problem and later L V Kantorovich and G S Rubinstein 51 further investigated this duality for the case c r y d x y The fact that the study of the dual problem can lead to important informations for the transport problem has been investigated by several authors among others M Knott and C S Smith 52 and S T Rachev and L R schendorf 69 71 The notions of cyclical monotonicity and its relation with subdifferential of convex function have been developed by Rockafellar in 70 The generalization to c cyclical monotonicity and to c sub super differential of c convex concave functions has been studied among others by R schendorf 71 The chara
76. A user s guide to optimal transport Luigi Ambrosio Nicola Gigli Abstract This text is an expanded version of the lectures given by the first author in the 2009 CIME summer school of Cetraro It provides a quick and reasonably account of the classical theory of optimal mass transportation and of its more recent developments including the metric theory of gradient flows geometric and functional inequalities related to optimal transportation the first and second order differential calculus in the Wasserstein space and the synthetic theory of metric measure spaces with Ricci curvature bounded from below Contents 1 The optimal transport problem 4 1 1 Monge and Kantorovich formulations of the optimal transport problem 4 1 2 Necessary and sufficient optimality conditions ooo 6 1 3 Thed alproblem 5 5 2 52 4 bey bee ea Se ee be ee E eR Se pb RE Ys 11 1 4 Existence of optimal maps lees 14 15 Bibliographical notes 2o o oc SAR REE 4 or Ee RE ES 22 2 The Wasserstein distance W 24 2 X Polish space s ss s rage 445 6454 RO EU X NO De x y 3X ee eRe 24 2 4 A geodesicspace 5 dme doe ce wg qe EU SHAR SER EE bY eH vod os 31 2 3 A Riemannian manifold 000000020000 40 2 3 1 Regularity of interpolated potentials and consequences 40 2 3 0 The weak Riemannian structure of Y2 M W2 004 43 2 4 Bibliographical notes 2 eA 49 3 Gradient flows 49 3 1 Hilbertian theory of
77. B Z Y Equivalently THY pi THY v where nX nY are the natural projections from X x Y onto X and Y respectively BH Transport plans can be thought of as multivalued transport maps y f Y du x with Y Z x x Y Another way to look at transport plans is to observe that for y Adm p v the value of y A x B is the amount of mass initially in A which is sent into the set B There are several advantages in the Kantorovich formulation of the transport problem e Adm v is always not empty it contains u X v e the set Adm u v is convex and compact w r t the narrow topology in A X x Y see below for the definition of narrow topology and Theorem 1 5 and y gt f c d y is linear e minima always exist under mild assumptions on c Theorem 1 5 e transport plans include transport maps since Tyu v implies that y Id x T zu belongs to Adm p v In order to prove existence of minimizers of Kantorovich s problem we recall some basic notions concerning analysis over a Polish space We say that a sequence un C A X narrowly converges to u provided etn feu Vo C X C X being the space of continuous and bounded functions on X It can be shown that the topology of narrow convergence is metrizable A set K C V X is called tight provided for every gt 0 there exists a compact set Ke C X such that WX Ke lt e VueK It holds the following important result Theorem 1 3 Prokhor
78. By compactness we can extract a subsequence converging to some y M Then from the continuity of d z 2 and q it is immediate to verify that y 0 p x Remark 1 31 The converse implication in the previous proposition is false if one doesn t assume y to be differentiable at x i e it is nof true in general that exp OT y x C O y x E From this proposition and following the same ideas used in the Euclidean case we give the following definition Definition 1 32 Regular measures in Z M We say that u P M is regular provided it van ishes on the set of points of non differentiability of i for any semiconvex function ip M R The set of points of non differentiability of a semiconvex function on M can be described as in the Euclidean case by using local coordinates For most applications it is sufficient to keep in mind that absolutely continuous measures w r t the volume measure and even measures vanishing on Lipschitz hypersurfaces are regular By Proposition 1 30 we can derive a result about existence and characterization of optimal trans port maps in manifolds which closely resembles Theorem 1 26 Theorem 1 33 McCann Let M be a smooth compact Riemannian manifold without boundary and u Y M Then the following are equivalent i for every v Y M there exists only one transport plan from p to v and this plan is induced byamap T ii jis regular If either i or ii hold
79. N Apart from this the works are strictly related and the differences are mostly on the technical side We mention only one of these In giving the definition of C D 0 N space we followed Sturm and asked only the functionals pm N f p p N dm 122 N gt N to be geodesically convex Lott and Villani asked for something more restrictive namely they introduced the displacement convexity classes DC as the set of functions u 0 o0 R continuous convex and such that z gt zug is convex Notice that u z N z z N belongs to DC Then they say that a space is C D 0 N provided pm futpidm with the usual modifications for a measure which is not absolutely continuous is geodesically con vex for any u DCy This notion is still compatible with the Riemannian case and stable un der convergence The main advantage one has in working with this definition is the fact that for a CD 0 N space in this sense for any couple of absolutely continuous measures there exists a geodesic connecting them which is made of absolutely continuous measures The distance D that we used to define the notion of convergence of metric measure spaces has been defined and studied by Sturm in 74 This is not the only possible notion of convergence of metric measure spaces Lott and Villani used a different one see 58 or Chapter 27 of 80 A good property of the distance D is that it pleasantly reminds the Wasserstein distanc
80. R7 W2 Let us recall that for a smooth function F M IR on a Riemannian manifold a gradient flow x starting from M is a differentiable curve solving x VF z 3 1 to T Observe that there are two necessary ingredients in this definition the functional F and the metric on M The role of the functional is clear The metric is involved to define V F it is used to identify the cotangent vector dF with the tangent vector V F 3 1 Hilbertian theory of gradient flows In this section we quickly recall the main results of the theory of Gradient flow for A convex func tionals on Hilbert spaces This will deserve as guideline for the analysis that we will make later on of the same problem in a purely metric setting 49 Let H be Hilbert and A R A A convex functional F H RU 00 is a functional satisfying F 1 t r ty 1 t F z tF y ae tc yl Vz y H this corresponds to V F gt Ad for functionals on IR We denote with D F the domain of F i e D F x F x lt oo The subdifferential O F x of F ata point x D F is the set of v H such that A F x v y 2 5le y SF VyeH An immediate consequence of the definition is the fact that the subdifferential of F satisfies the monotonicity inequality v w 2 y gt Ale yl Vv OF x w O Fly We will denote by VF x the element of minimal norm in OF x which exists and is unique as soon
81. Vvi Ve2 V va Vea Nu Ver Ves Nu V vn Vea Nu Vez Ves Nu V1 Vea 6 36 2 Nu Vti V 93 NV va Vo for any o4 CZ M From this formula it follows immediately that the operator R is actually a tensor Proposition 6 26 Let u Y2 IR The curvature operator given by formula 6 36 is a tensor on V 5 i e its value depends only on the ui a e value of the 4 vector fields Proof Clearly the left hand side of equation 6 36 is a tensor w r t the fourth entry The conclusion follows from the symmetries of the right hand side 105 We remark that from 6 36 it follows that R has all the expected symmetries Concerning the domain of definition of the curvature tensor the following statement holds whose proof follows from the properties of the normal tensor NV Proposition 6 27 Let jj Y2 R Then the curvature tensor thought as map from Vy to R given by 6 36 extends uniquely to a sequentially continuous map on the set of 4 ples of vector fields in r in which at least 3 vector fields are Lipschitz where we say that v v2 v3 v4 is converging n n to v1 v 4 vt if there is convergence in E on each coordinate and sup Lip v lt oo TL for at least 3 indexes 1 Thus in order for the curvature tensor to be well defined we need at least 3 of the 4 vector fields involved to be Lipschitz However for some related notion of curvature the situation simplifies O
82. Vy dm Using the hypothesis on M and the fact that Ay lt N V o we get sr gn Pt gt 0 i e the geodesic convexity of amp y For the converse implication it is possible to argue as above we omit the details also in this case Now we pass to the stability Theorem 7 12 Stability of weak Ricci curvature bound Assume that Xn dn Mn B X d m and that for every n N the space Xn dn m4 is CD K oo resp CD 0 N Then X d m isa CD K oo resp CD 0 N space as well Sketch of the Proof Pick po p P3 X and assume they are both absolutely continuous with bounded densities say 4 pim i 0 1 Choose dn Yn Opt dn Mn d m Define uz Yz ghi PS Xn i 0 1 Then by assumption there is a geodesic u C 2 X such that K Exo ut lt 1 1 So0 H t ut FE 0W ug n 7 12 Now let o y s ut YS X t 0 1 From Proposition 7 5 and its proof we know that W pi 07 0 as n oo i 0 1 Also from 7 12 ad Lemma 7 4 we know that o7 is uniformly bounded in n t Thus for every fixed t the sequence n o is tight and we can extract a subsequence not relabeled such that o narrowly converges to some o 2 supp m for every rational t By an equicontinuity argument it is not hard to see that then o narrowly converges to some a for any t 0 1 we omit the details We claim that o is a geodesic and that the K convexit
83. ability measures on a given metric space X d This amounts to study the transport problem with cost function c z y d x y An important characteristic of the Wasserstein distance is that it inherits many interesting geo metric properties of the base space X d For this reason we split the foregoing discussion into three sections on which we deal with the cases in which X is a general Polish space a geodesic space and a Riemannian manifold A word on the notation when considering product spaces like X with z X X we intend the natural projection onto the i th coordinate i 1 n Thus for instance for u v A X and y Adm p v we have ty u and 74 7 v Similarly with z X X we intend the projection onto the i th and j th coordinates And similarly for multiple projections 2 1 X Polish space Let X d be a complete and separable metric space The distance W is defined as Wa p v inf fee y dy x y YE Adm pv f Game Yr ona The natural space to endow with the Wasserstein distance W3 is the space 2 X of Borel 24 Probability measures with finite second moment P X u P X ee zo dji x lt oo for some and thus any xo x Notice that if either u or v is a Dirac delta say v r then there exists only one plan y in Adm p v the plan u x so which therefore is optimal In particular it holds I d z z dp z W2 11 eo that is the second mo
84. al of the c concave function is the classical subdifferential resp superdifferential e the c_ transform is the Legendre transform Thus in this situation these new definitions become the classical basic definitions of convex analysis Oo Remark 1 12 For most applications c concavity is sufficient There are several trivial relations between c convexity c concavity and related notions For instance y is c concave if and only if 1q is c convex y y and 0 q 0 q Therefore roughly said every statement concerning c concave functions can be restated in a statement for c convex ones Thus choosing to work with c concave or c convex functions is actually a matter of taste Our choice is to work with c concave functions Thus all the statements from now on will deal only with these functions There is only one important part of the theory where the distinction between c concavity and c convexity is useful in the study of geodesics in the Wasserstein space see Section 2 2 and in particular Theorem 2 18 and its consequence Corollary 2 24 We also point out that the notation used here is different from the one in 80 where a less symmetric notion but better fitting the study of geodesics of c concavity and c convexity has been preferred An equivalent characterization of the c superdifferential is the following y 0 y if and only if it holds or equivalently if p x c z y gt v z c z
85. alculats the i bracket of two vector fields To this aim let jjj 1 2 be two feu curves such that pj ue p and let ui Tan pl Po R be two C vector fields satisfying u v2 u2 vd where vi are the velocity vector fields of ui We assume that the velocity fields vi of jii are continuous in time in the sense that the map t vt u is continuous in the set of vector valued measure with the weak topology and te pv yi is continuous as well to be sure that 6 7 holds for all t with v vi and the initial condition makes sense With these hypotheses it makes sense to consider the covariant derivative 2u along u2 at t 0 for this derivative we write V uiui Similarly for uf 92 T Cae X Y e 4 L3 Let us consider vector fields as derivations and the functional u gt F u f pdp for given p C IR By the continuity equation the derivative of F along u is equal to Vo uz ui therefore the compatibility with the metric 6 9 gives d ul PG 5 Werte lezo e bd Ve Vat Vp up Ud Ve Vau Subtracting the analogous term u u F i and using the symmetry of V ip we get ju u F u Ve Vau Vau Given that the set Vy ece is dense in Tan 4 R the above equation characterizes u u as u u Vazut Vagur 6 10 which proves the torsion free identity for the covariant derivative Example 6 9 The velocity vector f
86. allel transport along a smooth curve in an embedded Riemannian manifold then we will see how this proof can be adapted to the Wasserstein case this approach should help highlighting what s the geometric idea behind the construction Thus say that M is a given smooth Riemannian manifold embedded on RP t gt 4 Ma smooth curve on 0 1 and u T4 M is a given tangent vector Our goal is to prove the existence of an absolutely continuous vector field t uz T M such that ug u and d Py 2 0 a e t For any t s 0 1 let tr T R T R be the natural translation map which takes a vector with base point y tangent or not to the manifold and gives back the translated of this vector with base point y Notice that an effect of the curvature of the manifold and the chosen embedding on RP is that tr u may be not tangent to M even if u is Now define P T R T4 M by P u Py tr u Vu T4 RP 94 An immediate consequence of the smoothness of M and are the two inequalities trz u P u C u s t Vt s 0 1 and u T4 M 6 122 P u Clul s t Vt s 0 1 and u TM 6 12b where Ty M i is the orthogonal complement of T M in TL RD These two inequalities are all we need to prove existence of the parallel transport The proof will be constructive and is based on the identity Vy Pilu 0 Yu TM 6 13 i o which tells th
87. ally absolutely continuous curves 44 in 0 00 converging to T as t 0 Furthermore if yz is a solution of 3 2 starting from y it holds y e z g v Asymptotic behavior Zf A gt 0 then there exists a unique minimum Xmin of F and it holds F aj Fran F E F si e In particular the pointwise energy inequality F z ES F Zmin z7 E sal Vz e H gives EZ Cata SS 3 2 The theory of Gradient Flows in a metric setting Here we give an overview of the theory of Gradient Flows in a purely metric framework 3 2 1 The framework The first thing we need to understand is the meaning of Gradient Flow in a metric setting Indeed the system 3 2 makes no sense in metric spaces thus we need to reformulate it so that it has a metric analogous There are several ways to do this below we summarize the most important ones For the purpose of the discussion below we assume that H R and that E H R is A convex and of class C1 Let us start observing that 3 2 may be written as t 2 is locally absolutely continuous in 0 00 converges to 7 as t 0 and it holds d 1 1 Indeed along any absolutely continuous curve y it holds d qt EQ VEU ut gt VE ye ly if and only if y is a positive multiple of VE y 3 4 1 1 gt AIVEP u Sb C if and only if Iy VEC Thus in particular equation 3 3 may be written in the following integral form 1 f
88. ase is much more complicated than the one with c z y x y 2 as it is typically not true that optimal plans are unique or that optimal plans are induced by maps For example consider on R any two probability measures ju v such that jj is concentrated on the negative numbers and v on the positive ones Then one can see that any admissible plan between them is optimal for the cost c z y x y Still even in this case there is existence of optimal maps but in order to find them one has to use a sort of selection principle A successful strategy which has later been applied to a number of different situation has been proposed by V N Sudakov in 77 who used a disintegration principle to reduce the d dimensional problem to a problem on IR The original argument by V N Sudakov was flawed and has been fixed by the first author in 4 in the case of the Euclidean distance Meanwhile different proofs of existence of optimal maps have been proposed by L C Evans W Gangbo 34 Trudinger and Wang 78 and L Caffarelli M Feldman and R McCann 24 Later existence of optimal maps for the case c z y a v being any norm has been established at increasing levels of generality in 9 28 27 containing the most general result for any norm and 25 2 The Wasserstein distance W The aim of this chapter is to describe the properties of the Wasserstein distance W on the space of Borel Prob
89. at max y 0 L u implies max p 0 L v and thus it holds f en em fole y y dy z y J eere f ses supply cae f ole eevee oda f ea Thus the inequality must be an equality which is true if and only if for a e x y it holds x y 0 7 c hence by the continuity of c we conclude supp 7 C 8 y H 13 The dual problem The transport problem in the Kantorovich formulation is the problem of minimizing the linear func tional f cdy with the affine constraints T y p ny v and y gt 0 It is well known that problems of this kind admit a natural dual problem where we maximize a linear functional with affine constraints In our case the dual problem is Problem 1 16 Dual problem Let jj P X v P Y Maximize the value of J euo f tovg among all functions p L 1 v L v such that e x p y c z y VrcX yc Y 1 5 11 The relation between the transport problem and the dual one consists in the fact that int Jean a sup f padna f voto ye Adn n v gw where the supremum is taken among all v 4 as in the definition of the problem Although the fact that equality holds is an easy consequence of Theorem 1 13 of the previous section taking Y q as we will see we prefer to start with an heuristic argument which shows why duality works The calculations we are going to do are very common in linear programming and are based on the min max princ
90. at the vectors P w are a first order approximation at t 0 of the parallel transport Taking 6 11 into account 6 13 is equivalent to P trb u P u o t u To M 6 14 Equation 6 14 follows by applying inequalities 6 12 note that tr u Pj u TM X P tro u Po u Ct tro u Po u lt Ce lul Now let 8 be the direct set of all the partitions of 0 1 where for P Oc B P gt QifP isa refinement of Q For P 0 to lt t lt lt ty 1 Pand u T4 M define P u T M as tn P u Pin Pena C Po u Our first goal is to prove that the limit P u for P 58 exists This will naturally define a curve t uz T3 M by taking partitions of 0 t instead of 0 1 the final goal is to show that this curve is actually the parallel transport of u along the curve y The proof is based on the following lemma Lemma 6 11 Let 0 lt s lt s9 lt s3 lt 1 be given numbers Then it holds P33 u P PE u lt C ul si s2 s2 53 Yu Ty M Proof From P u Ps tr 2 u Py tr 3 ts u we get Psd u PZ P3 u Pop 2 u P3 u Since u T M and tr2 u P u D M the proof follows applying inequalities 6 12 From this lemma an easy induction shows that for any 0 lt s lt lt sy lt landu Ty M we have PEN u Fee Paya P P32 0I SN 1 SN 2 E ENG E ESL
91. ation of 3 2 It is then natural to introduce the rescaled curve t x7 by Ler where denotes the integer part and to ask whether the curves t x converge in some sense to a limit curve x which solves 3 2 as 7 0 This is the case and this procedure is actually the heart of the proof of Theorem 3 1 What is important for the discussion we are making now is that the minimization procedure just described can be naturally posed in a metric setting for a general functional E X RU 00 it is sufficient to pick z E lt oo 7 gt 0 define Tio T and then recursively d z zt Ling1 argmin 4 E x f 3 11 We this give the following definition Definition 3 7 Discrete solution Let X d be a metric space E X RU 00 lower semi continuous X E lt oo and T gt 0 A discrete solution is a map 0 00 5 t af defined by T4 jn where T o T and Tin41 satisfies 3 11 Clearly in a metric context it is part of the job the identification of suitable assumptions that ensure that the minimization problem 3 11 admits at least a minimum so that discrete solutions exist We now divide the discussion into three parts to see under which conditions on the functional and the metric space X it is possible to prove existence of Gradient Flows in the EDI EDE and EVI formulation 3 2 2 General l s c functionals and EDI In this section we will make mi
92. ature bound we should also known what is the dimension of the metric measure space we are working with consider for instance the Brunn Minkowski and the Bishop Gromov inequalities above both make sense if we know the di mension of M and not just that its Ricci curvature is bounded from below This tells that the natural notion of bound on the Ricci curvature should be a notion speaking both about the curvature and the dimension of the space Such a notion exists and is called CD K N condition K being the bound from below on the Ricci curvature and NV the bound from above on the dimension Let us tell in advance that we will focus only on two particular cases the curvature dimension condition C D K oo where no upper bound on the dimension is specified and the curvature dimension con dition C D 0 N where the Ricci curvature is bounded below by 0 Indeed the general case is much more complicated and there are still some delicate issues to solve before we can say that the theory is complete and fully satisfactory Before giving the definition let us highlight which are the qualitative properties that we expect from a weak notion of curvature dimension bound Intrinsicness The definition is based only on the property of the space itself that is is not something like if the space is the limit of smooth spaces Compatibility If the metric measure space is a Riemannian manifold equipped with the volume measure then the bound provide
93. brenier approach to branched transport Accepted paper at SIAM J of Math Anal 2010 Y BRENIER D composition polaire et r arrangement monotone des champs de vecteurs C R Acad Sci Paris S r I Math 305 1987 pp 805 808 Polar factorization and monotone rearrangement of vector valued functions Comm Pure Appl Math 44 1991 pp 375 417 D BURAGO Y BURAGO AND S IVANOV A course in metric geometry vol 33 of Graduate Studies in Mathematics American Mathematical Society Providence RI 2001 L A CAFFARELLI Boundary regularity of maps with convex potentials Comm Pure Appl Math 45 1992 pp 1141 1151 The regularity of mappings with a convex potential J Amer Math Soc 5 1992 pp 99 104 Boundary regularity of maps with convex potentials II Ann of Math 2 144 1996 pp 453 496 L A CAFFARELLI M FELDMAN AND R J MCCANN Constructing optimal maps for Monge s transport problem as a limit of strictly convex costs J Amer Math Soc 15 2002 pp 1 26 electronic L CARAVENNA A proof of sudakov theorem with strictly convex norms Math Z to appear J A CARRILLO S LISINI G SAVAR AND D SLEPCEV Nonlinear mobility continuity equations and generalized displacement convexity J Funct Anal 258 2010 pp 1273 1309 T CHAMPION AND L DE PASCALE The Monge problem in R4 Duke Math J The Monge problem for strictly convex norm
94. calnotes so sies soe seca es 106 7 Ricci curvature bounds 107 7 1 Convergence of metric measure spaces o oo eA 109 7 2 Weak Ricci curvature bounds definition and properties 112 7 3 Bibliographical notes socs oo ede ok Rok HR OR A RR a Ge A 122 Introduction The opportunity to write down these notes on Optimal Transport has been the CIME course in Cetraro given by the first author in 2009 Later on the second author joined to the project and the initial set of notes has been enriched and made more detailed in particular in connection with the differentiable structure of the Wasserstein space the synthetic curvature bounds and their analytic implications Some of the results presented here have not yet appeared in a book form with the exception of 44 It is clear that this subject is expanding so quickly that it is impossible to give an account of all developments of the theory in a few hours or a few pages A more modest approach is to give a quick mention of the many aspects of the theory stimulating the reader s curiosity and leaving to more detailed treatises as 6 mostly focused on the theory of gradient flows and the monumental book 80 for a much broader overview on optimal transport In Chapter 1 we introduce the optimal transport problem and its formulations in terms of trans port maps and transport plans Then we introduce basic tools of the theory namely the duality formula the c monotonicity and disc
95. cation of the definition gives that its total derivative is given by t 5 L tt d ae O amp VEE Vt a e t 6 7 which shows that the total derivative is nothing but the convective derivative well known in fluid dynamics E For p Z R we denote by P L Tan 4 R the orthogonal projection and we put p i Id P Definition 6 8 Covariant derivative Let u be an absolutely continuous and tangent vector field along the regular curve u Its covariant derivative is defined as D d D cp Zu 6 8 shows that the covariant derivative is an L vector field In order to prove that the covariant derivative we just defined is the Levi Civita connection we need to prove two facts compatibiliy with the metric and torsion free identity Recall that on a standard Riemannian manifold these two conditions are respectively given by The trivial inequality Me Ht E XY WX Ye Y 14 OX 90 VY X Y VxY Vy X where X Y are smooth vector fields and y is a smooth curve on M The compatibility with the metric follows immediately from the Leibnitz rule 6 6 indeed if ul u2 are tangent absolutely continuous vector fields we have d d d E2 1 2 1 2 di uj up Zulu ut Su Ht Ht d d 505 Gt ut uh Pa Su 4 6 9 D Jj 1 D 2 Ug Ut Up U n 3 u dt m To prove the torsion free identity we need first to understand how to c
96. concave Kantorovich potentials are related to the transport problem in the follow ing two different but clearly related ways e as c concave functions whose superdifferential contains the support of optimal plans accord ing to Theorem 1 13 e as maximizing functions together with their c tranforms in the dual problem 1 4 Existence of optimal maps The problem of existence of optimal transport maps consists in looking for optimal plan y which are induced by a map T X Y i e plans y which are equal to Id T p for p THY and some measurable map T As we discussed in the first section in general this problem has no answer as it may very well be the case when for given u A X v P Y there is no transport map at all from js to v Still since we know that 1 2 holds when sz has no atom it is possible that under some additional assumptions on the starting measure ju and on the cost function c optimal transport maps exist To formulate the question differently given u v and the cost function c is that true that at least one optimal plan is induced by a map Let us start observing that thanks to Theorem 1 13 the answer to this question relies in a natural way on the analysis of the properties of c monotone sets to see how far are they from being graphs Indeed Lemma 1 20 Let Adm g v Then y is induced by a map if and only if there exists a y measurable set U C X x Y where y is concentrated such t
97. cterization of the set of non differentiability of convex functions is due to Zaji ek 83 see also the paper by G Alberti 2 and the one by G Alberti and the first author 3 Theorem 1 26 on existence of optimal maps in IR for the cost distance squared is the celebrated result of Y Brenier who also observed that it implies the polar factorization result 1 28 18 19 Brenier s ideas have been generalized in many directions One of the most notable one is R Mc Cann s theorem 1 33 concerning optimal maps in Riemannian manifolds for the case cost squared distance 64 R McCann also noticed that the original hypothesis in Brenier s theorem which was u 1 can be relaxed into u gives 0 mass to Lipschitz hypersurfaces In 42 W Gangbo and R McCann pointed out that to get existence of optimal maps in R with c z y x y 2 it is sufficient to ask to the measure u to be regular in the sense of the Definition 1 25 The sharp version of Brenier s and McCann s theorems presented here where the necessity of the regularity of u is also proved comes from a paper of the second author of these notes 46 Other extensions of Brenier s result are e Infinite dimensional Hilbert spaces the authors and Savar 6 e cost functions induced by Lagrangians Bernard Buffoni 13 namely 1 cary int f 20 30 00 1 7 v 0 e Carnot groups and sub Riemannian manifolds c do 2 the first author and S Rigot 10 A
98. d is Polish and Z5 X W2 is geodesic then X d is geodesic as well Indeed given x y X and a geodesic u connecting to y we can build a measure u Y Geod X satisfying 2 7 Then every y supp u is a geodesic connecting rto y E Definition 2 15 Non branching spaces A geodesic space X d is said non branching if for any t 0 1 a constant speed geodesic y is uniquely determined by its initial point yo and by the point n In other words X d is non branching if the map Geod X gt y 0 X is injective for any t 0 1 34 Non branching spaces are interesting from the optimal transport point of view because for such spaces the behavior of geodesics in A X is particularly nice optimal transport plan from inter mediate measures to other measures along the geodesic are unique and induced by maps it is quite surprising that such a statement is true in this generality compare the assumption of the proposition below with the ones of Theorems 1 26 1 33 Examples of non branching spaces are Riemannian manifolds Banach spaces with strictly convex norms and Alexandrov spaces with curvature bounded below Examples of branching spaces are Banach spaces with non strictly convex norms Proposition 2 16 Non branching and interior regularity Let X d be a Polish geodesic non branching space Then 2 X W2 is non branching as well Furthermore if m C P2 X isa constant speed geodesic t
99. d by the abstract definition coincides with the lower bound on the 108 Ricci curvature of the manifold equipped with the Riemannian distance and the volume measure Stability Curvature bounds are stable w r t the natural passage to the limit of the objects which define it Interest Geometrical and analytical consequences on the space can be derived from curvature dimension condition In the next section we recall some basic concepts concerning convergence of metric measure spaces which are key to discuss the stability issue while in the following one we give the definition of curvature dimension condition and analyze its properties All the metric measure spaces X d rn that we will consider satisfy the following assumption Assumption 7 1 X d is Polish the measure m is a Borel probability measure and m 2 X 7 1 Convergence of metric measure spaces We say that two metric measure spaces X dx mx and Y dy my are isomorphic provided there exists a bijective isometry f supp mx supp my such that fymx my This is the same as to say that we don t care about the behavior of the space X dx where there is no mass This choice will be important in discussing the stability issue Definition 7 2 Coupling between metric measure spaces Given two metric measure spaces X dx mx Y dy my we consider the product space X x Y Dxy where Dxy is the distance defined by Dxy 1 41 22 y2 y d amp
100. d the continuity equation is given by the following theorem Theorem 2 29 Characterization of absolutely continuous curves in Z5 M W 5 Let M bea smooth complete Riemannian manifold without boundary Then the following holds A For every absolutely continuous curve p C A2 M there exists a Borel family of vector fields v on M such that vi r2 u lt iel for a e t and the continuity equation d g tV vette 0 2 20 holds in the sense of distributions B If mv satisfies the continuity equation 2 20 in the sense of distributions and i lvc r2 u dt lt 00 then up to redefining t p on a negligible set of times u is an ab solutely continuous curve on P2 M and fu4 vi r2 u for ae t 0 1 Note that we are not assuming any kind of regularity on the ju s We postpone the sketch of the proof of this theorem to the end of the section for the moment we analyze its consequences in terms of the geometry of Z M The first important consequence is that the Wasserstein distance which was defined via the static optimal transport problem can be recovered via the following dynamic Riemannian like formula 44 Proposition 2 30 Benamou Brenier formula Let 1 ut 2 M Then it holds 1 Wa u8 uw int joldi 2 21 0 where the infimum is taken among all weakly continuous distributional solutions of the continuity equation u v4 such that po p and p pg P
101. d which rules out the Finsler case Progresses in this direction have been made in 8 where the notion of spaces with Riemannian Ricci bounded below is introduced shortly said these spaces are the subclass of CD K N spaces where the heat flow studied in 45 53 7 is linear References 1 10 11 12 13 14 16 A AGRACHEV AND P LEE Optimal transportation under nonholonomic constraints Trans Amer Math Soc 361 2009 pp 6019 6047 G ALBERTI On the structure of singular sets of convex functions Calc Var and Part Diff Eq 2 1994 pp 17 27 G ALBERTI AND L AMBROSIO A geometrical approach to monotone functions in R Math Z 230 1999 pp 259 316 L AMBROSIO Lecture notes on optimal transport problem in Mathematical aspects of evolv ing interfaces CIME summer school in Madeira Pt P Colli and J Rodrigues eds vol 1812 Springer 2003 pp 1 52 L AMBROSIO AND N GIGLI Construction of the parallel transport in the Wasserstein space Methods Appl Anal 15 2008 pp 1 29 L AMBROSIO N GIGLI AND G SAVAR Gradient flows in metric spaces and in the space of probability measures Lectures in Mathematics ETH Z rich Birkhauser Verlag Basel sec ond ed 2008 Calculus and heat flows in metric measure spaces with ricci curvature bounded below preprint 2011 Spaces with riemannian ricci curvature bounded below preprint 2011 L
102. e gn m c lItezz u i 0 converges to some plan w r t the distance W on X Now fix n N and notice that for t 2 i 1 2 and y 5 Geod X it holds d e Y d orugos 0 1 22 d d 5o 5 2n and therefore squaring and then taking the sup over t 0 1 we get 27 1 1 HM gd mn 2 3 d Yaar Wit1 2 zi d 0 0 d 30 53 2 10 Choosing 5 to be a constant geodesic and using 2 9 we get that y Z Geod X for every m N Now for any given v P Y Geod X by a gluing argument Lemma 2 12 below with v i in place of v v Y Geod X Z X t we can find a plan 8 Z Geod X such that TaB v TaB v 2 27 estas or CTN on BE opt eu2 v cio 2 i 0 i 0 where optimality between In ison v and IL ts jon 3 U is meant w r t the Wasserstein dis tance on Y2 X Using 2 to bound from above W v and using 2 10 we get that for every couple of measures v v Y2 Geod X it holds 2 2 Wi p lt aw lever Teo i 0 i 1 cu Je Yo V1 dv y fe Jo 51 dv 3 as for Theorem 2 7 everything is simpler if closed balls in X are compact Indeed observe that a geodesic connecting two points in Ba xo lies entirely on the compact set D2n xo and that the set of geodesics lying on a given compact set is itself compact in Geod X so that the tightness of 14 follows directly from the one of 1o p1
103. e W to some extent the relation of D to Wz is the same relation that there is between Gromov Hausdorff distance and Hausdorff distance between compact subsets of a given metric space A bad property is that it is not suitable to study convergence of metric measure spaces which are endowed with infinite reference measures well the definition can easily be adapted but it would lead to a too strict notion of convergence very much like the Gromov Hausdorff distance which is not used to discuss convergence of non compact metric spaces The only notion of convergence of Polish spaces endowed with c finite measures that we are aware of is the one discussed by Villani in Chapter 27 of 80 Definition 27 30 It is interesting to remark that this notion of convergence does not guarantee uniqueness of the limit which can be though of as a negative point of the theory yet bounds from below on the Ricci curvature are stable w r t such convergence which in turn is a positive point as it tells that these bounds are even more stable The discussion on the local Poincar inequality and on Lemma 7 19 is extracted from 57 There is much more to say about the structure and the properties of spaces with Ricci curvature bounded below This is an extremely fast evolving research area and to give a complete discussion on the topic one would probably need a book nowadays Two things are worth to be quickly mentioned
104. e density that is given a smooth function A 0 oo 0 oo we define 1 WE E age int f iones opas adt 5 1 0 81 where the infimum is taken among all the distributional solutions of the non linear continuity equa tion d gaty v h pr 0 5 2 with po p and p pl The key assumption that leads to the existence of an action minimizing curve is the concavity of h since this leads to the joint convexity of J o D Malen h p so that using this convexity with J vh p one can prove existence of minima of 5 1 Particularly important is the case given by h z z for lt 1 from which we can build the distance Wa defined by 1 a Watt int foto 65 3 0 the infimum being taken among all solutions of 5 2 with p o and p pl The following theorem holds Theorem 5 4 Let a gt 1 i Then the infimum in 5 3 is always reached and if it is finite the minimizer is unique Now fix a measure p Y R The set of measures v with Waf v lt oo endowed with Wa is a complete metric space and bounded subsets are narrowly compact We remark that the behavior of action minimizing curves in this setting is in some very rough sense dual of the behavior of the branched optimal transportation discussed in the previous section Indeed in this problem the mass tends to spread out along an action minimizing curve rather than to glue together 5 3 An extension to measures with unequal
105. e need only to check the convexity of the functionals Let v be an interpolating curve with base the regular measure ju and To T the optimal trans port maps from u to vo and v4 respectively The only if part of i follows simply considering interpolation of deltas For the if observe tha V r f voina fva t To x tT x du x lt 1 8 f Vedea t VT due 5a 0 f trs Pda lt 1 t V vo tV 4 A tW2 vo wi 3 46 For ii we start claiming that W2 u x p v x v 2W2 u v for any p v R2 To prove this it is enough to check that if y Opt u v then nt nt 17 m7 uy Opt x p v x v To see this let y R RU 00 be a convex function such that supp y C O y and define the convex function 2 on R by G x y v x y It is immediate to verify that supp y C 07 so that is optimal as well This argument also shows that if v is an interpolating curve with base u then t gt v X v is an interpolating curve from vo x vo to v4 x vj with base u x u Also 21 22 W z4 z2 is A convex if W is The conclusion now follows from case i We pass to iii We will make the simplifying assumption that y and that Tp and T are smooth and satisfy det VTo r Z 0 det VTi r A 0 for every x supp j up to an approximation argument it is possible to reduce to this case we omit the details Then writing u pL from the change of variable formula
106. en by the distance squared on IR7 Then Theorem 2 10 and Remark 2 13 tell that the unique geodesic u connecting u to v is given by 1 pi 5 Su peren E so that the geodesic produces a V shaped path For some applications this is unnatural for instance in real life networks when one wants to transport the good located in x to the destinations y and y it is preferred to produce a branched structure where first the good it is transported on a single truck to some intermediate point and only later split into two parts which are delivered to the 2 destinations This produces a Y shaped path If we want to model the fact that it is convenient to ship things together we are lead to the following construction due to Gilbert Say that the starting distribution of mass is given by u gt J 45x and that the final one is v 5 bj y with 5 a 7 b 1 An admissible dynamical transfer is then given by a finite oriented weighted graph G where the weight is a function w set of edges of G gt R satisfying the Kirchoff s rule NS w e 5 w e ai Vi edges e outgoing from edges e incoming in x gt we 5 w e b VJ edges e outgoing from yj edges e incoming in yj 1 w e X w e 0 for any internal node z of G edges e outgoing from z edges e incoming in z Then for o 0 1 one minimizes w e length e edges e of G among all admissible graphs G Observe that
107. en these three statements holds in a much more general context more general underlying spaces cost functions measures Key concepts that are needed in the analysis are the generalizations of the concepts of cyclical monotonicity convexity and subdifferential which fit with a general cost function c The definitions below make sense for a general Borel and real valued cost Definition 1 7 c cyclical monotonicity We say that T C X x Y is c cyclically monotone if xi yi ET 1 i N implies N 5 c zi Yi 5 c zxi yo forall permutations a of 1 N i 1 i l Definition 1 8 c transforms Let y Y RU 00 be any function Its c transform wt X RU o0 is defined as Y x inf e x y vy ye Similarly given o X RU 00 its c4 transform is the function p Y RRU Eoo defined by qt y inf c z y p z The c_ transform Y X RU 00 ofa function w on Y is given by po x sup c x y YY ycY and analogously for c_ transforms of functions ip on X Definition 1 9 c concavity and c convexity We say that y X IRU oo is c concave if there exists ip Y RU oo such that p w Similarly Y Y RU 00 is c concave if there exists p Y RU o0 such that y pt Symmetrically p X RU 00 is c convex if there exists Y gt RU 00 such that p 4 and Y RU 00 is c convex if there exists p
108. energy Let u 0 20 RU 00 be a convex function bounded from below such that u 0 0 and u z lim 2l gt 0oo for some a gt z 0 z d 4 2 3 44 let u co limz ss u z z The internal energy functional E associated to u is E u yet oo R where u pL u is the decomposition of p in absolutely continuous and singular parts w r t the Lebesgue measure 71 Condition 3 44 ensures that the negative part of u p is integrable for uj 4 IR so that E is well defined possibly 00 Indeed from 3 44 we have u z az bz for some a lt 1 satisfying 2o 1 a gt d and it holds J etin e ea latae lt f oae lapractay fa Eeo lt o Under appropriate assumptions on V W and e the above defined functionals are compatible with the distance W2 As said before we will use as interpolating curves those given in Definition 3 29 Proposition 3 33 Let gt 0 The following holds i The functional V is A convex along interpolating curves in 4 R7 W2 if and only if V is A convex ii The functional W is convex along interpolating curves 2 IR W2 if W is A convex iii The functional is convex along interpolating curves 4 IR W2 provided u satisfies z 24u z is convex and non increasing on 0 00 3 45 Proof Since the second inequality in 3 29 is satisfied by the interpolating curves that we are con sidering inequality 3 43 w
109. entia tion so that in particular we have Jo Id 7 8 I Vw is 113 The fact that Jacobi fields are the differential of the exponential map reads in our case as VTi 2 v I x v therefore we have D det Jz 7 9 Also Jacobi fields satisfy the Jacobi equation which we write as Jj ApS 0 7 10 where A x Tox Ypa M gt T exp A x i RW v e5 tVy a M is the map given by where exp tVy a Recalling the rule det B det B tr B B7 valid for a smooth curve of linear operators we obtain from 7 9 the validity of D Ditr J J7 7 11 Evaluating this identity at t 0 and using 7 8 we get the first of 7 7 Recalling the rule B B B B7 valid fora smooth curve of linear operators and differentiating in time equation 7 11 we obtain D D E HI D IT IIT HIT Da HIT te At Se 1 having used the Jacobi equation 7 10 Evaluate this expression at t 0 use 7 8 and observe that tr Ao tr v e R V v vel Ric Vy Vt to get the second of 7 7 Theorem 7 11 Compatibility of weak Ricci curvature bounds Let M be a compact Riemannian manifold d its Riemannian distance and m its normalized volume measure Then i the functional amp is K geodesically convex on P2 M W2 if and only if M has Ricci curvature uniformly bounded from below by K ii the functional Ey is geodesically convex on 2 M W2 if and only
110. eorem was proved in the compact case in 80 Theorem 7 21 this has been extended to locally compact structures and much more general forms of interpolation The main source of difficulty when dealing with general Polish structure is the potential lack of tightness the proof presented here is strongly inspired by the work of S Lisini 54 Proposition 2 16 and Theorem 2 18 come from 80 Corollary 7 32 and Theorem 7 36 respec tively Theorem 2 20 and the counterexample 2 21 are taken from 6 Theorem 7 3 2 and Example 7 3 3 respectively The proof of Corollary 2 24 is taken from an argument by A Fathi 35 the paper being inspired by Bernand Buffoni 13 Remark 2 27 is due to N Juillet 48 The idea of looking at the transport problem as dynamical problem involving the continuity equa tion is due to J D Benamou and Y Brenier 12 while the fact that 75 IR W2 can be viewed as a sort of infinite dimensional Riemannian manifold is an intuition by F Otto 67 Theorem 2 29 has been proven in 6 where also Propositions 2 32 2 33 and 2 34 were proven in the case M R4 the generalization to Riemannian manifolds comes from Nash s embedding theorem 3 Gradient flows The aim of this Chapter is twofold on one hand we give an overview of the theory of Gradient Flows in a metric setting on the other hand we discuss the important application of the abstract theory to the case of geodesically convex functionals on the space 5 I
111. equiring geodesic convexity on the whole 42 X would lead to a notion not invariant under isomorphism of metric measure spaces Also for the C D 0 N condition one requires the geodesic convexity of all y to ensure the following compatibility condition if X is a CD 0 N space then it is also a CD 0 N space for any N N Using Proposition 2 16 it is not hard to see that such compatibility condition is automatically satisfied on non branching spaces a Remark 7 9 How to adapt the definitions to general bounds on curvature the dimension It is pretty natural to guess that the notion of bound from below on the Ricci curvature by K c R and bound from above on the dimension by N can be given by requiring the functional y to be K geodesically convex on Z X W2 However this is wrong because such condition is not compatible with the Riemannian case The hearth of the definition of CD K N spaces still con cerns the properties of y but a different and more complicated notion of convexity is involved Oo 112 Let us now check that the definitions given have the qualitative properties that we discussed in the introduction of this chapter Intrinsicness This property is clear from the definition Compatibility To give the answer we need to do some computations on Riemannian manifolds Lemma 7 10 Second derivative of the internal energy Let M be a compact and smooth Rieman nian manifold m its normalized volume measure
112. er 5 is devoted to the presentation of three recent variants of the optimal transport problem which lead to different notions of Wasserstein distance the first one deals with variational problems giving rise to branched transportation structures with a Y shaped path opposed to the V shaped one typical of the mass splitting occurring in standard optimal transport problems The second one involves modification in the action functional on curves arising in the Benamou Brenier formula this leads to many different optimal transportation distances maybe more difficult to describe from the Lagrangian viepoint but still with quite useful implications in evolution PDE s and functional inequalities The last one deals with transportation distance between measures with unequal mass a variant useful in the modeling problems with Dirichlet boundary conditions Chapter 6 deals with a more detailed analysis of the differentiable structure of 45 R4 besides the analytic tangent space arising from the Benamou Brenier formula also the geometric tangent space based on constant speed geodesics emanating from a given base point is introduced We also present Otto s viewpoint on the duality between Wasserstein space and Arnold s manifolds of measure preserving diffeomorphisms A large part of the chapter is also devoted to the second order differentiable properties involving curvature The notions of parallel transport along sufficiently regular geode
113. er by R Jordan D Kinderlehrer and F Otto 47 where it was proved that the minimizing movements procedure for the functional pL plogp VpdL on the space Z7 IR W2 produce solutions of the Fokker Planck equation Later F Otto in 67 showed that the same discretization applied to pee pal a 1 with the usual meaning for measures with a singular part produce solutions of the porous medium equation The impact of Otto s work on the community of optimal transport has been huge not only he was able to provide concrete consequences in terms of new estimates for the rate of convergence 76 of solutions of the porous medium equation out of optimal transport theory but he also clearly described what is now called the weak Riemannian structure of 2 R W2 see also Chapter 6 and Subsection 2 3 2 Otto s intuitions have been studied and extended by many authors The rigorous description of many of the objects introduced by Otto as well as a general discussion about gradient flows of A geodesically convex functionals on 5 IR W2 has been done in the second part of 6 the discussion made here is taken from this latter reference 4 Geometric and functional inequalities In this short Chapter we show how techniques coming from optimal transport can lead to simple proofs of some important geometric and functional inequalities None of the results proven here are new in the sense that they all were wel
114. ere exists a constant speed geodesic connecting them i e a constant speed geodesic such that Yo x and 4 y Before entering into the details let us describe an important example Recall that X 5 xz gt 6 P X is an isometry Therefore if t is a constant speed geodesic on X connecting x to y the curve t d is a constant speed geodesic on 2 X which connects 6 to y The important thing to notice here is that the natural way to interpolate between 6 and y is given by this so called displacement interpolation Conversely observe that the classical linear interpolation t uy 1 t toy produces a curve which has infinite length as soon as x y because W ut Hs v t s d x y and thus is unnatural in this setting We will denote by Geod X the metric space of all constant speed geodesics on X endowed with the sup norm With some work it is possible to show that Geod X is complete and separable as soon as X is we omit the details The evaluation maps e Geod X X are defined for every t 0 1 by ey o 2 6 Theorem 2 10 Let X d be Polish and geodesic Then Y2 X W2 is geodesic as well Further more the following two are equivalent i t gt m Po X is a constant speed geodesic ii There exists a measure p 2 Geod X such that eo e1 4H Opt uo p and pi eig Hh 2 7 31 Proof Choose u u A X and find an optimal plan Opt u
115. es and just focus on the main concepts For an interpolating curve as in the definition it holds W32 u w 1 HW pu vo tW2 u vi t HW vo 1i 3 43 Indeed the map 1 t To tT is optimal from p to v because we know that To and T are the gradients of convex functions qo p respectively thus 1 To tT is the gradient of the convex function 1 go tv and thus is optimal and we know by inequality 2 1 that W2 vo 14 lt To Ti Z2 gt thus it holds W3 u ve 1 t To M Z2 1 t To Id 72 tlT1 Idl720 t0 lT Ti lieg 1 WS m vo tW2 p v1 t 1 0W3 vo v1 We now pass to the description of the three functionals we want to study Definition 3 30 Potential energy Let V R RU 00 be lower semicontinuous and bounded from below The potential energy functional V P2 R R U 00 associated to V is defined by vu Vap Definition 3 31 Interaction energy Let W R RU 00 be lower semicontinuous even and bounded from below The interaction energy functional W IR RU 00 associated to W is defined by 1 W 5 f Wii zajdu x uis m Observe that the definition makes sense also for not even functions W however replacing if neces sary the function W x with W x W 2 2 we get an even function leaving the value of the functional unchanged Definition 3 32 Internal
116. es n such that Lip u lt L and fix two smooth vectors ij 0 CV R1 R4 100 Notice that for n m I it holds Nu un Un m Num Um lu lt lA un Un lu IAN un Um 0 lu lA Um EX Um lu Lun Lip un unllu Llvm Olle and thus EN lim Nu un Un Nu m vm llu 2Ll v Olly n m oo n mcl this expression being vacuum if J is finite If n J and m I we have Lip v L and IN Cun Un Nu um Um llu S Nu us v Dll NS Qum 8 Dla Nu CI 9 Um Mle Nu um oma lt L v llu Lip un Gly Lip o Olly Lllum llu which gives J Nu Quas Nus tola lt Lilo llu Lipa llu n lI m I Exchanging the roles of the u s and the v s in these inequalities for the case in which n I we can conclude lim Ns un vn Nu um mln S 2L v vll 2L u aly n m oo Since are arbitrary we can let gt u and 6 v in L and conclude that n gt Na Un Un is a Cauchy sequence as requested The other claims follow trivially by the sequential continuity Definition 6 19 The operators O and O Let p Pa R and v L with Lip v lt oo Then the operator u gt O u is defined by O u Na v u The operator u O u is the adjoint of O i e it is defined by O7 u w u Ov w Vw p It is clear that the operator norm
117. f particular relevance is the case of sectional curvature Example 6 28 The sectional curvature If we evaluate the curvature tensor R on a 4 ple of vectors of the kind u v u v and we recall the antisymmetry of M we obtain R u v u v 31A u oll Thanks to the simplification of the formula the value of R u v u v is well defined as soon as either u or v is Lipschitz That is R u v u v is well defined for u v LNL In analogy with the Riemannian case we can therefore define the sectional curvature K u v at the measure u along the directions u v by R 3 Qus v A R uv uv IN ua vl V u v M n 27 2 felled use lulio uv This expression confirms the fact that the sectional curvatures of 2 IR are positive coherently with Theorem 2 20 and provides a rigorous proof of the analogous formula already appeared in 67 and formally computed using O Neill formula 6 4 Bibliographical notes The idea of looking at the Wasserstein space as a sort of infinite dimensional Riemannian manifold is due to F Otto and given in his seminal paper 67 The whole discussion in Section 6 1 is directly taken from there The fact that the tangent space made of gradients Tan IR7 was not sufficient to study all the aspects of the Riemannian geometry of Z IR7 W2 has been understood in 6 in con nection with the definition of subdifferential of a geodesically convex functional
118. from 7 16 that v is also left continuous Thus it is continuous and in particular the volume of the spheres y d y x r is 0 for any r gt 0 In particular m y 0 for any y X and the proof is concluded An interesting geometric consequence of the Brunn Minkowski inequality in conjunction with the non branching hypothesis is the fact that the cut locus is negligible Proposition 7 16 Negligible cut locus Assume that X d m is a C D 0 N space and that it is non branching Then for every x supp m the set of y s such that there is more than one geodesic from x to y is m negligible In particular form x m a e x y there exists only one geodesic from x to y and the map X gt x y y Geod X is measurable Proof Fix x supp m R gt 0 and consider the sets A Bn z Fix t lt 1 and y A We claim that there is only one geodesic connecting it to x By definition we know that there is some z Bg x and a geodesic y from z to x such that 7 y Now argue by contradiction and assume that there are 2 geodesics 41 4 from y to x Then starting from z following y for time 1 t and 118 then following each of y y for the rest of the time we find 2 different geodesics from z to x which agree on the non trivial interval 0 1 t This contradicts the non branching hypothesis Clearly A C A C Br x for t lt s thus t m A is non decreasing By 7 14 a
119. function p which is identically 0 on OQ such that supp y C 0 here c z y x y Observe that M2 Q Wb2 is always a geodesic space while from Theorem 2 10 and Remark 2 14 we know that Y Q W2 is geodesic if and only if is that is if and only if Q is convex It makes perfectly sense to extend the entropy functional to the whole M2 Q the formula is still E u f plog p for u pli and E u oo for measures not absolutely continuous The Gradient Flow of the entropy w r t Wb2 produces solutions of the Heat equation with Dirichlet boundary conditions in the following sense Theorem 5 6 Let y M2 Q be such that E u lt oo Then e for every T gt 0 there exists a unique discrete solution p starting from p and constructed via the Minimizing Movements scheme as in Definition 3 7 e As T 0 the measures p converge to a unique measure p in M2 Q Wb2 for any t gt 0 e The map x t gt pi x is a solution of the Heat equation So Api in x 0 00 pt gt H weakly as t 0 subject to the Dirichlet boundary condition p x e in OQ for every t gt 0 that is py e belongs to Hj Q for every t gt 0 83 The fact that the boundary value is given by e can be heuristically guessed by the fact that the entropy has a global minimum in M 2 Q such minimum is given by the measure with constant density e i e the measure whose density is everywhere equal to the minimum of z z log z
120. g the Riemannian distance Let us start with the case X Y IR and c z y x y 2 In this case there is a simple characterization of c concavity and c superdifferential Proposition 1 21 Let o R gt RU oo Then o is c concave if and only if x G x z 2 p x is convex and lower semicontinuous In this case y 0 q x if and only if y O P x Proof Observe that 2 2 2 a int EZL iy e gle int EL o ay yy y y eet E ciatis y B vw which proves the first claim For the second observe that x y 2 p y 6 p z yea ole lt z y 2 2 y y Yz R4 p z a a 2 x y lyl 2 p y p z z 2 z y ly 2 p y Vz c R7 e e z Iz 2 lt e z lx 2 z 2 y Vz eR y t p 2 a Syed Pz Therefore in this situation being concentrated on the c superdifferential of a c concave map means being concentrated on the graph of the subdifferential of a convex function Remark 1 22 Perturbations of the identity via smooth gradients are optimal An immediate consequence of the above proposition is the fact that if v C IR then there exists gt 0 such that Id V is an optimal map for any e lt Z Indeed it is sufficient to take such that Id lt V w lt Id With this choice the map x r 2 amp v z is convex for any e
121. hat for 1 a e x there exists only one y T x Y such that x y T In this case y is induced by the map T Proof The if part is obvious For the only if let I be as in the statement of the lemma Possibly removing from I a product N x Y with N j4 negligible we can assume that T is a graph and denote by T the corresponding map By the inner regularity of measures it is easily seen that we can also assume I U T to be c compact Under this assumption the domain of T i e the projection of I on X is c compact hence Borel and the restriction of T to the compact set 7 x Tn is continuous It follows that T is a Borel map Since y T x a e in X x Y we conclude that J d z y dyle y 7 ae F a dy ae y J ae T 2 du z so that y Id x T np Thus the point is the following We know by Theorem 1 13 that optimal plans are concentrated on c cyclically monotone sets still from Theorem 1 13 we know that c cyclically monotone sets are obtained by taking the c superdifferential of a c concave function Hence from the lemma above what we need to understand is how often the c superdifferential of a c concave function is single valued There is no general answer to this question but many particular cases can be studied Here we focus on two special and very important situations 14 e X Y R and c z y x y 2 e X Y M where M is a Riemannian manifold and c z y d z y 2 d bein
122. hat m B gt 0 and any Lipschitz function f X R it holds 92N41 r being the radius Mon Proof Notice that x te Jimla lt ae o M Fy lam a dm y L m B Jpxp _ If 40 f duly Geod X where yz is defined as in the statement of Lemma 7 19 Observe that for any geodesic y the map t f is Lipschitz and its derivative is bounded above by d o 1 V f 7 for a e t Hence since any geodesic y whose endpoints are in B satisfies d yo 1 2r we have Ju 0 fev dey ysa f a IV fl ve du Cy a f f IV f d er 4 udt By Lemma 7 19 we obtain f Nidedsna lt Z Iv fiim By the Bishop Gromov inequality we know that m 2B lt 2 m B and thus em 92N 1 V f dmm lt V f dmrn m B T m 2B which is the conclusion 7 3 Bibliographical notes The content of this chapter is taken from the works of Lott and Villani on one side 58 57 and of Sturm 74 75 on the other The first link between K geodesic convexity of the relative entropy functional in Z 5 M W2 and the bound from below on the Ricci curvature is has been given by Sturm and von Renesse n 76 The works 74 75 and 58 have been developed independently The main difference between them is that Sturm provides the general definition of CD K N bound which we didn t speak about with the exception of the quick citation in Remark 7 9 while Lott and Villani focused on the cases C D K oo and CD 0
123. he unique optimal transport map from u to v which exists and is unique by Theo rem 1 26 due to our assumptions on p We conclude the section with a sketch of the proof of Theorem 2 29 Sketch of the Proof of Theorem 2 29 Reduction to the Euclidean case Suppose we already know the result for the case IR and we want to prove it for a compact and smooth manifold M Use the Nash embedding theorem to get the existence of a smooth map i M RP whose differential provides an isometry of Tp M and its image for any x M Now notice that the inequality i x i y lt d x y valid for any x y M ensures that Wa igu ipv lt Wo p v for any u v 45 M Hence given an absolutely continuous curve u C Y2 M the curve is C 5 IRP is absolutely continuous as well and there exists a family vector fields v such that 2 20 is fulfilled with 7441 in place of ju 46 and vzl z2 4u S lips lt 4 for a e t Testing the continuity equation with functions constant on i M we get that for a e t the vector field v is tangent to i M for i4 p4 a e point Thus the v s are the isometric image of vector fields on M and part A is proved Viceversa let uj C Ao M be a curve and the vps vector fields in M such that J v r2 u dt lt oo and assume that they satisfy the continuity equation Then the measures fiz ig p and the vector fields 9 di v satisfy the continuity equation on RP Therefore f is a
124. hematical Society Providence RI 2003 127 80 Optimal transport old and new Springer Verlag 2008 81 Q XIA Optimal paths related to transport problems Commun Contemp Math 5 2003 pp 251 279 Interior regularity of optimal transport paths Calc Var Partial Differential Equations 20 2004 pp 283 299 83 L ZAJ CEK On the differentiability of convex functions in finite and infinite dimensional spaces Czechoslovak Math J 29 1979 pp 340 348 82 128
125. hen for every t 0 1 there exists only one optimal plan in Opt po pt and this plan is induced by a map from ju Finally the measure p Geod X associated to u via 2 7 is unique Proof Let pj C P2 X be a constant speed geodesic and fix to 0 1 Pick yt Opt uo Lto and y Opt ju H1 We want to prove that both yt and are induced by maps from u To this aim use the gluing lemma to find a 3 plan P X such that 12 wl Ty Oe 23 2 Ty a and observe that since u is a geodesic it holds d x 9l zz lt drt m d x T lt ldir r lle ldir 79 22 dr roni dir m II cya W2 Ho teo Wa uto 1 W po pa so that 7 73 za Opt uo u1 Also since the first inequality is actually an equality we have that d x y d y z d x z for o a e x y z which means that x y z lie along a geodesic Furthermore since the second inequality is an equality the functions x y z d x y and x y z d y z are each a positive multiple of the other in supp q It is then immediate to verify that for every x y z supp a it holds d x y 1 to d z z d y z tod a z We now claim that for x y z z y 2 supp q it holds a y z x y 2 if and only if y y Indeed pick x y z 2 y z supp a and assume for instance that z Z z Since 11 1 4 0 is an optimal plan by the cyclical monoto
126. hus the problem is to show the existence There is an important analogy which helps under standing the proof that we want to point out we already know that the space IR W2 looks like a Riemannian manifold but actually it has also stronger similarities with a Riemannian manifold M embedded in some bigger space say on some Euclidean space R indeed in both cases e we have a natural presence of non tangent vectors elements of L V Tan 45 R for Z R4 and vectors in R non tangent to the manifold for the embedded case e The scalar product in the tangent space can be naturally defined also for non tangent vectors scalar product in L5 for the space 2 IR and the scalar product in RP for the embedded case This means in particular that there are natural orthogonal projections from the set of tangent and non tangent vectors onto the set of tangent vectors P L7 gt Tan 22 IR4 for Z5 R7 and P RP T M for the embedded case e The Covariant derivative of a tangent vector field is given by projecting the time derivative onto the tangent space Indeed for the space 4 IR we know that the covariant derivative is given by formula 6 8 while for the embedded manifold it holds d Vau Py Zu 6 11 where t gt Y is a smooth curve and t u T4 M is a smooth tangent vector field Given these analogies we are going to proceed as follows first we give a proof of the existence of the par
127. ided the map df Ker df z Tr N 84 is a surjective isometry for any x M A trivial example of submersion is given in the case M N x L for some Riemannian manifold L with M endowed with the product metric and f M N is the natural projection More generally if f is a Riemannian submersion for each y N the set f y C M is a smooth Riemannian submanifold The duality between the Wasserstein and the Arnold Manifolds consists in the fact that there exists a Big Manifold BM which is flat and a natural Riemannian submersion from BM to 4 IR whose fibers are precisely the Arnold Manifolds Let us define the objects we are dealing with Fix once and for all a reference measure p Z IR recall that we are assuming that all the measures are absolutely continuous with smooth densities so that we will use the same notation for both the measure and its density The Big Manifold BM is the space L p of maps from R to R which are L w r t the reference measure p The tangent space at some map T BM is naturally given by the set of vector fields belonging to L p where the perturbation of T in the direction of the vector field u is given by t T tu The target manifold of the submersion is the Wasserstein manifold 4 IR We recall that the tangent space Tan 2 IR7 at the measure p is the set Tan P2 R Ve p CPR endowed with the scalar product of L p we
128. identity tells us that p is a first order approximation of the distributional solution of the Heat equation starting from po and evaluated at time 7 67 To prove 3 38 fix y CS IR and perturb p in the following way Id eV pr The density of p can be explicitly expressed by pz x p e eVe z det Id eV2g x Observe that it holds e _ E E E P Pr E p f log p f prlog f o Id eV f ones A E p Z log det Id eV y E pr e f oho ot where we used the fact that det Id A 1 etr A o e To evaluate the first variation of the distance squared let T be the optimal transport map from p to po which exists because of Theorem 1 26 and observe that from Typ po Id EV p p and inequality 2 1 we have W3 po p T Id EVel 3 39 therefore from the fact that equality holds at e 0 we get W3 po W2 po pr IT Id eVell22 T Idll22 2e r Id V pr of From the minimality of p for the problem 3 37 we know that W3 0 po W2 pr po 2T 2T so that using 3 39 and 3 40 dividing by e rearranging the terms and letting 0 and amp f 0 we get following Euler Lagrange equation for p T Id ode e pr 0 3 41 Now observe that from Typ po we get Jer epo _ f p T 2 e o a dz T 3 40 E p gt E p Ve iff Vo 1 tha tT a
129. ield of a geodesic Let u be the restriction to 0 1 of a geodesic defined in some larger interval e 1 and let v be its velocity vector field Then we know by Proposition 6 3 that u is regular Also from formula 6 5 it is easy to see that it holds vs o T t s vi Vt s 0 1 and thus v is absolutely continuous and satisfies do 0 and a fortiori Du 0 Thus as expected the velocity vector field of a geodesic has zero convariant derivative in analogy with the standard Riemannian case Actually it is interesting to observe that not only the covariant derivative is 0 in this case but also the total one L Now we pass to the question of parallel transport The definition comes naturally Definition 6 10 Parallel transport Let u be a regular curve A tangent vector field uz along it is a parallel transport if it is absolutely continuous and u 0 a e t dt 93 It is immediate to verify that the scalar product of two parallel transports is preserved in time indeed the compatibility with the metric 6 9 yields diya 2 3 1 2 D 3 uj uz QUU Ut Ut 0 a t dt u dt dt T for any couple of parallel transports In particular this fact and the linearity of the notion of parallel transport give uniqueness of the parallel transport itself in the sense that for any u Tanpo 5 R7 there exists at most one parallel transport u along p satisfying uo u T
130. imension free condition on u We saw that a sufficient condition on u to ensure that is convex along interpolating curves is the fact that the map z gt ztu z7 is convex and non increasing so the dimension d of the ambient space plays a role in the condition The fact that the map is non increasing follows by the convexity of u together with u 0 0 while by simple computations we see that its convexity is equivalent to z lu z u z zu z gt x J zu 2 3 48 Notice that the higher d is the stricter the condition becomes For applications in infinite dimensional spaces it is desirable to have a condition on u ensuring the convexity of in which the dimension does not enter As inequality 3 48 shows the weakest such condition for which is convex in any dimension is z lu z u z zu z gt 0 and some computations show that this is in turn equivalent to the convexity of the map z c e u e A key example of map satisfying this condition is z zlog z Therefore we have the following existence and uniqueness result 73 Theorem 3 35 Let gt 0 and F be either Y W E or a linear combination of them with positive coefficients and A convex along interpolating curves Then for every Ti IR there exists a unique Gradient Flow 14 for F starting from ji in the EVI formulation The curve qu satisfies is locally absolutely continuous on 0 00 pt T as t 0 and if u is reg
131. in particular con cerning the issue of having a closed subdifferential In the appendix of 6 the concept of Geometric Tangent space discussed in Section 6 2 has been introduced Further studies on the properties of Tan Z M have been made in 43 Theorem 6 1 has been proved in 46 The first work in which a description of the covariant derivative and the curvature tensor of Z5 M W2 M being a compact Riemannian manifold has been given beside the formal calculus of the sectional curvature via O Neill formula done already in 67 is the paper of J Lott 56 rigorous formulas are derived for the computation of such objects on the submanifold Zc M 106 made of absolutely continuous measures with density C and bounded away from 0 In the same paper Lott shows that if M has a Poisson structure then the same is true for Yow M atopic which has not been addressed in these notes Independently on Lott s work the second author built the parallel transport on 45 R7 W2 in his PhD thesis 43 along the same lines provided in Section 6 3 The differences with Lott s work are the fact that the analysis was carried out on IR rather than on a compact Riemannian manifold that no assumptions on the measures were given and that both the existence Theorem 6 15 for the parallel transport along a regular curve and counterexamples to its general existence the Example 6 16 were provided These results have been published by the a
132. iple Observe how the constraint y Adm p v becomes the functional to maximize in the dual problem and the functional to minimize f cd y becomes the constraint in the dual problem Start observing that inf c z y dy x y inf c z y dy 1 6 veal f cleuddr a u _ int ECOLE 1 6 where x y is equal to 0 if y Adm u v and 00 if y Adm u v and M X x Y is the set of non negative Borel measures on X x Y We claim that the function x may be written as xy sur oodua f vari f o oae P where the supremum is taken among all y Y C X x Ca Y Indeed if y Adm u v then x y 0 while if y Adm u v we can find Y Y Cy X x C Y such that the value between the brackets is different from 0 thus by multiplying y Y by appropriate real numbers we have that the supremum is 00 Thus from 1 6 we have inf J c x y dy x y YEAdm u v spd J denaren f odua f voava e evayhsy inf yYEM4 X xY yw Call the expression between brackets F p Y Since y gt F 7 v v is convex actually linear and p Y F v y w is concave actually linear the min max principle holds and we have inf sup F y 9 v jm P595 V e inf y Adm p v p p ab Y MO xY Thus we have inf EC y dy z y YE Adm pv sup inf yw LL CX xY f ni f oodua ovt ole ive supd f oana f Gyr e ut f eto e voava d pw Cy MLL X xY Now
133. ith Dirichlet boundary conditions This is actually doable as we briefly discuss now Let Q C R7 be open and bounded Consider the set M2 Q defined by M2 Q measures pon Q such that ee OQ dpi a lt and for any u v Ma Q define the set of admissible transfer plans Adm u v by y Adm p v if and only if y is a measure on Q such that THY h TA Yy g 7 v Notice the difference w r t the classical definition of transfer plan here we are requiring the first respectively second marginal to coincide with u respectively v only inside the open set Q This means that in transferring the mass from p to v we are free to take put as much mass as we want from to the boundary Then one defines the cost C y of a plan y by y J lz yd y and then the distance W bz by Wal v inf VOO where the infimum is taken among all y Adm y v The distance W bg shares many properties with the Wasserstein distance W2 Theorem 5 5 Main properties of Wb2 The following hold e Wb isa distance on M and the metric space M2 Q Wb2 is Polish and geodesic e A sequence Hn C Me Q converges to u wrt Wb if and only if un converges weakly to p in duality with continuous functions with compact support in and f d x 0Q dpin gt f x 0Q dp as n oo e Finally a plan y Admy p v is optimal i e it attains the minimum cost among admissible plans if and only there exists a c concave
134. know that f f on some set A C 0 1 such that Z 0 1 V A 0 and we want to prove that they actually coincide everywhere Recall that f is s c and f is continuous hence lt fM in 0 1 If by contradiction it holds V to lt c lt C lt f to for some to c C we can find 6 gt 0 such that f t gt C int to to Thus f t gt C fort to 6 to N A and the contradiction comes from 1 C n g t dt gt g t dt gt dh o 0 to 5 to 6 NA to d to s n lt tol Thus we proved that if g L 0 1 it holds Vineet s na f sw Vt lt s 0 1 M gt 0 t Letting M oo we prove 3 24 and hence the thesis This proposition is the key ingredient to pass from existence of Gradient Flows in the EDI for mulation to the one in the EDE formulation Theorem 3 20 Gradient Flows in the EDE formulation Let X E be satisfying Assumption 3 17 and T X be such that E Z lt oo Then all the results of Theorem 3 14 hold Also any Gradient Flow in the EDI sense is also a Gradient Flow in the EDE sense Definition 3 4 Proof The first part of the statement follows directly from Proposition 3 18 By Theorem 3 14 we know that the limit curve is absolutely continuous and satisfies 1 f 1 f E t 5 2dr 7 V E x dr lt ET Ys gt 0 3 27 0 0 2 loc In particular the functions t gt d and t VE x belong to L Proposition 3 19 we kno
135. l Lebesgue measure and for each t A we can find a subsequence Tn O such that Tnk sup VE x lt oo Then the third assumption in 3 13 guarantees that E a E x and the lower semicontinuity of E that E z lt lim o E x5 for every s gt t Thus passing to the limit in 3 16 as T 0 and using 3 18 and 3 19 we get 1 f If E t 5 f l dr 5 VEP ar dr lt E m Wee A Vs gt t t t We conclude with an example which shows why in general we cannot hope to have equality in the EDI Shortly said the problem is that we don t know whether t gt E x is an absolutely continuous map Example 3 15 Let X 0 1 with the Euclidean distance C C X a Cantor type set with null Lebesgue measure and f 0 1 1 00 a continuous integrable function such that f a 00 58 for any x C which is smooth on the complement of C Also let g 0 1 0 1 be a Devil staircase built over C i e a continuous non decreasing function satisfying g 0 0 g 1 1 which is constant in each of the connected components of the complement of C Define the energies E 0 1 gt R by E a g z f Fly ay ie f fG dy It is immediate to verify that E E satisfy all the Assumptions 3 8 3 13 the choice of f guarantees that the slopes of E E are continuous Now build a Gradient Flow starting from 0 with some work it is possible to check that the Minimizing Movement
136. l in some but not all problems For instance problems with constraints or in Wiener spaces infinite dimensional Gaussian spaces include oo valued costs with a large set of points where the cost is not finite We won t discuss these topics L Animportant consequence of the previous theorem is that being optimal is a property that depends only on the support of the plan y and not on how the mass is distributed in the support itself if is an optimal plan between its own marginals and y is such that supp C supp then is optimal as well between its own marginals of course We will see in Proposition 2 5 that one of the important consequences of this fact is the stability of optimality Analogous arguments works for maps Indeed assume that T X Y is a map such that T x x for some c concave function x for all x Then for every y A X such that condition 1 4 is satisfied for v Tq the map T is optimal between u and Tz Therefore it makes sense to say that T is an optimal map without explicit mention to the reference measures Remark 1 15 From Theorem 1 13 we know that given uj A X v A Y satisfying the assumption of the theorem for every optimal plan there exists a c concave function o such that supp y C O y Actually a stronger statement holds namely if supp y C O for some optimal y then supp C O q for every optimal plan y Indeed arguing as in the proof of 1 13 one can see th
137. l known before the proofs coming from optimal transport appeared Still it is interesting to observe how the tools described in the previous sections allow to produce proofs which are occasionally simpler and in any case providing new informations when compared to the standard ones 4 1 Brunn Minkowski inequality Recall that the Brunn Minkowski inequality in IR is e 8 emm and is valid for any couple of compact sets A B C R To prove it let A B C IR be compact sets and notice that without loss of generality we can assume that Z A Z B gt 0 Define 1 Ho sam d 1 ZA lA d M1 ZBI B and let u be the unique geodesic in Z5 R7 W2 connecting them Recall from 3 47 that for u z d z 4 z the functional p f u p dL4 is geodesi cally convex in 45 R4 W2 Also simple calculations show that juo d Z4 A 4 1 u1 dC Z4 B 4 1 Hence we have Elja lt 24a Gram 4 Now notice that Theorem 2 10 see also Remark 2 13 ensures that 44 2 is concentrated on thus letting fi 2 CZ A B 2 7 24 a ay 9 and applying Jensen s inequality to the convex function u we get E u1 2 2 E f1j2 d e which concludes the proof n 77 4 2 Isoperimetric inequality On RR the isoperimetric inequality can be written as P E dLUB t where E is an arbitrary open set P E its perimeter and B the unitary ball We wi
138. let x be a function with bounded support values in 0 1 and identically 1 on B and notice that for every n N it holds J mm f us f ra ans x f xau f NELLE J ats 2a a being given by 2 2 Since fx is continuous and bounded we have f fxdun f fxd and therefore im Hie f Pies i fap 2ae Since gt 0 was arbitrary this part of the statement is proved ii gt iii Obvious iii gt i Argue by contradiction and assume that there exist gt 0 and 39 X such that for every R gt 0 it holds suppen Fx Bn Go d io dp gt Then it is easy to see that it holds lim d 29 dpin gt 2 3 noo X Br 20 27 For every R gt 0 let yp be a continuous cutoff function with values in 0 1 supported on Br xo and identically 1 on Bg ro Since d xo x is continuous and bounded we have J 4c ooa lim d xo x ndum n oo lim d zo du 4c xn dun 2 J dione Die dC zo xi dins n oo lt f 20 du lim f P se0 dpin n gt JX Br 20 f e zo du lim d zo dun n oo X Br ao lt J c having used 2 3 in the last step Since J conn o sip PC ox lt f Gan e we got a contradiction Proposition 2 5 Stability of optimality The distance W is lower semicontinuous w rt narrow convergence of measures Furthermore if y C P X is a sequence of optimal plans which narrowly converges to y amp
139. ll prove this inequality via Brenier s theorem 1 26 neglecting all the smoothness issues Let p FT win n y 7T wip lp and T E B be the optimal transport map w r t the cost given by the distance squared The change of variable formula gives 1 1 det VT z Va E PUB et 7 Dace T Since we know that T is the gradient of a convex function we have that VT x is a symmetric matrix with non negative eigenvalues for every x E Hence the arithmetic geometric mean inequality ensures that E det VI a 1 4 lt E Va E Coupling the last two equations we get 1 T 1 lt va Va E ZE d Bj Integrating over E and applying the divergence theorem we get Hee o l1 ER z v x ar LEY lt Sra V Tode gays TO aH where v OE IR is the outer unit normal vector Since T x B for every x E we have T x 1 for x E and thus T x v z 1 We conclude with 1 d i P E 4 3 Sobolev Inequality E ZI E The Sobolev inequality in IR reads as q ui cap vm vf e WR where 1 lt p d p qu and C d p is a constant which depends only on the dimension d and the exponent p We will prove it via a method which closely resemble the one just used for the isoperimetric inequality Again we will neglect all the smoothness issues Fix d p and observe that without loss of generality we can assume f gt 0 and f f 1 so that our aim is to p
140. lt and thus its gradient is an optimal map a Proposition 1 21 reduced the problem of understanding when there exists optimal maps reduces to the problem of convex analysis of understanding how the set of non differentiability points of a convex function is made This latter problem has a known answer in order to state it we need the following definition Definition 1 23 c c hypersurfaces A set E C IR is called c c hypersurface if in a suitable system of coordinates it is the graph of the difference of two real valued convex functions i e if there exists convex functions f g R R such that E y t eR ger tER t f y 9 y here c c stands for convex minus convex and has nothing to do with the c we used to indicate the cost function 15 Then it holds the following theorem which we state without proof Theorem 1 24 Structure of sets of non differentiability of convex functions Let A C R Then there exists a convex function Y IR R such that A is contained in the set of points of non differentiability of Y if and only if A can be covered by countably many c c hypersurfaces We give the following definition Definition 1 25 Regular measures on IR A measure u IR is called regular provided u E 0 for any c c hypersurface E C R4 Observe that absolutely continuous measures and measures which give 0 mass to Lipschitz hy persurfaces are automatically regular becau
141. lutely continuous w r t the Lebesgue measure To give a meaning to formula 6 23 we need to introduce a new tensor Definition 6 17 The Lipschitz non Lipschitz space Let p Z R The set LNL C L7 is the set of couples of vector fields u v such that min Lip u Lip v lt oo i e the set of couples of vectors such that at least one of them is Lipschitz We say that a sequence Un Un LNL converges to u v LNL provided u u 0 lun vll 0 and sup min Lip u Lip v4 lt oo The following theorem holds Theorem 6 18 The Normal tensor Let y R4 The map N u v CX R4 R4 Tan P2 R4 u v gt Pi Vut v extends uniquely to a sequentially continuous bilinear and antisymmetric map still denoted by N from LNL in Tan 43 IR7 for which the bound IN us v S min Lip w loll Lip lull 6 24 holds Proof For u v CX R R we have V u v Vut v Vot u so that taking the projections on Tan 43 R we get Nu u v N v u Vu v C R4 R In this case the bound 6 24 is trivial To prove existence and uniqueness of the sequentially continuous extension it is enough to show that for any given sequence n tn Un C IR R7 converging to some u v LNL the sequence n gt Na Un Un Tan 2 R is a Cauchy sequence Fix such a sequence tn Un let L sup min Lip u Lip un Z C N be the set of index
142. ly to a limit curve x as T 0 so that the limit curve is unique Furthermore x is the unique solution of the system of differential inequalities ld zyl ev zd Gy BG S EU ae t gt 0 Vy e X 3 30 among all locally absolutely continuous curves 4 converging to Y as t 0 Le x isa Gradient Flow in the EVI formulation see Definition 3 5 e Let T J D E and x yi be the two Gradient Flows in the EVI formulation Then there is A exponential contraction of the distance i e d isy qn lt e ey 3 31 Suppose that A gt 0 that x D E and build xf x4 as above Then the following a priori error estimate holds sup d xz 27 lt 8y T E T E 2z 3 32 t gt 0 Sketch of the Proof We will make the following simplifying assumptions E gt 0 A gt 0 and T D E Also we will prove just that the sequence of discrete solutions n gt at 2 converges to a limit curve as n oo for any given 7 gt 0 Existence and uniqueness of the discrete solution Pick x X We have to prove that there exists a unique minimizer of 3 12 Let J gt 0 be the infimum of 3 12 Let n be a minimizing 64 sequence for 3 12 fix n m N and let y 0 1 X bea curve satisfying 3 29 for zo n 1 m and y x Using the inequalities 3 29 at t 1 2 we get d 91 2 a IzcE lt E y1 2 27 1 d an d m d 1 AT o lt
143. m Uniqueness and contractivity It remains to prove that the solution to the EVI is unique and the contractivity 3 31 The heuristic argument is the following pick x and y solutions of the EVI starting from 7 y respectively Choose y y in the EVI for x to get ld as eet d z yi 5d Go ye E x E y Symmetrically we have ld p 3 eli d ui ys 54 zc yt Ely lt Elz 66 Adding up these two inequalities we get Sd Go lt 92Ad 4 Yt a e t The rigorous proof follows this line and uses a doubling of variables argument la Kruzkhov Uniqueness and contraction then follow by the Gronwall lemma 3 3 Applications to the Wasserstein case The aim of this section is to apply the abstract theory developed in the previous one to the case of functionals on 5 IR W2 As we will see various diffusion equations may be interpreted as Gradient Flows of appropriate energy functionals w r t to the Wasserstein distance and quantitive analytic properties of the solutions can be derived by this interpretation Most of what we are going to discuss here is valid in the more general contexts of Riemannian manifolds and Hilbert spaces but the differences between these latter cases and the Euclidean one are mainly technical thus we keep the discussion at a level of IR to avoid complications that would just obscure the main ideas The secton is split in two subsections in the first o
144. mal transport map from ji to v and recall that T is the gradient of a convex function Assume that is smooth and define v x x v 2 The geodesic u from jz to v can then be written as pi 0 014 tT pu Q0 01d tV9 n Id tVo ju From the A convexity hypothesis we know that d i2 FU 2 F u quao n 5 Wa oy therefore since we know that 4 lcu uz f v Vp du from the arbitrariness of v we deduce v OV F p Proposition 3 36 Subdifferential of V Let V R IR be A convex and Ot let V be as in Definition 3 30 and let u Po IR be regular and satisfying V u lt oo Then OW Y u is non empty if and only if VV L u and in this case VV is the only element in the subdifferential of V at u 74 Therefore if u is a Gradient Flow of V made of regular measures it solves d urs V VV ut dt Mt Mt in the sense of distributions in R x 0 00 Sketch of the Proof Fix p CX R and observe that jim VA TEYA ABIE VU uu ved We Ya VV Vo du 30 30 Conclude using the equivalence 3 50 Proposition 3 37 Subdifferential of W Let W R R be A convex even and Ct let W be defined by 3 31 and y be regular and satisfying W u lt oo Then 0OWW y 0 if and only if VW u belongs to L u and in this case VW u is the only element in the subdifferential of W at p Therefore if u is a Gradient Flow of W made of regula
145. map i e satisfies y 92 21 12 gt O for every zi z2 R y G z i 1 2 then the operator 42 1 t Id tG is single valued Lipschitz with Lipschitz constant bounded above by 1 1 t To prove this pick z1 2 R y G z1 y2 G x2 and observe that 1 t ay tyi 1 t zo tya 1 tf zi zal tly yol 2t 1 t z1 22 41 yo gt 1 8 21 22 which is our claim Now pick uo y1 IR an optimal plan y Opt uo 41 and consider the geodesic t gt pi 1 t n tr yy recall Remark 2 13 From Theorem 1 26 we know that there exists a convex function o such that supp C 9 v Also we know that the unique optimal plan from jo to u is given by the formula nt 1 te tr 3 which is therefore supported in the graph of 1 t Id t07 y Since the subdifferential of a convex function is a monotone operator the thesis follows from the previous claim Considering the case in which ju is a delta and jio is not we can easily see that the bound 1 t on the Lipschitz constant of the optimal transport map from p to jo is sharp An important consequence of Corollary 2 24 is the following proposition Proposition 2 26 Geodesic convexity of the set of absolutely continuous measures Let M be a Riemannian manifold ui C P2 M a geodesic and assume that uo is absolutely continuous w r t the volume measure resp gives O ma
146. mass Let us come back to the Heat equation seen as Gradient Flow of the entropy functional E p J o log p with respect to the Wasserstein distance W as discussed at the beginning of Section 3 3 and in Subsection 3 3 2 We discussed the topic for arbitrary probability measures in IR but actually everything could have been done for probability measures concentrated on some open bounded set Q C R with smooth boundary that is consider the metric space A Q W2 and the entropy functional E p f plog p for absolutely continuous measures and E u 00 for measures with a singular part Now use the Minimizing Movements scheme to build up a family of discrete solutions p starting from some given measure p Q It is then possible to see that these discrete families converge as 7 0 to the solution of the Heat equation with Neumann boundary condition Jp Apt in Q x 0 Foo pt p weakly as t 0 Vpi 9 0 in OQ x 0 00 where n is the outward pointing unit vector on Q The fact that the boundary condition is the Neumann s one can be heuristically guessed by the fact that working in Y Q enforces the mass to be constant with no flow of the mass through the boundary It is then natural to ask whether it is possible to modify the transportation distance in order to take into account measures with unequal masses and such that the Gradient Flow of the entropy 82 functional produces solutions of the Heat equation in Q w
147. measure space i e a metric space where a reference non negative measure is also given When looking to the Riemannian case this fact is somehow hidden as a natural reference measure is given by the volume measure which is a function of the distance There are several viewpoints from which one can see the necessity of a reference measure which can certainly be the Hausdorff measure of appropriate dimension if available A first cheap one is the fact that in most of identities inequalities where the Ricci curvature appears also the reference measures appears e g equations 7 1 7 2 and 7 3 above A more subtle point of view comes from studying stability issues consider a sequence Mn gn of Riemannian manifolds and assume that it converges to a smooth Riemannian manifold M g in the Gromov Hausdorff sense Assume that the Ricci curvature of Mn gn is uniformly bounded below by some K R Can we deduce that the Ricci curvature of M g is bounded below by K The answer is no while the same question with sectional curvature in place of Ricci one has affirmative answer It is possible to see that when Ricci bounds are not preserved in the limiting process it happens that the volume measures of the approximating manifolds are not converging to the volume measure of the limit one Another important fact to keep in mind is the following if we want to derive useful ana lytic geometric consequences from a weak definition of Ricci curv
148. ment is nothing but the squared Wasserstein distance from the corresponding Dirac mass We start proving that W is actually a distance on 4 X In order to prove the triangle inequal ity we will use the following lemma which has its own interest Lemma 2 1 Gluing Let X Y Z be three Polish spaces and let y P X xY y P Y x Z be such that ny Ty Then there exists a measure y P X x Y x Z such that NY S cud Tau VEY Y Z Ty y Y Proof Let p thy mL Yy and use the disintegration theorem to write d y r y du y dy x and dy y z du y d y2 z Conclude defining y by d z y z du y d vy x v a 2 Theorem 2 2 W2 is a distance W is a distance on P2 X Proof It is obvious that W2 y u 0 and that W2 u v W v u To prove that W2 u v 0 implies u v just pick an optimal plan y Opt u v and observe that f d x y d y v y 0 implies that y is concentrated on the diagonal of X x X which means that the two maps 7 and 7 coincide a e and therefore tyy n2 y For the triangle inequality we use the gluing lemma to compose two optimal plans Let Ui H2 H3 X and let y Opt m1 u2 Y3 Opt u2 u3 By the gluing lemma we know that there exists y X such that 1 2 42 Tye Y Y 2 3 3 Tu Y Ya Since THY u and THY u3 we have may Adm 1 u3 and therefore from the triangle 25 inequality in L it holds Walis lt q
149. mong the minimizers of 3 12 Then the map TH E a PE is locally Lipschitz in 0 7 and it holds d Pezi d a dr ze a UAE t Q 6 T 0 7 3 13 2 2 Proof Observe that from E x4 Dn lt E zn Taa we deduce d Er T E z x o 279 d m 2 1 1 T pa gai a MS ANN Pa sies Bos m 2n C Ex In 25 2 eem ma zz x Arguing symmetrically we see that tre x 2T0 d x 2 Ti m diis x Tor Elan d o 271 2T0T1 ui E x7 ay The last two inequalities show that 7 E a Pose is locally Lipschitz and that equation 3 13 holds Lemma 3 10 With the same notation and assumptions as in the previous theorem T gt d T x is non decreasing and T gt E 2 is non increasing Also it holds d r T vajej 802 3 14 T Proof Pick 0 lt To lt 7 lt T From the minimality of and z we get d z4 T d x7 T ap C Mera lt g Er rit E 25 270 E 7 9m 2 d gt E z d zz 2 lt E z d 0 2 d 2n Adding up and using the fact that gt 0 we get d T 79 lt d T x7 The fact that r E z is non increasing now follows from a 455 d r m a gum E z d y lt E z ds lt E z4 dT 271 274 274 For the second part of the statement observe that from d 27 7 d y T E x lt Vy EX a Be lt py 4 ye we get E x Ey lt d y T i d 7 T d y T
150. n ifold borns as smooth manifold on which we define a scalar product on each tangent space but the space 4 IR7 does not have a smooth structure there is no diffeomorphism of a small ball around the origin in Tan Z3 R onto a neighborhood of u in Z5 IR7 Thus we have to proceed in a different way which we describe now Regular curves first of all we drop the idea of defining a smooth vector field on the whole mani fold We will rather concentrate on finding an appropriate definition of smoothness for vector fields defined along curves We will see that to do this we will need to work with a particular kind of curves which we call regular see Definition 6 2 Smoothness of vector fields We will then be able to define the smoothness of vector fields defined along regular curves Definition 6 5 Among others a notion of smoothness of particular relevance is that of absolutely continuous vector fields for this kind of vector fields we have a natural notion of total derivative not to be confused with the covariant one see Definition 6 6 Levi Civita connection At this point we have all the ingredients we need to define the covariant derivative and to prove that it is the Levi Civita connection on 2 IR Definiton 6 8 and discussion thereafter Parallel transport This is the main existence result on this subject we prove that along regular curves the parallel transport always exists Theorem 6 15 We will also discuss a c
151. n absolutely continuous curve and it holds 7 lt ell 22 a4 l vcl r2 u for a e t Notice that i is bilipschitz and therefore u is absolutely continuous as well Hence to conclude it is sufficient to show that ji a e t To prove this one can notice that the fact that 4 is bilipschitz and validity of d x lim sup L y 1 r gt 0 evem i a i y d x y lt r give that W p v lim sup MESH l T O uve23 M Walizn inv Wa u v lt r We omit the details Part A Fix o C9 IR and observe that for every yf Opt ju Hs it holds pds J pdu i e y dy x y J etu f ou ent IT Vola A y 2 y 2 tits 2 25 f Voe a arte v EEE lt J Vola Pay x y 1 s lye y E Rm 654 nd Vell 52 4 Was Ls F Rem y t 8 where the remainder term Rem q t s can be bounded by by Lip Vo Lip Vo Reno t lt POP fla yaniy PO WF n Thus 2 25 implies that the map t f pdp is absolutely continuous for any y C R4 Now let D C C R be a countable set such that Vip Y D is dense in Tan Z IR for every t 0 1 the existence of such D follows from the compactness of 14 rejo 1 C IR we omit the details The above arguments imply that there exists a set A C 0 1 of full Lebesgue measure such that t gt f pdy is differentiable at t A for every y D we can also assume that the metric derivative
152. nd the fact that m x 0 proved in Proposition 7 15 we know that lim m A m Bn x which means that m a e point in B x is connected to x by a unique geodesic Since R and x are arbitrary uniqueness is proved The measurability of the map x y y7 is then a consequence of uniqueness of Lemma 2 11 and classical measurable selection results which ensure the existence of a measurable selection of geodesics in our case there is m x m almost surely no choice so the unique geodesic selection is measurable Corollary 7 17 Compactness Let N D lt co Then the family X N D of isomorphism classes of metric measure spaces X d m satisfying the condition C D 0 N with diameter bounded above by D is compact w r t the topology induced by Sketch of the Proof Using the Bishop Gromov inequality with R D we get that m Bz 2 gt S V X d m X N D supp mx 7 17 Thus there exists n N D which does not depend on X N D such that we can find at most n N D e disjoint balls of radius in X Thus supp m x can be covered by at most n N D amp balls of radius 2e This means that the family A N D is uniformly totally bounded and thus it is compact w r t Gromov Hausdorff convergence see e g Theorem 7 4 5 of 20 Pick a sequence Xn dn Mn X N D By what we just proved up to pass to a subsequence not relabeled we may assume that supp m dn
153. ndowed with a scalar product the one of L u This fact Theorem 2 29 and Proposition 2 30 are the bases of the so called weak Riemannian structure of Y2 1 W2 We now state without proof some other properties of 27 M W2 which resemble those of a Riemannian manifold For simplicity we will deal with the case M IR only and we will assume that the measures we are dealing with are regular Definition 1 25 but analogous statements hold for general manifolds and general measures In the next three propositions is an absolutely continuous curve in Z5 IR such that p is regular for every t Also v is the unique up to a negligible set of times family of vector fields such that the continuity equation holds and v Tan 4 R for a e t Proposition 2 32 v can be recovered by infinitesimal displacement Let u and v as above Also let T be the optimal transport map from u to s which exists and is unique by Theorem 1 26 due to our assumptions on ju Then for a e t 0 1 it holds T Id v lim sot s t the limit being understood in L p Proposition 2 33 Displacement tangency Let u and vi as above Then for a e t 0 1 it holds lim Wo uten 1d hoi g qu lim z 0 2 24 Proposition 2 34 Derivative of the squared distance Let u and v as above and v P R Then for a e t 0 1 it holds d qW ue v 2 f on Id dit where T is t
154. ne we discuss the definition of subdifferential of a geodesicaly convex functional on Z IR7 which is based on the interpretation of 4 IR7 as a sort of Riemannian manifold as discussed in Subsection 2 3 2 In the second one we discuss three by now classical applications for which the full power of the abstract theory can be used i e we will have Gradient Flows in the EVI formulation Before developing this program we want to informally discuss a fundamental example Let us consider the Entropy functional E A R RU 00 defined by d d Ei J eios if u pLl Foo otherwise We claim that the Gradient Flow of the Entropy in Z R7 W2 produces a solution of the Heat equation This can be proved rigorously see Subsection 3 3 2 but for the moment we want to keep the discussion at the heuristic level By what discussed in the previous section we know that the Minimizing Movements scheme produces Gradient Flows Let us apply the scheme to this setting Fix an absolutely continuous measure po here we will make no distinction between an absolutely continuous measure and its density fix 7 gt 0 and minimize 2 E Ei Heo 3 37 It is not hard to see that the minimum is attained at some absolutely continuous measure p actually the minimum is unique but this has no importance Our claim will be proved if we show that for any y C3 IR it holds Sorp J Peo _ f deste 3 38 T because this
155. neglect to take the closure in L p because we want to keep the discussion at a formal level The perturbation of a measure p in the direction of a tangent vector Vy is given by t gt Id tVy xp The Arnold Manifold Arn p associated to a certain measure p 4 IR7 is the set of maps S R R which preserve p Arn p s R gt R Sgp p We endow Arn p with the L distance calculated w r t p To understand who is the tangent space at Arn p at a certain map S pick a vector field v on R and consider the perturbation t S tv of S in the direction of v Then v is a tangent vector if and only if diesel tv 4p 0 Observing that S tv zp Id tvoS Sp Id tvoS 4p V voS p d ai lt 0 arl d dt t 0 we deduce TangArn p vector fields v on R such that V vo S 1p o which is naturally endowed with the scalar product in L p We are calling the manifold Arn p an Arnold Manifold because if p is the Lebesgue measure restricted to some open smooth and bounded set Q this definition reduces to the well known definition of Arnold manifold in fluid mechanics the geodesic equation in such space is formally the Euler equation for the motion of an incompressible and inviscid fluid in Q Finally the Riemannian submersion Pf from BM to Z5 IR is the push forward map Pf BM gt 42 R T gt TP 85 We claim that Pf is a Riemannian submersion and that the fiber Pf
156. nes constant speed geodesics starting from ju Jr Geod 1 and defined on some interval of the kind 0 T ibi where we say that u u4 provided they coincide on some right neighborhood of 0 The natural distance D on Geod is ry gc Wolke it D ut ut lim 7777 6 1 The Geometric Tangent space Tan IR is then defined as the completion of Geod u Wirt the distance D The natural question here is what is the relation between the space of gradients Tan 4 IR7 and the space of directions Tan 3 R Recall that from Remark 1 22 we know that given o C9 R2 the map t Id tVy uu is a constant speed geodesic on a right neighborhood of 0 This means that there is a natural map t from the set Vy y E CZ into Geod and therefore into Tan 4 IR which sends V into the equivalence class of the geodesic t gt Id fVq u The main properties of the Geometric Tangent space and of this map are collected in the following theorem which we state without proof Theorem 6 1 The tangent space Let jj Z IR4 Then e the lim in 6 1 is always a limit e the metric space Tan 4 R D is complete and separable e the map t Vp gt Tan A IR is an injective isometry where on the source space we put the L distance wrt u Thus t always extends to a natural isometric embedding of Tan IR7 into Tan P2 R Furthermore the followi
157. ng statements are equivalent i the space Tan R4 D is an Hilbert space ii the map t Tan 5 IR gt Tan Z IR is surjective iii the measure p is regular definition 1 25 We comment on the second part of the theorem The first thing to notice is that the space of di rections Tan 45 IR can be strictly larger than the space of gradients Tan 45 IR7 This is actually not surprising if one thinks to the case in which p is a Dirac mass Indeed in this situ ation the space Tan 45 IR7 D coincides with the space 45 R W2 this can be checked 87 directly from the definition however the space Tan 5 IR7 is actually isometric to R itself and is therefore much smaller The reason is that geodesics are not always induced by maps that is they are not always of the form t Id tu for some vector field u L To some extent here we are facing the same problem we had to face when starting the study of the optimal transport problem maps are typically not sufficient to produce optimal transports From this perspective it is not surprising that if the measure we are considering is regular that is if for any v 4 IR there exists a unique optimal plan and this plan is induced by a map then the space of directions coincides with the space of directions induced by maps 6 3 Second order calculus Now we pass to the description of the second order analysis over 4
158. nicity of its support we know that d z z d a z d v z d 2 z d x y d y zy d a y d y z 1 to d z z tod 2 z 1 to d a z tod z z which after some manipulation gives d x z d z z D Again from the cyclical mono tonicity of the support we have 2D lt d x z d x z thus either d x z or d x z is gt than D Say d x z gt D so that it holds D lt d x z d z y d y z 1 to D toD D which means that the triple of points x y z lies along a geodesic Since x y z lies on a geodesic as well by the non branching hypothesis we get a contradiction 35 Thus the map supp 2 x y z y is injective This means that there exists two maps f g X X such that x y z supp a if and only if x f y and z g y This is the same as to say that y is induced by f and is induced by g To summarize we proved that given o 0 1 every optimal plan y Opt 10 Hto is induced by a map from jj Now we claim that the optimal plan is actually unique Indeed if there are two of them induced by two different maps say f and f then the plan 1 b CF Id bye F Id byt would be optimal and not induced by a map It remains to prove that Y2 X is non branching Choose u 2 Geod X such that 2 7 holds fix to 0 1 and let y be the unique optimal plan in Opt 4
159. nimal assumptions on the functional and show how it is possible starting from them to prove existence of Gradient Flows in the EDI sense Basically there are two independent sets of assumptions that we need those which ensure the existence of discrete solutions and those needed to pass to the limit To better highlight the structure of the theory we first introduce the hypotheses we need to guarantee the existence of discrete solution and see which properties the discrete solutions have Then later on we introduce the assumptions needed to pass to the limit We will denote by D E C X the domain of E i e D E E lt oo Assumption 3 8 Hypothesis for existence of discrete solutions X d is a Polish space and E X RU co be al s c functional bounded from below Also we assume that there exists T gt 0 such that for every 0 lt T lt FT and X D E there exists at least a minimum of d x Z 2r zo c E e 3 12 55 Thanks to our assumptions we know that discrete solutions exist for every starting point z for T sufficiently small The big problem we have to face now is to show that the discrete solutions satisfy a discretized version of the EDI suitable to pass to the limit The key enabler to do this is the following result due to de Giorgi Theorem 3 9 Properties of the variational interpolation Let X E be satisfying the Assumption 3 8 Fix Y X and for any 0 lt T lt T choose x a
160. ns can t be strictly cheaper than transporting with maps We won t detail the proof of this fact 1 2 Necessary and sufficient optimality conditions To understand the structure of optimal plans probably the best thing to do is to start with an example Let X Y R and c z y x y 2 Also assume that jj v P R are supported on finite sets Then it is immediate to verify that a plan y Adm u v is optimal if and only if it holds x les yil a Voc l 2 3 52 3 i 1 i l for any N N xi yi supp and c permutation of the set 1 N Expanding the squares we get N N Doa i 1 i 1 6 which by definition means that the support of y is cyclically monotone Let us recall the following theorem Theorem 1 6 Rockafellar A set T C R x IR is cyclically monotone if and only if there exists a convex and lower semicontinuous function p R RU 00 such that T is included in the graph of the subdifferential of q We skip the proof of this theorem because later on we will prove a much more general version What we want to point out here is that under the above assumptions on u and v we have that the following three things are equivalent e y Adm u v is optimal e supp is cyclically monotone e there exists a convex and lower semicontinuous function y such that y is concentrated on the graph of the subdifferential of q The good news is that the equivalence betwe
161. o Now define the 3again if closed balls in X are compact the argument simplifies Indeed from the uniform bound on the second moments and the inequality R u X Bn zo lt Jasse d o du we get the tightness of the sequence Hence up to pass to a subsequence we can assume that yn narrowly converges to a limit measure jz and then using the lower semicontinuity of W2 w r t narrow convergence we can conclude lim Wo 4 Hn lt lim lim W2 Um in 0 30 measures Hn 1 En Endx where en is chosen such that end T n r To bound from above W2 u Hn leave fixed 1 n u move n p to T and then move 67 into this gives Wis us lt en f f 2 dule sm so that lim Wa p Hn lt r Conclude observing that lim pe x duy lim 1 n fe z Z dp End n 2 eands n oo n oo thus the second moments do not converge Since clearly yn weakly converges to u we proved that there is no local compactness E 2 2 X geodesic space In this section we prove that if the base space X d is geodesic then the same is true also for P2 X W2 and we will analyze the properties of this latter space Let us recall that a curve y 0 1 X is called constant speed geodesic provided d 7 s t s d o 71 Vt s 0 1 2 5 or equivalently if lt always holds Definition 2 9 Geodesic space A metric space X d is called geodesic if for every x y X th
162. o account that for a general u of the form m A m it holds En u NQ m A and that as before if m Ao m A1 gt 0 it cannot be m Ao A1 0 or we would violate the convexity inequality A consequence of Brunn Minkowski is the Bishop Gromov inequality Proposition 7 15 Bishop Gromov Let X d m be a CD 0 N space Then it holds m B x ys m Bn z 2 Va supp m 7 15 R In particular supp m d m is a doubling space Proof Pick x supp m and assume that m x 0 Let u r m B x Fix R gt 0 and apply the Brunn Minkowski inequality to Ag x A1 Br x observing that Ao A1 C Bir x to get v GR gt m Ao Aij gt te N R WO lt t lt 1 Now let r tR and use the arbitrariness of R t to get the conclusion It remains to deal with the case m z Z 0 We can also assume supp m x otherwise the thesis would be trivial under this assumption we will prove that m x 0 for any x X A simple consequence of the geodesic convexity of amp y tested with delta measures is that supp m is a geodesically convex set therefore it is uncountable Then there must exist some x supp m such that m a 0 Apply the previous argument with x in place of x to get that v r r N nec WO lt r lt R 7 16 where now v r is the volume of the closed ball of radius r around x By definition v is right continuous letting r T R we obtain
163. oT P VeoT lt Vell Lip T Id Vp CX R 6 19 Let us suppose first that T Id C R In this case the map y o T is in C9 R too and therefore V y o T VT Vy o T belongs to Tan 2 R From the minimality properties of the projection we get IVeoT P VeoeT l IVeoT VT Vo oTll fia vro VATE Pana i 1 2 lt J IVe T x IIvaa TG dao lt Vell Lip T Id where I is the identity matrix and V Zd T x o is the operator norm of the linear functional from R to R given by v V Id T z v Now turn to the general case and we can certainly assume that T is Lipschitz Then it is not hard to see that there exists a sequence T Id C Ce R7 such that Ta T uniformly on compact sets and lim Lip T Id Lip T Id It is clear that for such a sequence it holds IT Ta 0 and we have IVeoT P VeoT S IVe oT V eoT l lt VeeT Veo Tally Veo T V eo T lt Lip Vo lIT Talla Vie o Tall Lip Tn Id Letting n 00 we get the thesis For the second inequality just notice that P woT l sup woT v sup w voT v Tan 29 Rd v Tany 22 Rd lvl 1 lol ml sup u voT P vo T 7 lt w Lip T Id veTany Z3 Rd K lvla 1 From this lemma and the inequality Lip T s t J 1d lt elf Lira 1c Vt s 0 1 EAr t whose
164. observing that if u is absolutely continuous along 98 u then P u is absolutely continuous as well as it follows from the inequality Pu us T t 8 Pua Pu us e T t 8 P Pu us o T t s P us o T t s y Py us o T E s Pu ue P 0 T 5 Pj ue lt Ht Ht Ht ll ge us o T t s well 6 20 s Sil qd lt asc f Lip v jar f t t d dr Ur Hr 6 21 valid for any t lt s where S sup u Thus P u has a well defined covariant derivative for a e t The question is can we find a formula to express this derivative To compute it apply the Leibniz rule for the total and covariant derivatives 6 6 and 6 9 to get that for a e t 0 1 it holds d D 5 Paleo Ve CP tu Ve P ut D pa Ut T V Ht dt Ht d d u Vo u Vo dt dt Since Vi Tan Z5 R for any t it holds P uj Vip ut Vp for any t 0 1 and thus the left hand sides of the previous equations are equal for a e t Recalling formula 6 7 we have 2Vo V o v and DVo P V o uz thus from the equality of the right hand sides we obtain D d 2r ve eve E ut V vi Pu ut Py V p i vi d gs va bt d E Sune Pi ue Pr V v9 Me 6 22 This formula characterizes the scalar product of DP u u
165. of Alexandrov if for every constant speed geodesic y 0 1 X and every z X the following concavity inequality holds d w 2 1 0d yo 2 td 0 2 t 0d 00 1 2 14 Similarly X is said to be non positively curved NPC in the sense of Alexandrov if the converse inequality always holds Observe that in an Hilbert space equality holds in 2 14 The result here is that 275 X W2 is PC if X d is while in general it is not NPC if X is Theorem 2 20 Z5 X W2 is PC if X d is Assume that X d is positively curved Then P X W2 is positively curved as well Proof Let u be a constant speed geodesic in Y2 X and v W X Let y Yo Geod X be a measure such that p e sn Vte 0 1 as in Theorem 2 10 Fix to 0 1 and choose y Opt uto v Using a gluing argument we omit the details it is possible to show the existence a measure Y Geod X x X such that Geod X Tg P RECON 2 15 eu 7 9 7 v where 99930 y x y Geod X amp y x X ande 7 2 Yt X Then a satisfies also eo o Adm uo v 2 16 e1 r a Adm u1 v and therefore it holds a Uto Y j fe en y z do x 2 14 2 2 gt 1 to d 40 z tod m1 z to 1 to d yo 11 do v x C 1 to J d o z do y to J d 51 z da y x to 1 to f d dn 2 16 2 1 to W3 uo v toW2 ui v to 1
166. of Subsection 3 2 4 be cause the compatibility in Energy and distance ensures strong properties both at the level of discrete solutions and for the limit curve obtained Once we will have a Gradient Flow the Subdifferential formulation will let us understand which is the PDE associated to it Let us recall Example 2 21 that the space 7 R7 W2 is not Non Positively Curved in the sense of Alexandrov this means that if we want to check whether a given functional is compatible with the distance or not we cannot use geodesics to interpolate between points because we would violate the second inequality in 3 29 A priori the choice of the interpolating curves may depend on the functional but actually in what comes next we will always use the ones defined by Definition 3 29 Interpolating curves Let u vo v IR and assume that p is regular Def inition 1 25 The interpolating curve 1 from vo to vi with base u is defined as v 0 t To fT agp where To and T are the optimal transport maps from u to vo and v4 respectively Observe that if U vo the interpolating curve reduces to the geodesic connecting it to v 70 Strictly speaking in order to apply the theory of Section 3 2 4 we should define interpolating curves having as base any measure pp 5 IR and not just regular ones This is actually possible and the foregoing discussion can be applied to the more general definition but we prefer to avoid technicaliti
167. ollary 2 23 Regularity properties of the interpolated potentials Let i be a c convex po tential for uo p and let p H wv Define v Hi h pt Ht p and choose a geodesic u from uo to ui Then for every t 0 1 it holds i we gt p and both the functions are real valued ii y Yr on supp H iii Y and p are differentiable in the support of p and on this set their gradients coincide Proof For i we have p Hie Hi o Hg v Hi o H og Ho vs Id Now observe that by definition x lt 00 and y x gt oo for every x M thus it holds Foo gt U x gt plx gt oo Va M To prove iz let ys be the unique plan associated to the geodesic u via 2 7 recall Proposi tion 2 16 for uniqueness and pick y supp z Recall that it holds pilye ct no Ye F Wyo pilt c ys e Thus from y1 c 40 71 Y Y0 we get that Y t plyt Since we er 4p the compactness of M gives supp ji Vt yesupp u SO that 22 follows Now we turn to iii With the same choice of t gt as above recall that it holds Vie e Qo w o vi a lt cO ae x v o Vr M and that the function x c 2o a v 5o is superdifferentiable at y Thus the function z gt w is superdifferentiable at x y Similarly p is subdifferentiable at Choose v Oyly vg O qi 1 and observe that ve vi exp 2
168. ons 6 26 and 6 29 and by induction it follows that P u is C Also J P uz is the sum of addends each of which is the composition of projections onto the tangent or normal space and up to n operators O and O7 applied to the vector u Since the operator norm of O and OY is bounded by L we deduce that d ut lt url L lulu L Vn N tE 0 1 Ht Defining the curve t U P ut o T to t s the above bound can be written as de io lt Utollu 2 Yn N te 0 1 Hto which implies that the curve t gt U Ls is analytic This means that for close to to it holds t to d Py ut e T to t J 2 nb dt lt to Pu uc n20 Now notice that equations 6 26 and 6 29 and the fact that Ju 0 ensure that a ese Pu uj An u where A D gt Ds is bounded Thus the thesis follows by the arbitrariness of u Lo Now we have all the technical tools we need in order to study the curvature tensor of the mani fold 4 IR4 Following the analogy with the Riemannian case we are lead to define the curvature tensor in the following way given three vector fields ju gt Vol Tan Z R7 i 1 3 the curvature tensor R calculated on them at the measure u is defined as R Voj Ven Ven Vve Voor Ven Vyer Vve Von Vivei vez Ve where the objects like Vy V Yp are heuristically speaking the
169. otice that the main problem in considering the smoothness of uz is that for different times the vectors belong to different spaces To overcome this obstruction we will define the smoothness of t gt uz 5 in terms of the smoothness of t gt us o T to t Los Definition 6 5 Smoothness of vector fields Let u be a regular curve T t s its flow maps and u a vector field defined along it We say that u is absolutely continuous or O or C or C or analytic provided the map te uro T to t Lj is absolutely continuous or C or C or C or analytic for every to 0 1 Since u o T t1 t uro T to t o T t to and the composition with T to provides an isometry from Ls to Lh it is sufficient to check the regularity of t uw o T to t for some to 0 1 to be sure that the same regularity holds for every to 90 Definition 6 6 Total derivative With the same notation as above assume that u is an absolutely continuous vector field Its total derivative is defined as li Ut h o T t t h Ut u lim di h 0 h f where the limit is intended in D Observe that we are not requiring the vector field to be tangent and that the total derivative is in general a non tangent vector field even if uz is The identity din Urtno T t t h ui zd m Ur T fo t h uto T to t o T t to d h
170. ounterexample to the existence of parallel transport along a non regular geodesic Example 6 16 This will show that the definition of regular curve is not just operationally needed to provide a definition of smoothness 88 of vector fields but is actually intrinsically related to the geometry of 2 IR Calculus of derivatives Using the technical tools developed for the study of the parallel transport we will be able to explicitly compute the total and covariant derivatives of basic examples of vector fields Curvature We conclude the discussion by showing how the concepts developed can lead to a rigor ous definition of the curvature tensor on Z R4 We will write v and v w for the norm of the vector field v and the scalar product of the vector fields v w in the space L u which we will denote by Lh respectively We now start with the definition of regular curve All the curves we will consider are defined on 0 1 unless otherwise stated Definition 6 2 Regular curve Let u be an absolutely continuous curve and let v be its ve locity vector field that is vz is the unique vector field up to equality for a e t such that vi Tan Z5 R7 for a e t and the continuity equation a V 0 att V vim 0 holds in the sense of distributions recall Theorem 2 29 and Definition 2 31 We say that qu is regular provided 1 f Vedi at lt on 62 and 1 J Lip v dt lt oo 6 3 0 Observe
171. ov Let X d be a Polish space Then a family K C P X is relatively compact w r t the narrow topology if and only if it is tight Notice that if K contains only one measure one recovers Ulam s theorem any Borel probability measure on a Polish space is concentrated on a o compact set Remark 1 4 The inequality Y X x Y K x Ko n XN Ki v Y Ko 1 1 valid for any y Adm p v shows that if K C A X and K2 C A Y are tight then so is the set fy P XxY THEY Ki THY Ka Existence of minimizers for Kantorovich s formulation of the transport problem now comes from a standard lower semicontinuity and compactness argument Theorem 1 5 Assume that c is lower semicontinuous and bounded from below Then there exists a minimizer for Problem 1 2 Proof Compactness Remark 1 4 and Ulam s theorem show that the set Adm v is tight in A X x Y and hence relatively compact by Prokhorov theorem To get the narrow compactness pick a sequence 7 C Adm u v and assume that 7 gt y narrowly we want to prove that y Adm u v as well Let y be any function in C X and notice that x y x is continuous and bounded in X x Y hence we have f eei y x dy x y lim f ote dy 2 y lim n vari odn so that by the arbitrariness of p C X we get THY p Similarly we can prove THY which gives y Adm u v as desired Lower semicontinuity We claim that the functional y f cdy
172. p is isometric to the manifold Arn p We start considering the fibers Fix p 4 IR7 Observe that Pr r BM Typ e and that the tangent space Tanz Pf p is the set of vector fields u such that 4 b T tu gp 0 so that from d a ad B 3v d _ Glo THAP loit tuoT e Tp glod t tuo Typ V uoT p we have TanpPf p vector fields u on R such that V uo T 1p o and the scalar product between two vector fields in Tanz Pf a p is the one inherited by the one in BM i e is the scalar product in L p Now choose a distinguished map T c Pf p and notice that the right composition with T provides a natural bijective map from Arn p into Pf p because Suyp p SoT up p We claim that this right composition also provides an isometry between the Riemannian manifolds Arn p and Pf p indeed if v TangArn p then the perturbed maps S tv are sent to S o T tv o T which means that the perturbation v of S is sent to the perturbation u v o T of S o T by the differential of the right composition The conclusion follows from the change of variable formula which gives f Pao fur Clearly the kernel of the differential dPf of Pf at T is given by TanpPf Pr T thus it remains to prove that its orthogonal is sent isometrically onto Tanpr 7 5 R7 by dPf Fix T BM let p Pf T Tgp and observe that Tanj Pf p vector fields w fw dp 0 Yu s t V u
173. proach The fact that for distributional solutions of the continuity equation the vector field v acts only on gradients of smooth functions suggests that the v s should be taken in the set of gradients as well or more rigorously v should belong to L p 2 22 ve p CX M for a e t 0 1 Variational approach The fact that the continuity equation is linear in v and the L norm is strictly convex implies that there exists a unique up to negligible sets in time family of vector fields v L u4 t 0 1 with minimal norm for a e t among the vector fields compatible with the curve u via the continuity equation In other words for any other vector field 0 compatible with the curve u in the sense that 2 20 is satisfied it holds 9 r2 vellz2 u for a e t It is immediate to verify that v is of minimal norm if and only if it belongs to the set v L t fow du 0 Vw L p s t V wH o 2 23 The important point here is that the sets defined by 2 22 and 2 23 are the same as it is easy to check Therefore it is natural to give the following 45 Definition 2 31 The tangent space Let jj 2 M Then the tangent space Tan A2 M at M in u is defined as r Tan P2 M ve pE ce M v L y f e du 0 Vw L p s t V wu o Thus we now have a definition of tangent space for every y 2 M and this tangent space is natu rally e
174. r measures it solves the non local evolution equation d 47 V VW pt ue in the sense of distributions in R x 0 00 Sketch of the Proof Fix p CX R9 let u Id eVy 4 and observe that 1 1 X pes W a y du a du y 5 I W x y e Vo a Ve y du z du y 7 f W x u du a du y 5 VW z y Vela Vey du z du v o c Now observe that vis 9 VoCe autant C vw sata VoCe aut f Sw n x Vole auta and similarly VW z y Ve y du z du y i VW u y Voly duly J VW u z Vola dula Thus the conclusion follows by applying the equivalence 3 50 Proposition 3 38 Subdifferential of Let u 0 4 oo R be convex C on 0 4 oo bounded from below and satisfying conditions 3 44 and 3 45 Let u pL IR be an absolutely continuous measure with smooth density Then V u p is the unique element in OW E u Therefore if u is a Gradient Flow for E and p is absolutely continuous with smooth density px for every t gt 0 then t p solves the equation V p Vu ps 75 Note this statement is not perfectly accurate because we are neglecting the integrability issues Indeed a priori we don t know that V u o belongs to L y Sketch of the Proof Fix p CX R4 and define u Id eV 4 For e sufficiently small ju is absolutely continuous and its density p satisfies by
175. ransport map but the so called Knothe s map Such a map has the property that its gradient has non negative eigenvalues at every point and the reader can easily check that this is all we used of Brenier s map in our proof so that the argument of Gromov is the same we used here The use of Brenier s map instead of Knothe s one makes the difference when studying the quantitative version of the isoperimetric problem Figalli Maggi and Pratelli in 38 using tools coming from optimal transport proved the sharp quantitative isoperi metric inequality in R7 endowed with any norm the sharp quantitative isoperimetric inequality for the Euclidean norm was proved earlier by Fusco Maggi and Pratelli in 40 by completely different means The approach used here to prove the Sobolev inequality has been generalized by Cordero Erasquin Nazaret and Villani in 30 to provide a new proof of the sharp Gagliardo Nirenberg Sobolev inequality together with the identification of the functions realizing the equality 79 5 Variants of the Wasserstein distance In this chapter we make a quick overview of some variants of the Wasserstein distance W2 together with their applications No proofs will be reported our goal here is only to show that concepts coming from the transport theory can be adapted to cover a broader range of applications 5 1 Branched optimal transportation Consider the transport problem with u y and v 2 dy dy for the cost giv
176. rmulation or the rate of dissipation of the energy the EDE and EDI formulations For all these formulations there is a corresponding discrete version of the gradient flow formulation given by the implicit Euler scheme We will then show that there is convergence of the scheme to the continuous solution as the time discretization parameter tends to 0 The EVI formulation is the stronger one in terms of uniqueness contraction and regularizing effects On the other hand this formulation depends on a compatibility condition between energy and distance this condition is fulfilled in Non Positively Curved spaces in the sense of Alexandrov if the energy is convex along geodesics Luckily enough the compatibility condition holds even for some important model functionals in ZZ IR sum of the so called internal potential and interaction energies even though the space is Positively Curved in the sense of Alexandrov In Chapter 4 we illustrate the power of optimal transportation techniques in the proof of some classical functional geometric inequalities the Brunn Minkowski inequality the isoperimetric in equality and the Sobolev inequality Recent works in this area have also shown the possibility to prove by optimal transportation methods optimal effective versions of these inequalities for instance we can quantify the closedness of F to a ball with the same volume in terms of the vicinity of the isoperimetric ratio of E to the optimal one Chapt
177. roof We start with inequality lt Let H v be a solution of the continuity equation Then if J Vell 2u 00 there is nothing to prove Otherwise we may apply part B of Theorem 2 29 to get that u is an absolutely continuous curve on 2 The conclusion follows from 1 1 Wa u9 u lt f li dt lt n lel quad 0 0 where in the last step we used part B of Theorem 2 29 again To prove the converse inequality it is enough to consider a constant speed geodesic u connect ing u to ut and apply part A of Theorem 2 29 to get the existence of vector fields v such that the continuity equation is satisfied and vi r2 X fel Wa u u for a e t 0 1 Then we have 1 Walut u gt eel tagged 0 as desired This proposition strongly suggests that the scalar product in L should be considered as the metric tensor on Z M at u Now observe that given an absolutely continuous curve u C gt M in general there is no unique choice of vector field v such that the continuity equation 2 20 is satisfied Indeed if 2 20 holds and w is a Borel family of vector fields such that V wife 0 for a e t then the continuity equation is satisfied also with the vector fields v w It is then natural to ask whether there is some natural selection principle to associate uniquely a family of vector fields v to a given absolutely continuous curve There are two possible approaches Algebraic ap
178. rove that 1 p wr gt C 4 1 78 for some constant C not depending on f Fix once and for all a smooth non negative function g R R satisfying J 9 1 define the probability measures fP Lt v gg and let T be the optimal transport map from p to v w r t the cost given by the distance squared The change of variable formula gives __ Jf a det VT a for foh famem aseo ys As for the case of the isoperimetric inequality we know that T is the gradient of a convex function thus VT x is a symmetric matrix with non negative eigenvalues and the arithmetic geometric mean inequality gives det VT a 4 lt XTi Thus we get fos fv mm E 1 1 rur where E 1 Finally by Holder inequality we have g T x Va c R Hence we have so a7 1 f rin fony F 1 J owwa five Since g was a fixed given function 4 1 is proved 4 4 Bibliographical notes The possibility of proving Brunn Minkowski inequality via a change of variable is classical It has been McCann in his PhD thesis 62 to notice that the use of optimal transport leads to a natural choice of reparametrization It is interesting to notice that this approach can be generalized to curved and non smooth spaces having Ricci curvature bounded below see Proposition 7 14 The idea of proving the isoperimetric inequality via a change of variable argument is due to Gro mov 65 in Gromov s proof it is not used the optimal t
179. rove the stability of Ricci curvature bounds see Theorem 7 12 Proof For the first statement we just notice that by Lemma 7 4 we have E uma gt E n 4 un m and the conclusion follows from the narrow lower semicontinuity of amp m For the second one we define un y iji Then applying Lemma 7 4 twice we get u m gt amp u ma gt amp Cy gun m from which the T lim inequality follows Thus to conclude we need to show that Wa Yn Ln Lt 0 To check this we use the Wassertein space built over the pseudo metric space Xn LI X d let uu pmx and for any n N define the plan 5 A X x X by d y y x p x d y y x and notice that 4 Adm un u Thus Wa tins H lt J d z y d y 2 lt J di z y e z dv y 2 lt VM V Olin Ta where M is the essential supremum of p By definition it is immediate to check that the density 7 of fn is also bounded above by M Introduce the plan 7 by d y x mn y d y y x and notice that y Adm I Yn s 1n so that as before we have Wa tins In attin lt i d z yd y 2 lt J d s ym yd v2 lt VM v ds Ta In conclusion we have W u Yn Hn lt W2 LUn Yn bn W2 LUn u lt 2 M y Cds Vidi which gives the thesis 111 7 2 Weak Ricci curvature bounds definition and properties Define the functions uy N gt 1 and us on 0 00 as un z N z z1 N
180. s in IR J Eur Math Soc JEMS 12 2010 pp 1355 1369 J CHEEGER Differentiability of Lipschitz functions on metric measure spaces Geom Funct Anal 9 1999 pp 428 517 D CORDERO ERAUSQUIN B NAZARET AND C VILLANI A mass transportation approach to sharp Sobolev and Gagliardo Nirenberg inequalities Adv Math 182 2004 pp 307 332 C DELLACHERIE AND P A MEYER Probabilities and potential vol 29 of North Holland Mathematics Studies North Holland Publishing Co Amsterdam 1978 Q DENG AND K T STURM Localization and tensorization properties of the curvature dimension condition for metric measure spaces ii Submitted 2010 J DOLBEAULT B NAZARET AND G SAVAR On the Bakry Emery criterion for linear diffusions and weighted porous media equations Comm Math Sci 6 2008 pp 477 494 L C EVANS AND W GANGBO Differential equations methods for the Monge Kantorovich mass transfer problem Mem Amer Math Soc 137 1999 pp viii 66 A FATHI AND A FIGALLI Optimal transportation on non compact manifolds Israel J Math 175 2010 pp 1 59 D FEYEL AND A S USTUNEL Monge Kantorovitch measure transportation and Monge Amp re equation on Wiener space Probab Theory Related Fields 128 2004 pp 347 385 A FIGALLI AND N GIGLI A new transportation distance between non negative measures with applications to gradients flows with Dirichlet boundary conditions J Math Pures Appl
181. same time with c concave and c convex potentials Theorem 2 18 Interpolation of potentials Let X d be a Polish geodesic space ui C Y2 X a constant speed geodesic in X W2 and yp a c c 1 convex Kantorovich potential for the couple uo p1 Then the function ps Hj q is a c concave Kantorovich potential for the couple us ut for any t lt s Similarly if is a c concave Kantorovich potential for pi po then H Q is a c 5 convex Kantorovich potential for ju ps for any t lt s Observe that that for t 0 s 1 the theorem reduces to the fact that Hi p 4 is a c concave Kantorovich potential for 44 4o a fact that was already clear by the symmetry of the dual problem discussed in Section 1 3 Proof We will prove only the first part of the statement as the second is analogous Step 1 We prove that H3 w is a c concave function for any t lt s and any X RU 00 This is a consequence of the equality c x y inf e z y x 2 37 from which it follows HE f 0 s i f t s inf t o Y z dne z y Yy inf c x z inte z y vy Step 2 Let y A Geod X be a measure associated to the geodesic u via equation 2 7 We claim that for every y supp jt and s 0 1 it holds s Ys evo c 9o Ys 2 13 Indeed the inequality lt comes directly from the definition by taking x yo To prove the opposite one observe that since e9 e
182. scheme converges in both cases to absolutely continuous curves x and respectively satisfying x V E z a e t i VE ae t Now notice that VE x VE x f x for every x 0 1 therefore the fact that f gt 1 is smooth on 0 1 V C gives that each of these two equations admit a unique solution Therefore this is the key point of the example x and must coincide In other words the effect of the function g is not seen at the level of Gradient Flow It is then immediate to verify that there is Energy Dissipation Equality for the energy E but there is only the Energy Dissipation Inequality for the energy E 3 2 5 The geodesically convex case EDE and regularizing effects Here we study gradient flows of so called geodesically convex functionals which are the natural metric generalization of convex functionals on linear spaces Definition 3 16 Geodesic convexity Let E X RU 00 be a functional and A R We say that E is A geodesically convex provided for every x y X there exists a constant speed geodesic y 0 1 X connecting x to y such that Bl lt HEE tE y 240 ts y 3 20 In this section we will assume that Assumption 3 17 Geodesic convexity hypothesis X d is a Polish geodesic space E X RU 00 is lower semicontinuous geodesically convex for some A R Also we assume that the sublevels of E are boundedly compact i e the set E
183. se convex functions are locally Lipschitz thus a c c hypersurface is a locally Lipschitz hypersurface Now we can state the result concerning existence and uniqueness of optimal maps Theorem 1 26 Brenier Let y IR be such that f x d x is finite Then the following are equivalent i for every v P R with f v dv x lt oo there exists only one transport plan from p to v and this plan is induced by a map T ii uis regular If either i or ii hold the optimal map T can be recovered by taking the gradient of a convex function Proof ii gt i and the last statement Take a x b x x in the statement of Theorem 1 13 Then our assumptions on u v guarantees that the bound 1 4 holds Thus the conclusions of Theorems 1 13 and 1 17 are true as well Using Remark 1 18 we know that for any c concave Kantorovich po tential y and any optimal plan y Opt u v it holds supp y C 0 y Now from Proposition 1 21 we know that 2 w is convex and that 0 O v Here we use our assumption on u Since Y is convex we know that the set E of points of non differentiability of Y is jj negligible Therefore the map V R R4 is well defined ju a e and every optimal plan must be concen trated on its graph Hence the optimal plan is unique and induced by the gradient of the convex function ii i We argue by contradiction and assume that there is some convex function Y R R such tha
184. sics and Levi Civita connection in the Wasserstein space are discussed in detail Finally Chapter 7 is devoted to an introduction to the synthetic notions of Ricci lower bounds for metric measure spaces introduced by Lott amp Villani and Sturm in recent papers This notion is based on suitable convexity properties of a dimension dependent internal energy along Wasserstein geodesics Synthetic Ricci bounds are completely consistent with the smooth Riemannian case and stable under measured Gromov Hausdorff limits For this reason these bounds and their analytic implications are a useful tool in the description of measured GH limits of Riemannian manifolds Acknowledgement Work partially supported by a MIUR PRIN2008 grant 1 The optimal transport problem 1 1 Monge and Kantorovich formulations of the optimal transport problem Given a Polish space X d i e a complete and separable metric space we will denote by A X the set of Borel probability measures on X By support supp j of a measure jj Z X we intend the smallest closed set on which p is concentrated If X Y are two Polish spaces T X Y is a Borel map and u A X a measure the measure Tyu P Y called the push forward of u through T is defined by Tau E p T E VE C Y Borel The push forward is characterized by the fact that J tn f 1570 for every Borel function f Y RU 00 where the above identity has to be understood in the following sense
185. simple proof we omit where C efo Lip ur dr _ 1 it is immediate to verify that it holds 1 Lip v dr t Lip v dr t These inequalities are perfectly analogous to the 6 12 well the only difference is that here the bound on the angle is L in t s while for the embedded case it was L but this does not really change anything Therefore the arguments presented before apply also to this case and we can derive the existence of the parallel transport along regular curves lu 9 T s t 2 m FP u llu lt Cl ul E uc Tan Fa R 6 20 Fee Ins S Chullu uc Tanz PaRI 97 Theorem 6 15 Parallel transport along regular curves Let u be a regular curve and u Tanpo 45 R7 Then there exists a parallel transport uz along u such that uo u Now we know that the parallel transport exists along regular curves and we know also that regular curves are dense it is therefore natural to try to construct the parallel transport along any absolutely continuous curve via some limiting argument However this cannot be done as the fol lowing counterexample shows Example 6 16 Non existence of parallel transport along a non regular geodesic Let Q 0 1 x 0 1 be the unit square in R and let T i 1 2 3 4 be the four open trian gles in which Q is divided by its diagonals Let po xqY and define the function v Q R as the gradient of the convex map max x y as
186. spect to X starting at T provided it is a locally absolutely continuous curve in 0 00 x T as t 0 and d A E a d zi y 5d v V lt Ely Vy X a e t 7 0 i 1 2 dt There are two basic and fundamental things that one needs understand when studying the problem of Gradient Flows in a metric setting 1 Although the formulations EDI EDE and EVI are equivalent for A convex functionals on Hilbert spaces they are not equivalent in a metric setting Shortly said it holds EVI gt EDE gt EDI and typically none of the converse implication holds see Examples 3 15 and 3 23 below Here the second implication is clear for the proof of the first one see Proposition 3 6 below 2 Whatever definition of Gradient Flow in a metric setting we use the main problem is to show existence The main ingredient in almost all existence proofs is the Minimizing Movements scheme which we describe after Proposition 3 6 Proposition 3 6 EVI implies EDE Let E X RU 00 be a lower semicontinuous func tional X X a given point A R and assume that x is a Gradient Flow for E starting from T in the EVI sense w rt A Then equation 3 9 holds Proof First we assume that x is locally Lipschitz The claim will be proved if we show that t gt E x is locally Lipschitz and it holds d 1 1 4 2609 zlil 5IV EP a a e t gt 0 Let us start observing that the triangle inequality implies ld 5g 24 9 gt
187. sport map T belongs to C9 Q1 for some a lt 1 In addition the following implication holds pec n e C Na T C 0 The convexity assumption on Q is needed to show that the convex function y whose gradient provides the optimal map T is a viscosity solution of the Monge Ampere equation pl x p Ve x det V e a and then the regularity theory for Monge Ampere developed by Caffarelli and Urbas applies As an application of Theorem 1 26 we discuss the question of polar factorization of vector fields on R7 Let Q C IR be a bounded domain denote by uo the normalized Lebesgue measure on and consider the space S Q Borel map s Q 40 syuo po The following result provides a nonlinear projection on the nonconvex space Proposition 1 28 Polar factorization Let S L uo R be such that v Say is regular Definition 1 25 Then there exist unique s S Q and V with p convex such that S Vp os Also s is the unique minimizer of 5 seas Proof By assumption we know that both uo and v are regular measures with finite second moment We claim that among all S Q inf S adu i J y dy x y 1 7 at fI S du UN x yl d Y v y 1 7 To see why associate to each 5 S Q the plan y 3 4q which clearly belongs to Adm uo v This gives inequality gt Now let be the unique optimal plan and apply Theorem 1 26 twice to get that F7 Id Vv
188. ss to Lipschitz hypersurfaces of codimension 1 Then p is absolutely continuous w rt the volume measure resp gives O mass to Lipschitz hypersurfaces of codimension 1 for every t 1 In particular the set of absolutely continuous measures is geodesi cally convex and the same for measures giving 0 mass to Lipschitz hypersurfaces of codimension 1 Proof Assume that po is absolutely continuous let A C M be of 0 volume measure t 0 1 and let T be the optimal transport map from u to uo Then for every Borel set A C M it holds T T A D A and thus pl A i T7 03 A wo Ti A The claims follow from the fact that T is locally Lipschitz Remark 2 27 The set of regular measures is not geodesically convex It is natural to ask whether the same conclusion of the previous proposition holds for the set of regular measures Definitions 1 25 and 1 32 The answer is not there are examples of regular measures jo p1 in 2 IR such that the middle point of the geodesic connecting them is not regular 2 3 The weak Riemannian structure of 2 W2 In order to introduce the weak differentiable structure of W X W2 we start with some heuristic considerations Let X IR and u be a constant speed geodesic on W2 R induced by some optimal map 7 i e pi 1 t Hd tT uo Then a simple calculation shows that u satisfies the continuity equation d dit V up 0 43 with v T
189. t tris Weg un lt tri uto Pa Uto Ped uto un S Clullti tol 1 Clts tol which shows the absolute continuity Finally due to 6 17 it is sufficient to check that the covariant derivative vanishes at 0 To see this put t 0 and t t in 6 18 to get Pt u u C u t so that the thesis follows from 6 13 Now we come back to the Wasserstein case To follow the analogy with the Riemannian case keep in mind that the analogous of the translation map tr is the right composition with T s t and the analogous of the map P is Zr u P u 2 T s t which maps L onto Tan 27 R We saw that the key to prove the existence of the parallel transport in the embedded Riemannian case are inequalities 6 12 Thus given that we want to im itate the approach in the Wasserstein setting we need to produce an analogous of those inequalities This is the content of the following lemma We will denote by Tan 43 R the orthogonal complement of Tan R7 in L2 Lemma 6 14 Control of the angles between tangent spaces Let u v 3 IR4 and T R gt IR be any Borel map satisfying Typ v Then it holds lvo T Ppl o T ll lvllcLip T Id Wu Tan 2 R and if T is invertible it also holds P wo Tl lwll Lip T 14 Vw Tant Z2 R 96 Proof We start with the first inequality which is equivalent to lVe
190. t with any Vx when varies on C R Since the set Vip is dense in Tan Z IR7 for any t 0 1 the formula actually identifies DP uz However from this expression it is unclear what is the value of DP m Ut w for a general w Tan 5 IR7 because some regularity of Vip seems required to compute Vp v In order to better understand what the value of DP u Cut is fix t 0 1 and assume for a moment that v CX IR Then compute the gradient of x V x v 2 to obtain V Vo v Vp v Vu Vo and consider this expression as an equality between vector fields in L5 Taking the projection onto the Normal space we derive PL V o w Pi Vot Vy 0 99 m Rs us 9 T t 5 l Pu P us 9 Ti Ht Plugging the expression for p V ve into the formula for the covariant derivative we get D d L L as ve COON NA Pi ut P Vot d Vo t d l gre TA vul which identifies P u as D d Spee ue Py Zu Vu PL u 6 23 We found this expression assuming that v was a smooth vector field but given that we know that DP n ut exists for a e t it is realistic to believe that the expression makes sense also for general Lipschitz v s The problem is that the object Vv may very well be not defined j a e for arbitrary u and Lipschitz v Rademacher s theorem is of no help here because we are not assuming the measures u to be abso
191. t h 0 and use the dominated convergence theorem to get E x thy d ld i 2 lt t t hr ELE LE T E SET im f ar q E09 f rdr 54 09 Recalling 3 10 we conclude with Elg gt i gt sl IIVE a e t gt 0 Finally we see how the local Lipschitz property of x can be achieved It is immediate to verify that the curve t x44 is a Gradient Flow in the EVI sense starting from z for all h gt 0 We now use the fact that the distance between curves satisfying the EVI is contractive up to an exponential factor see the last part of the proof of Theorem 3 25 for a sketch of the argument and Corollary 4 3 3 of 6 for the rigorous proof We have das ien lt e 6579 d T viua Vs gt t Dividing by h letting h 0 and calling B C 0 00 the set where the metric derivative of x exists we obtain t lt dde7 979 Vs t B s gt t This implies that the curve x is locally Lipschitz in 0 00 Let us come back to the case of a convex and lower semicontinuous functional F on an Hilbert space Pick z D F fix 7 gt 0 and define the sequence n gt Tn recursively by setting T6 F and defining 27 as a minimizer of n4 1 T 2 z 20 x ex F a 2T 54 It is immediate to verify that a minimum exists and that it is unique thus the sequence n gt Lin is z i well defined The Euler Lagrange equation of T5411 Is zr r n1 n 9 T which is a time discretiz
192. t the set E of points of non differentiability of Y has positive jj measure Possibly modifying Y outside a compact set we can assume that it has linear growth at infinity Now define the two maps T x the element of smallest norm in O P x S x the element of biggest norm in 07 P x and the plan 1 Y g 0d T un Id 5 4H The fact that Y has linear growth implies that v T y has compact support Thus in particular J Ix dv x lt oo The contradiction comes from the fact that y Adm pi v is c cyclically mono tone because of Proposition 1 21 and thus optimal However it is not induced by a map because T x S ona set of positive u measure Lemma 1 20 16 The question of regularity of the optimal map is very delicate In general it is only of bounded variation BV in short since monotone maps always have this regularity property and disconti nuities can occur just think to the case in which the support of the starting measure is connected while the one of the arrival measure is not It turns out that connectedness is not sufficient to prevent discontinuities and that if we want some regularity we have to impose a convexity restriction on supp v The following result holds Theorem 1 27 Regularity theorem Assume Q1 Q2 C R4 are two bounded and connected open sets u PL lo y nL lo with 0 lt c p n C for some c C R Assume also that Qo is convex Then the optimal tran
193. te to verify that a vector field of the kind V7 along it is C Its covariant derivative calculated at t 0 is given by P V w Vo Thus we write VvoVU P Q V v Ve Ve CX R 6 31 Proposition 6 25 Let jj P2 R and 1 p2 o3 C R4 The curvature tensor R in p calculated for the 3 vector fields V qi i 1 2 3 is given by Riv Via Ved os Ns Vio Via 6 32 Ot Ns Von Vea 2050 Nu Vipi Vea Proof We start computing the value of Vyy Vvy Vos Let u Id tV 92 and observe as just recalled that u is a regular geodesic in some symmetric interval T T The vector field V 03 Vi is clearly C along it thus by Proposition 6 24 also the vector field wz P V 43 Viq1 Vvo Voa pi is C The covariant derivative at t 0 of uz along u is by definition the value of Vy Vy Vos at u Applying formula 6 25 we get Vvo Ve V3 Pu V V us V1 V2 V ga Pi V V3 V1 6 33 Symmetrically it holds Voor Vvo Vs Pu V V ps Va V1 V7 41 PL V ga Vy 6 34 Finally from the torsion free identity 6 10 we have Vy Vo P V pi Vp V p2 Ver and thus Vivo vei Ves Pu Ves PV Ves Vh Ve 35 Subtracting 6 35 and 6 34 from 6 33 and observing that V V os V1 Vea V V7 93 Va V1 V ps V7 91 Va V pa V pa Vq we get the thesis Observe that equation 6 32 is equivalent to R
194. that the validity of 6 3 is independent on the parametrization of the curve thus if it is fulfilled it is always possible to reparametrize the curve e g with constant speed in order to let it satisfy also 6 2 Now assume that p is regular Then by the classical Cauchy Lipschitz theory we know that there exists a unique family of maps T t s supp ut supp is satisfying ds 6 4 d T t s z vs T t s 2 Vt 0 1 x supp u a e s 0 1 T t t 2 Vt 0 1 supp ue Also it is possible to check that these maps satisfy the additional properties T r s o T t r T t 8 Vt r s 0 1 T t s J4 ht Hs Vt s 0 1 We will call this family of maps the flow maps of the curve u Observe that for any couple of times t s 0 1 the right composition with T t s provides a bijective isometry from L7 to T Also notice that from condition 6 2 and the inequalities K 7 IT t 8 s T t s JII2 lt J vr T t r aar dpa x lt s 4f lor 2 I2 ey dr 89 we get that for fixed t 0 1 the map s T t s D is absolutely continuous It can be proved that the set of regular curves is dense in the set of absolutely continuous curves on IR with respect to uniform convergence plus convergence of length We omit the technical proof of this fact and focus instead on the important case of geodesics Proposition 6 3 Regular
195. the analysis done by Cheeger in 29 Our final goal is to show that in non branching C D 0 N spaces a local Poincar inequality holds The importance of the non branching assumption is due to the following lemma Lemma 7 19 Let X d m be a non branching C D 0 N space B C X a closed ball of positive measure and 2B the closed ball with same center and double radius Define the measures ju m B m and u yg p x u P Geod X where x y gt 7 is the map which associates to each x y the unique geodesic connecting them such a map is well defined for m x m a e x y by Proposition 7 16 Then 9N er eu lt mB By lae Vt 0 1 120 Proof Fix x B t 0 1 and consider the homothopy map B 3 y Hom y yp By Proposition 7 16 we know that this map is well defined for rn a e y and that using the characteriza tion of geodesics given in Theorem 2 10 t gt u Hom 4p is the unique geodesic connecting x to u We have m Homz E VE C X Borel m B C ore ut E n Hom E The non branching assumption ensures that Hom is invertible therefore from the fact that z Hom E Hom Hom E E the Brunn Minkowski inequality and the fact that m x 0 we get m E gt t m Hom E and therefore pu E lt a Given that E was arbitrary we deduce n m Ht S tNm B 7 21 Notice that the expression on the right hand side is independent on z
196. the change of variable formula the identity 7 p x PUES NOE EVERY Using the fact that 4 p det Id eV y z Av x we have d ae Ge lexo H de le o z pu og x u p Ap V pu p u p V f V v p Ve p E d p x 02 fw y dy Sees uA det Id eV o xz dx and the conclusion follows by the equivalence 3 50 As an example let u z zlog x and let V be a A convex smooth function on R Since u z log z 1 we have pV u p Ap thus a gradient flow p of F E V solves the Fokker Plank equation d di Api V VV pc Also the contraction property 3 31 in Theorem 3 25 gives that for two gradient flows pz pz it holds the contractivity estimate Wa pt jx Wa po po 3 4 Bibliographical notes The content of Section 3 2 is taken from the first part of 6 we refer to this book for a detailed bibliographical references on the topic of gradient flows in metric spaces with the only exception of Proposition 3 6 whose proof has been communicated to us by Savar see also 72 73 The study of geodesically convex functionals in Z7 IR W2 has been introduced by R Mc Cann in 63 who also proved that conditions 3 44 and 3 45 were sufficient to deduce the geodesic convexity called by him displacement convexity of the internal energy functional The study of gradient flows in the Wasserstein space began in the seminal pap
197. there is only one geodesic connecting x to T x a The question of regularity of optimal maps on manifolds is much more delicate than the cor responding question on R4 even if one wants to get only the continuity We won t enter into the details of the theory we just give an example showing the difficulty that can arise in a curved setting The example will show a smooth compact manifold and two measures absolutely continuous with positive and smooth densities such that the optimal transport map is discontinuous We remark that similar behaviors occur as soon as M has one point and one sectional curvature at that point which is strictly negative Also even if one assumes that the manifold has non negative sectional curvature everywhere this is not enough to guarantee continuity of the optimal map what comes into play in this setting is the Ma Trudinger Wang tensor an object which we will not study Example 1 36 Let M C IR be a smooth surface which has the following properties e M is symmetric w r t the x axis and the y axis e M crosses the line x y 0 0 at two points namely O O 21 e the curvature of M at O is negative These assumptions ensure that we can find a b gt 0 such that for some za zp the points A a 0 2a A a 0 Za B 0 b zi B 0 b zb belong to M and d A B gt d A O d O B d being the intrinsic distance on M By continuity and symmetry we can find
198. tion problem 3 12 admits a solution if 7 lt 1 A7 The lower semicontinuity of the slope is a direct consequence of 3 21 and of the lower semi continuity of E Thus to conclude we need only to show that Ln x sup VE an E an lt oo gt lim E r lt E x 3 22 n oo From 3 21 with x y replaced by n x respectively we get E x gt E x4 VE an d a n P e 4 and the conclusion follows by letting n oo Thus Theorem 3 14 applies directly also to this case and we get existence of Gradient Flows in the EDI formulation To get existence in the stronger EDE formulation we need the following result which may be thought as a sort of weak chain rule observe that the validity of the proposition below rules out behaviors like the one described in Example 3 15 Proposition 3 19 Let E be a geodesically convex and l s c functional Then for every absolutely continuous curve x C X such that E x lt oo for every t it holds E zs E a1 lt T VE dr Vt s 3 23 t Proof We may assume that the right hand side of 3 23 is finite for any t s 0 1 and by a reparametrization argument we may also assume that z 1 for a e t in particular x is 60 1 Lipschitz so that t VE x is an L function Notice that it is sufficient to prove that t E zi is absolutely continuous as then the inequality z E at n E zi E z i E zi n I
199. to W2 uo u1 and by the arbitrariness of t we conclude 39 Example 2 21 2 X W2 may be not NPC if X d is Let X R with the Euclidean dis tance We will prove that 275 R W2 is not NPC Define 1 1 1 Ho 300 c3 i 3 0c 6 5 3 v 5 50 0 d 0 4 then explicit computations show that W2 uo p1 40 and W2 uo v 30 W2 ui v The unique constant speed geodesic u from 4o to u1 is given by 1 pi 5 a 6t 1 21 5 6t 3 20 and simple computations show that 30 30 40 40 W3 my2 v gt uw tuu 2 3 X Riemannian manifold In this section X will always be a compact smooth Riemannian manifold M without boundary endowed with the Riemannian distance d We study two aspects the first one is the analysis of some important consequences of Theorem 2 18 about the structure of geodesics in Z M the second one is the introduction of the so called weak Riemannian structure of Pa M W2 Notice that since M is compact Y2 M M Yet we stick to the notation Y2 M because all the statements we make in this section are true also for non compact manifolds although for simplicity we prove them only in the compact case 2 3 1 Regularity of interpolated potentials and consequences We start observing how Theorem 2 10 specializes to the case of Riemannian manifolds Corollary 2 22 Geodesics in 2 M W2 Let m C M Then the following two things are equivalent
200. try is divide the cube into 27 cubes of side length 1 2 then split the delta into 27 masses and let them move onto the centers of these 27 cubes Repeat the process by dividing each of the 27 cubes into 27 cubes of side length 1 4 and so on The total cost of this dynamical transfer is proportional to oo oo X gid 1 1 ES 9i d 1 ad Sn Ji Qaid m i i l numberof segments SH i 1 at the step length of each weighted mass on each segment at the step i segment at the step 2 which is finite if and only if d 1 ad lt 0 that is if and only if a gt 1 i A regularity result holds for a 1 1 d 1 which states that far away from the supports of the starting and final measures any minimal transfer is actually a finite tree Theorem 5 3 Regularity Let u v A R with compact support a 1 1 n 1 and let T r w be a continuous tree with minimal a cost between u and v Then Y is locally a finite tree in IR X supp 4 U supp v 5 2 Different action functional Let us recall that the Benamou Brenier formula Proposition 2 30 identifies the squared Wasserstein distance between u 90 274 u pl 74 IR by 1 Wig m f eG ous t nd 0 where the infimum is taken among all the distributional solutions of the continuity equation d gaty vpt 0 with po p and p pt A natural generalization of the distance W2 comes by considering a new action modified by putting a weight on th
201. ts yields that the sequence n jin is tight Thus to prove narrow convergence it is sufficient to check that f fdus f fdu for every f Ce X Since Lipschitz functions are dense in C X w r t uniform convergence it is sufficient to check the convergence of the integral only for Lipschitz f s This follows from the inequality J ran f tam fto fes rs lt furto ronis s arate f d z y dv T x Lip f f d v y d y 2 4 Lip f Wa us pa 29 Since the argument does not depend on the subsequence chosen the claim is proved We pass to the converse implication in 2 4 Pick y Opt ju Hn and use Remark 1 4 to get that the sequence 7 is tight hence up to passing to a subsequence we can assume that it narrowly converges to some y By Proposition 2 5 we know that y Opt u u which forces J a y dy x y 0 By Proposition 2 4 and our assumption on 1 p we know that un is 2 uniformly integrable thus by Remark 2 3 again we know that 7 is 2 uniformly integrable as well Since the map x y d a y has quadratic growth in X it holds lim Wu p lim rs os irs Yan 0 Now we prove that 275 X W2 is complete Pick a Cauchy sequence un and assume with out loss of generality that X W2 Un 4541 lt oo For every n N choose y Opt Hn Mn 1 and use repeatedly the gluing lemma to find for every n N a measure 3 4 X such that n n4 1 E T
202. u 0 00 be convex continuous and C on 0 00 with u 0 0 and define the pressure p 0 20 gt R by p z zu z u z Vz gt 0 and p 0 0 Also let u pm Y M with p C M pick p CS M and define T M gt M by Ti x exp V q a Then it holds Salf Ty t oA n CA V Rie Ve ve dm where by V2y 2 we mean the trace of the linear map V7p x Te M T M in coordi nates this reads as gt 0 9 z Proof Computation of the second derivative Let D x det VTi x pt Ti piVol By compactness for t sufficiently small T is invertible with smooth inverse so that Dz p C M For small the change of variable formula gives pt pa p Ta Delay Thus we have all the integrals being w r t m d _d p 0 pD p f BOX aca a 9 3 5 5 pp DS ADU and Salo f 00 fo D J vo p p Do having used the fact that Do 1 Evaluation of D and Dj We want to prove that Di z Ay 2 Dj 2 Av z V y a Rie Ve z Vy 2 For t gt 0 and x M let J x be the operator from Ty M to Ti evy x M given by 7 7 the value at s t of the Jacobi field js along the geodesic s exp sV q x having the initial conditions jo v jj V q v JIi x v where here and in the following the apex on a vector tensor field stands for covariant differ
203. u oo lim z 00 Zz and for every compact metric space X d define the functional amp A X RU 00 by 8 ulv J u g dv ul co u X 7 6 where u pv u is the decomposition of u in absolutely continuous py and singular part ji w r t tov Lemma 7 4 amp decreases under y Let X dx mx and Y dy my be two metric measure space and d y a coupling between them Then it holds Ysu my umx Vu e Z3 X yg vmx lt amp v my Vv P Y 110 Proof Clearly it is sufficient to prove the first inequality Let jj pm x and yyy nmy with 7 given by 7 5 By Jensen s inequality we have Eran f ulay dmy y fu f muti amt lt f uo e ery o amy y f totos n f wola dmx 2 mx Proposition 7 5 Mosco convergence of internal energy functionals Let Xn dn Mn EA X d m and dn Yn Opt dn Mn d m Then the following two are true Weak T lim For any sequence n un P Xn such that n gt Yn Hn narrowly converges to some u P X it holds lim amp u ma gt amp u m n gt o0 Strong T lim For any u P9 X with bounded density there exists a sequence n gt in PS Xn such that W2 Y Hn H 0 and lim amp u ma amp u m n gt o0 Note we put the apexes in Mosco because we prove the I lim inequality only for measures with bounded densities This will be enough to p
204. uence of measures such that mih pna m22 mu Uy Yn N Theorem 2 7 Basic properties of the space 2 5 X W2 Let X d be complete and separable Then Un gt H narrowly Wat di 0 con gt J c for some zo X 2 4 Furthermore the space X W2 is complete and separable Finally K C A X is relatively compact w r t the topology induced by W if and only if it is tight and 2 uniformly integrable Proof We start showing implication in 2 4 Thus assume that W j1 p 0 Then J 2C 20 dun f ec oon W2 un zo Wa u s Wa un u 0 To prove narrow convergence for every n N choose y Opt u Hn and use repeatedly the gluing lemma to find for every n N a measure a A X x X such that 0 n uu Tu an Yn gt 0 1 n 1 Ty An An 1 Then by Kolmogorov s theorem we know that there exists a measure a A X x XN such that m ub o Qn Yn N By construction we have lld x9 ll rox xm o ld n n l ra x2 u Wau Un 0 Thus up to passing to a subsequence not relabeled we can assume that 7 x 7 x for a almost any x X x XN Now pick f C X and use the dominated convergence theorem to get lim fdun lim fomda f fonaa f fay noo n oo if closed balls in X are compact the proof greatly simplifies Indeed in this case the inequality R u X NV Br xo lt f d xo d and the uniform bound on the second momen
205. ular for every t gt 0 it holds v OV F u a e t 0 00 3 49 where v is the velocity vector field associated to u characterized by d ght T V Qua 0 v Tan Z5 R a e t Proof Use the existence Theorem 3 25 and the equivalence of the EVI formulation of Gradient Flow and the Subdifferential one provided by Proposition 3 28 It remains to understand which kind of equation is satisfied by the Gradient Flow u By equation 3 49 this corresponds to identify the subdifferentials of V W ata generic pp 4 IR This is the content of the next three propositions For simplicity we state and prove them only under some unneeded smoothness assumptions The underlying idea of all the calculations we are going to do is the following equivalence v OV X u amp lim F Id Wee F u f eva Yo CHR 3 50 valid for any A geodesically convex functional where we wrote amp to intend that this equivalence holds only when everything is smooth To understand why 3 50 holds start assuming that v OV F u fix o CS IR and recall that for sufficiently small the map Jd eV is optimal Remark 1 22 Thus by definition of subdifferential we have Flu e f o Ve du PRIR S Fd even Subtracting F u on both sides dividing by gt 0 ande lt 0 and letting e 0 we get the implication To prove the converse one pick v 4 IR let T be the opti
206. umption 7 1 Then D is a distance on X and in particular D is 0 only on couples of isomorphic metric measure spaces Finally the space X D is complete separable and geodesic Proof See Section 3 1 of 74 We will denote by Opt dx mx dy my the set of optimal couplings between X dx mx and Y dy my i e the set of couplings where the inf in 7 4 is realized Given a metric measure space X d m we will denote by Z2 X C A X the set of measures which are absolutely continuous w r t m To any coupling d y of two metric measure spaces X dx mx and Y dy my it is natu rally associated a map yy P3 X 4 Y defined as follows fb pmx 41 nmy where 7 is defined by n y oar a 7 5 where t5 is the disintegration of y w r t the projection on Y Similarly there is a natural map yg PSY gt P X given by v nmy Vey pmx where pis defined by p x f to where obviously y is the disintegration of y w r t the projection on X Notice that yzmx my and yg mv m x and that in general Ya ral p Also if y is induced by a map T X Y i e if y Id T ym x then yy u Typ for any p Z3 X Our goal now is to show that if Xn dn Mn D X d m of the internal energy kind on 42 X4 W2 Mosco converge to the corresponding functional on 272 X W2 Thus fix a con vex and continuous function u 0 00 gt R define u z
207. us it is not true that sublevels of amp w are tight and therefore boundedly compact Then the inequality R u supp m Br xo EXE to dp shows that the set of ws in X with bounded second moment is tight Hence the conclusion follows as before using this narrow compactness together with the lower semicontinuity of amp w w r t narrow convergence It remains to discuss the interest from now on we discuss some of the geometric and analytic properties of spaces having a weak Ricci curvature bound Proposition 7 13 Restriction and rescaling Let X d m be a CD K oo space resp C D 0 N space Then i Restriction f Y C X is a closed totally convex subset i e every geodesic with endpoints in Y lies entirely inside Y such that m Y gt 0 then the space Y d m Y m isa C D K oo space resp C D 0 N space ii Rescaling for every o gt 0 the space X od m is a C D o K oo space resp C D 0 N space Proof i Pick fo y P Y C A X and a constant speed geodesic u C Y X connecting them such that K amp x 4 1 6 uo t amp H1 FEC WA Ho 11 resp satisfying the convexity inequality for the functional amp y N gt N We claim that supp j4 C Y for any t 0 1 Recall Theorem 2 10 and pick a measure u Y Geod X such that pi eo where e is the evaluation map defined by equation 2 6 Since supp uo supp u1 C Y we know that for any
208. uss the problem of existence of optimal maps in the model case cost distance In Chapter 2 we introduce the Wasserstein distance W2 on the set 4 X of probability measures with finite quadratic moments and X is a generic Polish space This distance naturally arises when considering the optimal transport problem with quadratic cost The connections between geodesics in 2 X and geodesics in X and between the time evolution of Kantorovich potentials and the Hopf Lax semigroup are discussed in detail Also when looking at geodesics in this space and in particular when the underlying metric space X is a Riemannian manifold M one is naturally lead to the so called time dependent optimal transport problem where geodesics are singled out by an action minimization principle This is the so called Benamou Brenier formula which is the first step in the interpretation of Z M as an infinite dimensional Riemannian manifold with W as Riemannian distance We then further exploit this viewpoint following Otto s seminal work 67 In Chapter 3 we make a quite detailed introduction to the theory of gradient flows borrowing almost all material from 6 First we present the classical theory for A convex functionals in Hilbert spaces Then we present some equivalent formulations that involve only the distance and therefore are applicable at least in principle to general metric space They involve the derivative of the distance from a point the EVI fo
209. uthors of these notes in 5 Later on after having beed aware of Lott s results the second author generalized the construction to the case of Wasserstein space built over a manifold in 44 Not all the results have been reported here we mention that it is possible to push the analysis up show the differentiability properties of the exponential map and the existence of Jacobi fields 7 Ricci curvature bounds Let us start recalling what is the Ricci curvature for a Riemannian manifold M which we will always consider smooth and complete Let R be the Riemann curvature tensor on M x M and u v T M Then the Ricci curvature Ric u v R is defined as Ric u v 2 R u ei v ei where e is any orthonormal basis of TaM An immediate consequence of the definition and the symmetries of R is the fact that Ric u v Ric v u Another more geometric characterization of the Ricci curvature is the following Pick x M a small ball B around the origin in TyM and let u be the Lebesgue measure on B The exponential map exp B M is injective and smooth thus the measure exp 4 4 has a smooth density w r t the volume measure Vol on M For any u B let f u be the density of exp 4 0 w r t Vol at the point exp u Then the function f has the following Taylor expansion 1 f u 14 jRic u u o ul 7 1 It is said that the Ricci curvature is bounded below by A R provided Ric u u gt Alul for e
210. vergence e any limit curve x is a Gradient Flow in the EDI formulation Definition 3 3 Sketch of the Proof Compactness By Corollary 3 12 we have T 2 T d 7 T lt n lt m Dspz dr lt 2T E z inf E yes 7 0 0 for any T nr Therefore for any T gt 0 the set x7 lt r is uniformly bounded in 7 As this set is also contained in E lt E Z it is relatively compact The fact that there is relative compactness w r t local uniform convergence follows by an Ascoli Arzel type argument based on the inequality s 2 d x7 7 lt Psp lar lt 2 s t E z inf E Vi nr s mr n m c N t 3 17 Passage to the limit Let 7 0 be such that 17 converges to a limit curve z locally uniformly Then by standard arguments based on inequality 3 17 it is possible to check that t gt x is abso lutely continuous and satisfies r dr lt lim J Dsp7 dr vV0xt s 3 18 t t n oo By the lower semicontinuity of V E and 3 14 we get VE z lt lim VE zi lt lim Dsl Vt n oo Nn Oo thus Fatou s lemma ensures that for any t lt s it holds VEP Gd lt f lim VEl zz dr lim Dsl dr 2T E z inf E 3 19 t t n oo no Jt Now passing to the limit in 3 16 written for t 0 we get the first inequality in 3 8 Also from 3 19 we get that the L norm of f t lim VE z on 0 oo is finite Thus A f lt oo has ful
211. very x M and u T M Several important geometric and analytic inequalities are related to bounds from below on Ricci curvature we mention just two of them e Brunn Minkowski Suppose that M has non negative Ricci curvature and for any Ao A C M compact let At I 7 is a constant speed geodesic s t yo Ao 1 Ai Vt 0 1 Then it holds Vol Ao gt 1 Vol Ao t Vol Ay Vt 0 1 02 where n is the dimension of M 107 e Bishop Gromov Suppose that M has Ricci curvature bounded from below by n 1 k where n is the dimension of M and k a real number Let M be the simply connected n dimensional space with constant curvature having Ricci curvature equal to n 1 k so that Misa sphere if k gt 0 a Euclidean space if k 0 and an hyperbolic space if k lt 0 Then for every x M and i M the map Vol B x rd V l B 8 7 3 is non increasing where Vol and Vol are the volume measures on M M respectively A natural question is whether it is possible to formulate the notion of Ricci bound from below also for metric spaces analogously to the definition of Alexandrov spaces which are a metric analogous of Riemannian manifolds with bounded either from above or from below sectional curvature What became clear over time is that the correct non smooth object where one could try to give a notion of Ricci curvature bound is not a metric space but rather a metric
212. w that for any s gt 0 it holds 0 00 Now we use s 1 s 1 S EG EES f rllVEl nar lt 5 f Parti f EPE 62 0 0 0 Therefore t F a is locally absolutely continuous and it holds A f L4 Es 5 f nart IV E z dr E z Vs gt 0 0 0 Subtracting from this last equation the same equality written for s t we get the thesis Remark 3 21 It is important to underline that the hypothesis of A geodesic convexity is in general of no help for what concerns the compactness of the sequence of discrete solutions E The geodesic convexity hypothesis ensures various regularity results for the limit curve which we state without proof Proposition 3 22 Let X E be satisfying Assumption 3 17 and let x be any limit of a sequence of discrete solutions Then 62 the limit A xt4h Xt li thst lee a exists for every t gt 0 ii the equation d d Feo VEl z i ltf V Elo is satisfied at every t gt 0 iii the map t e E z is convex the map t e VE az is non increasing right continuous and satisfies t SIVE a1 lt e E e0 Eo VE x lt 1 24 te Evo inf E where E X gt R is defined as d a y 2t E x inf E y iv pns if A gt 0 then E admits a unique minimum min and it holds 2dP 2 tmin lt E10 Bltmin lt E 2o min Observe that we didn t state an
213. we expect that the total derivative of N uz we is something like d d d n ut Wt su en TN v Wt as the derivative of V applied to the couple uz w some tensor which we may think di di E Forget about the last object and look at the first two addends given that the domain of definition of N a is not the whole L2 in order for the above formula to make sense we should ask that in each of the couples u w and uz mu t at least one vector is Lipschitz Under the assumption that Us Lip u dt lt oo and d Lip Zu dt lt oo it is possible to prove the following theorem whose proof we omit Theorem 6 22 Let u be an absolutely continuous curve let vi be its velocity vector field and let us w be two absolutely continuous vector fields along it Assume that J Lip u dt lt oo and i Lip Z u dt lt 00 Then Nn ui we is absolutely continuous and it holds d d gNm uz wi aN en TN u San 6 27 Ov Np us We Pp O3 Niue ues we Corollary 6 23 Let u be a regular curve and assume that its velocity vector field v satisfies d f Lip Zu dt lt 6 28 Then for every absolutely continuous vector field uz both Oy uz and O o u4 are absolutely continuous and their total derivatives are given by d d 49 us Ody uz T Ov Su xs On On u4 F Pu en O u 6 29 50 ue O us 01 S 02 61 ue O On
214. y inequality is satisfied along it To check that it is a geodesic just notice that for any partition of 0 1 we have Wa vo p1 lim Wa eg o1 lim 5 Walo otya 2 5 lim Walk Ohy z 2 Wales Otiga LU p n gt o0 Passing to the limit in 7 12 recalling Proposition 7 5 to get that amp u7 Go fi i 0 1 and that lim amp n 2 lim 4 amp 0 o 2 Es0 o4 we conclude To deal with general uo p1 we start recalling that the sublevels of are tight indeed using first the bound z log z gt i and then Jensen s inequality we get 1 m X E UE 0 gt TA ESQ 2 plog p dm gt i E log 25 115 for any u pm such that amp i lt C and any Borel E C X This bound gives that if mm E gt 0 then u En 0 uniformly on the set of ws such that 6 4 lt C This fact together with the tightness of m gives the claimed tightness of the sublevels of z Now the conclusion follows by a simple truncation argument using the narrow compactness of the sublevels of and the lower semicontinuity of amp 55 w r t narrow convergence For the stability of the C D 0 N condition the argument is the following we first deal with the case of 4o p with bounded densities with exactly the same ideas used for o Then to pass to the general case we use the fact that if X d m is a CD 0 N space then supp m d m is a doubling space Proposition 7 15 below notice that n lt N and th
215. y interpreted as the metric speed of the absolutely continuous curve x as defined in 2 19 The metric analogous of V E x is the slope of E defined as Definition 3 2 Slope Let E X RU 20 and x X be such that E x lt oo Then the slope V E x of E at x is V E z m Pt Fy max f d x y The three definitions of Gradient Flows in a metric setting that we are going to use are 2 d x y o Definition 3 3 Energy Dissipation Inequality definition of GF EDI Let E X gt RU 00 and let x X be such that E T lt oo We say that 0 o0 3 t z X is a Gradient Flow in the EDI sense starting at T provided it is a locally absolutely continuous curve xo T and 1 f 1 f E x 3 li Adr 3 VE a dr lt E z Vs gt 0 3 8 1 f 1 f E as 7 e dr 7 IVE z dr lt E zi a e t gt 0 Vs gt t t t Definition 3 4 Energy Dissipation Equality definition of GF EDE Let E X RU co and let x X be such that E x lt oo We say that 0 o0 3 t x X is a Gradient Flow in the EDE sense starting at T provided it is a locally absolutely continuous curve xo x and 1 f 1 f E as 7 tr dr gt VE z dr E z VO lt t lt s 3 9 t t 52 Definition 3 5 Evolution Variation Inequality definition of GF EVI Let E X RU oo T E lt oo and A R We say that 0 00 3 t a X is a Gradient Flow in the EVI sense with re
216. y result concerning the uniqueness nor about contractivity of the curve x satisfying the Energy Dissipation Equality 3 9 The reason is that if no further assumptions are made on either X or E in general uniqueness fails as the following simple example shows Example 3 23 Lack of uniqueness Let X R endowed with the L norm E X R be defined by E x x x and z 0 0 Then it is immediate to verify that V E 1 and that any Lipschitz curve t gt x x1 22 satisfying a t Vt gt 0 x al a e t gt 0 satisfies also E x t e 1 This implies that any such x satisfies the Energy Dissipation Equality 3 9 3 2 4 The compatibility of Energy and distance EVI and error estimates As the last example of the previous section shows in general we cannot hope to have uniqueness of the limit curve x obtained via the Minimizing Movements scheme for a generic geodesically convex functional If we want to derive properties like uniqueness and contractivity of the flow we need to have some stronger relation between the Energy functional E and the distance d on X in this section we will assume the following 63 Assumption 3 24 Compatibility in Energy and distance X d is a Polish space E X gt R U 00 is a lower semicontinuous functional and for any xo x1 y X there exists a curve t q t such that 2 E w 1 t E zo tE 21 Put t d zo 21 d 4 4

A user's guide to optimal transport

Contents

Download Pdf Manuals

Related Search

Related Contents