Flow Coupling and Semidiscrete Couplings

Flow models parameterized as time-dependent velocity fields can generate data with noise by integrating the ODE. These models are often trained using flow simulation, i.e. by randomly sampling noise and target points. (x0,x1)(mathbf{x}_0, mathbf{x}_1)(x0.,x1.) and to ensure that the velocity field corresponds, on average, to x1−x0mathbf{x}_1 - mathbf{x}_0x1.−x0. when tested with segment connectivity x0mathbf{x}_0x0. to x1mathbf{x}_1x1.. While these pairs are sampled independently by default, they can also be selected more carefully by cluster matching nnn sound in nnn target points using an optimal transport (OT) solver. Although promising in theory, the OT flow matching (OT-FM) method is not widely used in practice. Zhang et al. (2025) showed recently that OT-FM really starts to pay off when the batch size nnn increases significantly, which can only be done by the multi-GPU implementation of the Sinkhorn algorithm. Unfortunately, the cost of using Sinkhorn can balloon quickly, which it needs O(n2/ε2)O(n^2/varepsilon^2)O(n2/ε2) everyone's jobs nnn pairs are used to fit the velocity field, where εvarepsilonε the normalization parameter should be smaller in general to produce better results. To fulfill the theoretical promises of OT-FM, we propose to move away from batch-OT and instead rely on a semidiscrete architecture that exploits the fact that the target dataset distribution is often of finite size. NNN. The SD-OT problem is solved by estimating a two-dimensional vector using SGD; using that vector, the newly generated audio samples during the train can be matched to the data points at the cost of internal product search (MIPS). Semidiscrete FM (SD-FM) removes the quadratic dependence n/εn/varepsilonn/ε that shuts down OT-FM. SD-FM outperforms both FM and OT-FM on all training metrics and target budget constraints, on all multiple data sets, in unconditional/conditional generation, or when using slow-flow models. ** Work done while at Apple Source link