Dynamic Flows on Curved Space Generated by Labeled Data

Dynamic Flows on Curved Space Generated by Labeled Data

Xinru Hua, Truyen Nguyen, Tam Le, Jose Blanchet, Viet Anh Nguyen

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
Main Track. Pages 3803-3811. https://doi.org/10.24963/ijcai.2023/423

The scarcity of labeled data is a long-standing challenge for many machine learning tasks. We propose our gradient flow method to leverage the existing dataset (i.e., source) to generate new samples that are close to the dataset of interest (i.e., target). We lift both datasets to the space of probability distributions on the feature-Gaussian manifold, and then develop a gradient flow method that minimizes the maximum mean discrepancy loss. To perform the gradient flow of distributions on the curved feature-Gaussian space, we unravel the Riemannian structure of the space and compute explicitly the Riemannian gradient of the loss function induced by the optimal transport metric. For practical applications, we also propose a discretized flow, and provide conditional results guaranteeing the global convergence of the flow to the optimum. We illustrate the results of our proposed gradient flow method on several real-world datasets and show our method can improve the accuracy of classification models in transfer learning settings.
Keywords:
Machine Learning: ML: Optimization
Machine Learning: ML: Multi-task and transfer learning
Computer Vision: CV: Machine learning for vision
Machine Learning: ML: Few-shot learning