MMFP

Motion Manifold Flow Primitives for Task-Conditioned Trajectory Generation under Complex Task-Motion Dependencies

RA-Letters 2025: IEEE Robotics and Automation Letters

¹Massachusetts Institute of Technology, ²Seoul National University

TL;DR: MMFP can generate motions from text inputs, capturing complex text dependencies in motion distributions while simultaneously addressing the challenges posed by high-dimensional trajectory data and small dataset sizes.

Example 1: SE(3) Pouring Motion

Can I have a drink,
whatever you've got?

Give me some wine,
please.

May I have some water
from the very left?

Example 2: 7-DoF Robot Waving Motion

Wave your hand.

Look to the very left and
wave your hand.

Look to the front and
wave your hand in a small gesture.

Real Robot Experiments

Give me some wine.

Give me some water and pour it from the very left.

Turn to the front and wave your hand in a very big gesture.

Turn to the very right and wave your hand.

Abstract

Developing text-based robot trajectory generation models is made particularly difficult by the small dataset size, high dimensionality of the trajectory space, and the inherent complexity of the text-conditional motion distribution. Recent manifold learning-based methods have partially addressed the dimensionality and dataset size issues, but struggle with the complex text-conditional distribution. In this paper we propose a text-based trajectory generation model that attempts to address all three challenges while relying on only a handful of demonstration trajectory data. Our key idea is to leverage recent flow-based models capable of capturing complex conditional distributions, not directly in the high-dimensional trajectory space, but rather in the low-dimensional latent coordinate space of the motion manifold, with deliberately designed regularization terms to ensure smoothness of motions and robustness to text variations. We show that our Motion Manifold Flow Primitive (MMFP) framework can accurately generate qualitatively distinct motions for a wide range of text inputs, significantly outperforming existing methods.

Citation

@article{lee2025motion, title={Motion manifold flow primitives for task-conditioned trajectory generation under complex task-motion dependencies}, author={Lee, Yonghyeon and Lee, Byeongho and Kim, Seungyeon and Park, Frank C}, journal={IEEE Robotics and Automation Letters}, year={2025}, publisher={IEEE} }

Motion Manifold Flow Primitives for Task-Conditioned Trajectory Generation under Complex Task-Motion Dependencies

Example 1: SE(3) Pouring Motion

Example 2: 7-DoF Robot Waving Motion

Real Robot Experiments

Abstract

Summary

1 Dataset: SE(3) Pouring Trajectories

2 Three Key Challenges

3 Curse of Dimensionality

4 Previous Works: Manifold-based Models

5 Solution: Motion Manifold Flow Primitives

* Language-Guided Trajectory Generation with MMFP and Sentece-BERT

Citation