MotionFix: Text-Driven 3D Human Motion Editing

AI-generated keywords: 3D motion editing

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors focus on 3D motion editing based on textual descriptions
  • Challenges addressed include scarcity of training data and accurate editing of source motion
  • Methodology introduced for collecting dataset consisting of triplets: source motion, target motion, and edit text
  • Conditional diffusion model named TMED trained on MotionFix dataset shows superior performance over baseline models
  • New retrieval-based metrics introduced for evaluating motion editing
  • Code and models to be made publicly available for future research in fine-grained motion generation
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Nikos Athanasiou, Alpár Ceske, Markos Diomataris, Michael J. Black, Gül Varol

arXiv v1

Abstract: The focus of this paper is 3D motion editing. Given a 3D human motion and a textual description of the desired modification, our goal is to generate an edited motion as described by the text. The challenges include the lack of training data and the design of a model that faithfully edits the source motion. In this paper, we address both these challenges. We build a methodology to semi-automatically collect a dataset of triplets in the form of (i) a source motion, (ii) a target motion, and (iii) an edit text, and create the new MotionFix dataset. Having access to such data allows us to train a conditional diffusion model, TMED, that takes both the source motion and the edit text as input. We further build various baselines trained only on text-motion pairs datasets, and show superior performance of our model trained on triplets. We introduce new retrieval-based metrics for motion editing and establish a new benchmark on the evaluation set of MotionFix. Our results are encouraging, paving the way for further research on finegrained motion generation. Code and models will be made publicly available.

Submitted to arXiv on 01 Aug. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2408.00712v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

, , , , In their paper "MotionFix: Text-Driven 3D Human Motion Editing," authors Nikos Athanasiou, Alpár Ceske, Markos Diomataris, Michael J. Black, and Gül Varol focus on the challenging task of 3D motion editing. Their goal is to generate an edited motion based on a textual description of the desired modification provided alongside a 3D human motion. The primary challenges they address include the scarcity of training data and the development of a model capable of accurately editing the source motion in accordance with the text description. To tackle these challenges, the authors introduce a methodology for semi-automatically collecting a dataset consisting of triplets: (i) a source motion, (ii) a target motion, and (iii) an edit text. This dataset creation process results in the establishment of the MotionFix dataset. By leveraging this curated dataset, they train a conditional diffusion model named TMED that takes both the source motion and edit text as input. In their study, various baseline models trained solely on text-motion pairs datasets are compared with their proposed model trained on triplets. The results demonstrate superior performance of their TMED model over these baselines. Additionally, new retrieval-based metrics for evaluating motion editing are introduced, leading to the establishment of a new benchmark using the evaluation set from MotionFix. The promising outcomes presented in this paper pave the way for further advancements in fine-grained motion generation research. The authors plan to make their code and models publicly available for future exploration and utilization by other researchers in this field.
Created on 26 Mar. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.