SMPLR: Deep SMPL reverse for 3D human pose and shape recovery

AI-generated keywords: 3D human pose

AI-generated Key Points

  • Significant advancements in 3D human pose and shape recovery using deep neural networks and statistical morphable body models like SMPL
  • Introduction of SMPLR method to address issues with SMPL-based solutions, involving embedding SMPL within a deep model for accurate 3D pose and shape estimation from single RGB images
  • Use of CNN-based 3D joint predictions as an intermediate representation similar to an autoencoder
  • Advantage of SMPLR method in eliminating complex constraints on pose and shape compared to traditional approaches
  • Introduction of denoising autoencoder component for datasets lacking accurate 3D annotations, lifting 2D joints to 3D without paired annotations
  • Significant improvements over existing methods shown in experiments on SURREAL and Human3.6M datasets with error reductions of approximately 4 and 25 millimeters respectively
  • End-to-end training approach applied by initially training all networks independently before fine-tuning collectively, with ablation study conducted to analyze effects of different module combinations
  • UP-3D dataset comprising labeled images from various sources fitted with a gender-neutral SMPL model, while SURREAL dataset consisted of synthetic images generated with realistic poses under diverse conditions
  • Promising results demonstrated by the proposed SMPLR method in improving accuracy and efficiency in 3D human pose and shape recovery tasks compared to existing techniques
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Meysam Madadi, Hugo Bertiche, Sergio Escalera

License: CC BY-NC-SA 4.0

Abstract: Current state-of-the-art in 3D human pose and shape recovery relies on deep neural networks and statistical morphable body models, such as the Skinned Multi-Person Linear model (SMPL). However, regardless of the advantages of having both body pose and shape, SMPL-based solutions have shown difficulties to predict 3D bodies accurately. This is mainly due to the unconstrained nature of SMPL, which may generate unrealistic body meshes. Because of this, regression of SMPL parameters is a difficult task, often addressed with complex regularization terms. In this paper we propose to embed SMPL within a deep model to accurately estimate 3D pose and shape from a still RGB image. We use CNN-based 3D joint predictions as an intermediate representation to regress SMPL pose and shape parameters. Later, 3D joints are reconstructed again in the SMPL output. This module can be seen as an autoencoder where the encoder is a deep neural network and the decoder is SMPL model. We refer to this as SMPL reverse (SMPLR). By implementing SMPLR as an encoder-decoder we avoid the need of complex constraints on pose and shape. Furthermore, given that in-the-wild datasets usually lack accurate 3D annotations, it is desirable to lift 2D joints to 3D without pairing 3D annotations with RGB images. Therefore, we also propose a denoising autoencoder (DAE) module between CNN and SMPLR, able to lift 2D joints to 3D and partially recover from structured error. We evaluate our method on SURREAL and Human3.6M datasets, showing improvement over SMPL-based state-of-the-art alternatives by about 4 and 25 millimeters, respectively.

Submitted to arXiv on 27 Dec. 2018

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1812.10766v2

, , , , The field of 3D human pose and shape recovery has seen significant advancements with the use of deep neural networks and statistical morphable body models like the Skinned Multi-Person Linear model (SMPL). However, these SMPL-based solutions often struggle to accurately predict 3D bodies due to their unconstrained nature, resulting in unrealistic body meshes. To address this issue, a new approach called SMPLR has been proposed in this paper. The SMPLR method involves embedding SMPL within a deep model to estimate 3D pose and shape from a single RGB image, using CNN-based 3D joint predictions as an intermediate representation. This process can be likened to an autoencoder, with the encoder being a deep neural network and the decoder being the SMPL model. One key advantage of the SMPLR method is its ability to eliminate complex constraints on pose and shape that are typically required in traditional SMPL-based approaches. Additionally, for datasets lacking accurate 3D annotations, a denoising autoencoder (DAE) component has been introduced to lift 2D joints to 3D without paired annotations. Experiments on popular datasets SURREAL and Human3.6M showed significant improvements over existing methods, with error reductions of approximately 4 and 25 millimeters respectively. During training, an end-to-end approach was applied by initially training all networks independently (SHN, DAE, Ω and Ψ) before fine-tuning the entire network collectively. An ablation study was also conducted to analyze the individual effects of different combinations of modules in training. The UP-3D dataset comprised labeled images from various sources such as LSP, LSP-extended, and MPII-HumanPose datasets after fitting a gender-neutral SMPL model into them. The SURREAL dataset consisted of synthetic images of humans generated with realistic poses under diverse conditions. Overall, the proposed SMPLR method demonstrates promising results in improving accuracy and efficiency in 3D human pose and shape recovery tasks compared to existing techniques.
Created on 30 Mar. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.