SMPLR: Deep SMPL reverse for 3D human pose and shape recovery
AI-generated Key Points
- Significant advancements in 3D human pose and shape recovery using deep neural networks and statistical morphable body models like SMPL
- Introduction of SMPLR method to address issues with SMPL-based solutions, involving embedding SMPL within a deep model for accurate 3D pose and shape estimation from single RGB images
- Use of CNN-based 3D joint predictions as an intermediate representation similar to an autoencoder
- Advantage of SMPLR method in eliminating complex constraints on pose and shape compared to traditional approaches
- Introduction of denoising autoencoder component for datasets lacking accurate 3D annotations, lifting 2D joints to 3D without paired annotations
- Significant improvements over existing methods shown in experiments on SURREAL and Human3.6M datasets with error reductions of approximately 4 and 25 millimeters respectively
- End-to-end training approach applied by initially training all networks independently before fine-tuning collectively, with ablation study conducted to analyze effects of different module combinations
- UP-3D dataset comprising labeled images from various sources fitted with a gender-neutral SMPL model, while SURREAL dataset consisted of synthetic images generated with realistic poses under diverse conditions
- Promising results demonstrated by the proposed SMPLR method in improving accuracy and efficiency in 3D human pose and shape recovery tasks compared to existing techniques
Authors: Meysam Madadi, Hugo Bertiche, Sergio Escalera
Abstract: Current state-of-the-art in 3D human pose and shape recovery relies on deep neural networks and statistical morphable body models, such as the Skinned Multi-Person Linear model (SMPL). However, regardless of the advantages of having both body pose and shape, SMPL-based solutions have shown difficulties to predict 3D bodies accurately. This is mainly due to the unconstrained nature of SMPL, which may generate unrealistic body meshes. Because of this, regression of SMPL parameters is a difficult task, often addressed with complex regularization terms. In this paper we propose to embed SMPL within a deep model to accurately estimate 3D pose and shape from a still RGB image. We use CNN-based 3D joint predictions as an intermediate representation to regress SMPL pose and shape parameters. Later, 3D joints are reconstructed again in the SMPL output. This module can be seen as an autoencoder where the encoder is a deep neural network and the decoder is SMPL model. We refer to this as SMPL reverse (SMPLR). By implementing SMPLR as an encoder-decoder we avoid the need of complex constraints on pose and shape. Furthermore, given that in-the-wild datasets usually lack accurate 3D annotations, it is desirable to lift 2D joints to 3D without pairing 3D annotations with RGB images. Therefore, we also propose a denoising autoencoder (DAE) module between CNN and SMPLR, able to lift 2D joints to 3D and partially recover from structured error. We evaluate our method on SURREAL and Human3.6M datasets, showing improvement over SMPL-based state-of-the-art alternatives by about 4 and 25 millimeters, respectively.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.