Extracting 3D human pose and body shape details from a single monocular image is a significant challenge in computer vision. Traditional methods use RGB images, constrained by varying lighting and ...