HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation

Han, Bing; Huang, Yuhua; Gao, Pan

计算机科学 > 计算机视觉与模式识别

arXiv:2508.14431 (cs)

[提交于 2025年8月20日 ]

标题： HyperDiff：超图引导的扩散模型用于3D人体姿态估计

标题： HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation

Authors:Bing Han, Yuhua Huang, Pan Gao

摘要：单目3D人体姿态估计（HPE）在从2D到3D的提升过程中常常遇到深度模糊和遮挡等挑战。此外，传统方法在利用骨骼结构信息时可能忽略多尺度骨骼特征，这可能会对姿态估计的准确性产生负面影响。为了解决这些挑战，本文引入了一种新颖的3D姿态估计方法，HyperDiff，该方法将扩散模型与HyperGCN相结合。扩散模型有效地捕捉数据不确定性，缓解深度模糊和遮挡。同时，作为去噪器的HyperGCN采用多粒度结构，准确建模关节之间的高阶相关性。这提高了模型的去噪能力，特别是在复杂姿态的情况下。实验结果表明，HyperDiff在Human3.6M和MPI-INF-3DHP数据集上达到了最先进的性能，并能灵活适应不同的计算资源，以平衡性能和效率。

摘要： Monocular 3D human pose estimation (HPE) often encounters challenges such as depth ambiguity and occlusion during the 2D-to-3D lifting process. Additionally, traditional methods may overlook multi-scale skeleton features when utilizing skeleton structure information, which can negatively impact the accuracy of pose estimation. To address these challenges, this paper introduces a novel 3D pose estimation method, HyperDiff, which integrates diffusion models with HyperGCN. The diffusion model effectively captures data uncertainty, alleviating depth ambiguity and occlusion. Meanwhile, HyperGCN, serving as a denoiser, employs multi-granularity structures to accurately model high-order correlations between joints. This improves the model's denoising capability especially for complex poses. Experimental results demonstrate that HyperDiff achieves state-of-the-art performance on the Human3.6M and MPI-INF-3DHP datasets and can flexibly adapt to varying computational resources to balance performance and efficiency.

主题：	计算机视觉与模式识别 (cs.CV)
引用方式：	arXiv:2508.14431 [cs.CV]
	(或者 arXiv:2508.14431v1 [cs.CV] 对于此版本)
	https://doi.org/10.48550/arXiv.2508.14431

提交历史

来自： Bing Han [查看电子邮件]
[v1] 星期三， 2025 年 8 月 20 日 05:03:55 UTC (5,416 KB)

计算机科学 > 计算机视觉与模式识别

标题： HyperDiff：超图引导的扩散模型用于3D人体姿态估计

标题： HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 计算机视觉与模式识别

标题： HyperDiff：超图引导的扩散模型用于3D人体姿态估计 显示英文标题

标题： HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题： HyperDiff：超图引导的扩散模型用于3D人体姿态估计