Spine-Aware Multi-Person 2D and 3D Pose Estimation


Open-source inference library for non-commercial and research use on Linux, Windows and macOS.

Note: SpinePose v2.1.0 is now available! This includes 3D pose estimation capabilities from a single camera, along with improved accuracy and robustness. See the release notes for more details.

SpinePose is an inference library for spine-aware 2D human pose estimation in the wild. It provides a simple CLI and Python API for running inference on images and videos. Our models predict the SpineTrack skeleton hierarchy comprising 37 keypoints, including 9 directly along the spine chain in addition to the standard body joints.

For installation instructions and complete API documentation, see the SpinePose GitHub repository.


SpinePose can robustly estimate spine-aware full-body 2D poses for multiple people in unconstrained images and videos.

Coronal view of two people

In addition to body and feet keypoints, we track 9 points along the spine chain, which roughly coincide with the centroids of S1, L1, L3, L5, T8, T3, C7, C4, and C1 vertebral bodies1.

Sagittal view of a person doing seated cable rows in a gym

Spine-aware pose estimation provides additional keypoints along the spine chain, enabling better analysis of torso and spine motion. This is particularly beneficial for applications in fitness, sports, and ergonomics.

[1] Inferred spine keypoints represent a learned mapping from whole-body visual cues to a spine-aware skeleton representation, instead of directly corresponding to unobserved anatomical landmarks.
One person from front-side view going from standing to sitting on the floor positions

Our models are specifically designed to work in unconstrained settings, including challenging poses and occlusions. They can be used for real-time applications on consumer-grade hardware.

Warning: The default SpinePose models are not intended for clinical use. Use with caution and do not rely on them for critical decisions. For medical applications, please consult with healthcare professionals and use clinically validated tools.
Skating demo (dynamic, spine-only, 2D+3D)

You can either track the whole body or only spine keypoints, with robust performance during dynamic movements.

Single-person dancing demo (front-view, 2D + 3D)

Our monocular 3D pose estimation model can predict 3D spine-aware poses in either camera coordinates or automatically estimated metric world coordinates, without requiring any additional sensors or multi-view setups.

Boxing Demo

SpinePose is also robust to lighting changes and challenging scenarios, and works well in low-light conditions and high motion blur, making it suitable for a wide range of real-world applications.

Model Zoo

2D Pose Estimation

Method Training Data SpineTrack SIMSPINE Usage
APB ARB APS ARS AUC
spinepose_v2_smallSpineTrack
+ SIMSPINE
0.7880.8150.9200.9290.790--mode small --model-version v2
spinepose_v2_medium0.8210.8460.9280.9370.798--mode medium --model-version v2
spinepose_v2_large0.8400.8620.9170.9270.803--mode large --model-version v2
spinepose_v1_smallSpineTrack0.7920.8210.8960.9080.611--mode small --model-version v1
spinepose_v1_medium0.8400.8640.9140.9260.633--mode medium --model-version v1
spinepose_v1_large0.8540.8770.9100.9220.633--mode large --model-version v1
spinepose_v1_xlarge0.7590.8010.8930.910---mode xlarge --model-version v1

Citation

If you use SpinePose in your research, please consider citing the following papers:

Preferred citations can be found on the respective paper pages.

Maintained by saifkhichi96 on GitHub.

The website is distributed under different open-source licenses. For more details, see the notice at the bottom of the page.