3D Full-body pose estimation playground

owen · May 10, 2023, 6:44pm

I ported a 3D pose detection demo into a Babylon Playground. The pre-trained model uses MediaPipe. The detector takes images from your webcam as input and then returns an array of landmarks with x,y,z coordinates with a confidence score. I massaged these points a bit and draw red or green boxes for each landmark to distinguish left from right. Then I use the detector’s confidence score to set the box visibility. To have your whole body fit in the video, you have to stand pretty far back.

Playground:
https://playground.babylonjs.com/#ZZUZEG

Original Source: 3D Pose Detection with MediaPipe BlazePose GHUM and TensorFlow.js — The TensorFlow Blog

Would love to see if someone can drive a rigged model using this data next.

carolhmj · May 12, 2023, 1:23pm

That’s SOOO COOL! I agree it would be super nice to see a rigged model from the pose

owen · May 12, 2023, 4:33pm

Thanks!

I’m wondering if I can get some form of pose estimation just from the WebXR head and hand locations. For example, take the data that is streaming out of the webcam → pose detector, while simultaneously recording head and hand controller positions from WebXR, then train a model that uses the controllers as input (x) and the pose detection output data as the supervised output (y).

One immediate obstacle I see is “normalizing” these two sets of data so that they fit on top of each other. The raw data are in completely different “spaces” and orientations and scale. Does anyone have any expertise in this area?

sebavan · May 12, 2023, 7:08pm

If anybody it would be @RaananW

waverider404 · May 12, 2023, 9:22pm

Here hoping when some of this pose estimation performance will get better🤞🏾…perhaps offscreen canvas + webgpu will help in the future implementations

RaananW · May 15, 2023, 10:28am

the hand / controller data is being transposed by the underlying system, based on your current space (in XR). Babylon offers the viewer space (in the XR session manager), that you can use to get a “normalized” position of those (in world coordinates). Those will not change even if the user teleports itself somewhere else. You will, however, need to move them to the right position if you want to change the position in which they are rendered.

Topic		Replies	Views
Mediapipe Face Tracking Playground Demos and projects	7	2011	June 30, 2023
Babylon.js and Mediapipe pose detection Questions	3	1008	May 10, 2023
Babylon with mediapipe Questions	1	1276	July 11, 2023
Single camera, in browser motion tracking/rendering with babylonjs Demos and projects	3	1240	February 9, 2022
Rendering models based on MediaPipe Hand tracking model data Questions ar	3	2008	May 18, 2022

3D Full-body pose estimation playground

Related topics