Pose estimation

zhec
https://github.com/ZheC/Realtime_Multi-Person_Pose_Estimation OpenPose is freely available for free non-commercial use, and may be redistributed under these conditions. Please, see the license for further details. Interested in a commercial license? Check this FlintBox link. For commercial queries, use the Directly Contact Organization section from the FlintBox link and also send a copy of that message to Yaser Sheikh.

This is why https://aws.amazon.com/rekognition/ is expensive, they have to pay lots of money for using open source proprietary code. In South africa you do whatever you want, if you have assets, then setup a Fronting company: your BEE empowerment partner can't go to jail if he loses the court case.

https://github.com/CMU-Perceptual-Computing-Lab/openpose used by priya dwivedi

https://arxiv.org/abs/1611.08050

densepose
http://densepose.org/

https://www.youtube.com/watch?v=EMjPqgLX14A Can machine vision map humans from videos to 3D Models? Yes! DensePose is a new architecture by the team at Facebook AI research that does just that. It uses a convolutional network with some special features like region of interest pooling and cascading to make this happen. It was also trained on a newly created labeled dataset that mapped human poses to 3D models. The team open sourced the dataset but not the code, but using the details in the paper we can recreate their results. I'll explain how it works in this video. https://github.com/llSourcell/3D_Pose_Estimation

https://github.com/n1ckfg/OpenPoseRig

https://github.com/facebookresearch/VideoPose3D

https://github.com/facebookresearch/pythia

akanazawa
https://github.com/akanazawa/hmr End-to-end Recovery of Human Shape and Pose. Indicates for example if a person is carefully sneaking around a corner, it can even flag the same person by his unique gait. Human Mesh Recovery (HMR): End-to-end adversarial learning of human pose and shape. We present a real time framework for recovering the 3D joint angles and shape of the body from a single RGB image. Bottom row shows results from a model trained without using any coupled 2D-to-3D supervision. We infer the full 3D body even in case of occlusions and truncations. Note that we capture head and limb orientations.

We present Human Mesh Recovery (HMR), an end-to-end framework for reconstructing a full 3D mesh of a human body from a single RGB image. In contrast to most current methods that compute 2D or 3D joint locations, we produce a richer and more useful mesh representation that is parameterized by shape and 3D joint angles. The main objective is to minimize the reprojection loss of keypoints, which allow our model to be trained using images in-the-wild that only have ground truth 2D annotations. However, reprojection loss alone is highly under constrained. In this work we address this problem by introducing an adversary trained to tell whether a human body parameter is real or not using a large database of 3D human meshes. We show that HMR can be trained with and without using any paired 2D-to-3D supervision. We do not rely on intermediate 2D keypoint detection and infer 3D pose and shape parameters directly from image pixels. Our model runs in real-time given a bounding box containing the person. We demonstrate our approach on various images in-the-wild and out-perform previous optimization-based methods that output 3D meshes and show competitive results on tasks such as 3D joint location estimation and part segmentation. from Neural papers with code.

nvidia labs
https://github.com/NVlabs/Deep_Object_Pose This is the official DOPE ROS package for detection and 6-DoF pose estimation of known objects from an RGB camera. The network has been trained on the following YCB objects: cracker box, sugar box, tomato soup can, mustard bottle, potted meat can, and gelatin box. For more details, see our CoRL 2018 paper and video.
 * https://www.youtube.com/watch?v=yVGViBqWtBI&feature=youtu.be For the first time, an algorithm trained only on synthetic data is able to beat a state-of-the-art network trained on real images for object pose estimation on several objects of a standard benchmark. Learn more here: https://nvda.ws/2CvO2Jy

https://arxiv.org/abs/1809.10790 Using synthetic data for training deep neural networks for robotic manipulation holds the promise of an almost unlimited amount of pre-labeled training data, generated safely out of harm's way. One of the key challenges of synthetic data, to date, has been to bridge the so-called reality gap, so that networks trained on synthetic data operate correctly when exposed to real-world data. We explore the reality gap in the context of 6-DoF pose estimation of known objects from a single RGB image. We show that for this problem the reality gap can be successfully spanned by a simple combination of domain randomized and photorealistic data. Using synthetic data generated in this manner, we introduce a one-shot deep neural network that is able to perform competitively against a state-of-the-art network trained on a combination of real and synthetic data. To our knowledge, this is the first deep network trained only on synthetic data that is able to achieve state-of-the-art performance on 6-DoF object pose estimation. Our network also generalizes better to novel environments including extreme lighting conditions, for which we show qualitative results. Using this network we demonstrate a real-time system estimating object poses with sufficient accuracy for real-world semantic grasping of known household objects in clutter by a real robot.

timctho
https://github.com/timctho/VNect-tensorflow

http://gvv.mpi-inf.mpg.de/projects/VNect/

https://github.com/timctho/convolutional-pose-machines-tensorflow

dwivedi
Priya dwivedi pose estimation allows 25 key points on a person, indicating normal and anomalous behavior.

links
Priya dwivedi

Semantic segmentation