Ear keypoints#
Overview#
The ground truth exposed in this modality is a set of ear landmarks that conforms to the standard created by the Intelligent Behavior Understanding Group (iBUG).
This modality consists of the following files:
Relevant file |
Location |
---|---|
ears_key_points.json |
key_points folder |
all_key_points.json |
key_points folder |
This modality is not available in the Humans in Context platform. However, the basic location of the ears can be found as part of the Body keypoints modality.
ears_key_points.json#
This file contains the locations of 55 landmarks for each ear. These keypoints conform to the iBUG ear landmark standard.

The locations of the 55 facial landmarks developed by iBUG. Source: https://ibug.doc.ic.ac.uk/resources/ibug-ears/#
The file has the following format:
{
"version": "2.0.0",
"ears": {
"left": {
// 55 keypoint objects
}
"right": {
// 55 keypoint objects
}
}
}
Objects and fields:
version: String. Version tracking for this file. Whenever you access this file in a datapoint, make sure to check that the version matches what you expect it to be; otherwise its format and fields may not be recognized.
left: Object. This object contains 55 objects, each identifying a specific keypoint in the left ear. The keypoint objects are named “0” through “54”, and each one has the same format:
"7": { "global_3d": { "x": 0.0649380311369896, "y": 0.04291728511452675, "z": 0.11093330383300781 }, "pixel_2d": { "x": 91.42204655475659, "y": 31.024237959018187 }, "is_visible": "true" },
global_3d: Object. Contains a set of three Floats giving the location of the keypoint in global coordinates: “x”, “y”, and “z”. See About our coordinate systems for details.
pixel_2d: Object. Contains a set of two Floats giving the x and y coordinates of the keypoint in the rendered image itself. This provides not only which pixel contains the keypoint but also precisely where the keypoint is inside the pixel. See About our coordinate systems for details.
is_visible: Boolean. Indicates whether the keypoint is visible in the images produced by this camera. The value of this field is false if the keypoint is outside the frame or is on the other side of the body from the camera. Otherwise the value is true, even if there is another object between the keypoint and the camera.
right: Object. This object has the same structure as “left” above, except it identifies the 55 keypoints in the right ear.
Using this ground truth, you can train your model to recognize the shape and location of the human ear.
This file is not provided in the Humans in Context platform.
all_key_points.json#
For convenience, this file includes the same set of keypoints as ears_key_points.json above, located at this position in the hierarchy:
{
"ear": {
"left": {
// 55 keypoint objects
}
"right": {
// 55 keypoint objects
}
}
}
For details on the structure of the keypoint objects, see ears_key_points.json above.
On this platform, the data in this file is not relevant to this modality.