Ear keypoints#


The ground truth exposed in this modality is a set of ear landmarks that conforms to the standard created by the Intelligent Behavior Understanding Group (iBUG).

This modality consists of the following file:

Relevant file



key_points folder


This file contains the locations of each of 55 landmarks for each ear. These keypoints conform to the iBUG ear landmark standard.


The locations of the 55 facial landmarks developed by iBUG. Source: https://ibug.doc.ic.ac.uk/resources/ibug-ears/#

The file uses the following format:

    "version": "2.0.0",
    "ears": {
    "left": {
         // 55 keypoint objects
    "right": {
         // 54 keypoint objects

Objects and fields:#

  • version: String. Version tracking for this file. Whenever you access this file in a datapoint, make sure to check that the version matches what you expect it to be; otherwise its format and fields may not be recognized.

  • left: Object. This object contains 55 objects, each identifying a specific keypoint in the left ear. The keypoint objects are named “0” through “54”, and each one has the same format:

    "7": {
       "global_3d": {
          "x": 0.0649380311369896,
          "y": 0.04291728511452675,
          "z": 0.11093330383300781
       "pixel_2d": {
          "x": 547,
          "y": 264
       "is_visible": "true"
    • global_3d: Object. Contains a set of three Floats giving the location of the keypoint in global coordinates: “x”, “y”, and “z”. See About our coordinate systems for details.

    • pixel_2d: Object. Contains a set of two Ints giving the x and y coordinates of the keypoint in the images produced by this camera. See About our coordinate systems for details.

    • is_visible: Boolean. Indicates whether the keypoint is visible in the images produced by this camera. The value of this field is false if the keypoint is outside the frame, is on the wrong side of the face, or is blocked from the camera’s view by another object. Otherwise the value is true.

  • right: Object. This object has the same structure as “left” above, except it identifies the 55 keypoints in the right ear.