a richly grounded dataset inspired by child language acquisition
Spatial parameters are recorded every frame for the head, hands, and each object in the scene.
|pos (xyz)||absolute cartesian position of object center|
|rot (xyzw)||absolute quaternion rotation of object|
|vel (xyz)||absolute velocity of object center|
|relPos (xyz)||position relative to head|
|relRot (xyzw)||rotation relative to head|
|relVel (xyz)||velocity from frame of reference of the head|
|bound (xyz)||distance from object center to edge of bounding box|
|inView (bool)||whether object is in the participant's field of view|
example of y-position data (height) when picking up an apple:
Images at each timestep are available for download, or can be viewed as videos below.