Download
Please send an e-mail to yanmnn@stu.xmu.edu.cn, including contact details (title, full name, organization, and country) and the purpose for downloading the dataset. By sending the e-mail you accept the following License.
Data structure
# the data structure for XMU sequence
├── root_folder
├── sequence_img/ # Downsample video to 20fps and align to point clouds
├── sequence_pos.csv
├── sequence_rot.csv # Original IMU Data
├── sequence.bvh # Original Human Motion 100fps
├── Sparse_scene_mesh.ply
├── Sparse_scene_points.pcd
├── LiDAR2Cam_e.txt # Extrinsic of LiDAR to Camera
├── lidar_p.txt # Extrinsic of LiDAR to Mocap
├── Dense_scene_mesh.ply
├── Dense_scene_points.pcd
├── sequence.pkl
| | └── 'beta' # Shape of subject
| | └── 'gt_pose_3D'
| | └── 'gt_pose_2D'
| | └── 'gt_trans'
| | └── 'human_point_clouds'
| | └── 'IMU_pose'
| | └── 'IMU_trans'
| | └── 'frame_num'
├── others...
# the data structure for ChangSha sequence
├── root_folder
├── sequence_pos.csv
├── sequence_rot.csv # Original IMU Data
├── sequence.bvh # Original Human Motion 100fps
├── Sparse_scene_points.pcd
├── sequence.json # Sequence Human Info
├── sequence_label.h5py
| | └── 'beta' # Shape of subject
| | └── 'gt_pose_3D'
| | └── 'gt_pose_2D'
| | └── 'gt_trans'
| | └── 'human_point_clouds'
| | └── 'IMU_pose'
| | └── 'IMU_trans'
| | └── 'frame_num'
├── others...
Tips:
- We mentioned in the paper that in order to unify the coordinate system conversion problem, we first converted the mocap coordinate system and LiDAR coordinate system to the world coordinate system.
In order to facilitate the calibration of the initial image and initial radar, we register the point cloud scenes in the RGB and LiDAR coordinate systems.
So you also need to transfer the human body in the world coordinate system to the LiDAR coordinate system, and then to the camera coordinate system.
The RT matrix from the LiDAR coordinate system to the world coordinate system is in the lidar_p.txt file, and the extrinsic parameter matrix and camera intrinsic parameter matrix from the LiDAR coordinate system to the camera coordinate system are in LiDAR2Cam_e.txt. - The x, y, z of trans are global trajectories.
*tips: The unit of T in the external parameter matrix in lidar_p.txt is mm. You need to divide T by 1000 to convert it to m. - bbox is the center point coordinates of bbox + w and h.
Citation
@inproceedings{yan2023cimi4d,
title={CIMI4D: A Large Multimodal Climbing Motion Dataset under Human-scene Interactions},
author={Yan, Ming and Wang, Xin and Dai, Yudi and Shen, Siqi and Wen, Chenglu and Xu, Lan and Ma, Yuexin and Wang, Cheng},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
pages={12977--12988},
month={June},
year={2023}
}
Commercial licensing
Please email us:
siqishen@xmu.edu.cn
clwen@xmu.edu.cn
cwang@xmu.edu.cn