KITTI Dataformat

Heaea 2023. 2. 13. 10:35

KITTI 데이터 셋은 비전 태스크에 사용되는 데이터 셋이다.

전체 이미지 셋에 대하여 하나의 json 파일로 레이블을 표시하는 coco data set과 달리

이미지 데이터 한 장 당 레이블 txt 파일을 가진다. yolo나 자율 주행 등에서 활용된다.

KITTI 형식의 데이터 셋은 위와 같이 생겼다.

GitHub - bostondiditeam/kitti: Boston Team for the Udacity/Didi Challenge

Boston Team for the Udacity/Didi Challenge. Contribute to bostondiditeam/kitti development by creating an account on GitHub.

github.com

위의 레포에서 kitti 레이블 포맷에 대해서 자세히 설명해주고 있다.

#Values    Name      Description
----------------------------------------------------------------------------
   1    type         Describes the type of object: 'Car', 'Van', 'Truck',
                     'Pedestrian', 'Person_sitting', 'Cyclist', 'Tram',
                     'Misc' or 'DontCare'
   1    truncated    Float from 0 (non-truncated) to 1 (truncated), where
                     truncated refers to the object leaving image boundaries
   1    occluded     Integer (0,1,2,3) indicating occlusion state:
                     0 = fully visible, 1 = partly occluded
                     2 = largely occluded, 3 = unknown
   1    alpha        Observation angle of object, ranging [-pi..pi]
   4    bbox         2D bounding box of object in the image (0-based index):
                     contains left, top, right, bottom pixel coordinates
   3    dimensions   3D object dimensions: height, width, length (in meters)
   3    location     3D object location x,y,z in camera coordinates (in meters)
   1    rotation_y   Rotation ry around Y-axis in camera coordinates [-pi..pi]
   1    score        Only for results: Float, indicating confidence in
                     detection, needed for p/r curves, higher is better.

이 설명에 따라 위의 데이터 셋의 구조를 보면,

객체 하나당 한 줄로 나타나있고, type, truncated가 차례로 나와있는 것을 해석할 수 있다.

위의 예시 사진에 나와있는 데이터 셋에는 객체의 bbox 좌표만 나타나 있다.

그리고 해당 데이터는 학습데이터 셋이므로 score에 대한 값은 빠져있음을 알 수 있다.

https://github.com/eweill/convert-datasets/blob/master/datasets/kitti.py

GitHub - eweill/convert-datasets: Convert object detection datasets to different formats

Convert object detection datasets to different formats - GitHub - eweill/convert-datasets: Convert object detection datasets to different formats

github.com