yolo4dAs expected, the YOLO4D models outperform the frame stacking models. Frame stacking encodes the temporal information only through the reshaping of inputs, while YOLO4DYolo4d: A spatio-temporal approach for real-time multi-object detection and classification from lidar point clouds Conference on Neural Information Processing Systems