Ret3D: Rethinking Object Relations for Efficient 3D Object Detection in Driving 
Scenes

Wu, Yu-Huan; Zhang, Da; Zhang, Le; Zhan, Xin; Dai, Dengxin; Liu, Yun; Cheng, Ming-Ming

Local TagsRelease HistoryDetailsSummary

Ret3D: Rethinking Object Relations for Efficient 3D Object Detection in Driving Scenes

Wu, Y.-H., Zhang, D., Zhang, L., Zhan, X., Dai, D., Liu, Y., et al. (2022). Ret3D: Rethinking Object Relations for Efficient 3D Object Detection in Driving Scenes. Retrieved from https://arxiv.org/abs/2208.08621.

Item is Released

show all

Basic

hide

Item Permalink: https://hdl.handle.net/21.11116/0000-000C-1BA0-1 Version Permalink: https://hdl.handle.net/21.11116/0000-000C-1BA1-0

Genre: Paper

Files

hide Files

:

arXiv:2208.08621.pdf (Preprint), 761KB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-000C-1BA2-F

Name:
arXiv:2208.08621.pdf

Description:
File downloaded from arXiv at 2023-01-02 15:07

OA-Status:
Not specified

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://creativecommons.org/licenses/by-nc-sa/4.0/

Locators

show

Creators

hide

Creators:
Wu, Yu-Huan¹, Author
Zhang, Da¹, Author
Zhang, Le¹, Author
Zhan, Xin¹, Author
Dai, Dengxin², Author
Liu, Yun¹, Author
Cheng, Ming-Ming¹, Author

Affiliations:
1External Organizations, ou_persistent22
2Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society, ou_1116547

Content

hide

Free keywords: Computer Science, Computer Vision and Pattern Recognition, cs.CV,Computer Science, Artificial Intelligence, cs.AI

Abstract: Current efficient LiDAR-based detection frameworks are lacking in exploiting
object relations, which naturally present in both spatial and temporal manners.
To this end, we introduce a simple, efficient, and effective two-stage
detector, termed as Ret3D. At the core of Ret3D is the utilization of novel
intra-frame and inter-frame relation modules to capture the spatial and
temporal relations accordingly. More Specifically, intra-frame relation module
(IntraRM) encapsulates the intra-frame objects into a sparse graph and thus
allows us to refine the object features through efficient message passing. On
the other hand, inter-frame relation module (InterRM) densely connects each
object in its corresponding tracked sequences dynamically, and leverages such
temporal information to further enhance its representations efficiently through
a lightweight transformer network. We instantiate our novel designs of IntraRM
and InterRM with general center-based or anchor-based detectors and evaluate
them on Waymo Open Dataset (WOD). With negligible extra overhead, Ret3D
achieves the state-of-the-art performance, being 5.5% and 3.2% higher than the
recent competitor in terms of the LEVEL 1 and LEVEL 2 mAPH metrics on vehicle
detection, respectively.

Details

hide

Language(s): eng - English

Dates: Created: 2022-08-17Published Online: 2022

Publication Status: Published online

Pages: 9 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 2208.08621
BibTex Citekey: Wu2208.08621
URI: https://arxiv.org/abs/2208.08621

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show