Hyp-OW: Exploiting Hierarchical Structure Learning with Hyperbolic Distance Enhances Open World Object Detection

Authors

  • Thang Doan Bosch Research North America & Bosch Center for Artificial Intelligence
  • Xin Li Bosch Research North America & Bosch Center for Artificial Intelligence
  • Sima Behpour Bosch Research North America & Bosch Center for Artificial Intelligence
  • Wenbin He Bosch Research North America & Bosch Center for Artificial Intelligence
  • Liang Gou Bosch Research North America & Bosch Center for Artificial Intelligence
  • Liu Ren Bosch Research North America & Bosch Center for Artificial Intelligence

DOI:

https://doi.org/10.1609/aaai.v38i2.27921

Keywords:

CV: Object Detection & Categorization, CV: Representation Learning for Vision, ML: Representation Learning, ML: Applications

Abstract

Open World Object Detection (OWOD) is a challenging and realistic task that extends beyond the scope of standard Object Detection task. It involves detecting both known and unknown objects while integrating learned knowledge for future tasks. However, the level of "unknownness" varies significantly depending on the context. For example, a tree is typically considered part of the background in a self-driving scene, but it may be significant in a household context. We argue that this contextual information should already be embedded within the known classes. In other words, there should be a semantic or latent structure relationship between the known and unknown items to be discovered. Motivated by this observation, we propose Hyp-OW, a method that learns and models hierarchical representation of known items through a SuperClass Regularizer. Leveraging this representation allows us to effectively detect unknown objects using a similarity distance-based relabeling module. Extensive experiments on benchmark datasets demonstrate the effectiveness of Hyp-OW, achieving improvement in both known and unknown detection (up to 6 percent). These findings are particularly pronounced in our newly designed benchmark, where a strong hierarchical structure exists between known and unknown objects.

Published

2024-03-24

How to Cite

Doan, T., Li, X., Behpour, S., He, W., Gou, L., & Ren, L. (2024). Hyp-OW: Exploiting Hierarchical Structure Learning with Hyperbolic Distance Enhances Open World Object Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 38(2), 1555-1563. https://doi.org/10.1609/aaai.v38i2.27921

Issue

Section

AAAI Technical Track on Computer Vision I