TY - GEN
T1 - Object Detection with a Unified Label Space from Multiple Datasets
AU - Zhao, Xiangyun
AU - Schulter, Samuel
AU - Sharma, Gaurav
AU - Tsai, Yi Hsuan
AU - Chandraker, Manmohan
AU - Wu, Ying
N1 - Funding Information:
Acknowledgements. This work was supported in part by National Science Foundation grant IIS-1619078, IIS-1815561.
Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020
Y1 - 2020
N2 - Given multiple datasets with different label spaces, the goal of this work is to train a single object detector predicting over the union of all the label spaces. The practical benefits of such an object detector are obvious and significant—application-relevant categories can be picked and merged form arbitrary existing datasets. However, naïve merging of datasets is not possible in this case, due to inconsistent object annotations. Consider an object category like faces that is annotated in one dataset, but is not annotated in another dataset, although the object itself appears in the latter’s images. Some categories, like face here, would thus be considered foreground in one dataset, but background in another. To address this challenge, we design a framework which works with such partial annotations, and we exploit a pseudo labeling approach that we adapt for our specific case. We propose loss functions that carefully integrate partial but correct annotations with complementary but noisy pseudo labels. Evaluation in the proposed novel setting requires full annotation on the test set. We collect the required annotations (Project page: http://www.nec-labs.com/~mas/UniDet This work was part of Xiangyun Zhao’s internship at NEC Labs America.) and define a new challenging experimental setup for this task based on existing public datasets. We show improved performances compared to competitive baselines and appropriate adaptations of existing work.
AB - Given multiple datasets with different label spaces, the goal of this work is to train a single object detector predicting over the union of all the label spaces. The practical benefits of such an object detector are obvious and significant—application-relevant categories can be picked and merged form arbitrary existing datasets. However, naïve merging of datasets is not possible in this case, due to inconsistent object annotations. Consider an object category like faces that is annotated in one dataset, but is not annotated in another dataset, although the object itself appears in the latter’s images. Some categories, like face here, would thus be considered foreground in one dataset, but background in another. To address this challenge, we design a framework which works with such partial annotations, and we exploit a pseudo labeling approach that we adapt for our specific case. We propose loss functions that carefully integrate partial but correct annotations with complementary but noisy pseudo labels. Evaluation in the proposed novel setting requires full annotation on the test set. We collect the required annotations (Project page: http://www.nec-labs.com/~mas/UniDet This work was part of Xiangyun Zhao’s internship at NEC Labs America.) and define a new challenging experimental setup for this task based on existing public datasets. We show improved performances compared to competitive baselines and appropriate adaptations of existing work.
UR - http://www.scopus.com/inward/record.url?scp=85097055151&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85097055151&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-58568-6_11
DO - 10.1007/978-3-030-58568-6_11
M3 - Conference contribution
AN - SCOPUS:85097055151
SN - 9783030585679
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 178
EP - 193
BT - Computer Vision – ECCV 2020 - 16th European Conference, 2020, Proceedings
A2 - Vedaldi, Andrea
A2 - Bischof, Horst
A2 - Brox, Thomas
A2 - Frahm, Jan-Michael
PB - Springer Science and Business Media Deutschland GmbH
T2 - 16th European Conference on Computer Vision, ECCV 2020
Y2 - 23 August 2020 through 28 August 2020
ER -