PennSyn2Real: Training Object Recognition Models without Human Labeling

Nguyen, Ty; Miller, Ian D.; Cohen, Avi; Thakur, Dinesh; Prasad, Shashank; Taylor, Camillo J.; Chaudrahi, Pratik; Kumar, Vijay

Computer Science > Computer Vision and Pattern Recognition

arXiv:2009.10292 (cs)

[Submitted on 22 Sep 2020 (v1), last revised 16 Oct 2020 (this version, v2)]

Title:PennSyn2Real: Training Object Recognition Models without Human Labeling

Authors:Ty Nguyen, Ian D. Miller, Avi Cohen, Dinesh Thakur, Shashank Prasad, Camillo J. Taylor, Pratik Chaudrahi, Vijay Kumar

View PDF

Abstract:Scalable training data generation is a critical problem in deep learning. We propose PennSyn2Real - a photo-realistic synthetic dataset consisting of more than 100,000 4K images of more than 20 types of micro aerial vehicles (MAVs). The dataset can be used to generate arbitrary numbers of training images for high-level computer vision tasks such as MAV detection and classification. Our data generation framework bootstraps chroma-keying, a mature cinematography technique with a motion tracking system, providing artifact-free and curated annotated images where object orientations and lighting are controlled. This framework is easy to set up and can be applied to a broad range of objects, reducing the gap between synthetic and real-world data. We show that synthetic data generated using this framework can be directly used to train CNN models for common object recognition tasks such as detection and segmentation. We demonstrate competitive performance in comparison with training using only real images. Furthermore, bootstrap** the generated synthetic data in few-shot learning can significantly improve the overall performance, reducing the number of required training data samples to achieve the desired accuracy.

Comments:	7 pages, 9 figures, 3 tables. Submitted to R-AL and ICRA 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2009.10292 [cs.CV]
	(or arXiv:2009.10292v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2009.10292

Submission history

From: Ty Nguyen [view email]
[v1] Tue, 22 Sep 2020 02:53:40 UTC (6,849 KB)
[v2] Fri, 16 Oct 2020 04:58:40 UTC (15,746 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PennSyn2Real: Training Object Recognition Models without Human Labeling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PennSyn2Real: Training Object Recognition Models without Human Labeling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators