In the past few years, object detection has attracted a lot of attention in the context of human–robot collaboration and Industry 5.0 due to enormous quality improvements in deep learning technologies. In many applications, object detection models have to be able to quickly adapt to a changing environment, i.e., to learn new objects. A crucial but challenging prerequisite for this is the automatic generation of new training data which currently still limits the broad application of object detection methods in industrial manufacturing. In this work, we discuss how to adapt state-of-the-art object detection methods for the task of automatic bounding box annotation in a use case where the background is homogeneous and the object’s label is provided by a human. We compare an adapted version of Faster R-CNN and the Scaled-YOLOv4-p5 architecture and show that both can be trained to distinguish unknown objects from a complex but homogeneous background using only a small amount of training data. In contrast to most other state-of-the-art methods for bounding box labeling, our proposed method neither requires human verification, a predefined set of classes, nor a very large manually annotated dataset. Our method outperforms the state-of-the-art, transformer-based object discovery method LOST on our simple fruits dataset by large margins.
Gromit-MPX is an on-screen annotation tool that works with any Unix desktop environment under X11 as well as Wayland. - GitHub - bk138/gromit-mpx: Gromit-MPX is an on-screen annotation tool that works with any Unix desktop environment under X11 as well as Wayland.
@SafeVarargs
Is a cure for the warning: [unchecked] Possible heap pollution from parameterized vararg type Foo.
Is part of the method's contract, hence why the annotation has runtime retention.
Is a promise to the caller of the method that the method will not mess up the heap using the generic varargs argument.
J. Kurhila, M. Miettinen, P. Nokelainen, P. Flor&\#233;en, and H. Tirri. ITiCSE '03: Proceedings of the 8th annual conference on Innovation and technology in computer science education, 35, page 173--177. New York, NY, USA, ACM Press, (September 2003)
C. Wang, and G. Chen. ITiCSE '04: Proceedings of the 9th annual SIGCSE conference on Innovation and technology in computer science education, 36, page 132--136. New York, NY, USA, ACM Press, (September 2004)
B. Brush, D. Bargeron, J. Grudin, and A. Gupta. CHI '02: Proceedings of the SIGCHI conference on Human factors in computing systems, page 89--96. New York, NY, USA, ACM Press, (2002)
C. Marshall, and A. Brush. JCDL '04: Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries, page 349--357. New York, NY, USA, ACM, (2004)
A. Brush, D. Bargeron, A. Gupta, and J. Cadiz. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, page 285--292. New York, NY, USA, ACM, (2001)
D. Fogli, G. Fresta, and P. Mussio. AVI '04: Proceedings of the working conference on Advanced visual interfaces, page 98--102. New York, NY, USA, ACM Press, (2004)
J. Domingue, and M. Dzbor. IUI '04: Proceedings of the 9th international conference on Intelligent user interface, page 191--197. New York, NY, USA, ACM Press, (2004)
N. Kaplan, and Y. Chisik. JCDL '05: Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries, page 235--243. New York, NY, USA, ACM Press, (2005)