Data Annotation At Scale: Pitfalls and Solutions

Wednesday, May 22, 2:10 PM - 2:40 PM
Summit Track: 
Technical Insights II
Mission City M1-M3

In many real-world use cases, deep learning algorithms work well if you have enough high-quality data to train them. Obtaining that data is a critical limiting factor in the development of effective artificial intelligence. In this talk, we’ll identify common pitfalls encountered in obtaining and using public and private data for training and evaluating deep neural networks for visual AI – and we’ll present techniques to overcome these pitfalls. We’ll also present the open source Computer Vision Annotation Tool (CVAT -, illustrating techniques we have implemented to streamline annotation of visual data at scale. We’ll discuss challenges we faced in developing CVAT, how we addressed them and our plans for further improvements.


Nikita Manovich

Senior Software Engineer, Intel

Nikita is an experienced Senior Software Engineer with a demonstrated history of working in the semiconductor industry. He is a strong engineering professional with a master's degree focused in applied mathematics and cybernetics from State University of Nizhny Novgorod named after N.I. Lobachevsky. He was a computer vision research engineer at Itseez company. At Intel he is building infrastructure to analyze, visualize and annotate data for computer vision algorithms. Nikita is the architect of the Computer Vision Annotation Tool (CVAT -

See you at the Summit! May 20-23 in Santa Clara, California!
Register today and reserve your hotel room!