- Main
Scaling up Recognition in Expert Domains with Crowd-source Annotations
- Wang, Pei
- Advisor(s): Vasconcelos, Nuno
Abstract
The success of deep learning in image recognition is substantially driven by large-scale, well-curated data. On visual recognition of common objects, the data can be scalably annotated on online crowd-sourcing platforms because the labeling does not need any prior knowledge. However, the case is not true for images of expertise like biological or medical imaging in which labeling them needs background knowledge. Although data collection is still usually easy, the annotation is difficult. Existing self-supervised or semi-supervised solutions train a model that tries to learn from a small amount of labeled data and a large amount of unlabeled data. These solutions show good performances on common object recognition but have been found not to work effectively on fine-grained expert domains.
In this thesis, we propose a new solution with crowd source annotations to address the problem. Inspired by the fact that supervised learning on as much as data can always perform better, our method tries to scale up the annotation. This is implemented by two different approaches, machine teaching and human filtering. Machine teaching first teaches humans with a short carefully designed course to learn the expertise knowledge so that they can label the data later. Human filtering simplifies the process to a binary selection procedure without preceding training. Beyond these two approaches, a unified explanation framework is developed to generate visualizations that are merged into two approaches, enabling easier and more accurate annotation results. Experiments show that both methods significantly outperform various alternative approaches in several benchmarks. They have also been found to be versatile and can benefit from more advanced machine learning techniques in the future. Overall, we believe that this thesis opens up a new direction to think about the expert domain classification problem, in general.
Main Content
Enter the password to open this PDF file:
-
-
-
-
-
-
-
-
-
-
-
-
-
-