Domain Specific Machine Learning - A No-Free-Lunch Perspective
Skip to main content
eScholarship
Open Access Publications from the University of California

UC Santa Barbara

UC Santa Barbara Electronic Theses and Dissertations bannerUC Santa Barbara

Domain Specific Machine Learning - A No-Free-Lunch Perspective

Abstract

This work explores Domain Specific Machine Learning (DSML) problems that arise from a workflow in a semiconductor company. The focus is on understanding what characteristics make them different from a typical machine learning problem. We first discuss a general setup for a DSML problem and provide two postulates to describe the scope of the DSML considered in this research. We examine the theories of machine learning, including four different schools of thinking for articulation of what machine learning is. We review the No Free Lunch Theory for machine learning and discuss the incompleteness of the four theories in view of the No Free Lunch.

Based on the theoretical understanding, we then point out the essence of DSML.Specifically, a DSML problem considered in this work follows an iterative process. In each iteration, the learning goal is to attain a ``surprise'' for the next iteration. Essentially, the unpredictability in the data appears in the next iteration represents the value of the learning. From this point of view, we argue that the underlying problem in DSML is local No Free Lunch.

An important consequence of this view is that machine learning will no longer be used in making a decision for the user, as that traditionally used. Instead, it is used to help a data summarization process to facilitate user to make a decision. In this thesis, we investigate two types of applications, outlier screening in production testing and wafer map pattern recognition for yield monitoring and improvement. We use these two applications to illustrate our view of DSML as well as various considerations associated with the view. Based on experience of implementing a machine learning software solution in a company, we explain the lessons learned for deploying a successful machine learning solution. If we consider traditional machine learning as largely focusing on the learning algorithms, we conclude that DSML is largely focusing on all other aspects trying to leverage the capability of a learning algorithm (if possible). From this angle, DSML is solving a problem complementary to machine learning (Co-ML). In other words, DSML shall not be taken as another direct extension of ML. Instead, it would be more appropriate to view DSML (in the semiconductor industry) as Co-ML.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View