Utility Learning, Non-Markovian Planning, and Task-Oriented Programming Language
- Author(s): Shukla, Nishant
- Advisor(s): Zhu, Song-Chun
- et al.
We formulate a domain-independent language for representing stochastic tasks, and show how this representation can be synthesized from few training observations. In one experiment, we study how a physical robot may learn to fold clothes from a small number of visual observations, in contrast to big data techniques. Under the same framework, we also show how a virtual chat-bot may learn dialogue policies from few example transcripts, resulting in an interpretable dialogue model, outperforming current statistical techniques. Central to both examples in the concept of utility, why it's essential for generalizability, and how to learn it from small data.