Yang, Kevin

Controlling Long-Form Large Language Model Outputs

2023

Yang, Kevin
Advisor(s): Klein, Dan

Abstract

As large language models have greatly increased in capability in recent years, it becomes increasingly important to improve our ability to exert control over their outputs. In this thesis, I discuss several such control schemes I have developed, ranging from pure inference-time control to finetuning-based alignment methods. I will first discuss highly general methods that apply to unstructured natural language generation, including both an inference-time control scheme called FUDGE as well as a reinforcement-learning based finetuning approach called RLCD. I will next discuss more specialized methods that can be used for control in more structured domains such as molecule design, program synthesis, and semantic parsing. Finally, I will show how many of these ideas can be used in conjunction with structured planning via prompting to extend our control to much longer outputs—in the range of thousands of words—in an automatic story generation application.

Main Content

For improved accessibility of PDF content, download the file to your device.

UC Berkeley

Controlling Long-Form Large Language Model Outputs