- Main
CorreGram: Using Corpus Data to Develop Student-Adaptable Automated Corrective Feedback for the L2 Spanish Language Classroom
- Davidson, Samuel Stewart
- Advisor(s): Sagae, Kenji
Abstract
Automated corrective feedback (ACF), in which a computer system helps language learners identify and correct errors in their writing or speech, is considered an important tool for language instruction by many researchers. Such systems allow learners to correct their own mistakes, thereby reducing teacher workload and potentially preventing issues related to grammatical error fossilization. Research in this area has led to the development and widespread adoption of tools such as Grammarly for English learners. However, research in grammatical error correction (GEC) and other forms of ACF in languages other than English has been much more limited. This dearth of research is in part due to the large demand for English instruction, but is also driven by the limited training data available for non-English languages. However, a new corpus of learner Spanish collected at UC Davis, COWS-L2H, provided me with an opportunity to explore development of ACF for students studying Spanish. In my dissertation work, I explore the error patterns present in writing by students of Spanish in COWS-L2H, and use this information to inform a novel data augmentation technique to generate synthetic data for training language models capable of correcting learner errors in Spanish text. I then use this synthetic data, along with learner data from COWS-L2H, to train an AI-based GEC model for Spanish learners that is adaptable to learner L1 and proficiency level. Finally, I explore how this automatically corrected writing can be used to present feedback to learners in a pedagogically motivated way. To that end, I combine the GEC model trained using data from COWS-L2H with hand-written templates and feedback produced by generative LLMs to craft appropriate feedback for learners using the system. The end goal is a grammar-checker that is able to not only explain why something a student wrote is potentially incorrect, but is also able to guide the student to make the correction themselves. I demonstrate this novel system, CorreGram, and further discuss details of its implementation and proposals for how the system may be effectively utilized in the language classroom.
Main Content
Enter the password to open this PDF file:
-
-
-
-
-
-
-
-
-
-
-
-
-
-