Skip to main content
eScholarship
Open Access Publications from the University of California

UC San Diego

UC San Diego Electronic Theses and Dissertations bannerUC San Diego

Part of speech tagging of Levantine

Abstract

The goal for this project is to explore strategies in adapting a Part of Speech (POS) tagger that was trained on Modern Standard Arabic sentences for tagging Levantine sentences, a dialect of Modern Standard Arabic, leveraged by methods of morphological analysis. I propose a tagging model that supports an explicit representation of the root -template patterns of Arabic. I will analyze the functionality and performance of the algorithms, and will compare the results. In leveraging the MSA POS tagger for tagging Levantine data, I achieved a peak accuracy of 73.28% which is 6% higher than the baseline for a standard Hidden Markov Model based tagger

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View