Skip to main content
eScholarship
Open Access Publications from the University of California

Building bilingual semantic representations based on a corpus-based statisticallearning algorithm

Abstract

In the current study, we applied a corpus-based statistical learning algorithm to derive semantic representations ofwords under bilingual situations (English and Chinese). The algorithm relies on the analyses of contextual information extractedfrom a text corpus, specifically, analyses of word co-occurrences in a large-scale electronic database of text. Particularly, weexamined how the semantic structure of L2 words can be built based on and influenced by the semantic representations of L1words in a sequential L2 learning situation. We got the semantic representations under various conditions and the results wereprocessed and illustrated on self-organizing maps, an unsupervised neural network model that projects the statistical structureof the context onto a 2-D space. We further discussed a couple of factors that affected the validity of the representations.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View