Building resources for MT: What the user hasn’t got we have to provide
Skip to main content
eScholarship
Open Access Publications from the University of California

UC Irvine

UC Irvine Previously Published Works bannerUC Irvine

Building resources for MT: What the user hasn’t got we have to provide

Creative Commons 'BY' version 4.0 license
Abstract

The greatest sources of language data for natural language processing are held by the machine translation development community. That data is potentially more in demand than the MT-systems themselves. The defensive attitude of not making these data available for further development is damaging the natural evolution in the field. Activation generates users and those in turn the number of systems to be bought. However, that activation is stalled primarily by the cost of building an MT-system, i.e. the lack of language data available, and secondly by the fact that the potential buyers of machine translation systems lack the knowledge needed for tuning the system to fit the in-house environment.

Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View