Skip to main content
eScholarship
Open Access Publications from the University of California

A New Class Of Proximity Data Obtained From Dictionary Networks

Abstract

Background. Proximity data is a notion that indicates the degree of psychological closeness of concepts. It includes,among others, judgments of similarity, relatedness and cause-effect. Obtaining proximity data is challenging because itinvolves experts, corpora and people. On the other hand, dictionaries are fair representations made by experts (and thus,good proxies) of the lexicon and linguistic heritage of people.Methods. We present a method to automatically obtain proximity data from dictionaries. We construct a network represen-tation of a dictionary; exploit classical techniques on networks to build a similarity matrix; extract parameterized cloudsof lexical proximity; test them with native speakers.Results. Preliminary evaluations show that the method captures word associations significant to humans. Although theresearch was done in Spanish, the methods are easily reproducible in other languages.Conclusions. Dictionaries are good sources of proximity data. We conjecture that dictionary networks are good proxies tohuman mind semantic associations.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View