Skip to main content
eScholarship
Open Access Publications from the University of California

Quantifying the Socio-semantic Representations of Words

Abstract

Quantifying the meaning of a word is a complex challenge. Humans can encode semantic information along a large and diverse range of semantic dimensions for any given word. Whilst a number of studies have applied a range of techniques to quantify word meaning along specific dimensions, little work has focussed on the socio-semantic dimensions of meaning. Here, we present data that quantifies the socio-semantic representations of 2,700 Czech words along the dimensions of gender, location, political, valence and age. We also demonstrate the utility of the data set by calculating an estimate of socio-semantic similarity between all words, which can be used to identify words that are either proximally close or distant in socio-semantic space.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View