Skip to main content
eScholarship
Open Access Publications from the University of California

Department of Linguistics

Open Access Policy Deposits bannerUCLA

This series is automatically populated with publications deposited by UCLA Department of Linguistics researchers in accordance with the University of California’s open access policies. For more information see Open Access Policy Deposits and the UC Publication Management System.

Cover page of How to vocode: Using channel vocoders for cochlear-implant research.

How to vocode: Using channel vocoders for cochlear-implant research.

(2024)

The channel vocoder has become a useful tool to understand the impact of specific forms of auditory degradation-particularly the spectral and temporal degradation that reflect cochlear-implant processing. Vocoders have many parameters that allow researchers to answer questions about cochlear-implant processing in ways that overcome some logistical complications of controlling for factors in individual cochlear implant users. However, there is such a large variety in the implementation of vocoders that the term vocoder is not specific enough to describe the signal processing used in these experiments. Misunderstanding vocoder parameters can result in experimental confounds or unexpected stimulus distortions. This paper highlights the signal processing parameters that should be specified when describing vocoder construction. The paper also provides guidance on how to determine vocoder parameters within perception experiments, given the experimenters goals and research questions, to avoid common signal processing mistakes. Throughout, we will assume that experimenters are interested in vocoders with the specific goal of better understanding cochlear implants.

Cover page of Disentangling the Role of Biphone Probability From Neighborhood Density in the Perception of Nonwords

Disentangling the Role of Biphone Probability From Neighborhood Density in the Perception of Nonwords

(2024)

In six experiments we explored how biphone probability and lexical neighborhood density influence listeners' categorization of vowels embedded in nonword sequences. We found independent effects of each. Listeners shifted categorization of a phonetic continuum to create a higher probability sequence, even when neighborhood density was controlled. Similarly, listeners shifted categorization to create a nonword from a denser neighborhood, even when biphone probability was controlled. Next, using a visual world eye-tracking task, we determined that biphone probability information is used rapidly by listeners in perception. In contrast, task complexity and irrelevant variability in the stimuli interfere with neighborhood density effects. These results support a model in which both biphone probability and neighborhood density independently affect word recognition, but only biphone probability effects are observed early in processing.

Cover page of The everyday speech environments of preschoolers with and without cochlear implants.

The everyday speech environments of preschoolers with and without cochlear implants.

(2024)

Children who receive cochlear implants develop spoken language on a protracted timescale. The home environment facilitates speech-language development, yet it is relatively unknown how the environment differs between children with cochlear implants and typical hearing. We matched eighteen preschoolers with implants (31-65 months) to two groups of children with typical hearing: by chronological age and hearing age. Each child completed a long-form, naturalistic audio recording of their home environment (appx. 16 hours/child; >730 hours of observation) to measure adult speech input, child vocal productivity, and caregiver-child interaction. Results showed that children with cochlear implants and typical hearing were exposed to and engaged in similar amounts of spoken language with caregivers. However, the home environment did not reflect developmental stages as closely for children with implants, or predict their speech outcomes as strongly. Home-based speech-language interventions should focus on the unique input-outcome relationships for this group of children with hearing loss.

Cover page of Information conveyed by voice qualitya)

Information conveyed by voice qualitya)

(2024)

The problem of characterizing voice quality has long caused debate and frustration. The richness of the available descriptive vocabulary is overwhelming, but the density and complexity of the information voices convey lead some to conclude that language can never adequately specify what we hear. Others argue that terminology lacks an empirical basis, so that language-based scales are inadequate a priori. Efforts to provide meaningful instrumental characterizations have also had limited success. Such measures may capture sound patterns but cannot at present explain what characteristics, intentions, or identity listeners attribute to the speaker based on those patterns. However, some terms continually reappear across studies. These terms align with acoustic dimensions accounting for variance across speakers and languages and correlate with size and arousal across species. This suggests that labels for quality rest on a bedrock of biology: We have evolved to perceive voices in terms of size/arousal, and these factors structure both voice acoustics and descriptive language. Such linkages could help integrate studies of signals and their meaning, producing a truly interdisciplinary approach to the study of voice.

Cover page of Constraints on Acceleration in Bilingual Development: Evidence from Word Segmentation by Spanish Learning Infants

Constraints on Acceleration in Bilingual Development: Evidence from Word Segmentation by Spanish Learning Infants

(2024)

We have previously shown that bilingual Spanish and English-learning infants can segment English iambs, two-syllable words with final stress (e.g., guiTAR), earlier than their monolingual peers. This is consistent with accelerated development in bilinguals and was attributed to bilingual infants' increased exposure to iambs through Spanish; about 10% of English content words start with an unstressed syllable, compared to 40% in Spanish. Here, we evaluated whether increased exposure to a stress pattern alone is sufficient to account for acceleration in bilingual infants. In English, 90% of content words start with a stressed syllable (e.g., KINGdom), compared to 60% in Spanish. However, we found no evidence for accelerated segmentation of Spanish trochees by Spanish-English bilingual infants compared to their monolingual Spanish-learning peers. Based on this finding, we argue that merely increased exposure to a linguistic feature in one language does not result in accelerated development in the other. Instead, only the acquisition of infrequent patterns in one language may be accelerated due to the additive effects of the other language.

Cover page of A meta-analytic review of morphological priming in Semitic languages

A meta-analytic review of morphological priming in Semitic languages

(2023)

Two types of discontinuous morphemes are thought to be the basic building blocks of words in Semitic languages: roots and templates. However, the role of these morphemes in lexical access and representation is debated. Priming experiments, where reaction times to target words are predicted to be faster when preceded by morphologically-related primes compared to unrelated control primes, provide conflicting evidence bearing on this debate. We used meta-analysis to synthesise the findings from 229 priming experiments on 4710 unique Semitic speakers. With Bayesian modelling of the aggregate effect sizes, we found credible root and template priming in both nouns and verbs in Arabic and Hebrew. Our results show that root priming effects can be distinguished from the effects of overlap in form and meaning. However, more experiments are needed to determine if template priming effects can be distinguished from overlap in form and morphosyntactic function.

Cover page of Acoustic voice variation in spontaneous speech

Acoustic voice variation in spontaneous speech

(2022)

This study replicates and extends the recent findings of Lee, Keating, and Kreiman [J. Acoust. Soc. Am. 146(3), 1568-1579 (2019)] on acoustic voice variation in read speech, which showed remarkably similar acoustic voice spaces for groups of female and male talkers and the individual talkers within these groups. Principal component analysis was applied to acoustic indices of voice quality measured from phone conversations for 99/100 of the same talkers studied previously. The acoustic voice spaces derived from spontaneous speech are highly similar to those based on read speech, except that unlike read speech, variability in fundamental frequency accounted for significant acoustic variability. Implications of these findings for prototype models of speaker recognition and discrimination are considered.

Cover page of The puzzling nuanced status of who free relative clauses in English: a follow-up to Patterson and Caponigro (2015)

The puzzling nuanced status of who free relative clauses in English: a follow-up to Patterson and Caponigro (2015)

(2022)

This squib challenges Patterson & Caponigro's (2015, this journal) claim that there are few acceptable free relative clauses with who. We show that free relatives with who are generally acceptable when they are ‘transparent’ free relatives or complements of a copula, and add further nuance to their findings concerning how the degree of acceptability of free relatives with who varies according to positional factors.

Cover page of Speaker discrimination performance for "easy" versus "hard" voices in style-matched and -mismatched speech.

Speaker discrimination performance for "easy" versus "hard" voices in style-matched and -mismatched speech.

(2022)

This study compares human speaker discrimination performance for read speech versus casual conversations and explores differences between unfamiliar voices that are "easy" versus "hard" to "tell together" versus "tell apart." Thirty listeners were asked whether pairs of short style-matched or -mismatched, text-independent utterances represented the same or different speakers. Listeners performed better when stimuli were style-matched, particularly in read speech-read speech trials (equal error rate, EER, of 6.96% versus 15.12% in conversation-conversation trials). In contrast, the EER was 20.68% for the style-mismatched condition. When styles were matched, listeners' confidence was higher when speakers were the same versus different; however, style variation caused decreases in listeners' confidence for the "same speaker" trials, suggesting a higher dependency of this task on within-speaker variability. The speakers who were "easy" or "hard" to "tell together" were not the same as those who were "easy" or "hard" to "tell apart." Analysis of speaker acoustic spaces suggested that the difference observed in human approaches to "same speaker" and "different speaker" tasks depends primarily on listeners' different perceptual strategies when dealing with within- versus between-speaker acoustic variability.

Cover page of Effects of Laryngeal Vibratory Asymmetry and Neuromuscular Compensation on Voice Quality

Effects of Laryngeal Vibratory Asymmetry and Neuromuscular Compensation on Voice Quality

(2022)

Introduction

Vibratory asymmetry and neuromuscular compensation are often seen in laryngeal neuromuscular pathology. However, the ramifications of these findings on voice quality are unclear. This study investigated the effects of varying levels of vibratory asymmetry and neuromuscular compensation on cepstral peak prominence (CPP), an analog of voice quality.

Study design

In vivo canine phonation model.

Methods

Varying degrees of vocal fold vibratory asymmetry were achieved by stimulating one recurrent laryngeal nerve (RLN) over 11 levels from threshold to maximal muscle activation. For each of these levels, phonation was induced at systematically varied combinations of neuromuscular compensation: three levels each of contralateral RLN stimulation (80%, 90%, and 100% of maximal), superior laryngeal nerve (SLN) activation (0%, 50%, and 100% of maximal), and airflow levels (500, 700, and 900 mL/s). Vocal fold symmetry was determined by assessing the opening phase of the vibratory cycle in high-speed video recordings. Voice quality was estimated acoustically by calculating CPP for each voice sample.

Results

Eight hundred twenty-two phonatory conditions with varying degrees of vibratory asymmetry were evaluated. CPP was highest at vibratory symmetry. Increasing levels of asymmetry resulted in significant decreases in CPP. CPP increased significantly with increasing contralateral RLN activation. CPP was significantly higher at 50% SLN activation than 0% or 100% SLN activation.

Conclusion

Voice quality, as approximated by CPP, is best at vibratory symmetry and deteriorates with increasing degrees of asymmetry. Voice quality may be improved with neuromuscular compensation by increased adduction of the contralateral vocal fold or increased vocal fold tension at mid-levels of SLN activation.

Level of evidence

NA, Basic Science Laryngoscope, 132:130-134, 2022.