Search

Scholarly Works (9 results)

Sort By:

Article
Peer Reviewed

Forward entrainment: Psychophysics, neural correlates, and function

UC Irvine Previously Published Works (2023)

We define forward entrainment as that part of behavioral or neural entrainment that outlasts the entraining stimulus. In this review, we examine conditions under which one may optimally observe forward entrainment. In Part 1, we review and evaluate studies that have observed forward entrainment using a variety of psychophysical methods (detection, discrimination, and reaction times), different target stimuli (tones, noise, and gaps), different entraining sequences (sinusoidal, rectangular, or sawtooth waveforms), a variety of physiological measures (MEG, EEG, ECoG, CSD), in different modalities (auditory and visual), across modalities (audiovisual and auditory-motor), and in different species. In Part 2, we describe those experimental conditions that place constraints on the magnitude of forward entrainment, including an evaluation of the effects of signal uncertainty and attention, temporal envelope complexity, signal-to-noise ratio (SNR), rhythmic rate, prior experience, and intersubject variability. In Part 3 we theorize on potential mechanisms and propose that forward entrainment may instantiate a dynamic auditory afterimage that lasts a fraction of a second to minimize prediction error in signal processing.

Cover page: Forward entrainment: Psychophysics, neural correlates, and function

Thesis
Peer Reviewed

Relative contribution of amplitude and phase spectra to the perception of complex sounds

Broussard, Sierra Noel
Advisor(s): Saberi, Kourosh

UC Irvine Electronic Theses and Dissertations (2017)

Speech processing involves analysis of complex cues in both spectral and temporal

domains. This dissertation describes a set of studies that explore how speech and music,

the two most complex and ecologically important types of sound, are affected by spectral

degradation using a method that orthogonally and parametrically decorrelates their

amplitude and phase spectra. The first study investigates how amplitude and phase

information differentially contribute to speech intelligibility. Listeners performed a word

identification task after hearing spectrally degraded sentences that were segmented into

temporal units of varying lengths (e.g., phoneme and syllable durations) before the

decorrelation process. Results showed that for intermediate spectral correlation values,

segment length is generally inconsequential to intelligibility, and that intelligibility overall

is more adversely affected by phase-spectrum decorrelation than by amplitude-spectrum

decorrelation. The second study investigates how amplitude and phase information

differentially contribute to melody discrimination and speech intelligibility to better

characterize processing differences between music and speech. Listeners heard spectrally

degraded melodies and performed a same-different judgement in a psychophysical

discrimination task. Melody recognition was relatively unaffected by partial decorrelation

of the amplitude spectrum and more resilient to loss of phase-spectrum cues for both short

and long-duration analysis segments. The third study examines the effects of speaking rate

and spectral degradation on speech intelligibility. Consistent with prior findings, phase

spectrum cues were most useful to intelligibility at longer temporal windows of analysis,

and amplitude spectrum cues at short windows. For normal rate speech, the crossover

point between these two cues occurred at an estimated window size of 120 ms; i.e.,

amplitude-spectrum cues were more useful to intelligibility below this value and phase

spectrum cues were more useful above this window size. Increasing speaking rate to twice

normal rate, surprisingly seemed to have little to no effect on this crossover point.

However, slowing down speaking rate shifted this crossover point to significantly longer

temporal window sizes (~230 ms). Implications of these findings for cues critical to

intelligibility of speech at different speaking rates, and in particular, the importance of

preserving narrowband temporal envelope cues are discussed.

Cover page: Relative contribution of amplitude and phase spectra to the perception of complex sounds

Thesis
Peer Reviewed

Factors affecting relative pitch perception

McClaskey, Carolyn Marie
Advisor(s): Saberi, Kourosh

UC Irvine Electronic Theses and Dissertations (2016)

Sounds that evoke a sense of pitch are ubiquitous in our environment and important for speech, music, and auditory scene analysis. The frequencies of these sounds rarely remain constant, however, and the direction and extent of pitch change is often more important than the exact pitches themselves. This dissertation examines the mechanisms underlying how we perceive relative pitch distance, focusing on two types of stimuli: continuous pitch changes and discrete pitch changes.

In a series of experiments testing continuous pitch changes, listeners heard pure-tone frequency sweeps and reported whether they moved up or down. Sweeps varied in the extent of frequency change, the rate of frequency change, and sweep center frequency. Results provide evidence for a sampling mechanism in which listeners extract the start and end pitches of each sweep and then compare them to determine sweep direction. A comparison of performance between frequency regions shows a smaller effect of sweep rate at high frequencies (>6 kHz), suggesting that the mechanism by which listeners extract start/end pitches at low frequencies is based on temporal pitch processing.

To examine discrete frequency changes, nonmusicians, amateur musicians, and formally trained “expert” musicians heard two different pitch-intervals and were asked to indicate which was larger. Intervals varied in the size of the comparison interval and were presented in both low and high frequency regions. Expert musicians performed significantly better than other listeners, while amateur musicians performed similar to nonmusicians. Contrary to previous studies, all groups demonstrated better performance for smaller intervals. A comparison of frequency region also suggests a potential difference in listening strategy between groups: nonmusicians produced higher thresholds at high frequencies but amateur and expert musicians did not.

Overall, results provide novel evidence for the role of a sampling mechanism in sweep-direction identification, and present a previously undocumented effect of standard interval size in pitch-interval perception. The effects of frequency region found in both contexts furthermore suggests that temporal pitch processing mechanisms are used at low frequencies, and that different listening strategies may be used for relative pitch perception at higher frequencies where temporal pitch cues are less reliable.

Cover page: Factors affecting relative pitch perception

Creative Commons 'BY-NC-SA' version 4.0 license

Article
Peer Reviewed

The Rhythm of Perception

UC Irvine Previously Published Works (2015)

Acoustic rhythms are pervasive in speech, music, and environmental sounds. Recent evidence for neural codes representing periodic information suggests that they may be a neural basis for the ability to detect rhythm. Further, rhythmic information has been found to modulate auditory-system excitability, which provides a potential mechanism for parsing the acoustic stream. Here, we explored the effects of a rhythmic stimulus on subsequent auditory perception. We found that a low-frequency (3 Hz), amplitude-modulated signal induces a subsequent oscillation of the perceptual detectability of a brief nonperiodic acoustic stimulus (1-kHz tone); the frequency but not the phase of the perceptual oscillation matches the entrained stimulus-driven rhythmic oscillation. This provides evidence that rhythmic contexts have a direct influence on subsequent auditory perception of discrete acoustic events. Rhythm coding is likely a fundamental feature of auditory-system design that predates the development of explicit human enjoyment of rhythm in music or poetry.

Article
Peer Reviewed

Robustness of speech intelligibility at moderate levels of spectral degradation

UC Irvine Previously Published Works (2017)

The current study investigated how amplitude and phase information differentially contribute to speech intelligibility. Listeners performed a word-identification task after hearing spectrally degraded sentences. Each stimulus was degraded by first dividing it into segments, then the amplitude and phase components of each segment were decorrelated independently to various degrees relative to those of the original segment. Segments were then concatenated into their original sequence to present to the listener. We used three segment lengths: 30 ms (phoneme length), 250 ms (syllable length), and full sentence (non-segmented). We found that for intermediate spectral correlation values, segment length is generally inconsequential to intelligibility. Overall, intelligibility was more adversely affected by phase-spectrum decorrelation than by amplitude-spectrum decorrelation. If the phase information was left intact, decorrelating the amplitude spectrum to intermediate values had no effect on intelligibility. If the amplitude information was left intact, decorrelating the phase spectrum to intermediate values significantly degraded intelligibility. Some exceptions to this rule are described. These results delineate the range of amplitude- and phase-spectrum correlations necessary for speech processing and its dependency on the temporal window of analysis (phoneme or syllable length). Results further point to the robustness of speech information in environments that acoustically degrade cues to intelligibility (e.g., reverberant or noisy environments).

Cover page: Robustness of speech intelligibility at moderate levels of spectral degradation

Article
Peer Reviewed

Improved functional abilities of the life-extended Drosophila mutant Methuselah are reversed at old age to below control levels

UC Irvine Previously Published Works (2014)

Methuselah (mth) is a chromosome 3 Drosophila mutant with an increased lifespan. A large number of studies have investigated the genetic, molecular, and biochemical mechanisms of the mth gene. Much less is known about the effects of mth on preservation of sensorimotor abilities throughout Drosophila's lifespan, particularly in late life. The current study investigated functional senescence in mth and its parental-control line (w1118) in two experiments that measured age-dependent changes in flight functions and locomotor activity. In experiment 1, a total of 158 flies (81 mth and 77 controls) with an age range from 10 to 70 days were individually tethered under an infrared laser-sensor system that allowed monitoring of flight duration during phototaxic flight. We found that mth has a statistically significant advantage in maintaining continuous flight over control flies at age 10 days, but not during middle and late life. At age 70 days, the trend reversed and parental control flies had a small but significant advantage, suggesting an interaction between age and genotype in the ability to sustain flight. In experiment 2, a total of 173 different flies (97 mth and 76 controls) with an age range from 50 to 76 days were individually placed in a large well-lit arena (60 × 45 cm) and their locomotor activity quantified as the distance walked in a 1-min period. Results showed that mth flies had lower levels of locomotor activity relative to controls at ages 50 and 60 days. These levels converged for the two genotypes at the oldest ages tested. Findings show markedly different patterns of functional decline for the mth line relative to those previously reported for other life-extended genotypes, suggesting that different life-extending genes have dissimilar effects on preservation of sensory and motor abilities throughout an organism's lifespan.

Cover page: Improved functional abilities of the life-extended Drosophila mutant Methuselah are reversed at old age to below control levels

Article
Peer Reviewed

Auditory, Visual and Audiovisual Speech Processing Streams in Superior Temporal Sulcus

UC Irvine Previously Published Works (2017)

The human superior temporal sulcus (STS) is responsive to visual and auditory information, including sounds and facial cues during speech recognition. We investigated the functional organization of STS with respect to modality-specific and multimodal speech representations. Twenty younger adult participants were instructed to perform an oddball detection task and were presented with auditory, visual, and audiovisual speech stimuli, as well as auditory and visual nonspeech control stimuli in a block fMRI design. Consistent with a hypothesized anterior-posterior processing gradient in STS, auditory, visual and audiovisual stimuli produced the largest BOLD effects in anterior, posterior and middle STS (mSTS), respectively, based on whole-brain, linear mixed effects and principal component analyses. Notably, the mSTS exhibited preferential responses to multisensory stimulation, as well as speech compared to nonspeech. Within the mid-posterior and mSTS regions, response preferences changed gradually from visual, to multisensory, to auditory moving posterior to anterior. Post hoc analysis of visual regions in the posterior STS revealed that a single subregion bordering the mSTS was insensitive to differences in low-level motion kinematics yet distinguished between visual speech and nonspeech based on multi-voxel activation patterns. These results suggest that auditory and visual speech representations are elaborated gradually within anterior and posterior processing streams, respectively, and may be integrated within the mSTS, which is sensitive to more abstract speech information within and across presentation modalities. The spatial organization of STS is consistent with processing streams that are hypothesized to synthesize perceptual speech representations from sensory signals that provide convergent information from visual and auditory modalities.

Cover page: Auditory, Visual and Audiovisual Speech Processing Streams in Superior Temporal Sulcus

Article
Peer Reviewed

An fMRI Study of Audiovisual Speech Perception Reveals Multisensory Interactions in Auditory Cortex

UC Irvine Previously Published Works (2013)

Creative Commons 'BY' version 4.0 license

Article
Peer Reviewed

Informational Masking in Aging and Brain-lesioned Individuals

UC Irvine Previously Published Works (2023)

Auditory stream segregation and informational masking were investigated in brain-lesioned individuals, age-matched controls with no neurological disease, and young college-age students. A psychophysical paradigm known as rhythmic masking release (RMR) was used to examine the ability of participants to identify a change in the rhythmic sequence of 20-ms Gaussian noise bursts presented through headphones and filtered through generalized head-related transfer functions to produce the percept of an externalized auditory image (i.e., a 3D virtual reality sound). The target rhythm was temporally interleaved with a masker sequence comprising similar noise bursts in a manner that resulted in a uniform sequence with no information remaining about the target rhythm when the target and masker were presented from the same location (an impossible task). Spatially separating the target and masker sequences allowed participants to determine if there was a change in the target rhythm midway during its presentation. RMR thresholds were defined as the minimum spatial separation between target and masker sequences that resulted in 70.7% correct-performance level in a single-interval 2-alternative forced-choice adaptive tracking procedure. The main findings were (1) significantly higher RMR thresholds for individuals with brain lesions (especially those with damage to parietal areas) and (2) a left-right spatial asymmetry in performance for lesion (but not control) participants. These findings contribute to a better understanding of spatiotemporal relations in informational masking and the neural bases of auditory scene analysis.

Cover page: Informational Masking in Aging and Brain-lesioned Individuals