Skip to main content
eScholarship
Open Access Publications from the University of California

UC Berkeley Library

Berkeley Research Impact Initiative (BRII) bannerUC Berkeley

Content-based tools for editing audio stories

  • Author(s): Rubin, Steve
  • Berthouzoz, Floraine
  • Mysore, Gautham J
  • Li, Wilmot
  • Agrawala, Maneesh
  • et al.

Published Web Location

http://dl.acm.org/citation.cfm?id=2501993
No data is associated with this publication.
Abstract

Audio stories are an engaging form of communication that combine speech and music into compelling narratives. Existing audio editing tools force story producers to manipulate speech and music tracks via tedious, low-level waveform editing. In contrast, we present a set of tools that analyze the audio content of the speech and music and thereby allow producers to work at much higher level. Our tools address several challenges in creating audio stories, including (1) navigating and editing speech, (2) selecting appropriate music for the score, and (3) editing the music to complement the speech. Key features include a transcript-based speech editing tool that automatically propagates edits in the transcript text to the corresponding speech track; a music browser that supports searching based on emotion, tempo, key, or timbral similarity to other songs; and music retargeting tools that make it easy to combine sections of music with the speech. We have used our tools to create audio stories from a variety of raw speech sources, including scripted narratives, interviews and political speeches. Informal feedback from first-time users suggests that our tools are easy to learn and greatly facilitate the process of editing raw footage into a final story.

Item not freely available? Link broken?
Report a problem accessing this item