Keywords
UIST2.0 Archive - 20 years of UIST
Back
Back to keywords index

audio

audio

In Proceedings of UIST 1997
Article Picture

Audio aura: light-weight audio augmented reality (p. 211-212)

In Proceedings of UIST 1998
Article Picture

A dynamic grouping technique for ink and audio notes (p. 195-202)

In Proceedings of UIST 2000
Article Picture

The AHI: an audio and haptic interface for contact interactions (p. 149-158)

audio server

In Proceedings of UIST 1992
Article Picture

Tools for building asynchronous servers to support speech and audio applications (p. 71-78)

Abstract plus

Distributed client/server models are becoming increasingly prevalent in multimedia systems and advanced user interface design. A multimedia application, for example, may play and record audio, use speech recognition input, and use a window system for graphical I/O. The software architecture of such a system can be simplified if the application communicates to multiple servers (e.g., audio servers, recognition servers) that each manage different types of input and output. This paper describes tools for rapidly prototyping distributed asynchronous servers and applications, with an emphasis on supporting highly interactive user interfaces, temporal media, and multi-modal I/O.

The Socket Manager handles low-level connection management and device I/O by supporting a callback mechanism for connection initiation, shutdown, and for reading incoming data. The Byte Stream Manager consists of an RPC compiler and run-time library that supports synchronous and asynchronous calls, with both a programmatic interface and a telnet interface that allows the server to act as a command interpreter. This paper details the tools developed for building asynchronous servers, several audio and speech servers built using these tools, and applications that exploit the features provided by the servers.

audio user interface

In Proceedings of UIST 2002
Article Picture

Mediated voice communication via mobile IP (p. 141-150)

Abstract plus

Impromptu is a mobile audio device which uses wireless Internet Protocol (IP) to access novel computer-mediated voice communication channels. These channels show the richness of IP-based communication as compared to conventional mobile telephony, adding audio processing and storage in the network, and flexible, user-centered call control protocols. These channels may be synchronous, asynchronous, or event-triggered, or even change modes as a function of other user activity. The demands of these modes plus the need to navigate with an entirely non-visual user interface are met with a number of audio-oriented user interaction techniques.

audio visualization

In Proceedings of UIST 2003
Article Picture

SmartMusicKIOSK: music listening station with chorus-search function (p. 31-40)

Abstract plus

This paper describes a new music-playback interface for trial listening, SmartMusicKIOSK. In music stores, short trial listening of CD music is not usually a passive experience -- customers often search out the chorus or "hook" of a song using the fast-forward button. Listening of this type, however, has not been traditionally supported. This research achieves a function for jumping to the chorus section and other key parts of a song plus a function for visualizing song structure. These functions make it easier for a listener to find desired parts of a song and thereby facilitate an active listening experience. The proposed functions are achieved by an automatic chorus-section detecting method, and the results of implementing them as a listening station have demonstrated their usefulness.

non-speech audio

In Proceedings of UIST 1993
Article Picture

SpeechSkimmer: interactively skimming recorded speech (p. 187-196)

In Proceedings of UIST 1994
Article Picture

ENO: synthesizing structured sound spaces (p. 49-57)

Abstract plus

ENO is an audio server designed to make it easy for applications in the Unix environment to incorporate non-speech audio cues. At the physical level, ENO manages a shared resource, namely the audio hardware. At the logical level, it manages a sound space that is shared by various client applications. Instead of dealing with sound in terms of its physical description (i.e., sampled sounds), ENO allows sounds to be presented and controlled in terms of higher-level descriptions of sources, interactions, attributes, and sound space. Using this structure, ENO can facilitate the creation of consistent, rich systems of audio cues. In this paper, we discuss the justification, design, and implementation of ENO.

In Proceedings of UIST 1995
Article Picture

Hands-on demonstration: interacting with SpeechSkimmer (p. 71-72)

In Proceedings of UIST 1995
Article Picture

A tool to support speech and non-speech audio feedback generation in audio interfaces (p. 171-179)

real-time audio buffering

In Proceedings of UIST 2001
Article Picture

Real-time audio buffering for telephone applications (p. 193-194)

Abstract plus

A system that uses an ear proximity sensor to actively manage periods of distraction during telephone conversations is described. We detect when the phone is removed from the ear, record any incoming audio, and play it back when the phone is returned to the ear. By dropping silent intervals and speeding up playback with a pitch-preserving algorithm, we quickly return to real-time without the loss of information. This real-time audio buffering technique also allows us to create a user-activated, lossless instant replay function.

spatial audio

In Proceedings of UIST 1998
Article Picture

Audio hallway: a virtual acoustic environment for browsing (p. 163-170)