User Interface Software and Technology

Top

activity recognition

In Proceedings of UIST 2006

Sensing from the basement: a feasibility study of unobtrusive and low-cost home activity recognition (p. 91-100)

James Fogarty, Carolyn Au, Scott E. Hudson

Keywords: activity recognition, sensing in the home, sensor-based model

Top

continuous speech recognition

In Proceedings of UIST 1995

Demonstration of a reading coach that listens (p. 77-78)

Jack Mostow, Alexander G. Hauptmann, Steven F. Roth

4'05", 47Mb

Keywords: children, continuous speech recognition, education, non-reader, speech interface for children

Top

document recognition

In Proceedings of UIST 2004

Video-based document tracking: unifying your physical and electronic desktops (p. 99-107)

Jiwon Kim, Steven M. Seitz, Maneesh Agrawala

3'41", 23Mb

Keywords: document recognition, intelligent office, interactive desktop, video analysis

Top

fingerprint recognition

In Proceedings of UIST 1998

A user interface using fingerprint recognition: holding commands and data objects on fingers (p. 71-79)

Atsushi Sugiura, Yoshiyuki Koseki

Keywords: fingerprint recognition, input device, multimodal user interface, multi-computer user interface

Top

gesture recognition

In Proceedings of UIST 1998

The music notepad (p. 203-210)

Andrew Forsberg, Mark Dieterich, Robert Zeleznik

4'27", 58Mb

Keywords: direct display, gestural input, gesture recognition, handwriting recognition, interaction, music notation, user interface

In Proceedings of UIST 2003

EdgeWrite: a stylus-based text entry method designed for high accuracy and stability of motion (p. 61-70)

Jacob O. Wobbrock, Brad A. Myers, John A. Kembel

5'22", 21Mb

Keywords: pda, assistive technology, computer access, corner, edge, gesture recognition, graffiti, handheld, motor impairment, palm, pebbles, text entry, text input, unistroke

In Proceedings of UIST 2004

SHARK²: a large vocabulary shorthand writing system for pen-based computers (p. 43-52)

Per-Ola Kristensson, Shumin Zhai

4'21", 27Mb

Keywords: gesture recognition, shorthand, shorthand recognition, stenography, text input

In Proceedings of UIST 2006

Camera phone based motion sensing: interaction techniques, applications and performance study (p. 101-110)

Jingtao Wang, Shumin Zhai, John Canny

4'52", 43Mb

Keywords: fitts law, camera phone, computer vision, gesture recognition, handwriting recognition, human performance, input technique and device, mobile device, mobile phone, motion estimation

In Proceedings of UIST 2007

Gestures without libraries, toolkits or training: a $1 recognizer for user interface prototypes (p. 159-168)

Jacob O. Wobbrock, Andrew D. Wilson, Yang Li

Keywords: dynamic time warping, gesture recognition, mark, rapid prototyping, recognition rate, rubine, statistical classifier, stroke, symbol, unistroke, user interface

Abstract

Although mobile, tablet, large display, and tabletop computers increasingly present opportunities for using pen, finger, and wand gestures in user interfaces, implementing gesture recognition largely has been the privilege of pattern matching experts, not user interface prototypers. Although some user interface libraries and toolkits offer gesture recognizers, such infrastructure is often unavailable in design-oriented environments like Flash, scripting environments like JavaScript, or brand new off-desktop prototyping environments. To enable novice programmers to incorporate gestures into their UI prototypes, we present a "$1 recognizer" that is easy, cheap, and usable almost anywhere in about 100 lines of code. In a study comparing our $1 recognizer, Dynamic Time Warping, and the Rubine classifier on user-supplied gestures, we found that $1 obtains over 97% accuracy with only 1 loaded template and 99% accuracy with 3+ loaded templates. These results were nearly identical to DTW and superior to Rubine. In addition, we found that medium-speed gestures, in which users balanced speed and accuracy, were recognized better than slow or fast gestures for all three recognizers. We also discuss the effect that the number of templates or training examples has on recognition, the score falloff along recognizers' N-best lists, and results for individual gestures. We include detailed pseudocode of the $1 recognizer to aid development, inspection, extension, and testing.

In Proceedings of UIST 2008

OctoPocus: a dynamic guide for learning gesture-based command sets (p. 37-46)

Olivier Bau, Wendy E. Mackay

2'10", 33Mb

Keywords: dynamic guide, feedback, feedforward, gesture recognition, mouse input, octopocus, pen input

Abstract

We describe OctoPocus, an example of a dynamic guide that combines on-screen feedforward and feedback to help users learn, execute and remember gesture sets. OctoPocus can be applied to a wide range of single-stroke gestures and recognition algorithms and helps users progress smoothly from novice to expert performance. We provide an analysis of the design space and describe the results of two experi-ments that show that OctoPocus is significantly faster and improves learning of arbitrary gestures, compared to con-ventional Help menus. It can also be adapted to a mark-based gesture set, significantly improving input time compared to a two-level, four-item Hierarchical Marking menu.

Top

handwriting recognition

In Proceedings of UIST 1998

The music notepad (p. 203-210)

Andrew Forsberg, Mark Dieterich, Robert Zeleznik

4'27", 58Mb

Keywords: direct display, gestural input, gesture recognition, handwriting recognition, interaction, music notation, user interface

In Proceedings of UIST 1998

Integrating pen operations for composition by example (p. 211-212)

Toshiyuki Masui

1'33", 22Mb

Keywords: pbd, pbe, pobox, composition by example, handwriting recognition, marking menu, pen input

In Proceedings of UIST 2006

Camera phone based motion sensing: interaction techniques, applications and performance study (p. 101-110)

Jingtao Wang, Shumin Zhai, John Canny

4'52", 43Mb

Keywords: fitts law, camera phone, computer vision, gesture recognition, handwriting recognition, human performance, input technique and device, mobile device, mobile phone, motion estimation

In Proceedings of UIST 2006

CueTIP: a mixed-initiative interface for correcting handwriting errors (p. 323-332)

Michael Shilman, Desney S. Tan, Patrice Simard

2'31", 6Mb

Keywords: constraint, correction interface, handwriting recognition, mixed initiative, user study

Top

interactivity-augmented object recognition

In Proceedings of UIST 2005

Circle & identify: interactivity-augmented object recognition for handheld devices (p. 107-110)

Byungkon Sohn, Geehyuk Lee

Keywords: interactivity-augmented object recognition, smart environment, spatial mouse

Top

object recognition

In Proceedings of UIST 2009

Bonfire: a nomadic system for hybrid laptop-tabletop interaction (p. 129-138)

Shaun K. Kane, Daniel Avrahami, Jacob O. Wobbrock, Beverly Harrison, Adam D. Rea, Matthai Philipose, Anthony LaMarca

4'15", 47Mb

Keywords: ambient interaction, computer vision, extended display, gesture, laptop, micro-projector, object recognition, peripheral display, surface, tabletop, tangible bit

Abstract

We present Bonfire, a self-contained mobile computing system that uses two laptop-mounted laser micro-projectors to project an interactive display space to either side of a laptop keyboard. Coupled with each micro-projector is a camera to enable hand gesture tracking, object recognition, and information transfer within the projected space. Thus, Bonfire is neither a pure laptop system nor a pure tabletop system, but an integration of the two into one new nomadic computing platform. This integration (1) enables observing the periphery and responding appropriately, e.g., to the casual placement of objects within its field of view, (2) enables integration between physical and digital objects via computer vision, (3) provides a horizontal surface in tandem with the usual vertical laptop display, allowing direct pointing and gestures, and (4) enlarges the input/output space to enrich existing applications. We describe Bonfire's architecture, and offer scenarios that highlight Bonfire's advantages. We also include lessons learned and insights for further development and use.

Top

recognition

In Proceedings of UIST 2000

SATIN: a toolkit for informal ink-based applications (p. 63-72)

Jason I. Hong, James A. Landay

Keywords: satin, gesture, informal, ink, interpreter, pen, recognition, recognizer, sketching, toolkit

In Proceedings of UIST 2000

Fluid sketches: continuous recognition and morphing of simple hand-drawn shapes (p. 73-80)

James Arvo, Kevin Novins

Keywords: morphing, recognition, sketching

In Proceedings of UIST 2001

Guided gesture support in the paper PDA (p. 197-198)

Daniel Avrahami, Scott E. Hudson, Thomas P. Moran, Brian D. Williams

Keywords: hybrid paper electronic interface, augmented reality, interaction on paper, interaction technique, recognition

In Proceedings of UIST 2008

Lineogrammer: creating diagrams by drawing (p. 161-170)

Robert C. Zeleznik, Andrew Bragdon, Chu-Chi Liu, Andrew Forsberg

4'43", 22Mb

Keywords: alignment, beautification, diagram, disambiguation, drawing, gesture, handwriting, pen, pen-centric, pressure, recognition, ruler, sketching, snapping, symmetry

Abstract

We present the design of Lineogrammer, a diagram-drawing system motivated by the immediacy and fluidity of pencil-drawing. We attempted for Lineogrammer to feel like a modeless diagramming "medium" in which stylus input is immediately interpreted as a command, text label or a drawing element, and drawing elements snap to or sculpt from existing elements. An inferred dual representation allows geometric diagram elements, no matter how they were entered, to be manipulated at granularities ranging from vertices to lines to shapes. We also integrate lightweight tools, based on rulers and construction lines, for controlling higher-level diagram attributes, such as symmetry and alignment. We include preliminary usability observations to help identify areas of strength and weakness with this approach.

In Proceedings of UIST 2010

A framework for robust and flexible handling of inputs with uncertainty (p. 47-56)

Julia Schwarz, Scott Hudson, Jennifer Mankoff, Andrew D. Wilson

3'14", 25Mb

Keywords: ambiguity, input handling, recognition

Abstract

New input technologies (such as touch), recognition based input (such as pen gestures) and next-generation interactions (such as inexact interaction) all hold the promise of more natural user interfaces. However, these techniques all create inputs with some uncertainty. Unfortunately, conventional infrastructure lacks a method for easily handling uncertainty, and as a result input produced by these technologies is often converted to conventional events as quickly as possible, leading to a stunted interactive experience. We present a framework for handling input with uncertainty in a systematic, extensible, and easy to manipulate fashion. To illustrate this framework, we present several traditional interactors which have been extended to provide feedback about uncertain inputs and to allow for the possibility that in the end that input will be judged wrong (or end up going to a different interactor). Our six demonstrations include tiny buttons that are manipulable using touch input, a text box that can handle multiple interpretations of spoken input, a scrollbar that can respond to inexactly placed input, and buttons which are easier to click for people with motor impairments. Our framework supports all of these interactions by carrying uncertainty forward all the way through selection of possible target interactors, interpretation by interactors, generation of (uncertain) candidate actions to take, and a mediation process that decides (in a lazy fashion) which actions should become final.

Top

recognition error

In Proceedings of UIST 2000

Multimodal system processing in mobile environments (p. 21-30)

Sharon Oviatt

Keywords: mobile interface design, multimodal architecture, mutual disambiguation, recognition error, robust performance, speech and pen input

Top

recognition rate

In Proceedings of UIST 2007

Gestures without libraries, toolkits or training: a $1 recognizer for user interface prototypes (p. 159-168)

Jacob O. Wobbrock, Andrew D. Wilson, Yang Li

Keywords: dynamic time warping, gesture recognition, mark, rapid prototyping, recognition rate, rubine, statistical classifier, stroke, symbol, unistroke, user interface

Abstract

Top

shorthand recognition

In Proceedings of UIST 2004

SHARK²: a large vocabulary shorthand writing system for pen-based computers (p. 43-52)

Per-Ola Kristensson, Shumin Zhai

4'21", 27Mb

Keywords: gesture recognition, shorthand, shorthand recognition, stenography, text input

Top

sketch recognition

In Proceedings of UIST 2004

SketchREAD: a multi-domain sketch recognition engine (p. 23-32)

Christine Alvarado, Randall Davis

Keywords: bayesian network, input and interaction technology, intelligent ui, pen-based ui, sketch recognition

Top

speech recognition

In Proceedings of UIST 1994

Putting people first: specifying proper names in speech interfaces (p. 29-37)

Matt Marx, Chris Schmandt

Keywords: conversational system, error-repair, speech recognition, user interface

In Proceedings of UIST 1995

Speech for multimedia information retrieval (p. 79-80)

Alexander G. Hauptmann, Michael J. Witbrock, Alexander I. Rudnicky

5'37", 62Mb

Keywords: informedia, news-on-demand, multimedia indexing and search, speech recognition, video information retrieval

In Proceedings of UIST 1995

A tool to support speech and non-speech audio feedback generation in audio interfaces (p. 171-179)

Lisa J. Stifelman

4'12", 51Mb

Keywords: auditory feedback, hand-held computer, non-speech audio, speech recognition, speech user interface, text-to-speech synthesis

In Proceedings of UIST 2008

Search Vox: leveraging multimodal refinement and partial knowledge for mobile voice search (p. 141-150)

Tim Paek, Bo Thiesson, Yun-Cheng Ju, Bongshin Lee

2'41", 32Mb

Keywords: mobile search, multimodal, speech recognition

Abstract

Internet usage on mobile devices continues to grow as users seek anytime, anywhere access to information. Because users frequently search for businesses, directory assistance has been the focus of many voice search applications utilizing speech as the primary input modality. Unfortunately, mobile settings often contain noise which degrades performance. As such, we present Search Vox, a mobile search interface that not only facilitates touch and text refinement whenever speech fails, but also allows users to assist the recognizer via text hints. Search Vox can also take advantage of any partial knowledge users may have about the business listing by letting them express their uncertainty in an intuitive way using verbal wildcards. In simulation experiments conducted on real voice search data, leveraging multimodal refinement resulted in a 28% relative reduction in error rate. Providing text hints along with the spoken utterance resulted in even greater relative reduction, with dramatic gains in recovery for each additional character.

Top

speech recognition and synthesis

In Proceedings of UIST 1992

Tools for building asynchronous servers to support speech and audio applications (p. 71-78)

Barry Arons

Keywords: asynchronous message passing, audio server, distributed client-server architecture, remote procedure call, speech and studio application, speech recognition and synthesis

Top

symbol recognition

In Proceedings of UIST 2004

Hierarchical parsing and recognition of hand-sketched diagrams (p. 13-22)

Levent Burak Kara, Thomas F. Stahovich

4'06", 29Mb

Keywords: simulink, pen computing, pens, sketch understanding, symbol recognition, visual parsing

recognition

activity recognition

Sensing from the basement: a feasibility study of unobtrusive and low-cost home activity recognition (p. 91-100)

continuous speech recognition

Demonstration of a reading coach that listens (p. 77-78)

document recognition

Video-based document tracking: unifying your physical and electronic desktops (p. 99-107)

fingerprint recognition

A user interface using fingerprint recognition: holding commands and data objects on fingers (p. 71-79)

gesture recognition

The music notepad (p. 203-210)

EdgeWrite: a stylus-based text entry method designed for high accuracy and stability of motion (p. 61-70)

SHARK2: a large vocabulary shorthand writing system for pen-based computers (p. 43-52)

Camera phone based motion sensing: interaction techniques, applications and performance study (p. 101-110)

Gestures without libraries, toolkits or training: a $1 recognizer for user interface prototypes (p. 159-168)

OctoPocus: a dynamic guide for learning gesture-based command sets (p. 37-46)

handwriting recognition

The music notepad (p. 203-210)

Integrating pen operations for composition by example (p. 211-212)

Camera phone based motion sensing: interaction techniques, applications and performance study (p. 101-110)

CueTIP: a mixed-initiative interface for correcting handwriting errors (p. 323-332)

interactivity-augmented object recognition

Circle & identify: interactivity-augmented object recognition for handheld devices (p. 107-110)

object recognition

Bonfire: a nomadic system for hybrid laptop-tabletop interaction (p. 129-138)

recognition

SATIN: a toolkit for informal ink-based applications (p. 63-72)

Fluid sketches: continuous recognition and morphing of simple hand-drawn shapes (p. 73-80)

Guided gesture support in the paper PDA (p. 197-198)

Lineogrammer: creating diagrams by drawing (p. 161-170)

A framework for robust and flexible handling of inputs with uncertainty (p. 47-56)

recognition error

Multimodal system processing in mobile environments (p. 21-30)

recognition rate

Gestures without libraries, toolkits or training: a $1 recognizer for user interface prototypes (p. 159-168)

shorthand recognition

SHARK2: a large vocabulary shorthand writing system for pen-based computers (p. 43-52)

sketch recognition

SketchREAD: a multi-domain sketch recognition engine (p. 23-32)

speech recognition

Putting people first: specifying proper names in speech interfaces (p. 29-37)

Speech for multimedia information retrieval (p. 79-80)

A tool to support speech and non-speech audio feedback generation in audio interfaces (p. 171-179)

Search Vox: leveraging multimodal refinement and partial knowledge for mobile voice search (p. 141-150)

speech recognition and synthesis

Tools for building asynchronous servers to support speech and audio applications (p. 71-78)

symbol recognition

Hierarchical parsing and recognition of hand-sketched diagrams (p. 13-22)

SHARK²: a large vocabulary shorthand writing system for pen-based computers (p. 43-52)

SHARK²: a large vocabulary shorthand writing system for pen-based computers (p. 43-52)