HoverCross:
Seamless Diagram Creation and Editing for Pen-Based Interfaces
Abstract
A central problem
in pen-based interfaces is how to transition smoothly between drawing and
editing. Separate drawing and editing modes can be awkward and distracting, while
modeless editing gestures are error-prone. We present HoverCross,
a seamless inking and editing interface that provides a simple and reliable
method for users to create and edit drawings without explicit mode changes. HoverCross combines the strengths of several recent
developments in penbased interfaces. With HoverCross, users ink normally and then select objects or
ink strokes by crossing over them in the hover space above the tablet screen.
They can then edit their selection through a context menu on the canvas. The
results of our user study indicate that HoverCross provides
an efficient, fluid and robust transition between drawing and editing. Furthermore,
over half of our participants prefer HoverCross over
existing interfaces for several common diagram creation and editing tasks.
The promise of
pen-based interfaces is that they provide a natural way for users to create
diagrams and take free-form notes. However, the integration of diagram creation
and diagram editing remains a barrier to the widespread use of these
interfaces. Pens are more convenient and natural for drawing, but editing with
a pen remains cumbersome when the user is forced to rely on graphical user
interfaces designed for a mouse and keyboard. Because the user must use the pen
for both drawing and editing (or suffer the inconvenience of switching between
the pen and the keyboard), a core challenge for pen-based computing is to
construct an interface that allows users to switch easily between the two
tasks, while allowing the system to unambiguously interpret a given pen stroke
as drawing or editing.
Background & related work
Recent years have
brought many proposed solutions to this problem. Traditional solutions (e.g.,
Windows Journal) require the user to enter “edit mode,? usually by pressing a
software button. This solution is simple, but the problems with modes are well
known [11]. Using hardware buttons (e.g., the bezel buttons on the side of the
Tablet PC) to trigger mode switching helps solve these issues because the mode switch
is temporary—as soon as the user releases the button, the interface reverts to
drawing mode. Thus, there is little potential for mode confusion. Studies
suggest that this approach can be quite effective and natural [7, 4]. However,
simply pressing the button requires not only extra physical effort, but in many
cases also an extra hand.
Other researchers
take a recognition-based approach [9, 12], attempting to distinguish
automatically between drawing and editing strokes (e.g., lasso selections or
gestures). However, even with choice mediators [8] to help resolve recognition
ambiguities, recognition errors can be confusing. Furthermore, as we argue
below, it is not clear that lasso selection is optimal for many selection
tasks.
Recently,
researchers have explored using the hover space to invoke editing commands. The
hover space is the space above the
surface of a digital tablet where the pen is still tracked but does not
generate ink events. Grossman et al. present Hover Widgets [2], in which the
user performs gestures in the hover space to activate editing menus. Following
on this work, Subramanian et al. explore the possibility of using several
layers of the hover space to perform different editing tasks [10], while Kattinakere et al. formally model users? ability to track
(e.g., execute gestures) in hover space [5].
While each of
these recently developed approaches brings us closer to the dream of seamless
pen-based drawing and editing, we believe our arsenal of drawing and editing
techniques is not yet complete. Specifically, for common diagram creation and
editing tasks, HoverWidgets or a multi-level hover space
solution may be too heavyweight.
We explore the
power and simplicity that can be obtained by combining the simplest aspects of
many existing techniques. Our interface, called HoverCross,
combines the strengths of hover space editing, crossing-based selection (e.g.,
[1]), and simple gesture-triggered context menus (e.g., [3]), resulting in an
interface that is easy to learn, reliable, and fast for common drawing and
editing tasks. Furthermore, our approach integrates nicely with almost any
other penbased editing technique, so if desired,
users can simply ignore it and fall back on traditional editing methods.
Uniqueness of the approach
In many modal
pen-based interfaces (e.g.,Windows
Journal), it is selection, not editing in general, that is typically relegated
to its own mode. This division makes sense, as selecting objects in a drawing
is typically the first step in performing almost any editing task. Once the
user selects an object, she can edit it via a menu or direct manipulation. An
explicit selection mode seems necessary because any apparent selection stroke
just as easily could be a drawing stroke.
The key insight
behind HoverCross is that relegating the process of
selection to the hover space allows users to switch seamlessly between drawing
and selecting without pressing any buttons. Furthermore, because the user
performs only selection in the hover space, a simple crossing interface
suffices, and there is no need to perform gestures in the hover space. The
system then leverages the context of the selection to give the user additional
power through a gesture-invoked context menu or direct interaction with the
selected objects.
In our interface,
the user draws normally on the screen to create diagrams containing simple
shapes and text. The diagrams and text the user draws currently are recognized
by the Microsoft gesture and text recognition engines, but HoverCross
also works with unrecognized ink.
To edit the
diagram, the user hovers the pen briefly over the
tablet to trigger selection. The brief pause prevents triggering selection every
time the stylus enters the range, as it inevitably does on its way to the
screen. Once selection is triggered, a small vertical line appears in the
middle of each object (Figure 1), and the user simply needs to cross the stylus
over the line in either direction in hover space to select the object; crossing
the line again deselects it. We call each vertical line the “handle? for the corresponding
object. Figure 2 shows selected objects. The user can easily cross over
multiple objects to select all of them. To clear the selection, the user moves
the pen in hover space off the edge of the screen or quickly crosses over each
of the selected objects.

Figure 1: Vertical bars appear in each shape to give users a crossing
target.

Figure 2: The context menu that appears when the user makes a simple
gesture. The menu
appears at the tip of the stylus at the end of the
gesture, and actions apply to all selected shapes.
Using the hover
space for selection is particularly convenient for cluttered diagrams in which
the user wishes to edit several objects separated by intervening shapes or
text. The user can enter the hover space to cross one object and then
immediately exit the hover space by moving the pen tip away from the screen.
The selected object remains selected even when the pen is outside of hover
space. To select another object, the user simply needs to re-enter hover space
near that object. This process can be repeated as many times as needed to
easily select multiple objects spread out across the screen.
Once the user has
selected at least one item, she can either edit the selected item, or continue
to draw new portions of the diagram. The user has several options for editing
the diagram. She can move selected shapes by dragging one of them with her pen
on the screen. As long as she does not cross the handle in hover space,
touching the shape will not deselect it. For more complicated editing tasks,
the user can draw a simple gesture (an arc to the right) anywhere directly on
the screen to bring up a ring-shaped context menu around the tip of the stylus
(Figure 2). The content of the menu depends on the type of objects selected,
such as text or shapes. Another short movement allows the user quickly to
choose from the given options to manipulate the selection. With little
practice, the user easily can coordinate drawing, selecting, and editing in one
fluid motion.
The HoverCross technique is appropriate for any application
that combines pen-based drawing and editing. Our example application supports
the creation of clean diagrams that combine shapes and text, such as might be
produced in PowerPoint or Visio. We chose this domain because creating polished
diagrams typically requires more intensive editing than simply drawing freeform
notes and diagrams. In a previous pilot study in which we observed people
creating slides in PowerPoint, we found that they relied heavily on copying,
pasting, resizing and moving shapes within their diagrams. This domain thus
allows us to evaluate the utility of HoverCross for
its intended purpose.
In many ways, our
HoverCross interface is incredibly simple, yet quite
powerful. First, because drawing and selection take place in two distinct
spaces, there can be no ambiguity about the user’s intent. Second, users learn only
one simple gesture. Although our gesture is so simple it could conceivably be
confused with drawing strokes, it would be straightforward to construct a
gesture that is rarely confused with drawing strokes, such as one of the
compound gestures in [6]. Users perform the gesture on the screen, so there are
no issues of learning to execute more complex gestures in hover space. Third,
the context-specific in-place menus allow the user easily to access the most
common editing commands.
Confirming
previous results [1], the crossing selection metaphor has several advantages
over traditional lasso selection. For selecting single shapes or shapes roughly
in a horizontal row, crossing the shapes? targets is much faster than lassoing
them. Additionally, for shapes spread out in space, crossing in hover space
behaves like “Control-clicking? with a mouse and keyboard, allowing users to
select some objects while avoiding intervening ones.
A final strength
of this interface is that any part of it may be implemented in conjunction with
almost any other interface technique. If the user does not want to use it, she
can simply ignore it. Because the hover selection interface requires a short
pause to invoke, the user is not likely to trigger it by mistake, so it may be
combined with a traditional modal selection interface. For example, a user
might use modal lasso selection to select a group of tightly clumped objects,
and then use HoverCross to deselect one of them.
While we offer the context menu for convenience, additional menus may be added
easily to the interface.
Results & contributions
We conducted a user
study to verify our hypothesis that the combination of HoverCross
and our context-based ring menu is simple to use, does not distract the user
from the current task, and provides a fluid and efficient transition between
editing and sketching. We compared three selection interfaces: HoverCross, a variation of HoverCross
called Cross, and the traditional lasso select. Cross differs from HoverCross in that users cross objects with the pen on the
screen while holding down a nonpreferred hand button.
Li et al. showed that using a nonpreferred hand
button is the current most effective way to invoke edit mode [7]. Lasso select
invoked using a modal interface button is the current approach in most
commercial software.
Eighteen users
participated in our study. All were students at Harvey Mudd
College, and all had at least some experience using a tablet computer. No user
had experience with a hover or crossing-based interface.
After receiving
instructions and trying out the three interface techniques for about 2-5
minutes, users performed four tasks using each of the three interface
techniques. We designed our tasks to evaluate the utility of HoverCross in a number of different situations. In the
first task, users simultaneously created and edited recognized shapes. We
instructed them to draw a square, a circle and a triangle on the left side of
the screen, and to move each shape to the right side of the screen immediately
after drawing it. Our next two tasks evaluated the utility of HoverCross for selecting overlapping shapes. In task 2 we
instructed them to move an entire group of pre-drawn, overlapping shapes, while
in task 3 we instructed them to move only a subset of the overlapping shapes.
Finally, in task 4 we instructed them to change all the circles to red and all
the squares to blue in a pre-drawn sketch that had circles and squares spread
out across the screen. They could select as many or as few at a time as they
wished.
The order of the
tasks remained fixed across users but we varied the order of the interfaces
between tasks and between users; this ordering is perfectly balanced. We expect
that there were learning effects across tasks, but these effects serve to make
the interfaces more comparable, making users? lack of experience with Cross and
HoverCross less prevalent.
Overall, users
felt that it was easy and fast to select objects with HoverCross.
For tasks 1, 3, and 4, eight users preferred HoverCross
over the other two interfaces, and twelve users preferred either HoverCross or Cross over Lasso. Users enjoyed the fluid
transition between sketching and selecting, without the hassle of explicit
indication. In addition, for small selections, drawing a circle around the
objects to lasso them seemed excessive. When objects were spread out over the
canvas, users felt that selecting a few of them with lasso select was awkward,
while with HoverCross they could simply move the pen
across the screen to the objects of interest. Many users therefore preferred HoverCross despite prior familiarity with lasso. In fact,
such prior familiarity allowed them to conclude that HoverCross
is indeed an improvement. This result suggests that there is a set of users
that would benefit greatly from HoverCross in this
situation.
For task 2, which
involved selecting a group of closely placed objects, majority of the users
preferred lasso select. This result is understandable, because crossing over
each individual object in the group is more cumbersome than drawing a lasso
around it. However, one user still preferred HoverCross
for this task. He reflected that, while both interfaces are effective, HoverCross was more interesting and fun.
We believe these
results show great promise for the HoverCross
interface. First, the users who preferred the lasso select interface admitted
that the most significant reason that they preferred this interface was because
of its familiarity. We believe that users also will get faster at using HoverCross with practice. Second, the general preference
for HoverCross over Cross shows the promise of using
the hover space as a fluid and reliable method for invoking selection. Finally,
while the results of task 2 show the limitations of HoverCross,
we easily can combine the HoverCross interface with a
traditional button-triggered lasso select interface to allow the user to choose
the editing method that is most appropriate for a given situation.
We are
considering a few possible future improvements that could make our simple interface
even more effective. First, the desired time lag before entering hover select
mode varies by user, so we wish to allow the user to adjust this time. Second, some
users suggested variation of the selection handle, such as using a smaller one,
designating direction for selecting and deselecting, etc. We would like to try
a few of the ideas, implementing them and conducting studies to compare the
effectiveness of each.
HoverCross combines simple and effective ideas from
many recent advances in pen-based interfaces to provide an elegant interface
for inking and editing. Used in combination with other pen-based interaction
techniques, HoverCross will bring us one step closer
to the goal of creating pen-based interfaces that combine the freedom of paper
with the power of the computer in a useful, and usable, way.
References
1.
G. Apitz and F. Guimbreti`ere. CrossY: A
crossing-based drawing application. In Proc. of UIST, 2004.
2.
T. Grossman, K. Hinkley, P. Baudisch,
M. Agrawala, and R. Balakrishnan. Hover widgets: Using the tracking state to
extend capabilities of pen-operated devices. In Proc. of CHI,
2006.
3.
K. Hinckley, P. Baudisch, G. Ramos, and F. Guimbreti` ere. Design and analysis of delimiters for selection-action pen gesture
phases in scriboli. In Proc. of
CHI, 2005.
4.
K. Hinckley, F. Guimbreti`ere, P. Baudisch,
R. Sarin, M. Agrawala, and
E. Cutrell. The springboard: Multiple modes in one spring-loaded control. In Proc. of CHI, 2006.
5.
R. S. Kattinakere, T. Grossman, and S. Subramanian. Modeling steering within
above-the-surface interaction layers. In Proc. of CHI,
2007.
6.
J. LaViola and R. Zeleznik. Mathpad2: A system for the creation and
exploration of mathematical sketches. Proc. of SIGGRAPH,
23(3), 2004.
7. Y. Li, K.
Hinckley, Z. Guan, and J. Landay. An experimental
analysis of mode switching techniques in penbased
user interfaces. In Proceedings of CHI, 2005.
8.
J. Mankoff, S. E. Hudson, and G. D. Abowd.
Providing integrated toolkit-level support for ambiguity in recognition-based
interfaces. In Proc. of CHI, 2000.
9.
E. Saund and E. Lank. Stylus input and editing without prior
selection of mode. In Proc. of UIST, 2003.
10.
S. Subramanian, D. Aliakseyeu, and A. Lucero. Multilayer interaction
for digital tables. In Proc. Of
UIST, 2006.
11. L. Tesler. The smalltalk
environment. Byte, 6:90?147, August 1981.
12.
R. Zeleznik and T. Miller. Fluid inking: augmenting the medium of
free-form inking with gestures. In Proc. of Graphics
Interface, 2006.