HoverCross: Seamless Diagram Creation and Editing for Pen-Based Interfaces

 

Alice Zhu

Harvey Mudd College

 

Abstract

 

A central problem in pen-based interfaces is how to transition smoothly between drawing and editing. Separate drawing and editing modes can be awkward and distracting, while modeless editing gestures are error-prone. We present HoverCross, a seamless inking and editing interface that provides a simple and reliable method for users to create and edit drawings without explicit mode changes. HoverCross combines the strengths of several recent developments in penbased interfaces. With HoverCross, users ink normally and then select objects or ink strokes by crossing over them in the hover space above the tablet screen. They can then edit their selection through a context menu on the canvas. The results of our user study indicate that HoverCross provides an efficient, fluid and robust transition between drawing and editing. Furthermore, over half of our participants prefer HoverCross over existing interfaces for several common diagram creation and editing tasks.

 

 

Problem & Motivation

 

The promise of pen-based interfaces is that they provide a natural way for users to create diagrams and take free-form notes. However, the integration of diagram creation and diagram editing remains a barrier to the widespread use of these interfaces. Pens are more convenient and natural for drawing, but editing with a pen remains cumbersome when the user is forced to rely on graphical user interfaces designed for a mouse and keyboard. Because the user must use the pen for both drawing and editing (or suffer the inconvenience of switching between the pen and the keyboard), a core challenge for pen-based computing is to construct an interface that allows users to switch easily between the two tasks, while allowing the system to unambiguously interpret a given pen stroke as drawing or editing.

 

 

Background & related work

 

Recent years have brought many proposed solutions to this problem. Traditional solutions (e.g., Windows Journal) require the user to enter “edit mode,? usually by pressing a software button. This solution is simple, but the problems with modes are well known [11]. Using hardware buttons (e.g., the bezel buttons on the side of the Tablet PC) to trigger mode switching helps solve these issues because the mode switch is temporary—as soon as the user releases the button, the interface reverts to drawing mode. Thus, there is little potential for mode confusion. Studies suggest that this approach can be quite effective and natural [7, 4]. However, simply pressing the button requires not only extra physical effort, but in many cases also an extra hand.

 

Other researchers take a recognition-based approach [9, 12], attempting to distinguish automatically between drawing and editing strokes (e.g., lasso selections or gestures). However, even with choice mediators [8] to help resolve recognition ambiguities, recognition errors can be confusing. Furthermore, as we argue below, it is not clear that lasso selection is optimal for many selection tasks.

 

Recently, researchers have explored using the hover space to invoke editing commands. The hover space is the space above the surface of a digital tablet where the pen is still tracked but does not generate ink events. Grossman et al. present Hover Widgets [2], in which the user performs gestures in the hover space to activate editing menus. Following on this work, Subramanian et al. explore the possibility of using several layers of the hover space to perform different editing tasks [10], while Kattinakere et al. formally model users? ability to track (e.g., execute gestures) in hover space [5].

 

While each of these recently developed approaches brings us closer to the dream of seamless pen-based drawing and editing, we believe our arsenal of drawing and editing techniques is not yet complete. Specifically, for common diagram creation and editing tasks, HoverWidgets or a multi-level hover space solution may be too heavyweight.

 

We explore the power and simplicity that can be obtained by combining the simplest aspects of many existing techniques. Our interface, called HoverCross, combines the strengths of hover space editing, crossing-based selection (e.g., [1]), and simple gesture-triggered context menus (e.g., [3]), resulting in an interface that is easy to learn, reliable, and fast for common drawing and editing tasks. Furthermore, our approach integrates nicely with almost any other penbased editing technique, so if desired, users can simply ignore it and fall back on traditional editing methods.

 

 

Uniqueness of the approach

 

In many modal pen-based interfaces (e.g.,Windows Journal), it is selection, not editing in general, that is typically relegated to its own mode. This division makes sense, as selecting objects in a drawing is typically the first step in performing almost any editing task. Once the user selects an object, she can edit it via a menu or direct manipulation. An explicit selection mode seems necessary because any apparent selection stroke just as easily could be a drawing stroke.

 

The key insight behind HoverCross is that relegating the process of selection to the hover space allows users to switch seamlessly between drawing and selecting without pressing any buttons. Furthermore, because the user performs only selection in the hover space, a simple crossing interface suffices, and there is no need to perform gestures in the hover space. The system then leverages the context of the selection to give the user additional power through a gesture-invoked context menu or direct interaction with the selected objects.

 

In our interface, the user draws normally on the screen to create diagrams containing simple shapes and text. The diagrams and text the user draws currently are recognized by the Microsoft gesture and text recognition engines, but HoverCross also works with unrecognized ink.

 

To edit the diagram, the user hovers the pen briefly over the tablet to trigger selection. The brief pause prevents triggering selection every time the stylus enters the range, as it inevitably does on its way to the screen. Once selection is triggered, a small vertical line appears in the middle of each object (Figure 1), and the user simply needs to cross the stylus over the line in either direction in hover space to select the object; crossing the line again deselects it. We call each vertical line the “handle? for the corresponding object. Figure 2 shows selected objects. The user can easily cross over multiple objects to select all of them. To clear the selection, the user moves the pen in hover space off the edge of the screen or quickly crosses over each of the selected objects.

 

 

Figure 1: Vertical bars appear in each shape to give users a crossing target.

 

 

 

Figure 2: The context menu that appears when the user makes a simple gesture. The menu

appears at the tip of the stylus at the end of the gesture, and actions apply to all selected shapes.

 

Using the hover space for selection is particularly convenient for cluttered diagrams in which the user wishes to edit several objects separated by intervening shapes or text. The user can enter the hover space to cross one object and then immediately exit the hover space by moving the pen tip away from the screen. The selected object remains selected even when the pen is outside of hover space. To select another object, the user simply needs to re-enter hover space near that object. This process can be repeated as many times as needed to easily select multiple objects spread out across the screen.

 

Once the user has selected at least one item, she can either edit the selected item, or continue to draw new portions of the diagram. The user has several options for editing the diagram. She can move selected shapes by dragging one of them with her pen on the screen. As long as she does not cross the handle in hover space, touching the shape will not deselect it. For more complicated editing tasks, the user can draw a simple gesture (an arc to the right) anywhere directly on the screen to bring up a ring-shaped context menu around the tip of the stylus (Figure 2). The content of the menu depends on the type of objects selected, such as text or shapes. Another short movement allows the user quickly to choose from the given options to manipulate the selection. With little practice, the user easily can coordinate drawing, selecting, and editing in one fluid motion.

 

The HoverCross technique is appropriate for any application that combines pen-based drawing and editing. Our example application supports the creation of clean diagrams that combine shapes and text, such as might be produced in PowerPoint or Visio. We chose this domain because creating polished diagrams typically requires more intensive editing than simply drawing freeform notes and diagrams. In a previous pilot study in which we observed people creating slides in PowerPoint, we found that they relied heavily on copying, pasting, resizing and moving shapes within their diagrams. This domain thus allows us to evaluate the utility of HoverCross for its intended purpose.

 

In many ways, our HoverCross interface is incredibly simple, yet quite powerful. First, because drawing and selection take place in two distinct spaces, there can be no ambiguity about the user’s intent. Second, users learn only one simple gesture. Although our gesture is so simple it could conceivably be confused with drawing strokes, it would be straightforward to construct a gesture that is rarely confused with drawing strokes, such as one of the compound gestures in [6]. Users perform the gesture on the screen, so there are no issues of learning to execute more complex gestures in hover space. Third, the context-specific in-place menus allow the user easily to access the most common editing commands.

 

Confirming previous results [1], the crossing selection metaphor has several advantages over traditional lasso selection. For selecting single shapes or shapes roughly in a horizontal row, crossing the shapes? targets is much faster than lassoing them. Additionally, for shapes spread out in space, crossing in hover space behaves like “Control-clicking? with a mouse and keyboard, allowing users to select some objects while avoiding intervening ones.

 

A final strength of this interface is that any part of it may be implemented in conjunction with almost any other interface technique. If the user does not want to use it, she can simply ignore it. Because the hover selection interface requires a short pause to invoke, the user is not likely to trigger it by mistake, so it may be combined with a traditional modal selection interface. For example, a user might use modal lasso selection to select a group of tightly clumped objects, and then use HoverCross to deselect one of them. While we offer the context menu for convenience, additional menus may be added easily to the interface.

 

 

Results & contributions

 

We conducted a user study to verify our hypothesis that the combination of HoverCross and our context-based ring menu is simple to use, does not distract the user from the current task, and provides a fluid and efficient transition between editing and sketching. We compared three selection interfaces: HoverCross, a variation of HoverCross called Cross, and the traditional lasso select. Cross differs from HoverCross in that users cross objects with the pen on the screen while holding down a nonpreferred hand button. Li et al. showed that using a nonpreferred hand button is the current most effective way to invoke edit mode [7]. Lasso select invoked using a modal interface button is the current approach in most commercial software.

 

Eighteen users participated in our study. All were students at Harvey Mudd College, and all had at least some experience using a tablet computer. No user had experience with a hover or crossing-based interface.

 

After receiving instructions and trying out the three interface techniques for about 2-5 minutes, users performed four tasks using each of the three interface techniques. We designed our tasks to evaluate the utility of HoverCross in a number of different situations. In the first task, users simultaneously created and edited recognized shapes. We instructed them to draw a square, a circle and a triangle on the left side of the screen, and to move each shape to the right side of the screen immediately after drawing it. Our next two tasks evaluated the utility of HoverCross for selecting overlapping shapes. In task 2 we instructed them to move an entire group of pre-drawn, overlapping shapes, while in task 3 we instructed them to move only a subset of the overlapping shapes. Finally, in task 4 we instructed them to change all the circles to red and all the squares to blue in a pre-drawn sketch that had circles and squares spread out across the screen. They could select as many or as few at a time as they wished.

 

The order of the tasks remained fixed across users but we varied the order of the interfaces between tasks and between users; this ordering is perfectly balanced. We expect that there were learning effects across tasks, but these effects serve to make the interfaces more comparable, making users? lack of experience with Cross and HoverCross less prevalent.

 

Overall, users felt that it was easy and fast to select objects with HoverCross. For tasks 1, 3, and 4, eight users preferred HoverCross over the other two interfaces, and twelve users preferred either HoverCross or Cross over Lasso. Users enjoyed the fluid transition between sketching and selecting, without the hassle of explicit indication. In addition, for small selections, drawing a circle around the objects to lasso them seemed excessive. When objects were spread out over the canvas, users felt that selecting a few of them with lasso select was awkward, while with HoverCross they could simply move the pen across the screen to the objects of interest. Many users therefore preferred HoverCross despite prior familiarity with lasso. In fact, such prior familiarity allowed them to conclude that HoverCross is indeed an improvement. This result suggests that there is a set of users that would benefit greatly from HoverCross in this situation.

 

For task 2, which involved selecting a group of closely placed objects, majority of the users preferred lasso select. This result is understandable, because crossing over each individual object in the group is more cumbersome than drawing a lasso around it. However, one user still preferred HoverCross for this task. He reflected that, while both interfaces are effective, HoverCross was more interesting and fun.

 

We believe these results show great promise for the HoverCross interface. First, the users who preferred the lasso select interface admitted that the most significant reason that they preferred this interface was because of its familiarity. We believe that users also will get faster at using HoverCross with practice. Second, the general preference for HoverCross over Cross shows the promise of using the hover space as a fluid and reliable method for invoking selection. Finally, while the results of task 2 show the limitations of HoverCross, we easily can combine the HoverCross interface with a traditional button-triggered lasso select interface to allow the user to choose the editing method that is most appropriate for a given situation.

 

We are considering a few possible future improvements that could make our simple interface even more effective. First, the desired time lag before entering hover select mode varies by user, so we wish to allow the user to adjust this time. Second, some users suggested variation of the selection handle, such as using a smaller one, designating direction for selecting and deselecting, etc. We would like to try a few of the ideas, implementing them and conducting studies to compare the effectiveness of each.

 

HoverCross combines simple and effective ideas from many recent advances in pen-based interfaces to provide an elegant interface for inking and editing. Used in combination with other pen-based interaction techniques, HoverCross will bring us one step closer to the goal of creating pen-based interfaces that combine the freedom of paper with the power of the computer in a useful, and usable, way.

 

 

References

 

1. G. Apitz and F. Guimbreti`ere. CrossY: A crossing-based drawing application. In Proc. of UIST, 2004.

2. T. Grossman, K. Hinkley, P. Baudisch, M. Agrawala, and R. Balakrishnan. Hover widgets: Using the tracking state to extend capabilities of pen-operated devices. In Proc. of CHI, 2006.

3. K. Hinckley, P. Baudisch, G. Ramos, and F. Guimbreti` ere. Design and analysis of delimiters for selection-action pen gesture phases in scriboli. In Proc. of CHI, 2005.

4. K. Hinckley, F. Guimbreti`ere, P. Baudisch, R. Sarin, M. Agrawala, and E. Cutrell. The springboard: Multiple modes in one spring-loaded control. In Proc. of CHI, 2006.

5. R. S. Kattinakere, T. Grossman, and S. Subramanian. Modeling steering within above-the-surface interaction layers. In Proc. of CHI, 2007.

6. J. LaViola and R. Zeleznik. Mathpad2: A system for the creation and exploration of mathematical sketches. Proc. of SIGGRAPH, 23(3), 2004.

7. Y. Li, K. Hinckley, Z. Guan, and J. Landay. An experimental analysis of mode switching techniques in penbased user interfaces. In Proceedings of CHI, 2005.

8. J. Mankoff, S. E. Hudson, and G. D. Abowd. Providing integrated toolkit-level support for ambiguity in recognition-based interfaces. In Proc. of CHI, 2000.

9. E. Saund and E. Lank. Stylus input and editing without prior selection of mode. In Proc. of UIST, 2003.

10. S. Subramanian, D. Aliakseyeu, and A. Lucero. Multilayer interaction for digital tables. In Proc. Of UIST, 2006.

11. L. Tesler. The smalltalk environment. Byte, 6:90?147, August 1981.

12. R. Zeleznik and T. Miller. Fluid inking: augmenting the medium of free-form inking with gestures. In Proc. of Graphics Interface, 2006.