1Dept. of Computing & Information Science
University of
Guelph
Guelph, Ontario
Canada N1G 2W1
2Computer Systems Research Institute
University of
Toronto
Toronto, Ontario
Canada M5S 1A4
Keywords: interaction techniques, pointing and dragging tasks, Fitts' law, human performance modeling
The graphical user interface, popularized in 1983 with the introduction of the Apple Macintosh, has redefined the way humans interact with computers. Present-day mouse-driven interfaces employ sophisticated yet natural techniques for user input. "Pointing", "dragging", "inking", etc., form the core repertoire of interaction techniques in graphical user interfaces.
This paper presents and critiques prediction models for the common tasks of pointing and dragging. Our aim is (a) to illustrate the potential benefits and problems in using Fitts' law models as an engineer's approximate model as per Card, Moran, and Newell's (1983) Model Human Processor, and (b) to establish which target dimension is the most appropriate "target width" for pointing and dragging tasks on a 2D CRT display. We will present models from past research, describe an experiment building new models, and compare and reconcile the differences between these models.
Just as language is a tool for thought, models are tools for organizing and articulating ideas for the researcher or designer. One such model that captures the common acts of pointing and dragging in interactive systems is the Three-State Model for Graphical Input (Buxton, 1990). Pointing is represented as a State 1 action and dragging as a State 2 action (see Figure 1). Selection is a brief transition from State 1 to State 2 and back again via a mouse button. (State 0 actions are the "out-of-range" motions possible with a mouse or stylus while airborne.) The three-state model forms a vocabulary for exploring relationships -- affordances or constraints -- between input devices and interactive techniques.

Although Fitts' law has surfaced extensively as a prediction model for pointing tasks, its application to dragging tasks is limited to the studies by Gillan, Holden, Adam, Rudisill, and Magee (1990, 1992) and MacKenzie, Sellen, and Buxton (1991). We will review these following a brief introduction to Fitts' law. For extensive reviews, see MacKenzie (1992), Meyer, Smith, Kornblum, Abrams, and Wright (1990), or Welford (1968).
MT = log2(2A/W) (1)
A is the distance or amplitude to move and W is the
width or tolerance of the region within which the move terminates. From
Equation 1, the time to complete a movement task is predicted as
MT = a + b ID (2)
where a and b are the intercept and slope coefficients from
linear regression.Variations of Fitts' law have surface to correct systematic biases in regression analyses. These include the Welford (1968) formulation:
MT = a + b log2(A/W + 0.5) (3)
and the Shannon formulation (MacKenzie, 1989):
MT = a + b log2(A/W + 1) (4)
Equation 4 is preferred because it
(a) provides a slightly better fit with observations,
(b) exactly mimics the information theorem underlying Fitts' law,
and
(c) always gives a positive rating for ID.

We suggest two ways to correct this. The first is to use the Shannon formulation for ID, which always yields a positive rating for ID. A second and additional strategy is to substitute for W a measure more consistent with the 2D nature of the task. In Figure 2 the inherent 1D nature of the model is maintained by measuring W along the approach axis. Shown as W' in the figure, we call this the "W' model". The W' model is appealing because it allows a 1D interpretation of a 2D task, thus maintaining the theoretical premise of the law.
Another possible substitution for target width is "the smaller of W or H". This pragmatic approach has intuitive appeal in that the smaller of the two dimensions seems more indicative of the accuracy demands of the task. We call this the "SMALLER-OF" model.
We conducted an experiment to test the different models for target width on a standard 2D target selection task using a mouse (MacKenzie and Buxton, 1992). The design employed a balanced range of short-and-wide and tall-and-narrow targets approached from various angles. The results indicated that both the SMALLER-OF and W' models are empirically superior to the STATUS QUO model and that the difference between the SMALLER-OF and W' models is insignificant. The model with the highest correlation was
MT = 230 + 166 log2(A/W + 1) (5)
where W equaled the smaller of W or H (SMALLER-OF
model). Equation 5 had a correlation of r = .950 and a standard
error of estimate of SE = 63 ms. The latter measure is important in
establishing confidence intervals for subsequent applications of Fitts' law
models as engineering tools.For further evidence, we need only examine the observations of Gillan et al. (1990, 1992), who used conditions of W = 0.25, 1.0, 3.5, and 6.0 cm with H held constant at 0.5 cm (the height of a character). The targets were words or phrases of length 1, 5, 14, or 26 characters. The observed selection time decreased from the 1-character to the 5-character conditions for each amplitude condition (as expected for both models); however, MT remained the same across the 5-, 14-, and 26-character conditions. The latter effect, although not accounted for by the STATUS QUO model, is fully expected with the SMALLER-OF model because target height was constant and consistently smaller than target width.

If the angles of movement change or if the text block covers several lines, the approach angles (and W') change somewhat, but the two-dimensional extensions discussed above still apply (Figure 4). The model is applied exactly the same for other point-drag sequences, such as pull-down menus or scroll bars.

The studies by Gillan et al. (1990, 1992) and MacKenzie et al. (1991) are the only existing applications of Fitts' law to dragging tasks. Gillan et al. (1990, 1992) tested Fitts' law in point-select and point-drag-select tasks. They concluded that "dragging time in a point-drag sequence is under control of two features of a computer display: the dragging distance and the height of the text object" (1990, p. 232). Two models were compared: one substituting the constant 0.5 cm for target width and another substituting target height, H. A higher correlation was found in the latter case, and this led to the conclusion above. We are suspicious of the "target height" model because character width was positively correlated with character height. It is felt that Gillan et al. (1990) inadvertently confirmed the strength of the STATUS QUO model for one-dimensional tasks. In a subsequent paper, they concluded that "dragging time was affected by both dragging distance and the font size of the text object" (1992, p. 306). This is a more reasonable conclusion, however it is not generalizable to dragging tasks with arbitrary targets such as scroll bars or menus.
In the only other test of Fitts' law in dragging tasks, MacKenzie et al. (1991) tested serial pointing and dragging and found a slightly less efficient rate of information processing during dragging than during pointing (3.0 bits/s vs. 4.2 bits/s). A serial task similar to Fitts (1954) was employed, so the models of are restricted practical use. However, since dragging immediately follows pointing in the point-drag sequence, the serial dragging model may be appropriate in this limited case. The mouse-dragging model was
MT = 135 + 249 log2(A/W + 1) (6)
with r = .992 and SE = 38 ms. Equation 6 for dragging,
and the pointing model presented earlier (Equation 5), will be tested
later against the models from the experiment described in the next
section.
In this section we describe an experiment using a point-select (State 1) task followed by a drag-select (State 2) task. It is claimed that the effect is that of two Fitts' law tasks in sequence. Two prediction equations should apply, reflecting the inherent information processing capacities in each task.
For each trial, a small circle appeared near the centre of the CRT display, and a target, in the form of a horizontal scroll bar, appeared elsewhere (see Figure 5). Subjects were instructed to manipulate the mouse to move the cursor inside the circle, then wait for a visual cue before beginning. The cue was a small black rectangular bar which appeared on the left of the screen (see Figure 5) and slowly expanded in size for about 1 second. After the bar stabilized, a move could begin. Subjects could take as long as necessary to prepare for each move, but were told to move as quickly and accurately as possible once the cursor left the circle. The graduating cue prevented them from treating the experiment as a reaction time task as its end point, the start signal, was not well defined.
Timing began when the cursor left the circle. The task was a point-select action followed immediately by a drag-select action. For the point-select action, subjects acquired the left rectangle in the horizontal bar (at the top in Figure 5). For the drag-select action subjects dragged the rectangle horizontally and deposited it in the right rectangle. The procedure resembled the operation of a horizontal scroll bar on the Macintosh though no scrolling resulted.
The point-select task was timed from the cursor leaving to the start circle to the button-down action in the left rectangle of the horizontal bar. The drag-select task was timed from the button-down action terminating the point-select task to the button-up action where the rectangle was dropped on the right. A pointing error was recorded if the button-down action was outside the left rectangle. A dragging error occurred if the two rectangles did not overlap when the button-up action occurred.

Only 102 of 600 possible cells were used to keep the experiment manageable and to exhaust a wide and relevant range of test conditions. Thirty-four distance/size conditions (see Table 1) were crossed with the three approach angles. Drag amplitudes were selected in power-of-four increments starting at 2 x W. This provided at least W units of separation between the pick-up and drop regions for dragging.
==========================================================================
Point Amplitudea Drag Amplitudea
------------------ ------------------
Width Height 2 4 8 16 32 2 4 8 16 32 Combinations
--------------------------------------------------------------------------
1 2 x . x . x x . x . . 6
2 2 x . x . x . x . x . 6
4 2 . x . x . . . x . x 4
8 2 . . x . x . . . x . 2
1 4 . x . x . x . x . x 6
2 4 . x . x . . x . x . 4
4 4 . x . x . . . x . x 4
8 4 . . x . x . . . x . 2
Total 34
===========================================================================
a x = used; . = not used (Note: point and drag conditions crossed)
The mean time to complete moves was 633 ms (SD = 213 ms) for the pointing phase of tasks followed by 827 ms (SD = 226 ms) for the dragging phase. Mean error rates were 1.8% (SD = 3.7%) for pointing and 4.2% (SD = 6.0%) for dragging.
Although 102 unique conditions were tested (see Table 1), the data were aggregated by conditions unique to each of the pointing and dragging phases of the tasks. Aggregating by point-amplitude, width, height, and angle left 54 conditions for the pointing analysis. Aggregating by drag-amplitude, width, and height left 15 conditions for the dragging analysis. Regressing MTP (ms) on ID, where ID = log2(A/SOWH + 1), yielded
MTP = 177 + 169 ID (7)
with r = .9637 (p < .001) and SE = 54 ms.[1] Regressing MTD (ms) on ID,
where ID = log2(A/W + 1), yielded
MTD = 345 + 198 ID (8)
with r = .9711 (p < .001) and SE = 54 ms. The high
correlations in these analyses are, in themselves, support for the hypothesis
that point-drag-select tasks can be modeled as two separate Fitts' law tasks,
each with its own prediction equation.
A scatter plot of points is shown in Figure 6 for pointing and in Figure 7 for dragging. Each point plotted was derived from the mean of more than 300 observations. The dashed lines apply to the present experiment, with the range of conditions delimited on the left and right, and the 95% confidence intervals (on observed points) delimited on the top and bottom. The solid lines show the 95% confidence intervals predicted from the earlier models.


The use of a model derived from a serial dragging task (Equation 6) may be inappropriate for the dragging phase of a discrete point-drag-select task. It is evident in Figure 7 that Equations 6 and 8 have very different intercepts and slopes. The regression coefficients and standard errors (SEs) for the predictions equations and for each regression coefficient are summarized in Table 2 for the four equations in question.
==============================================================================
Regression Coefficients
------------------------------------------------------
SE Intercept, SE Slope, SE IP(bits
Equation ra (ms) a (ms) (ms) b (ms/bit) (ms/bit) (bits/s)b
------------------------------------------------------------------------------
**** Pointing ****
5 .9501 64 230 21 166 6.2 6.0
6 .9637 54 177 19 169 6.5 5.9
**** Dragging ****
7 .9921 38 135 36 198 13.0 5.1
8 .9711 54 345 36 198 13.0 5.1
==============================================================================
a p < .001
b IP = 1/b
The two dragging equations in Table 2 are quite different. Using Equation 8 as the reference, the slope is 3.9 SEs higher and the intercept 5.8 SEs lower for Equation 6. There are several possible sources of this disparity. First, the models are from experiments conducted separately using different subjects. Second, the range of conditions was different. IDs ranged from 1 to 6 bits in MacKenzie et al. (1991) and from 1.8 to 5 bits in the present experiment. Finally, the tasks were different. Equation 8 was derived from a serial dragging task, but was applied to data for the dragging phase of a discrete point-drag-select task.
We expect even greater disparities if the derived models were applied to State 1 and State 2 actions in real applications on interactive graphics systems, where an assortment of user actions arise. Tasks such as selecting words or blocks of text in a word processing environment, acquiring and manipulating icons, or selecting an item in a pull-down menu would severely test the generality of a model built in a research environment. As engineering tools, designers are cautioned not to rely on establish Fitts' law models to provide accurate predictions unless the device and task conditions in the new interface closely match those from the original research.
Correlations for the STATUS QUO, SMALLER-OF, and target height models respectively were .9711 (p < .001), .9688 (p < .001), and .6403 (p < .005). The poor showing of the target height model was fully anticipated. Since target height is measured perpendicular to the line of approach (in left-to-right dragging tasks), there is no reasonable basis for it to serve as target width in the model. Target height would have only a slight effect on movement time, since motion was one-dimensional along the horizontal axis. Therefore, the top ranking for the STATUS QUO model (which is the same as the W' model in this case) was not surprising.
This paper has demonstrated that the point-drag sequence common on interactive systems with a graphical user interface can be modeled as two separate Fitts' law tasks -- a point-select task followed by a drag-select task. Prediction models with high correlations and low standard errors were developed; however, when compared with models from previous research using similar tasks, the predictions were not as close as the standard errors implied. Based on this, we conclude that caution must be exercised in taking models built in a research setting and applying them subsequently on real systems: A model with a very high correlation may not stand-up to subsequent predictions in different settings.
The present attempt to apply derived models to subsequent tasks illustrates the difficulty in adopting models such as Fitts' law to practical problems in interface design. Meeting the usual statistical tests for validity seems easy in comparison to the challenges in applying the model later. Expectations must be kept low. A statistically sound model will be accompanied by a small standard error of estimate; but confidence intervals will not be met later unless the model was derived under conditions very similar to the application.
A problem in applying Fitts' law in two dimensional tasks is in choosing an appropriate "target width" to substitute as "W" in the calculation of task difficulty. The claim of Gillan et al. (1990) that target height is the appropriate substitute for target width in dragging tasks was not supported. The experiment described herein varied the dragging target's width and height independently. Both the STATUS QUO model and the SMALLER-OF model outperformed the target height model. We conclude that the appropriate substitute for target width in two-dimensional pointing or dragging tasks is either the smaller of the target's width or height (SMALLER-OF model), or the width of the target along the angle of approach (W' model).
This research was supported by the Natural Sciences and Engineering Research Council of Canada, Xerox Palo Alto Research Center, Digital Equipment Corp., and Apple Computer Inc. We gratefully acknowledge this contribution, without which, this work would not have been possible.
Boritz, J., Booth, K. S., & Cowan, W. B. (1991). Fitts's law studies of directional mouse movement. Proceedings of Graphics Interface '91, 216-223. Toronto, Ontario: Canadian Information Processing Society.
Buxton, W. (1990). A three-state model of graphical input. In D. Diaper et al. (Eds.), Human-Computer Interaction -- INTERACT '90, 449-456. Amsterdam: Elsevier.
Card, S. K., Moran, T. P., & Newell, A. (1983). The psychology of human-computer interaction. Hillsdale, NJ: Erlbaum.
Fitts, P. M. (1954). The information capacity of the human motor system in controlling the amplitude of movement. Journal of Experimental Psychology, 47, 381-391.
Fitts, P. M., & Peterson, J. R. (1964). Information capacity of discrete motor responses. Journal of Experimental Psychology, 67, 103-112.
Gillan, D. J., Holden, K., Adam, S., Rudisill, M., & Magee, L. (1990). How does Fitts' law fit pointing and dragging? Proceedings of the CHI '90 Conference on Human Factors in Computing Systems, 227-234. New York: ACM.
Gillan, D. J., Holden, K., Adam, S., Rudisill, M., & Magee, L. (1992). How should Fitts' law be applied to human-computer interaction. Interacting with Computers, 4, 291-313.
MacKenzie, I. S. (1989). A note on the information-theoretic basis for Fitts' law. Journal of Motor Behavior, 21, 323-330.
MacKenzie, I. S. (1992). Fitts' law as a research and design tool in human-computer interaction. Human-Computer Interaction, 7, 91-139.
MacKenzie, I. S., & Buxton, W. (1992). Extending Fitts' law to two-dimensional tasks. Proceedings of the CHI '92 Conference on Human Factors in Computing Systems, 219-226. New York: ACM.
MacKenzie, I. S., Sellen, A., & Buxton, W. (1992). A comparison of input devices in elemental pointing and dragging tasks. Proceedings of the CHI '91 Conference on Human Factors and Computing Systems, 161-166. New York: ACM.
Meyer, D. E., Smith, J. E. K., Kornblum, S., Abrams, R. A., & Wright, C. E. (1990). Speed-accuracy operating tradeoffs in aimed movements: Toward a theory of rapid voluntary action. In M. Jeannerod (Ed.), Attention and performance XIII (pp. 173-226). Hillsdale, NJ: Erlbaum.
Welford, A. T. (1968). Fundamentals of skill. London: Methuen.