SHORE 2001 Logo
SHORE 2001 Logo University of Maryland Logo
Student HCI Online Research Experiments
Abstract
Introduction
Experiment
Results
Discussion
Conclusions
Acknowledgements
References
Appendices
Credits
Feedback
Presentation Slides
SHORE 2001 : Layout and Readability : Visualization of Shallow Trees with Nodal Attributes using Fisheye Table, Table Lens, and Treemap

Visualization of Shallow Trees with Nodal Attributes using Fisheye Table, Table Lens, and Treemap

Authors

Fred Bernal

fredric@glue.umd.edu

Steve Betten

sbetten@wam.umd.edu

Chris Horn

chorn@wam.umd.edu

Abstract

As computers become an integral part of businesses' and consumers' daily lives, the ability to handle large amounts of data has become mandatory. Thus, data visualization techniques have been devised to help users view datasets in their entirety, allowing them to explore and analyze their data in ways that are not possible by traditional database management systems. This experiment compares three information visualization techniques: Fisheye Table, Table Lens, and Treemap. The goal of the experiment was to determine which technique allows users to answer certain questions in the least time with the least errors and with the most satisfaction. Experimentation with 18 subjects produced some statistically significant results. The statistical analysis confirmed the advantages of using sorted tabular visualizations for some types of tasks but did not support the claim that using filters would reduce subjects' times to correct completion of certain tasks. The analysis also did not support the claim that Fisheye Table's continuous fisheye view promotes faster subject performance than with Table Lens' discrete fisheye view.

Introduction

Currently, many people use spreadsheet programs at work and at home. Sometimes the spreadsheet data represents hierarchical (or tree) information. When data sets become too large to effectively extract information such as trends and correlations from them, then suitable visualization techniques must be found.

Over the years, there have been several approaches to obtaining optimal views of spreadsheet data. This experiment studies three applications that use different data visualization techniques. One approach is the Fisheye Tables from the University of Maryland Human Computer Interaction Laboratory (HCIL). Another approach is Table Lens from Xerox Palo Alto Research Center, which is now sold as Eureka from Insight Eureka. A third approach, also from the University of Maryland HCIL, is Treemap. The goal of the experiment was to determine which approach allows users to answer certain questions in the least time with the least errors and with the most satisfaction. Experimentation with 18 subjects produced some statistically significant results.

Visualization Applications

Fisheye Tables, Table Lens, and Treemap are similar in that they display an entire dataset to the user in one screen and that the data they use is tabular (in the case of Treemap, values from some of the table columns indicate each item's position in the hierarchy).

The Fisheye Table application uses the Fisheye data visualization technique to view tabular data. Data fields are organized into columns, where the user can sort by a single data field at a time, with the click of a mouse. To view the data on one screen, the Fisheye technique keeps a portion of rows in focus, while keeping the edges of the dataset in view, but out of focus to the user. Using the minimal movement of a mouse, the user can scroll up and down the rows to bring the edges in focus, and can easily scan the entire dataset. Each row represents a leaf node of the data set hierarchy. data that Treemap maintains. A snapshot of Fisheye Table is below.

Eureka embodies the Table Lens data visualization technique to view tabular data. Like the Fisheye technique, the Table Lens technique arranges data in tabular form, where columns can be sorted with a single mouse click. However, Table Lens initially gives the user a view of the entire dataset in compressed form, where individual data values are not viewed and scrolling is not necessary. Eureka accomplishes this by using colors to discriminate non-numeric data fields, and using variable length bars to discriminate numeric data fields. The resulting display allows the user to recognize patterns and trends among the fields instantly. To isolate specific data, Eureka also allows the user to filter the dataset based on any column's data values, and also uncompress and focus the rows that are of interest. Users have the option of anchoring columns, thus allowing them to sort other columns based on anchored columns' positions. A snapshot of Eureka is below.

The program Treemap embodies the Treemap data visualization technique to view hierarchical data. This technique visually maintains the hierarchical view of the data. Since the entire data set is viewed, the user has a powerful, fast, and efficient way to navigate through the data set, as well as the ability to recognize patterns and trends that may exist among the hierarchical levels. Treemap also utilizes filtering, sizing, and coloring features to allow the user to discriminate and isolate specific data easily. One popular commercial application of the Treemap technique is SmartMoney.com's Map of the Market. A snapshot of Treemap is below.

Review of Previous Experiments

Researchers have considered the problem of visualizing tabular data sets that have more data entries than can fit on the screen using a spreadsheet program. One solution is fisheye views, which give the user the ability to focus on some rows of the table while preserving the context of the entire table by presenting the unfocused rows in smaller scale [1]. One program that uses this approach is Table Lens from the Xerox Palo Alto Research Center [5]. Another such program is Fisheye Table from the University of Maryland Human Computer Interaction Laboratory (HCIL). Fisheye programs allow users to investigate large tabular data sets more effectively than they would with a spreadsheet program [5, 10]. The Table Lens graphically differs from the Fisheye Table in that the Table Lens displays all the unfocused columns at the same compressed height whereas the heights of unfocused rows in the Fisheye Table taper off in proportion to their distances from the focused rows.

If the tabular data represents a hierarchy, then another solution is to display the tabular data as a hierarchy of nested rectangles in which clicking each leaf node reveals the attributes that the hierarchy does not convey; this is the approach of the Treemap from the University of Maryland HCIL [7]. The Treemap can show entire large data sets in one screen, instead of requiring a fisheye focus+context compromise. The Treemap program also allows users to encode attribute values using rectangle sizes and colors.

Another consideration in visualizing large tabular data sets is how user interaction can maximize the amount of relevant information on the screen. One interactive function is sorting. Sorting allows the user to cluster entries that are related by a given attribute so that it is easier to compare their values for other attributes. Another interactive function is filtering. Filtering lets users eliminate from view the entries that are irrelevant to the task at hand, leading to faster task completion [5, 6]. More rapid, incremental, and reversible interactive functions promote better user performance and satisfaction [11]. The Table Lens filters are inadequate according to that rule; they are slow and difficult to reverse, because the user has to open a dialog box and click a command button to apply each filter, resulting in a new window containing the filtered data. In contrast, Treemap filters are rapid and reversible, because the display updates for every incremental filter change that the user applies via on-screen range scroll bars.

An unanswered interaction question is whether the continuous taper of row heights in the Fisheye Table encourages users to preview unfocused rows in a way that increases task completion speed in a way that the compressed rows of the Table Lens do not promote.

Relevant Psychological Theories

Many factors influence how quickly individuals can interpret data presented visually. Individual differences such as experience, intelligence, physical ability play a role. Environmental conditions such as display quality, lighting, ergonomics, distractions impact cognition as well. Holding those and other related factors constant, what remains is how well a program's features promote data exploration how well a program's data representations aid cognition. The designers of the three products examined in this experiment have aided data conceptualization in many ways. This section focuses on aids of perceptual redundancy, focus+context, and dynamic queries.

Sebrechts describes perceptual redundancy as "two different dimensions,...both represent[ing]...a common property" [12]. In Treemap, perceptual redundancy occurs when users set the size by and color by legends to the same property in a query. The sought after result is perceived with greater ease as both the largest and most brightly colored rectangle on the screen. This redundancy makes it possible for a user to finger an item faster than one would have had using only a single encoding. Redundancy is provided in Table Lens by allowing the user to both view the relative lengths of bars in a chart and sort them as well. The result allows one to more quickly perceive maxima and minima because of both their relative lengths in a column and their relative positions at the extremes of a column. Similarly, in Fisheye Tables, one can sort a column so that one can more quickly perceive maxima and minima based on its position at the extremes of a column.

Treemap and Table Lens present data graphically so that the user can grasp at a glance quantitative elements and relationships. For example, a user may identify correlations from graphical representations of datasets. In Treemaps, if one were to color nodes by flow and size nodes by severity of problems it may become visually apparent to a user that output value drops below a certain amount only when the number of problems rise above some value. In Table Lens and Fisheye Tables, users can identify "cross variable correlation" when they sort a column and thereby the values of another column become mostly sorted as well. In addition, statistical distributions become apparent if one knows how to recognize shape and skew [14].

An outstanding feature for facilitating cognition in Table Lens and Fisheye Table is focus+context, i.e. the ability to focus on more pertinent information while keeping less pertinent information on the screen solely to provide context. In Treemaps, cognition is aided with dynamic queries, the ability to rapidly, reversibly, and incrementally apply and view the results of filtering operations. One benefit of these features is that the user can adjust the breadth and depth of data representation to aid comprehension and exploration. Given that "cognitive research has shown us that people have trouble comprehending even simple displays and understanding relatively simple data" it stands to reason that dynamic queries and focus+context, which simplify complex displays will reduce users' cognitive load and promote understanding [13].

In Treemap, users can directly manipulate and simplify the display without losing context by filtering with double sliders. Such dynamic queries focus users' attention on the results by graying out nodes that fail any filter. Treemap preserves context by displaying all nodes in the data set. Users tend to maintain their orientation with respect to the data and the program's interface when context is preserved.

Examples of focus+context are as follows. In Table Lens, at any time, each row is either in or out of focus. Rows that are in focus are taller and display details in text. Unfocused rows are too short to contain text and instead summarize details with bars of appropriate color, length, and position. In Fisheye Table, only the rows near the cursor are displayed in detail. Rows far above and far below the cursor are visible but the text they contain is illegible.

Experiment

Introduction and Hypotheses

This experiment compares the effectiveness and satisfaction of people using several different techniques for visualizing 100 units that form a shallow, fixed-depth tree; each unit possesses several numerical attributes. The experiment varies a single independent variable (visualization technique) with three treatments: Fisheye Table, Table Lens, and Treemap. Each subject answered the same four questions about the data using each treatment. Below is a diagram illustrating part of a typical hierarchy of the data. The dependent variables are time to correctly answer of the question, the number of incorrect answers given before the correct answer, and subjective satisfaction.

Previous studies suggest the following predictions:

  1. Sorted table column v. size- or color-coded 2-D space
    For data sets containing about 100 units, usually subjects can sort a table by a column then search the column for a specified value faster than they can scan an unsorted 2D space for a unit possessing that value, even if the value is size- or color- encoded. (However, scanning the 2D size- or color-encoded space might be faster for orders-of-magnitude larger data sets, especially if the differences in size and color are large enough.)

  2. Filtered v. unfiltered
    Given a sorted table column, subjects take less time to filter out irrelevant rows (using a program feature) then find a specified value by visually searching the remaining rows than they take to find a specified value by visually searching the rows without filtering them first.

A prediction not based on previous research is that, given a sorted table column, subjects take less time to find a specific value in the column using continuous fisheye techniques (such as the Fisheye Table) than they do using discrete fisheye techniques (such as the Table Lens), because, with continuous fisheye techniques, users can see more data in nearby unfocused rows.

Combining these predictions leads to our hypotheses. In the hypotheses, the concept of better performance means less time to correct completion and less or equal number of incorrect answers. Each question tested a different aspect of the treatments in an attempt to clarify the strengths and weaknesses of each treatment. The hypotheses are:

  1. On question 1, which requires determining which unit possesses the maximum value for an attribute, subjects will perform best using the Table Lens and the Fisheye Table techniques (because they let the user sort by column). Users will not perform as well using the Treemap technique (because its color- and size-encoding will not help as much as sorting).

  2. On question 2, which requires determining how many units meet range restrictions for multiple attributes, subjects will perform best using the Treemap technique (because of its rapid, incremental, and reversible filters). The next most efficient treatment will be the Table Lens technique (because it also has filters, but they are less rapid and less reversible). Subjects will perform least well using Fisheye Table technique (because it lacks filters).

  3. On question 3, which requires determining the unit that possesses the minimum value for an attribute out of a set of units that meet a value range restriction for another attribute, subjects will perform best using the Table Lens technique (because it provides filtering and sorting). The next most efficient treatment will be the Treemap technique (because it provides filtering but not sorting). Subjects will perform least well using the Fisheye Table technique (because it has no filtering).

  4. On question 4, which requires repeatedly finding units that possess specified values for an attribute, subjects will perform best using the Fisheye Table technique (because it provides sorting and continuous fisheye layouts). The next most efficient treatment will be the Table Lens technique (because it provides sorting and discrete fisheye layouts). Subjects will perform least well using the Treemap technique (which lacks sorting but provides a hierarchical view that might simplify this task).

  5. Subjective ratings of satisfaction will rank the treatments in this order with statistically significant differences: Table Lens, Treemap, Fisheye (because that is the ranking of the treatments by decreasing number of features that the interface provides).

  6. Subjective ratings of ease of use will not contain any statistically significant differences between the treatments since all the programs provide straightforward interfaces to the features they provide and to the styles of interaction that they promote.

Pilot Study Results

The objectives of the pilot study were to ensure that subjects could understand the training and could answer each question in a reasonable amount of time and also to determine how much time to allot for each subject to complete the experiment. Running the pilot tests also provided the experimenters with practice running the experiment. Three subjects participated in the pilot study. Originally, the pilot study included five questions. The most drastic pilot study result was that two of the questions were extremely difficult for subjects to complete in a reasonable amount of time. One of these questions required too much scrolling with two of the treatments. Restricting the number of units that this question involved made it manageable. Another question required users to sum values while scrolling, which proved too difficult a task for all the pilot subjects. The question, which required subjects to sum attribute values for groups of units, did not seem to be worth the frustration it caused the subjects, so it was eliminated from the experiment, leaving four questions. The pilot subjects commented on certain weaknesses in the training materials. Revisions of the experiment materials included suggestions from pilot subjects, such as explaining how to reset each visualization to its original settings in order to start each question with a clean slate. Other suggestions included spending more time on how each visualization presented the hierarchical aspects of the data. The maximum time that any pilot subject took for training was 15 minutes, and the maximum time that any pilot subject took for answering the questions for all three treatments was 15 minutes. Taking into account training and testing time and about 5 minutes for miscellaneous activities, such as filling out consent forms or questionnaires, gave a total experiment time of 35 minutes.

Subjects

Eighteen subjects took part in the experiment. All subjects had at least one year of experience using a window manager such as Microsoft Windows 95 or Apple MacOS.

Materials

Before performing any tasks, subjects filled out a three-question background questionnaire that asked for each subject's sex, color-blindness (if any), and years of experience with several similar window managers (such as Microsoft Windows 95 or Apple MacOS). The training materials for the experiment included an overview of the data (i.e., the structure of the hierarchy and the types of numerical attributes that each unit had), as well as an introduction to the features of each of the three visualization programs. Some of these program introductions demonstrated program features via short walkthroughs of how to solve simple questions using them. The four questions for the tasks were on a single sheet, and subjects could determine the answer to each question using any of the features of each visualization program that the training materials described. After completion of the tasks, each user filled out a brief subjective satisfaction form. The training questionnaires, materials, and questions are in the appendices.

Procedures

The experiment was counterbalanced within subjects by having each subject answer the four questions using all three visualization techniques. To prevent subjects from memorizing answers, the answers were not necessarily the same for each treatment. To counterbalance the effects of subjects' familiarity with the questions on the second two treatments, an equal number of subjects followed each of the six permutations of the treatments.

Each run of the experiment went as follows: First, subjects read and signed a consent form. Then they filled out the background questionnaire. Next, the experimenters explained the structure of the data that all the programs would be visualizing. Then for each treatment the following occurred: the experimenter demonstrated the features of the program, then gave subjects up to five minutes to familiarize themselves with the program and to ask questions to clarify how the program worked. Next, subjects answered each question in the following way: first, subjects read the question and told experimenters when they understood it (or asked clarifying questions until they understood it); then the experimenters started hand-held timers and observed how the subjects answered the question until they verbally provided the correct answer to the question, at which point the experimenters would record the time to correct completion. After subjects completed every question for every treatment, they filled out a subjective satisfaction questionnaire.

Each experimenter ran the experiment with six subjects, one with each permutation of the treatments.

Sample Solutions

Sample solutions to the questions, including screenshots at each stage are here.

Results

This experiment collected information on the time it took 18 subjects to complete 4 tasks, their error rates, and their responses to two subjective questions. The primary measure of performance was the time it took subjects to correctly solve questions using three different tools. Microsoft Excel was used to analyze the data. For each task and subjective question, a one-way ANOVA was generated. If the analysis indicated that within the three treatments there was a statistically significant result with respect to an alpha value of 0.05 then a paired t-test was employed. An analysis of the paired t-test indicated which pairs of treatments supported the statistically significant result. The appendices include the raw data.

Performance Time

For Task 1, a single factor analysis of variance (ANOVA) on performance time showed that there was a main effect of tool, F(2, 51) = 7.68, p = 0.00 at alpha = 0.05. Paired t-tests showed that the main effect favored both Table Lens and Fisheye over Treemap as had been hypothesized. The results were statistically significant for the pairs Fisheye and Treemap, t(17) = 3.41, p = 0.00, and Table Lens and Treemap, t(17) = 3.37, p = 0.00. There was no statistically significant difference between Fisheye and Table Lens as was expected, t(17) = 0.47, p = 0.64. A comparison of mean values with standard deviation bars for Task 1 is displayed below. The appendices include analysis tables for Task 1 performance time.

Comparison of Mean Time for Task 1 (n=18).

For Task 2, a single factor analysis of variance (ANOVA) on performance time showed that there was no main effect of tool, F(2, 51) = 2.11, p = 0.13 at alpha = 0.05. Paired t-tests were not needed since the results were not statistically significant. A comparison of mean values with standard deviation bars for Task 2 is displayed below. The appendices include analysis tables for Task 2 performance time.

Comparison of Mean Time for Task 2 (n=18).

For Task 3, a single factor analysis of variance (ANOVA) on performance time showed that there was no main effect of tool, F(2, 51) = 0.44, p = 0.65 at alpha = 0.05. Paired t-tests were not needed since the results were not statistically significant. A comparison of mean values with standard deviation bars for Task 3 is displayed below. The appendices include analysis tables for Task 3 performance time.

Comparison of Mean Time for Task 3 (n=18).

For Task 4, a single factor analysis of variance (ANOVA) on performance time showed that there was no main effect of tool, F(2, 51) = 2.31, p = 0.11 at alpha = 0.05. Paired t-tests were not needed since the results were not statistically significant. A comparison of mean values with standard deviation bars for Task 4 is displayed below. The appendices include analysis tables for Task 4 performance time.

Comparison of Mean Time for Task 4 (n=18).

Error Rate

Out of 216 possible data points only 13 errors occurred. A single factor ANOVA on the error rate for each task showed that the number of errors in general is not a function of any treatment. However, enough errors occurred in Tasks 2 and 4 in a single treatment to barely make for a statistically significant result. Task 1, f(2, 51) = 2.13, p = 0.13; Task 2, f(2, 51) = 3.40, p = 0.04; Task 3, f(2, 51) = 1.55, p = 0.22; Task 4, f(2, 51) = 3.40, p = 0.04. The appendices include error analysis tables for Task 1, Task 2, Task 3, and Task 4. Also see the four figures below.

In Task 2, t-tests show that an error is just barely likely to occur in Table Lens over the other treatments, t(17) = 3.40, p = 0.04 when using the one-tailed result. When using a two-tailed result, there is no statistically significant difference with respect to Table Lens and the other two treatments, t(17) = 3.40, p = 0.08. See the associated figure below. The appendices include analysis tables for Task 2 error.

In Task 4, t-tests show that an error is just barely likely to occur in Treemap over the other treatments, t(17) = 1.84, p = 0.04 when using the one-tailed result. When using a two-tailed result, there is no statistically significant difference with respect to Treemap and the other two treatments, t(17) = 1.84, p = 0.08. See the associated figure below. The appendices include analysis tables for Task 4 error.

Comparison of Mean Errors for Task 1 (n=18).

Comparison of Mean Errors for Task 2 (n=18).

Comparison of Mean Errors for Task 3 (n=18).

Comparison of Mean Erorrs for Task 4 (n=18).

Subjective Questions

Were users more satisfied with any one treatment? A single factor analysis of variance (ANOVA) on overall satisfaction showed that there was a main effect of tool, F(2, 51) = 8.41, p = 0.00 at alpha = 0.05. Paired t-tests showed that the main effect favored both Table Lens and Treemap over Fisheye. The results were statistically significant for the pairs Table Lens and Fisheye, t(17) = 4.28, p = 0.00, and Treemap and Fisheye, t(17) = 3.57, p = 0.00. There was no statistically significant difference between Treemap and Table Lens, t(17) = 1.44, p = 0.17. A comparison of mean values with standard deviation bars for User Satisfaction is diplayed below. The appendices include analysis tables for Subjective Satisfaction.

Comparison of Mean Subjective Satisfaction (n=18).

Were the treatments equally easy to use? A single factor analysis of variance (ANOVA) on Ease of Use showed that all tools were on an equal footing, F(2, 51) = 1.09, p = 0.34 at alpha = 0.05. Paired t-tests were not needed since the results were not statistically significant. A comparison of mean values with standard deviation bars for Ease of Use is displayed in the figure below. The appendices include analysis tables for Subjective Ease of Use.

Comparison of Mean Subjbective Ease of Use (n=18).

Summary

 

Hypothesis

Supported

1

On question 1, which requires determining which unit possesses the maximum value for an attribute, subjects will perform best using the Table Lens and the Fisheye Table techniques (because they let the user sort by column). Users will not perform as well using the Treemap technique (because its color- and size-encoding will not help as much as sorting).

Yes

2

On question 2, which requires determining how many units meet range restrictions for multiple attributes, subjects will perform best using the Treemap technique (because of its rapid, incremental, and reversible filters). The next most efficient treatment will be the Table Lens technique (because it also has filters, but they are less rapid and less reversible). Subjects will perform least well using Fisheye Table technique (because it lacks filters).

No

3

On question 3, which requires determining the unit that possesses the minimum value for an attribute out of a set of units that meet a value range restriction for another attribute, subjects will perform best using the Table Lens technique (because it provides filtering and sorting). The next most efficient treatment will be the Treemap technique (because it provides filtering but not sorting). Subjects will perform least well using the Fisheye Table technique (because it has no filtering).

No

4

On question 4, which requires repeatedly finding units that possess specified values for an attribute, subjects will perform best using the Fisheye Table technique (because it provides sorting and continuous fisheye layouts). The next most efficient treatment will be the Table Lens technique (because it provides sorting and discrete fisheye layouts). Subjects will perform least well using the Treemap technique (which lacks sorting but provides a hierarchical view that might simplify this task).

No

5

Subjective ratings of satisfaction will rank the treatments in this order with statistically significant differences: Table Lens, Treemap, Fisheye (because that is the ranking of the treatments by decreasing number of features that the interface provides).

Partially*

6

Subjective ratings of ease of use will not contain any statistically significant differences between the treatments since all the programs provide straightforward interfaces to the features they provide and to the styles of interaction that they promote.

Yes

* The term "partially" requires clarification: Hypothesis 5 is actually two hypotheses: (5.1) Table Lens will rank statistically significantly higher than Treemap, and (5.2) Treemap will rank statistically significantly higher than Fisheye Table. Statistical analysis rejects hypothesis 5.1 but affirms hypothesis 5.2. In place of hypothesis 5.1, statistical analysis supports the following statement: Table Lens will rank equally with Treemap (that is, there will be no statistically significant difference between their ranks).

Discussion

The statistics for question 1 confirm hypothesis 1. That is, subjects performed statistically significantly better using Fisheye Table and Table Lens than they did using Treemap; there was no statistically significant difference in subject performance between Fisheye Table and Table Lens. Thus, the statistical analysis supports the claim that, for data size of around 100 units, subjects find units faster by visually scanning sorted rows than by visually scanning a 2-dimensional space of color- and size-encodod nested rectangles.

For question 2, there is no statistical significance in subjects' time to correct completion between the tasks, but for Table Lens there were statistically significantly more incorrect answers provided than for the other two treatments. These results lead us to reject hypothesis 2. One possible explanation for the inability of the presence of filtering features to promote faster performance times in Table Lens and Treemap is that many subjects, even after five minutes of training with each treatment, did not feel comfortable enough to take advantage of the filtering abilities of the treatments, since filtering was the most cognitively complex feature of any of the treatments. Without the benefit of filters, neither Table Lens nor Treemaps has any significant advantage over Fisheye Table.

Also, specifically for Treemaps, there is an explanation for why subjects who did use filters did not perform statistically significantly better. The explanation comes from observation of subjects and from subjects' comments during the experiment; the boxes surrounding the individual filters range sliders have significantly more empty space in the middle of each one than there is between each one. Also the side borders of each range slider filter are only a few pixels from the edge of the control panel. It seems that because of these two facts the subjects often misjudged which range slider was associated with which attribute and would have to backtrack to determine the correct answers. Errors from misjudged filters did not seem to occur with Table Lens because each filter has its own dialogue box; however, the amount of time needed to apply Table Lens filters via dialogue boxes appears to have eliminated any benefit to applying them on the relatively small data sets of 100 items. Perhaps with orders-of-magnitude larger data sets, the benefits of filters would be more pronounced.

For question 3, the statistics do not show any statistically significant differences between the treatments in time to correct completion or in number of incorrect answers, leading to a rejection of hypothesis 3. The explanations from question 2 all probably apply to this question, except that in addition subjects who had difficultly using Treemap filters to answer question 2 tended to avoid using filters in question 3, thus nullifying the benefits of Treemap having filtering features.

Statistics for question 4 do not dictate any statistically significant difference in time to correct completion, leading to a rejection of hypothesis 4. That is, the experiment did not support the claim that Fisheye Table's continuous fisheye view promotes faster subject performance than with Table Lens' discrete fisheye view. One explanation for the statistically significant number of incorrect answers using Treemap comes from subjects; they complained about difficulty they had differentiating between county names because of the small serif font used and because the names were only one letter long.

The results of the statistical analysis partially confirm hypotheses 5; the ranking of the treatments by subjective satisfaction is statistically significant and, from greatest to least, is Table Lens and Treemap (tied at first place), then Fisheye Table (at third place). Thus, there appears to be a correlation between the amount of functionality in an interface and the overall user satisfaction; however, the difference in amount of functionality between Table Lens and Treemap are not as significant to subjects as hypothesized. This correlates with the fact that the differences in amount of functionality between Table Lens and Treemap is much less than the differences between them and Fisheye Table.

The results of the statistical analysis confirm hypothesis 6; subjects did not assign any statistically significant differences to the three treatments.

Conclusions

Impact for Practitioners

The most advantageous action that users of the Table Lens and Treemap information visualization techniques could take is to practice applying filters much more. If users of these two programs do so, they will probably see significant decrease in the amount of time it takes them to perform tasks that involve determining which items meet numerical range and other attribute value restrictions.

Suggestions for Software Developers

There is room for the developers of all three programs used in this experiment to improve their software.

Fisheye Table: Fisheye Table lacks many powerful features that the similar Table Lens possesses. One such feature is sorting in reverse order; just as in Table Lens, the program could allow the user to toggle the sort order by repeatedly clicking the appropriate column header. Several subjects complained about the time wasted scrolling to the bottom of the table rows to retrieve minimum values. Another feature to add to Fisheye Table, one which both Table Lens and Treemap possess, is filtering. With proper training and rapid enough filters, filtering can be a versatile and potent tool for the user.

Table Lens: After trying out Treemap filters, some subjects reported that Table Lens filters seemed awkward because they were not nearly as interactive as Treemap filters. Specifically, developers could improve the Table Lens filters by eliminating dialog-box access to filters in favor of more direct access to filters from the main window. This change would facilitate a second filters improvement: having the display update after every filter change instead of only after the user clicks the Apply button. A third filter improvement that would drastically increase the reversibility of Table Lens filters would be to not have each application of a filter open a new window with the remaining data; a much better approach is that of Treemaps, which applies all filters to the current window. In short, Table Lens would probably benefit greatly in terms of user performance if the developers made its filters more like Treemap interactive filters.

Treemap: Observations of subjects and comments from subjects dictate that user performance with Treemap filters would increase significantly from only minor graphic changes. More specifically, as the discussion section of this report suggests, adding some pixel spacing to all the outer sides of each filter would substantially decrease user performance times by minimizing or eliminating user mismatches between filter names and the range sliders for other filters.

Refinement of Theory and Suggestions for Future Researchers

The discussion section of this report explains how many of the explanations for why our hypotheses failed seem to revolve more around experiment design issues than about user interface issues. Rejecting the theories behind the hypotheses for which lack of foresight in experiment design seemed to play a part would be bad practice.

Thus, the only theory that this experiment's results seem to challenge is the claim that the Fisheye Table's continuous fisheye layout promotes faster user performance than Table Len's discrete fisheye layout. However, the number of categories in question that compared the two fisheye techniques was relatively small (less than a dozen values for County). For such a small number of categories, for Table Lens, many subjects used the awkward method of approximating which blips "D" and "G" were, then checking by holding the cursor over nearby blips until the desired County name appeared, in order to avoid the long times that Table Lens takes to zoom in on single rows. In contrast, the continues fisheye technique of fisheye tables allowed subjects to focus on the current row plus or minus about six rows and thereby to more accurately anticipate correct cursor movement and to move the cursor quickly to the desired County value. Possibly, the potential benefits of this row lookahead did not show relative to Table Lens because the small number of catagories did not accentuate the awkwardness and slowness of the technique that many subjects used with Table Lens. Thus, a suggestion for future research is to run an experiment with a task similar to Question 4 in this experiment, but to have many more values (categories) in the column that the subject is searching. Such an experiment might reveal that the Fisheye Table continuous layout is superior to the Table Lens discrete layout, at least for some highly interactive tasks.

Given the chance to re-run this experiment, the major change to make (besides having more categories for Question 4 as the previous paragraph describes) would be to restrict the features that subjects would be allowed to use to get the answer for each question. This enforced uniformity of methods (as opposed to the intended but unenforced uniformity of this experiment design) would yield results that would be much more trustworthy for revising theories about the different techniques used in answering the questions.