CHI '95 ProceedingsTopIndexes
VideosTOC

Exploring Large Tables with the Table Lens

Ramana Rao and Stuart K. Card

Xerox Palo Alto Research Center
3333 Coyote Hill Road
Palo Alto, CA
rao@parc.xerox.com
card@parc.xerox.com

© ACM

Abstract

The Table Lens is a new technique for visualizing and making sense of large tables. By fusing symbolic and graphical representations into a single manipulable focus+context display and providing a small set of interactive operations (e.g. sorting), the Table Lens supports navigating around a large data space easily isolating and investigating interesting features and patterns. This high-bandwidth interactivity enables an extremely powerful style of direct manipulation exploratory data analysis.

Keywords:

Information Visualization, Exploratory Data Analysis, Graphical Representations, Focus+Context Technique, Fisheye Technique, Tables, Spreadsheets, Relational Tables.

Introduction

The size of information set which users can coherently bring together on the display of an interactive computer system limits the complexity of problems that can be addressed. We have been exploring the application of interactive graphics and animation to visualizing and making sense of larger information sets than would otherwise be practical by other means. Our video illustrates the use of the Table Lens (Rao and Card, 1994), a novel focus+context technique for visualizing and manipulating large tables, in a number of data analysis scenarios.

FOCUS+CONTEXT TECHNIQUE

The Table Lens (Rao and Card, 1994) supports effective interaction with much larger tables than conventional spreadsheets do. A spreadsheet can display a maximum of 660 cells at once on a 19 inch display (at cell size of 100 by 15 pixels, 82dpi). The Table Lens can comfortably manage about 30 times as many cells and can display up to 100 times as many cells in support of many tasks. The scale advantage is obtained by using a so-called ``focus+context'' or ``fisheye'' technique. These techniques allow interaction with large information structures by dynamically distorting the spatial layout of the structure according to varying interest levels of the parts. Thus, they often support visualizing an entire information structure at once as well as zooming in on specific items. This interplay between focus and context supports searching for patterns in the big picture and fluidly investigating interesting details without losing framing context. A number of such techniques have been developed in the last ten years as referenced in Rao and Card, 1994.

The Table Lens technique has been motivated by the particular nature of tables. The most salient feature of a table is the regularity of its content: information along rows or columns is interrelated, and can be interpreted on some reading as a coherent set, e.g. members of a group or attributes of an object. This is reflected in the fact that tables usually have labels at row and column edges that identify some portion of the meaning of the items in the row or column. These observations indicated a need to preserve the coherence of rows and columns and their labels despite distortions to the table. Thus, the Table Lens mutates the layout of a table without bending any rows or columns as illustrated in Figure 1.

FIGURE 1: Focal warping of 10 by 14 table with a focus area of 3 by 2 cells.

Since rows and columns aren't bent by the warping, they can be scanned entirely by a single horizontal or vertical eye motion. Furthermore, this enables label display, multiple focal areas, and multiple focal levels. Multiple focus areas are important for a number of reasons including comparing distal areas of the table and maintaining focus on summary rows or columns while investigating other portions of the table. Multiple focal levels allows dealing with larger tables and opens up a promising new design space.

As can be seen in Figure 1, though cells are allocated spaces along each dimension independently, there is an interaction in cell geometry. In fact, four types of cell regions are created by the distortions on the two axis: focal, row focal, column focal, and nonfocal. Focal cells are in the focus area along both axes, row focal and column focal are both half focal in that they are in the focal area of only one of the two axes, and nonfocal are in the context area along both axes.

GRAPHICAL REPRESENTATIONS

A second aspect of this work is the merging of graphical representations directly into the process of table visualization and manipulation. Initially, graphical representations were incorporated because of their natural economy in showing cell values. However, a second, perhaps more important, advantage is the effectiveness with which patterns and features can be spotted in the graphical rendering of the table data.

Our current scheme for incorporating graphical representation into the table (shown in Figure 2) is tailored for the most common type of table: the cases-by-variable array. In particular, this means that the underlying table represents a number of cases (the rows) for each of which values of various variables (the columns) are provided. For example, some of the analysis scenarios use a tables of baseball players performance/classification statistics for 1986. This table contains 323 players by 23 variables, 17 quantitative (e.g. At bats, Hits, Home Runs, Salary '87) and 6 category (e.g. Team, Offensive Position, Team '87).

FIGURE 2: The table lens with multiple focal regions and a sorted column revealing correlations.

A graphical representation is selected for each of it columns based on the type of variable in the column e.g. category, quantitative, or date. Graphical vocabulary including text, color, shading, length, and position to represent underlying cell values is utilized based on a number of factors besides the variable type including besides the data value itself, also the kind of region and its specific geometry as well as user options and results of user operations (e.g. search).

DATA ANALYSIS SCENARIOS

The scenarios in the video illustrate how the Table Lens can be used to explore patterns in large tables and investigate various explanatory models using three datasets: the baseball data described above, performance data for 406 cars, and Xerox stock performance data for 492 days. In each of them, we are able to quickly find interesting correlations or patterns that made sense based on a basic understanding of the domain. For example, the baseball observations would be readily acknowledged by any baseball fan (of course, statistics-loving) as baseball-sensical.

(The data, obtained from the CMU StatLib server, was collected by the American Statistical Association from Sports Illustrated and the 1987 Baseball Encyclopedia Update, Collier Books.)

ADVANTAGES

The baseball statistics table contains 323 rows by 23 columns for a total of 7429 cells. This is 11 times (an order of magnitude) more cells than our estimated maximum of 660 cells in a standard spreadsheet display. We calculate that the maximum size table the Table Lens can display on a 19 inch screen is about 68,400 cells more than two orders of magnitudes greater than a spreadsheet. Figure 3 depicts the advance in size of information sets achieved by our technique. The gray strip shows the displayable region of a typical spreadsheet program, where all cells are focal. The rest of the figure shows how a larger information set can be handled by progressively converting focal area into non-focal area.

FIGURE 3: The gray strip indicates the region reachable using a spreadsheet. The Table Lens by trading full-sized cells for non-focal can show over 2 orders of magnitude more cells.

Moreover, most of the patterns easily found using Table Lens would be much harder or very unlikely to be detected using a traditional spreadsheet. Most exploratory data analysis packages (e.g. S) require much greater overhead to learn and don't offer Table Lens's ease of interaction. The combination of our focus+context technique and graphical mapping scheme, with a small set of manipulation operations enables performing exploratory data analysis in a highly interactive and direct manner.

References

R. Rao and S.K. Card. The Table Lens: Merging Graphical and Symbolic Representations in an Interactive Focus+Context Visualization for Tabular Information. In Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems. ACM, April 1994.