Joëlle Coutaz, Daniel Salber, Eric Carraux
CLIPS-IMAG, BP 53
38041 Grenoble Cedex 9, France
+33 76 51 48 54
{joelle.coutaz,
daniel.salber,
eric.carraux}@imag.fr
Nathalie Portolan
CCETT, BP 59
35512 Cesson Sévigné
+33 99 12 43 84
portolan@ccett.fr
Current theoretical and heuristic frameworks for evaluating interactive systems do not cover multimodal interaction properly. At CLIPS-IMAG, we have developed NEIMO, a generic and flexible multiworkstation usability lab, to observe and analyze multimodal interaction experimentally [2]. This video shows NEIMO applied to a multiservice telecommunication terminal to elicit the usage of multimodality in the context of telecommunication tasks.
Meanwhile, behavioral data about the subject as well as experimenters' annotations are recorded automatically. NEIMO captures information at various levels of abstraction from keystroke level such as mouse events and speech acts, to high level tasks such as sending a fax.
In its current version, the NEIMO platform includes 4 Apple Macintosh Quadras connected by Ethernet. The user interfaces for the subject and the wizards workstations are prototyped with HyperCard. Apple Events are used as the standard communication mechanism but a specific tool has been developed for efficient transmission of video over Ethernet. (Sound is not yet transferred over the network.) Behavioral data are recorded using the QuickTime format.
Figure 1. Configuration of the NEIMO platform.
The speech-wizard translates the multimodal command into actions understandable by the system. To accomplish this, he can hear the subject talking and a miniature reproduction of the subject's screen allows the speech-wizard to track the user's mouse and keyboard actions.
If the subject makes a linguistic mistake such as uttering a wrong command name, the speech-wizard sends an error message through a dedicated tool. As shown in Figure 2, error messages are predefined and organized into categories (lexical&syntactic and domain-dependent errors), or may be customized on the fly. In the normal case, the speech-wizard simulates the subject's actions using direct manipulation on the miniature screen which, in this example, triggers a phone call at the task-wizard's.
The task-wizard, who plays the called person, incites the subject to use advanced features such as the Vphone and mirror facilities of the multiservice terminal. The subject hesitates. The annotation-wizard who can observe the subject's behavior through his own workstation (sound+ miniature screen), records a comment about the subject's hesitation. This information, which complements the subject's wrong mouse clicks, will be pointed out by the analysis tool in the next phase.
Figure 2. The screen of the speech-wizard: on the left, a miniature of the subject's screen; on the right, the error messages tool (enlarged artificially for the purpose of readability).
In addition, the tool makes tasks interleaving explicit using a Gantt diagram laid out on a perspective wall. The diagram is enhanced with clickable "bubbles" that reveal the annotations recorded on the fly by the annotation-wizard.
In addition to observation and annotation, NEIMO supports Wizard of Oz experiments. Most existing Wizard of Oz systems support the observation of one modality only or are limited by technical constraints. NEIMO has been designed from scratch to support multimodality. A significant amount of effort has been dedicated to implementation issues to satisfy performance requirements.
NEIMO is multiworkstation, generic and flexible: 1) It supports any number of wizards; 2) It is organized around a reusable and extensible kernel of common services onto which specific user interfaces can be plugged; 3) Workstations are configurable at start up time: wizards roles (e.g., speech recognition, annotations, etc.) can be freely allocated among the workstations. In addition, data capture can be set up at the appropriate level of abstraction.
2. Salber, D. & Coutaz, J. Applying the Wizard of Oz Technique to the Study of Multimodal Systems. In Human Computer Interaction, 3rd International Conference EWHCI'93, L. Bass, J. Gornostaev, C. Unger Eds., Springer Verlag, Lecture notes in Computer Science, Vol. 753, 1993, 219-230.
3. Salber, D. & Coutaz, J. A Wizard of Oz Platform for the Study of Multimodal Systems. in Proc. INTERCHI'93 Adjunct Proceedings, S. Ashlung, K. Mullet, A. Henderson, E. Hollnagel, T. White Eds., ACM New York, 1993, 95-96.
4. Catinis, L & Caelen, J. Analyse du comportement multimodal de l'usager humain dans une tâche de dessin. in Proc. IHM'95, Cepadues Eds: Toulouse, 1995, 123-129.