CHI 97 Electronic Publications: Papers
CHI 97 Prev CHI 97 Electronic Publications: Papers Next

Designing Effective Multimedia Presentations

Peter Faraday & Alistair Sutcliffe
Centre for HCI Design,
City University.
London EC1 0HB, UK
+44 171 477 8994
P.M.Faraday@city.ac.uk

ABSTRACT

The paper reports four studies concerning attention to and comprehension of Multimedia presentations. The Multimedia sequence used was taken from a commercially produced CD-ROM, 'The Etiology of Cancer'. First, an eye tracking study of the presentation is reported. A second study was then conducted on the recall of the materials used in the eye tracking study. The results of these studies were used to propose design guidelines for Multimedia presentations. The guidelines were applied to produce a re-authored version of the original presentation. A further two studies were then conducted on the re-authored version to assess the impact of the design guidelines.

Keywords

Multimedia, Usability testing, Empirical studies.

© Copyright ACM 1997



INTRODUCTION

One of the problems in the design of multimedia (MM) interfaces is knowing whether the presentation will successfully deliver information to an audience. While studies have been conducted on cognitive issues relating to MM design e.g. [10, 12], there are very few studies of attention and comprehension; hence current usability guidelines are based on minimal evidence. This paper outlines a series of studies to provide improved design advice for MM presentations.

Our first study used an eye tracking system to investigate the sequence of visual information processing by users interacting with a multimedia presentation. We were interested in describing the attention and the viewing process for different presentation components, such as text captions, labels and still and moving objects; and how the serial nature of visual attention would cope with a complex and rapidly changing MM presentation. Few studies have made use of eye tracking equipment to look at animated materials [3,15], and none have attempted to track subjects viewing a complex multimedia presentation. However, attention to appropriate information is vital if a message is to be successfully delivered by an MM presentation.

Our second study examined comprehension of the same presentation by testing free recall, in which subjects wrote down what they could remember of the presentation. The comprehension process is though to act upon the elements processed by attention, and combines them with the user's existing domain knowledge to produce propositions, or units of meaning [16]. For instructional presentations the propositions we used were. objects, action, procedure and causal relationships [2]. We were interested in the information types recalled from the presentation and in relating them to the components in the MM design. We also investigated the effects of the user's existing knowledge upon comprehension by testing two subject groups with, or without, domain knowledge. The scope of the investigation focused on training titles. There is evidence that pedagogical design is a key determinant of the effectiveness of MM tutorial applications; however, few studies have addressed how presentation design effects learning. Accordingly we focus on attention and information assimilation rather than the effect of pedagogy on interactive learning.

We used the results of the first two studies to propose a set of design guidelines, which were applied to re-author the presentation. In the latter two studies, we assessed the impact of the guidelines by testing information recall with the re-authored presentation in a multimedia and a text and speech only version to investigate the effect of redesign in mono-medium or multimedia format.

STUDY ONE

The first study explored users' attention to a MM presentation using an eye tracking system to match the pattern of subjects' fixations in the visual media. An 18 second animation was taken from a commercial MM title 'Etiology of Cancer' showing DNA repair by photoreactivation. This is illustrated as a series of screen shots in figure 1. The speech track which accompanied the presentation is given under the relevant screen shots. Figure 1a shows the initial state of the presentation, with the DNA and pyrimidine dimer labelled. In figure 1b the photolyase object is shown moving across the screen and attaching to the pyrimidine dimer. In figure 1c the photolyase is struck by light energy from the lower right of the screen to the center of the photolyase. In figure 1d an arrow, a second photolyase and the repaired DNA are revealed; and then the photolyase object is shown moving off the screen to the left.

Study Method & Procedure

Six subjects were drawn from research students within City University. A pupilometer system was fixed upon the subjects eye, whilst they viewed the MM presentation. This produced an x,y location of the pupil every 50 msec and filtered out spurious data from eye blinks. Subjects used a bite clamp to avoid head movement and were held approximately 50cm from a 17 inch VGA colour monitor.

Data Analysis

Figure 1a : Pyrimidine dimers are reconverted to monomers

Figure 1b : by the action of photo- reactivating enzymes called photolyases which

Figure 1c : form a stable protien-pyrimidine dimer complex and then use light energy absorbed from the visible range

Figure 1d : to convert the dimer into monomers without disrupting the double strands.

The raw data of pupil location was time sliced into four 5 second parts for each subject to ease analysis. A PC based tool was built to plot the data for each subject as a scan path (see figure 2). Each subject's scan path was analysed into a fixation graph, representing fixation clusters or pauses of the subject's eye at a location. The data for the six subjects was synchronised over time to build up a composite trace.

Figure 2 : Single subject scan path on figure 1a

Results

Whilst it is known that pupil location and attention may not always coincide, in general a fixation at a location will indicate attentional processing [8]. The fixation graphs were overlaid on the presentation. Figure 3 a,b,c,d shows the order of fixations for the six subjects, with the sequence of fixation shown as circled numbers.

Figure 3a

Figure 3b

Figure 3c

Figure 3d

(1) Five subjects initially fixated on the pyrimidine dimer label and object; one subject fixated on the deformed, central part of the DNA (see figure 3a).

(2a), (2b) Subjects shifted to the pyrimidine dimer and switched back and forth to its static label. The pyrimidine dimer object and label received a greater number of fixations than the DNA object and label; possibly because the pyrimidine dimer was cued within the speech track.

(3a), (3b)Following fixation upon the pyrimidine dimer and label, three subjects then shifted between the DNA object and it's label (figure 3a). Fewer fixations may have been given to the DNA label than to the pyrimidine dimer label because it was not reinforced by the speech track.

(4) The photolyase object appears and then moves along a path toward the pyrimidine dimer object. For all six subjects this produced a rapid alignment upon the photolyase object when it appeared, with subsequent fixations tracking its path (see figure 3b).

(5a), (5b) Fixations followed the photolyase object's path to its end point (5a), where revealing the photolyase label (5b) caused a shift between it and its label for all subjects (figure 3c). The photolyase label was also cued by the speech track.

(6) Three of the subjects shifted fixation to animation of the light energy object striking the photolyase. The motion of the light energy was more rapid than for (5), causing shifts of fixation rather then tracking. The other three subjects shifted to the photolyase object but did not track the light energy.

(7a), (7b) Subjects shifted back to the photolyase object and label (figure 3c); possibly directed by the end point of the light object's motion in (6). Attention to the pyrimidine dimer may also have been partially cued by the phrase 'protein pyrimidine complex' in the speech track.

(8) All subjects shifted from the photolyase to the revealed arrow symbol (see figure 3d) and followed the arrow downwards. Four subjects then fixated on the second photolyase object in the centre of the repaired DNA object; while two attended to the repaired DNA label.

(9a), (9b) Five subjects initially tracked the second photolyase object as it detached and moved away. Two subjects broke off tracking to re-fixate upon the first photolyase object; possibly to compare the two photolyase objects.

(10) After (9), two subjects shifted to the repaired DNA label (figure 1d). The label may have not been fixated in (8) because of the onset of motion of the photolyase shifting attention. Tracking the photolyase path in (9), may, however have directed fixation to the repaired DNA label which lies at the end of the path.

(11) Five of the subjects shifted to the repaired DNA object at the end of the presentation. Fixations were generally upon the centre of the repaired DNA from where the second photolyase had departed in (9).

Initial Design Guidelines

The results provide the basis for proposing the following set of tentative guidelines for MM attentional design :

  • Use object motion to control attention and viewing order. The onset of motion will generally produce a shift to the object and fixations will then track the object's path e.g. (4), (6). Fixations will be directed toward the end point of the path. These results agree with previous studies of attention to motion [6,7].
  • Use animation with care : it may be unwarranted if the object is not important as tracking may prevent attention to other information. This may have been the case for the 'repaired DNA' label (10) which was only fixated after tracking the photolyase by two subjects. The motion of the photolyase shifted attention before the revealed label could be attended to. Several subjects also failed to track the light energy object (6), either because they were reading the label (7b), or the animation moved too quickly.
  • Gradually reveal labels, symbols and objects, to control viewing order. the state change effectively attracts attention e.g. revealing the photolyase label in (5), or the arrow and repaired DNA in (8).
  • Reveal related elements in a sequence. Avoid revealing several presentation items simultaneously as this may hinder the order of attention; e.g. in (2), (3) at the start of the presentation when the DNA label, DNA object and photolyase are revealed together, consequently there was little order in the fixation to the DNA object and label.
  • Reveal important information to emphasise it. Static presentation elements may be neglected in favour of elements which are revealed or are in motion. The static DNA label in (3) received far fewer fixations than the revealed photolyase label in (4).
  • Ensure that an object and its label appear together to improve identification. Displaying a label and an object produces fixation shifts between the object and the label e.g. between the pyrimidine dimer and its label in (2). The findings of [5] are similar suggesting that switches of attention are used to integrate information from the caption and on the object.
  • Use speech to reinforce information presentation, reveal objects and labels when cued in the speech track. Cueing labels within the speech track will produce a shift attention to the object and its label, e.g. pyrimidine dimer (2) and photolyase (5) were both re-fixated when cued. This follows reports [1,4] that speech cues should be tightly synchronised to labels. This may be particularly important in promoting integration between the speech track and visual objects.
  • Allow reading time after cueing a label. Avoid reveals or animation for the duration of speech segment which cues a label. If the label is complex, reading speed will be similar to that for speech track to pronounce it [9].
  • Use symbols to direct attention to specific objects and locations. Symbols may direct fixations to other objects, rather than upon the symbol itself. The arrow symbol (8) shifted fixations to the repaired DNA object. The learnt meaning of symbols may be used to control or guide attention within the presentation.

    STUDY TWO

    Our second study investigated comprehension of the same MM presentation using two groups of eight subjects. The first group was drawn from research students within City University, who had low domain knowledge; the second from biochemistry post-graduates who had high domain knowledge, but were pretested to ensure that they had no specific knowledge of the subject matter itself. Both groups were shown the presentation once and were then instructed to write down all that they could recall of the presentation.

    Data Analysis

    In order to score the accuracy of subject recall, the MM presentation was analysed into a set of ten reference propositions by an independent expert. The propositions were grouped into either high level organising concepts of sequence, problem or cause-effect, or into lower level actions or objects within each part of the sequence. The subjects written recall was split into propositions, and then matched to the reference propositions.

    Results

    There was no significant difference in recall between high and low domain knowledge subjects. The results (see table 1) show that some parts of the presentation were well recalled, whilst others were lost. Slightly more was recalled by the high domain knowledge group, in particular proposition (iv), 'Photolyase attaches to the pyrimidine dimer'. This may be because these subjects were better able to interpret the implications of the animated actions. Propositions (iii, vi-ix) were well recalled by both groups, possibly because they were conveyed by animation and reinforced by speech; while the other propositions were not. This demonstrates how poor design of the viewing sequence may lead important parts of the message (e.g the reaction in v, vi) to be missed.

    Generally components which were effectively attended to from the eye tracking study experiment were also well recalled. However, other factors in the design may have effected comprehension; leading to some refinement of the initial guidelines :

    Table 1 : Low 'v' High domain knowledge recall for original presentation

  • Speech information should reinforce image. Propositions given only in the image or animation without speech cueing were poorly recalled. The initial state of the DNA (i) with the damaged dimer was poorly recalled. This was shown implicitly in the image and was not mentioned in the speech track. The formation of the pyrimidine dimer complex (v) and the photolyase detaching, (x) were both shown as animation without any speech cues. These results agree with previous recommendations [2, 12] that it is vital for speech to reference the object or action being shown.
  • Captions or labels may be useful in re-inforcing the speech track. Propositions given in speech which were reinforced by labels were well recalled; e.g. the photolyase object (iii), the repaired DNA (viii), and the light energy (vii) were presented with labels.
  • If the speech information is complex or important then present it concurrently in a caption. Propositions given only in the speech track were generally poorly recalled; e.g. (ii) photoreactivation; (iv), (v) protein-pyrimidine complex propositions. Facts relating to the Protein-Pyrimidine complex (iv), (v) which are the central propositions in the sequence, needed special emphasis.
  • Cue animations with speech. Animated objects which were cued by the speech track were well recalled. Almost all subjects recalled light energy (vii), and the dimer reconverted to a monomer, (viii) which was shown as an animated change between the two DNA objects and cued with speech.
  • Show the effect of actions. Complex actions shown in animations were poorly recalled; in particular the formation of the protein-pyrimidine complex (v), and in the action of light energy upon the protein-pyrimidine complex (vi). However, this may have been because these animations failed to show the effect of actions, since animation may only be valuable if it helps explain the cause and effect [11]. Thus for the formation of the protein-pyrimidine complex should have been shown explicitly as a reaction between the photolyase and dimer. When light energy strikes the Protein-Pyrimidine complex, a reaction should have been shown, indicating that it triggered repair of the DNA. Many subjects incorrectly recalled that light energy struck the Photolyase or Dimer, or it came from the DNA.

    IMPROVING DESIGN

    We used the guidelines to produce a re-authored version of the presentation. We began by adding a caption to the first frame, to support initial state of the DNA and locating damage by the Pyrimidine Dimer (figure 5a). The existing Pyrimidine Dimer label was revealed first to attract attention to Dimer object, then after allowing reading time, the new caption was revealed. A cue was also added to the speech track to reinforce the caption.

    Figure 5a : DNA is damaged by a pyrimidine dimer, which distorts the DNA's double strands. A photoreactivating enzyme, photolyase, binds to the pyrimidine dimer

    Figure 5b : The pyrimidine dimer and photolyase now form a protien-pyrimidine complex.

    Figure 5c : Next, protien pyrimidine complex absorbs light energy from visble range.

    Figure 5d This causes the DNA to be repaired by photoreactivation

    Figure 5e : The pyrimidine dimer is now reconverted to a monomer, without disrupting the DNA double strands.Finally, the photolyase dissociates from the DNA.

    The animation of the photolyase and revealed labels for Pyrimidine Dimer and Photolyase were left unchanged. To re-inforce the animation of Photolyase binding the Pyrimidine dimer, an additional speech cue was added and synchronised with the arrival of the Photolyase : 'photolyase binds to the pyrimidine dimer'.

    To improve recall of the Protein-Pyrimidine complex formation, an animation was added to emphasise the process of the Photolyase and Dimer combining. A new Protein-Pyrimidine complex object formed from its two constituents, which then changed colour and shape, to signify a new combined state (figure 5b). An additional Protein-Pyrimidine complex label and caption were also added and synchronised with the speech track .

    The animation of light energy striking the Protein-Pyrimidine complex was then amended, to show a reaction by a small explosion style effect added to the complex as the light energy struck (figure 5c). The light energy was paused as it struck, to allow attention to shift to it. An additional caption was then shown re-inforcing the speech .

    After the light energy animation was finished, the Protein-Pyrimidine complex was shown to pull apart, separating the photolyase and dimer (figure 5d). An additional caption and speech cue re-inforced 'photoreactivation'. The arrow symbol was then displayed and the repaired DNA revealed. To make the process clear, animation showed the dimer disappearing and the DNA re-joining (figure 5e). Finally, the repaired DNA label was given as a caption and moved to the centre of the DNA, where attention had been directed by the animation (figure 5e) allowing reading time. The animation of the photolyase was shown, with a speech cue 'photolyase detaches' to re-inforce it

    STUDY THREE

    To investigate whether our design guidelines would improve recall, we used a further group of eight subjects with low domain knowledge and showed them the re-authored presentation (figure 5). We then scored recall in the same way as before.

    Results

    The results are shown in table 2 in the 'Multimedia' column. A significant difference was found between recall in the low domain knowledge group in the original design (table 1, col 1) and the improved design (Mann-Whitney U, P<0.05). There was a clear improvement in recall over the original design, and in some cases performance was superior to those of the high domain knowledge group, although this difference was not significant. The improvement in recall gives some validation for our design guidelines :

    Table 2 :Recall of the re-authored multimedia design and text-speech only for low domain knowledge subjects

  • Recall of the initial state of the DNA, i) was improved by the addition of a caption and speech cue.
  • Recall of the simple animation photolyase attaches, iv) and detaches, ix) were both improved by the use of speech cues to the animations.
  • Recall of photoreaction, ii) was improved by using a caption and redesigning the speech track to be make this object more salient.
  • Recall of the more complex animations involving the Protein-Pyrimidine complex being formed, v) and being struck by the light energy were both improved, vi). These animations were re-authored to clarify the process, and reinforced by an additional label and caption.

    STUDY FOUR

    Whilst the results from study three seemed to demonstrate the value of our guidelines, they did not differentiate between the effect of redesign and adding information to the original presentation. In an attempt to be sure of the impact of these changes, we performed a further study using text and speech only. The text script derived from study three was accompanied by a matching speech track. If recall was improved above that found for the original presentation in study two, then this could be attributed to the addition of text captions and information in the speech track. If the recall was poorer than in study three, then the use of the re-authored images and animations had aided performance.

    Results

    The results, shown in table 2 in the 'text-speech' column, demonstrate the value of effective use of multimedia. Recall performance with text-speech was significantly worse than for the low knowledge group in the improved MM design (Mann Whitney U, P<0.001), but was not significantly different from the low knowledge group for the original presentation. This showed that for re-authoring to provide additional linguistic information was not in itself sufficient to improve comprehension : animation and images were also vital for low knowledge subjects to understand the presentation.

    CONCLUSIONS

    The studies reported have shown some of the dangers of attempting to design MM presentations without concern for attention and comprehension. MM designers should consider how to control shifts of attention according to the content being shown. Attention and the viewing sequence can be influenced by design of highlighting, and reinforcement with speech. Designers also need to model the user's viewing process to provide time for reading captions and to avoid attentional clashes. If the presentation is complex it may also be useful to guide attention to important elements using reveals, labels, symbols or speech.

    Attentional design may be more important for low domain knowledge subjects, who need to be guided at exactly what to attend to. This is vital for instructional design, and may useful when information delivery is critical e.g. in command and control applications.

    Our second study of subject's recall showed that design problems may lead to difficulties in comprehension. We suggested comprehension may be improved by design of components for attracting the users' attention, thereby ensuring that the presentation could be formed into coherent whole. This may be achieved by scripting the speech track to more tightly reference and synchronise with the visual elements of the presentation; and by using animations and captions to support the content.

    The final two studies demonstrated the effectiveness of our design guidelines. In study three, the re-authored version of the presentation showed a marked improvement in recall while the fourth study showed the value of presenting the information in a range of media. However MM presentation design is only one component of delivering effective learning. Pedagogical techniques, e.g. active engagement are key components, as is rhetorical design of the message. We have partially addressed the latter in [14] and intend to explore how different types of message (e.g. causation, procedure) may be delivered by media combinations.

    The main implication of our work is to point out how attention and the viewing process can be controlled for effective information delivery. In task centred MM getting the message across is vital, so immediate recall is an appropriate measure. However immediate memory does not always correlated with longer term recall [13], so in tutorial applications pedagogical techniques to encourage depth of memory encoding may be the key, although these techniques will still fail if they do not employ effective means of directing the user's viewing sequence, as demonstrated in our study.

    This work provides some tentative steps toward placing MM design and evaluation within a firmer theoretic framework. Several problems remain for future research. We would like to increase the number of subjects in study three and four by performing the studies on high domain knowledge subjects. We also feel it is important to address how design advice can be integrated into the life cycle of MM production, following [14]. Design advice is necessary during the formative processes of authoring, such as when scripting and choosing media.

    ACKNOWLEDGMENTS

    The Etiology of Cancer CD-ROM is copyright of American Medical Television and Silver Platter.

    REFERENCES

    1. Baggett, P. The Role of Temporal overlap of Visual and Auditory Material in Forming Dual Media Associations. Journal of Educational Psychology 76 (1984) 408 - 417
    2. Bieger, G.R. & Glock, M.D. The Information Content of Picture-Text Instructions. Journal of Experimental Education 53 (1984), 68-76.
    3. Brandt, H. The Psychology of Seeing. New York : The Philosophical library (1954)
    4. Grimes, T. Mild Auditory-Visual dissonance in TV News may exceed viewer attention. Human Communication Research 18, 2 (1991), 268 - 298
    5. Hegarty, M. & Just, M.A. Constructing mental models of machines from text and diagrams. Journal of Memory & Language, 32, 717-742, 1993.
    6. Hillstrom, A.P. & Yantis, S. Visual motion and attentional capture. Perception & Psychophysics, 55(4), (1994), 399-411.
    7. Hochberg, J & Brooks, V. The Perception of Motion Pictures. In Handbook of Perception and Human Performance X, LEA, 1980
    8. Johnson, W.A. & Dark, V..J. Selective Attention. Annual review of psychology, 37, (1987), 43-45.
    9. Loftus, G.R. Eye Fixations on Text and Scenes. In Eye Movements in Reading, Perception, and Language Processes. Academic Press, 1983, pp.359-376.
    10. Large, A, Beheshti & J., Breuleux, A. Multimedia and Comprehension : A cognitive study. Journal of American Soc. for Information Science, 515-528, 1994
    11. Park, O. & Hopkins, R. Instructional conditions for using dynamic visual displays : a review. Instructional Science 21 (1993), 427 - 449
    12. Rieber, L. P. Animation in Computer Based Instruction. ETR&D 38 (1990), 77 - 86
    13. Stasko, J., Badre, A. & Lewis, C. Do Algorithm Animations Assist Learning ? Proceedings CHI'93 ACM, 1993, pp. 61-66
    14. Sutcliffe, A.G. & Faraday, P.M. Designing Presentation in Multimedia Interfaces. Proceedings CHI'94 ACM, 1994, pp. 92-98
    15. Yarbus A. Eye Movements & Vision. Plenum.1967.
    16. Van Dijk, T.A. & Kintsch, W. Strategies of Discourse Comprehension. New York: Academic Press. 1983


    CHI 97 Prev CHI 97 Electronic Publications: Papers Next

    CHI 97 Electronic Publications: Papers