CHI 97 Electronic Publications: Late-Breaking/Short Talks
CHI 97 Prev CHI 97 Electronic Publications: Late-Breaking/Short Talks Next

Video Matters! When Communication Ability is Stressed, Video Helps

Elizabeth S. Veinott, Judith S. Olson, Gary M. Olson
Collaboratory for Research on Electronic Work (CREW)
University of Michigan
701 Tappan Street
Ann Arbor, MI 48109-1234
veinott@umich.edu, olsons@crew.umich.edu
Xiaolan Fu
Institute of Psychology
Chinese Academy of Sciences
Beijing, China
fuxl@cpy.cjfh.ac.cn

ABSTRACT

This study assesses whether remotely located pairs of people working on a collaborative task benefit from using video, looking in particular at people for whom communication is stressed. In this study, we extend the research on video-mediated communication to the domain of non-native speaker interactions. Thirty-six pairs performed a map task using either audio-only or audio-plus-video for communication. Half the pairs were non-native speakers, half were native speakers. As in many studies of video connectivity with native speakers, no benefit from the video was found. However, non-native speakers performed significantly better with a video connection than with audio only.

Keywords

Video-mediated communication, remote work, non-native speaker interaction

© 1997 Copyright on this material is held by the authors.



INTRODUCTION

Thirty years of research has shown very little effect of video on remote communication [1]. The classic studies of Chapanis [2] showing no effect of video on quality have not been refuted. Video is no better than audio-only for problem solving map tasks [3], collaborative writing tasks [4], and collaborative design task [5]. Clearly, when one wants to show the object under discussion, video connectivity matters [6], but the question remains whether video has any value in showing facial expressions and gestures.

Most of the video-mediated communication research to date has focused on different tasks, but perhaps the effect of video appears when communication clarity is strained. People might use all forms of communication possible if verbal ability is impaired. Indeed, speaking to someone in a language that is not their native language could involve gestures and eye-contact to confirm when understanding is established. Research has shown that the moment there is any sort of processing difficulty, people supplement audition with vision [7]. In this study, we extend the research on video-mediated communication to look at its effect on non-native speaker interactions. If, as research has shown, people who easily communicate verbally find no advantage of seeing each other, would those who are stressed in communication use and benefit from video?

To test this notion, we compared the task performance of native speakers of English (NS) with non-native speakers of English (NNS), who were presumably more stressed in their verbal communication ability. These pairs collaborated over audio-only (audio) or audio-plus-video (video) communication channels. We were interested in how access to visual information affects the outcome of the communication. We hypothesized that there would be an interaction between language and communication media. If pairs share a native language and culture, then video connectivity may not provide any added benefit over audio. However, if native language and culture are not shared, then video might facilitate the communication by allowing for non-verbal communication and the ability to visually monitor each other's understanding.

Specifically, we do not expect to find a difference between NS pairs in the two media conditions. However, our central hypothesis is that the NNS pairs in the video condition will perform more effectively than their counterparts in the audio condition.

PROCEDURE

Seventy-two (36 pairs) graduate students attending the University of Michigan participated in the study for pay. Twenty pairs were native English speakers, 16 were non-native English speaker pairs. None of the pairs knew each other prior to participating in the study. NNS pairs were comprised of participants from different countries who spoke different native languages. NNS pairs in the two media conditions did not differ significantly in their time studying English, time living in the United States, or scores on the Test of English as a Foreign Language (TOEFL).

The task was modeled after the map task reported in [3]. In this task, one person was the instruction giver (SPEAKER) and the other was the instruction follower (RECEIVER). Each person had a copy of the map, but only the SPEAKER's map had the path on it. In order to introduce uncertainty and a need for clarification, some of the 12 landmarks on the two maps were different. The RECEIVER's task was to accurately draw the path on his or her own map based on the SPEAKER's instructions. The pairs worked on four maps of equal difficulty. They worked in different rooms and traded off the roles of SPEAKER and RECEIVER. In the video condition, only the speaker's face and hands were visible, not the object of conversation.

The study involved four conditions and each pair participated in only one condition (a between-Ss design). The conditions were: NNS-Audio, NNS-Video, NS-Audio, and NS- Video. Map order was counter-balanced within each condition.

Performance was a combination of standardized quality and time scores. Quality was measured as the amount by which the RECEIVER's map deviated from the SPEAKER's map. Task completion time was measured in seconds. Both the quality and time data were transformed using natural logarithms. In order to have one performance score for each pair, both the time and quality data were normalized and added together to produce a normalized performance score. This overall performance score was possible because there was no evidence of a speed/accuracy tradeoff. Pairs showed poor performance in either time, accuracy, or both.

RESULTS

On average pairs took about 12 minutes to complete each map and they deviated from the path by approximately 36cm2. Native speakers and non-native speakers indeed were affected by video differently. A 2 X 2, between-subjects analysis-of-variance on the performance data indicated a significant interaction between communication media and language (F(1,35)= 4.35, p<.05). NS pairs in the audio and video condition did not differ significantly. However, the difference between audio and video for the NNS was significant, F(1,15)=5.12, p <.05, indicating that NNS performed better with video (Table 1.)

Table 1. NNS and NS standardized performance scores
Audio Audio+Video
NS pairs -.74
(n=10)
-.34
(n=10)
NNS pairs 1.35
(n=7)
.15
(n=9)
Note: Higher scores indicate worse performance.

In addition, NNS pairs in the video condition had fewer miscommunications, as measured by the number of wrong turns, than NNS pairs in the audio condition (NNS-Audio = 3.71 vs. NNS-Video=1.75). This difference was not found for native speakers (NNS-Audio=.7 vs. NNS-Video=.9).

Post-experiment questionnaire data provided some additional support for the results. NNS pairs reported finding the video more useful than NS pairs (F(1,34)=6.20, p<.05).

CONCLUSIONS

There are clear advantages of video connectivity for people whose verbal communication ability is stressed. Video helps NNS pairs to work together more effectively. These pairs produce higher quality work, spend less time, and have fewer miscommunications that result in errors. Video potentially allows them to monitor their common ground more effectively by allowing them to communicate non-verbally and identify misunderstandings more quickly. In contrast, video does not seem to benefit the native speakers. These results have important implications for people in multinational companies, who rely on remotely located teams distributed across language boundaries. Video is a benefit for their communication quality.

REFERENCES

  1. Finn, K., Sellen, A., & Wilbur, S. (Eds.) . (1997). Video-mediated communication. Hillsdale NJ: Lawrence Erlbaum Associates.
  2. Chapanis, A. (1975). Interactive human communication. Scientific American, 232, 36-42.
  3. Boyle, E. , Anderson, A. and Newlands, A. (1994). The effects of visibility on dialogue performance in a cooperative problem solving task, Language and Speech, 37,1, 1-20.
  4. Green and Williges, R. C. (1995). Evaluation of alternative media used with a groupware editor in a simulated telecommunication environment. Human Factors, 37, 3, 283-289.
  5. Olson, J., Olson, G. and Meader, D. (1995). What mix of video and audio is useful for remote real-time work. In Proceedings of CHI, 362-368.
  6. Farmer, S. M. and Hyatt, C. W. (1994). Effects of task language demand and task complexity on computer-mediated work groups, Small Group Research, 25,3, 331-336.
  7. Reisberg, D., McLean, J. and Goldfield, A. (1987). Easy to hear but hard to understand: A lip-reading advantage with intact auditory stimuli. In R. Campbell & B. Dodd (Eds.) Hearing by Eye: The psychology of lip-reading. Hillsdale, N. J.: Lawrence Erlbaum Associates

CHI 97 Prev CHI 97 Electronic Publications: Late-Breaking/Short Talks Next

CHI 97 Electronic Publications: Late-Breaking/Short Talks