1980 Richard A. Bolt, Chris Schmandt & Eric A. Hulteen, MIT Architecture Machine Group

Put-That-There

Voice + gesture at the graphics interface

VoiceGestureMultimodal

Overview

“Put-That-There” was a pioneering voice-and-gesture system demonstrated at MIT’s Architecture Machine Group (a predecessor to the Media Lab). A user could create and rearrange simple graphics on a large video display by speaking naturally while pointing — for example, saying “Put that there” and indicating the object and destination with hand gestures. The system compensated for imperfect speech recognition by combining redundant input channels, context, and speech-based feedback.

Deep dive

Publication.

The work was presented at SIGGRAPH ’80 as “Put-that-there: Voice and gesture at the graphics interface.” MIT’s Media Lab publication page also hosts the paper PDF.

Design philosophy.

The project explicitly assumed that speech-recognition hardware would never be 100% accurate. To raise “effective accuracy,” it combined voice, gesture, syntax, semantics, context-sensitive interpretation, immediate visual feedback, and spoken clarification questions.

Interaction.

Users referred to objects and locations deictically: “Create a blue square there,” “Put that below that,” etc. The system asked aloud when input was ambiguous, making the conversation loop visible and correctable.

Notable demos.

The canonical demo shows a user moving colored shapes around a large-format screen using only voice and pointing, anticipating the multimodal interfaces now common in phones, cars, and AR/VR systems.

Impact.

Put-That-There is a foundational reference for multimodal HCI. It inspired decades of research on gesture+speech input and is still revisited today; a 2025 arXiv paper proposes “Revisiting put-that-there” for modern head-mounted displays using large language models.

Weird / fun facts.

The project’s very name became shorthand for the entire genre of multimodal, deictic interaction. The phrase has outlived the specific hardware by more than four decades.

Team & pioneers

Richard A. Bolt Researcher at MIT's Architecture Machine Group; principal author and visionary behind the voice-and-gesture interface.
Chris Schmandt Longtime MIT speech and audio-interface researcher; co-author of the SIGGRAPH '80 paper.
Eric A. Hulteen Co-author of the "Put-that-there" paper, contributing to the system's implementation and evaluation.
MIT Architecture Machine Group The precursor to the MIT Media Lab, where multimodal interaction was explored as a way to compensate for imperfect speech recognition.

Media

Put-That-There SIGGRAPH '80 title page — Title page of the scanned SIGGRAPH ’80 paper. Source: MIT Media Lab

Overview

Deep dive

Team & pioneers

Media

Sources