The Conversational Layer: A New Interpretive Medium for Museums

Mar 9

WonderWay users feed their curiosity by asking questions about the exhibition in real time.

The “conversational layer” is a new interpretive medium where visitors ask questions in real time, by voice, while staying present with the collection. Unlike generic chatbots, it is grounded in museum-approved sources and governed like interpretation. Done well, it supports discovery, access, and belonging, including visitor-chosen lenses that surface connections often missing from default interpretation. Museums should treat this as an interpretive layer with guardrails: curatorial authority, provenance and attribution, privacy and consent, accessibility, and bias reduction.

Museums have always invented mediums.

The label turned scholarship into a public voice. The audio guide extended interpretation into time. And long before "immersive" became a buzzword, museum teams were already building interpretive experiences that let visitors participate, experiment, and feel wonder firsthand.

At the American Museum of Natural History, my teams and I helped pioneer new forms of engagement inside exhibitions. Interactive stations where visitors could test ideas like scientists. Media installations that blended sound, animation, and narrative into atmosphere. Projection-based experiences that made invisible systems visible. Early tablet and touchscreen interactives that turned learning into exploration rather than instruction.

Now a new medium is emerging, one that doesn't replace those tools but changes what becomes possible around them: the conversational layer.

What the conversational layer is

The conversational layer is an interpretive experience that responds to a visitor in real time through dialogue, using museum-approved knowledge. It's not a linear script. It's not a search box. It's not a generic chatbot trained on the internet. It's a living interface that sits between a person and a collection and does something museums have always wanted but could rarely do at scale: it follows curiosity. A visitor can begin with a high-level question that frames an entire exhibition, then zoom in to a single object, then zoom out again to connect it to a bigger idea.

For example, a visitor might ask:

“I’m at The Met Fifth Avenue, in the European Paintings galleries.”
“I’m standing in front of Rembrandt’s Aristotle with a Bust of Homer.”
“What’s happening here?”
“Okay, who is this guy, and why should I care?”
“Why is he touching the statue like that?”
“Is this a portrait, or is it more like a story?”
“Why does this feel kind of intense, even though nothing is ‘happening’?”
“What’s the big idea of this painting, in plain English?”
“Is this painting basically about ego, or am I projecting?”
“What detail do people usually miss?”
“If I only remember one thing about this, what should it be?”

The conversational layer is built for that chain reaction, from gallery-level orientation to object-level depth.

Why voice matters, and why real time matters

Most AI experiences today are text-based. They pull visitors into typing, scrolling, correcting, and retyping. That's friction, and in a museum it competes with the very thing the visitor came for: presence.

Voice changes the equation.

A real-time voice conversation is faster than typing, easier than searching, and more natural than navigating menus. It lowers the activation energy of curiosity. If a question arises, the visitor can simply ask it. No thumbs, no screen gymnastics, no breaking the flow. Just as importantly, voice helps visitors stay with the collection. Their attention remains on the object, the space, the light, the scale, the feeling of being there. The phone becomes a conduit, not the destination. When interpretation can live in the air, the museum stops competing with the device and starts reclaiming the visitor's gaze. Voice isn't a gimmick. It's the interface that makes conversation viable as an interpretive medium inside physical space.

Why this matters now

Museums are welcoming audiences with radically different expectations. People arrive fluent in video, feeds, and personalization. Many arrive multilingual. Many arrive with cultural contexts that don't match the assumptions of traditional interpretation. And a growing number arrive with limited patience for anything that feels one-size-fits-all. At the same time, museums are under pressure to deepen relevance, strengthen community trust, and defend curatorial authority in a world where AI can generate plausible nonsense in seconds. This is why the conversational layer matters. Done well, it can make interpretation more accessible and more human. Done poorly, it can be a reputational hazard.

So the question isn't "Should museums use AI?" The real question is: can we define a medium that's worthy of museums? Audio guides are linear. You choose a track, you listen, you move on. They're excellent for storytelling, but they don't adapt when a visitor's questions change. Most chatbots are generic. They're trained broadly, answer confidently, and are rarely tied to a museum's editorial standards, interpretive voice, or provenance requirements.

The conversational layer is different in three ways:

1) It's curiosity-driven, not route-driven. Visitors aren't forced into the museum's sequence. The museum meets visitors in their own.

2) It's grounded in curated knowledge. The system should retrieve from museum-approved sources, not invent. Interpretation remains authored, even when it's delivered dynamically.

3) It's designed for presence. When conversation is voice-first, visitors can look, listen, ask, and keep looking. The interpretation supports the encounter instead of pulling people away from it.

What museums gain

A conversational layer can unlock value across several institutional goals, without changing the collection itself.

1) Engagement that feels like discovery

When a visitor can ask follow-up questions, learning becomes a dialogue rather than a lecture. The object becomes less static. The visit becomes less passive. People leave with a sense of discovery, not just exposure.

2) Access across languages and learning styles

Interpretation can be offered in multiple languages, at different reading levels, with transcripts for accessibility, and with context that doesn't assume prior knowledge.

3) Belonging through relevance, without stereotyping

There's a careful, powerful idea here: interpretation can become more personally resonant when visitors can choose lenses that reflect their interests and contexts.

The key is choice, not inference.

A visitor should be able to say, "Give me the story through the lens of craft," or "Help me understand this through migration," or "Explain this as if I'm new to art history," or "Tell me why this matters today." This is how personalization becomes respectful and inclusive, rather than creepy.

For me, this isn't theoretical. I'm Venezuelan, and I often walk through museums instinctively looking for the threads that connect what I'm seeing to my heritage, to histories of trade, extraction, migration, and cultural exchange that shaped Latin America. Those stories are frequently present in the scholarship, but rarely surfaced in the default interpretation. Conversation makes it possible for visitors to pull those connections forward, on purpose, without asking the museum to redesign interpretation for every identity, every time.

4) A new way to "return" voice and context

Not every object can return home. But interpretation can carry more than one authority. The conversational layer can make space for credited voices from origin communities, artists, historians, and contemporary descendants to coexist with curatorial framing. This isn't a substitute for repatriation. It's something else: a way to restore perspective, credit, and context where museums choose to do so.

What museums must protect

This medium only works if museums can defend it with confidence. That requires guardrails.

Curatorial authority: The museum defines what the system is allowed to say, and what it's not.

Provenance and attribution: Sources are visible, reviewable, and updated through editorial workflows.

Consent and privacy: Visitors should understand what's being collected and why, with a clear opt-out.

Bias and harm reduction: Sensitive topics require careful framing, community review where appropriate, and language that doesn't flatten lived experience.

The conversational layer has to be governed like interpretation, not like a tech experiment.

Where WonderWay fits

WonderWay is one implementation of the conversational layer.

It's voice-first, designed to keep visitors present with the collection, and grounded in curated, museum-approved knowledge rather than open-web improvisation. The goal is simple: help museums offer interpretation that feels alive, accessible, and rigorous.

But the bigger point isn't the product. The bigger point is the medium.

Museums have always adapted their interpretive tools to meet the public where they are. The conversational layer is the next tool to define.

An invitation

If you're exploring what museum interpretation could look like in 2026 and beyond, you don't have to start with a full transformation.

Start with one gallery, a small set of objects, and a clear question: does conversation deepen curiosity and understanding?

Then build from there.

This is a moment for museums to shape the medium, not inherit it from the tech world. I'm inviting institutions, curators, educators, and visitor experience teams to engage and brainstorm together: what should the conversational layer become, and what must it never be?

Because the future of museums isn't more information. It's more meaning.

Interested in conversational interpretation for your museum?

WonderWay partners with museums to pilot a voice-first conversational layer in a single gallery, using curated institutional sources and clear governance. If your team is exploring AI for interpretation, access, multilingual engagement, or evaluation in 2026, reach out to discuss a Charter Pilot.

Contact: hello at wonderway.ai

Author: Hélène Alonso
Hélène Alonso is founder of WonderWay and a professor at New York University. She is a museum technology leader with over two decades of experience at institutions including the American Museum of Natural History, Liberty Science Center, and the Wildlife Conservation Society. Her work focuses on artificial intelligence infrastructure for museums, institutional knowledge systems, and the future of cultural interpretation.

TL,DR

Key takeaways

Voice-first conversation reduces friction and keeps attention on objects, not screens.
Conversational interpretation follows curiosity, from exhibition-level orientation to object-level depth.
Museums can support belonging through visitor-chosen lenses, without stereotyping or inferring identity.
This medium must be governed like interpretation: sources, provenance, editorial review, consent, and accessibility.
A small, contained pilot can validate whether conversation increases engagement and understanding.

FAQ

What is conversational interpretation in museums?
Conversational interpretation is a museum experience where visitors ask questions and receive real-time responses grounded in museum-approved knowledge, often delivered by voice so visitors stay present with the collection.

How is this different from an audio guide?
Audio guides are linear. Conversational interpretation adapts to a visitor’s questions and interests, enabling follow-up and deeper exploration.

How is this different from a chatbot?
A museum-grade conversational layer is governed and sourced. It retrieves from curated institutional materials instead of improvising from the open web.

Why use voice instead of text chat?
Voice reduces friction and helps visitors keep attention on the object and surroundings rather than typing and staring at a phone.

How do museums keep curatorial control?
By using curated sources, editorial review workflows, provenance checks, and explicit constraints on what the system can say.

What about privacy and consent?
Visitors should see clear consent language, have opt-out options, and museums should minimize data collection, retaining only what is needed to evaluate and improve the experience.

Definitions

Conversational layer: A voice-first interpretive interface that responds to visitor questions in real time using museum-approved knowledge.
Curated library: The vetted sources a museum authorizes for interpretation, such as label text, catalog entries, curator notes, and approved essays.
Governance: Editorial review, provenance checks, consent and privacy, accessibility, and bias reduction practices that keep the experience museum-grade.

Helene Alonso