February 10, 2026
Erzhen Hu, Student Researcher, and Ruofei Du, Interactive Perception & Graphics Lead, Google XR
DialogLab is a research prototype that provides a unified interface to configure conversational scenes, define agent personas, manage group structures, specify turn-taking rules, and orchestrate transitions between scripted narratives and improvisation.
Conversational AI has fundamentally reshaped how we interact with technology. While one-on-one interactions with large language models (LLMs) have seen significant advances, they rarely capture the full complexity of human communication. Many real-world dialogues, including team meetings, family dinners, or classroom lessons, are inherently multi-party. These interactions involve fluid turn-taking, shifting roles, and dynamic interruptions.
For designers and developers, simulating natural and engaging multi-party conversations has historically required a trade-off: settle for the rigidity of scripted interaction or accept the unpredictability of purely generative models. To bridge this gap, we need tools that blend the structural predictability of a script with the spontaneous, improvisational nature of human conversation.
To address this need, we introduce DialogLab, presented at ACM UIST 2025, an open-source prototyping framework designed to author, simulate, and test dynamic human-AI group conversations. DialogLab provides a unified interface to manage multi-party dialogue complexity, handling everything from defining agent personas to orchestrating complex turn-taking dynamics. Through integrating real-time improvisation with structured scripting, this framework enables developers to test conversations ranging from a structured Q&A session to a free-flowing creative brainstorm. Our evaluations with 14 end users or domain experts validate that DialogLab supports efficient iteration and realistic, adaptable multi-party design for training and research.
DialogLab is a research prototype that supports authoring, simulating, and testing dynamic human–AI group conversations. Designers can 1) configure group, party, snippet characteristics, 2) test with simulation and live interaction, and 3) gain insights with timeline view and post-hoc analytics.
DialogLab decouples a conversation’s social setup — such as participants, roles, subgroups, and relationships — from its temporal progression. This separation enables creators to author complex dynamics via a streamlined three-stage workflow: author, test, verify.
At its core, the DialogLab framework defines conversations along two dimensions:
Our framework separates the social setup (roles, parties) from the temporal flow (snippets, turn-taking rules), allowing for modular conversation design. Left: An example of authoring group dynamics for demo presenters and Q&A audiences; Right: An example of authoring conversation dynamics in three stages: opening, debate, consensus.
DialogLab guides creators through a structured author-test-verify workflow, supported by a visual interface designed for rapid iteration.
Demonstration of the DialogLab prototype, which supports the authoring, simulating, and testing of dynamic human-AI group conversations.
We evaluated DialogLab with 14 participants across game design, education, and social science research. Participants completed two tasks in DialogLab: designing an academic social event, and testing a group discussion with AI under three conditions:
Participants rated each condition at a 5-point Likert scale. Participants found the human control mode to be significantly more engaging, and generally more effective and realistic for simulating real-world conversations.
Bar chart comparing Human Control, Autonomous, and Reactive systems across Ease of Use, Engagement, Effectiveness, and Realism.
Participants’ feedback further highlighted the system's ability to balance automation with control:
DialogLab is more than just a research prototype; it's a step toward a future where human-AI collaboration is richer and more nuanced. The potential applications are vast:
Example applications of DialogLab, including practicing conference Q&A sessions, simulating debates, and creating game dialog design.
Moving forward, we envision richer multimodal behaviors, such as non-verbal gestures and facial expressions, could be integrated into this framework, We could also explore the use of photorealistic avatars and 3D environments like ChatDirector to create even more immersive and realistic simulations in our open-source XR Blocks framework. We hope this research will inspire continued innovation in the exciting and emerging field of human-AI group conversation dynamics.
See video demonstration of DialogLab to learn more.
Key contributors to the project include Erzhen Hu, Yanhe Chen, Mingyi Li, Vrushank Phadnis, Pingmei Xu, Xun Qian, Alex Olwal, David Kim, Seongkook Heo, and Ruofei Du. We would like to extend our thanks to Adarsh Kowdle for providing feedback or assistance for the manuscript and the blog post. This project is partly sponsored by the Google PhD fellowship.