All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Interaction Models: Native Real-Time Multimodal AI Collaboration

By

Thinking Machines Lab

20d ago· 16 min readen

Summary

The article introduces "interaction models," a new approach to human-AI collaboration where AI systems handle interaction natively—continuously processing audio, video, and text in real time—rather than relying on external scaffolding or turn-based interfaces. These models are trained from scratch with a multi-stream, micro-turn design to ensure real-time responsiveness, aiming to make AI collaboration feel as natural as human-to-human interaction.

Key quotes

· 3 pulled
We think interactivity should scale alongside intelligence; the way we work with AI should not be treated as an afterthought.
Interaction models let people collaborate with AI the way we naturally collaborate with each other—they continuously take in audio, video, and text, and think, respond, and act in real time.
To ensure real-time responsiveness, we adopt a multi-stream, micro-turn design.
Snippet from the RSS feed
Interaction models move beyond turn-based AI interfaces by handling multimodal, real-time collaboration natively across audio, video, and text.

You might also wanna read