Multimodal Live API

The Multimodal Live API enables low-latency, two-way interactions that use text, audio, and video input, with audio and text output.




Model response type




Connect

Disconnect

cloud_off disconnected



mic videocam present_to_all
send
A dialog that is opened by default.