Google demonstrated Gemini Omni at Google I/O 2026 performing real-time video understanding while simultaneously identifying objects, answering questions about live scenes, and executing on-screen tasks, prompting 58,000 upvotes on r/artificial. The demonstration video was described by multiple AI researchers in the comments as a meaningful step-change from prior multimodal models. Top comment with 8,400 upvotes: I watched it twice and I still have no idea how it processed all of that in under 400 milliseconds.
Comments on "Gemini Omni Real-Time Video AI Demo Breaks the Internet - r/artificial"
Create a free account or sign in to join the discussion.
Sign in to join the conversation