VC
Value Add VC
⚡HomePulse⚡Helpful Apps📝Blog
← Value Add PulseAI$0.10/sec video

Google's Gemini Omni Flash Hits the API, Turning Enterprise Video Production Into a Conversation

Google rolled out Gemini Omni Flash — the first model in its new multimodal 'Omni' family — to developers via the Gemini API, Google AI Studio and the Gemini Enterprise Agent Platform. The model generates and conversationally edits 720p video through natural-language prompts at $0.10 per second of output, putting a 10-second clip at roughly a dollar, and every clip carries Google's SynthID watermark.

$0.10/sec of 720p video
Price
~$1
10-Second Clip Cost
720p (no 1080p/4K yet)
Max Resolution
Gemini API, AI Studio, Enterprise Agent Platform
Availability
Google I/O 2026 (consumer)
Debut
TC
Trace Cohen
Early-stage VC & angel · Founder, New York Venture Partners
June 30, 2026
1 min read
ShareXLinkedInEmail
KEY TAKEAWAYS FOR VCs & FOUNDERS
1

First conversational video-editing model available via API, not just a consumer demo, opening enterprise marketing and L&D use cases

2

Pricing at $1 for a 10-second clip undercuts most professional video-generation tools on a per-second basis

3

SynthID watermarking and C2PA Content Credentials on every clip address AI-generated media provenance concerns proactively

4

Positions Google against OpenAI's Sora and Runway in the fast-growing enterprise AI video generation category

TC
The VC Read · Trace's TakeTrace Cohen

A dollar for a ten-second video clip that you can then conversationally re-edit is the moment enterprise video production stops being a specialized function and becomes something any marketing generalist does themselves — that's the same shift that happened to graphic design when Canva made it self-serve, just five years faster because the underlying models improved that quickly. Capping at 720p instead of going straight to 4K is a smart sequencing choice; Google is optimizing for iteration speed and API adoption first, and can raise resolution once developers are hooked on the conversational workflow. For founders building marketing or L&D tools, this is table stakes infrastructure you should be building on top of now, not competing against. Watch whether OpenAI matches this price point with Sora; if it does, video generation becomes commoditized within two quarters.

🤖 AI Landscape →🏢 Enterprise AI Adoption →

Google made Gemini Omni Flash available to developers and enterprise customers via the Gemini API, Google AI Studio, and the Gemini Enterprise Agent Platform starting July 1, 2026, after debuting the model to consumers at I/O earlier in the year. Omni Flash is the first entrant in Google's new 'Omni' model family, designed for video, image and text tasks with video output built into the same multimodal model rather than requiring a separate generation pipeline.

The standout feature is conversational editing, powered by Google's Interactions API: users can iteratively refine a generated video by describing changes in natural language, and the model preserves the parts of the clip they want to keep rather than regenerating from scratch. That workflow is aimed squarely at marketing and learning-and-development teams — the organizational functions that produce the highest volume of short-form video and have historically needed dedicated editors or expensive agency relationships to iterate quickly.

“The competitive field includes OpenAI's Sora, Runway's Gen-4, and Google's own higher-end Veo models.”

Pricing is aggressive: $0.10 per second of generated 720p video puts a 10-second clip at roughly a dollar, though Omni Flash currently caps at 720p while competing Veo tiers scale up to 4K, meaning Google is trading resolution ceiling for cost and iteration speed in this initial release.

The competitive field includes OpenAI's Sora, Runway's Gen-4, and Google's own higher-end Veo models. Omni Flash's differentiation is less about peak visual fidelity and more about API accessibility and conversational iteration speed — attributes that matter more for high-volume enterprise content production than for high-end film or advertising work, which will likely stay on Veo or Sora's premium tiers.

What to watch: whether Google adds 1080p or 4K output to Omni Flash as usage scales, how enterprise marketing teams' actual spend on the API compares to traditional video production budgets, and whether OpenAI responds with its own lower-cost, API-accessible Sora tier to match Omni Flash's price point.

ShareXLinkedInEmail
More onGoogle →

Originally reported by VentureBeat. Analysis and editorial commentary by Value Add Pulse.

← Back to Pulse

Read Next

AI3-tier launch

OpenAI Previews GPT-5.6 Sol, Terra and Luna — Restricted to Trusted Partners at Washington's Request

OpenAI unveiled a three-tier GPT-5.6 family — Sol (flagship, $5/$30 per M tokens), Terra (balanced, $2.50/$15) and Luna (fast/cheap, $1/$6) — on June 26 and released it to only ~20 partner orgs after sharing the deployment plan with the US government. It is the first frontier model gated by explicit federal government coordination before wide release.

AI750 tok/sec

Cerebras Runs OpenAI GPT-5.6 Sol at 750 Tokens per Second, Setting a New Frontier-Model Speed Record

Cerebras Systems will deploy OpenAI's GPT-5.6 Sol on its wafer-scale WSE-3 chips at up to 750 tokens per second in July — roughly 10x faster than any Nvidia GPU deployment of a frontier model in production. The partnership is a strategic proof point for Cerebras ahead of its planned 2026 IPO.

AI~$800M est.

Adobe Acquires Topaz Labs to Bring Best-in-Class AI Image Enhancement Native to Creative Cloud

Adobe agreed on June 25 to acquire Topaz Labs, the pro-grade AI photo and video upscaling and denoising firm favored by photographers and filmmakers. Terms weren't disclosed but industry estimates put the deal in the $700M-$1B range, folding Topaz's Gigapixel, DeNoise and Video AI models directly into Photoshop, Lightroom and Premiere.

@Trace_Cohen·t@nyvp.com