May 19, 2026AgentsOpen SourceFramework

ViMax Runs Video Generation Like a Film Crew

HKUDS dropped ViMax on GitHub Trending, +504 stars today, 5.4k total. End-to-end video generation built as a multi-agent system — Director, Screenwriter, Producer, Video Generator. Throw it an idea or a novel chapter and it walks through script, then storyboard, then multi-camera shots, then consistency checks, then final cut.

The thing that makes ViMax interesting isn't the video quality, which depends on whatever backend model you plug in — Google AI Studio, MiniMax, others. It's the structural bet: long-form video generation is bottlenecked by character consistency, scene continuity, reference management. Those are coordination problems, not pixel problems. Treating them with specialized agents and continuity gates instead of one giant model is a real structural argument.

Four entry points: Idea2Video (raw concept to story), Novel2Video (full novel to episodic content), Script2Video (custom screenplay to production), AutoCameo (your photo becomes a character in the film). MIT license. Active development with 329 commits, no formal release yet.

If you've been waiting for the moment when "multi-agent for X" stops being a research-paper buzzword and starts looking like a buildable product, ViMax is one of the cleanest examples in video.

GitHub: https://github.com/HKUDS/ViMax
← Previous
Forge Squeezes 86% Out of an 8B Local Model
Next →
SkillsVote Treats Agent Skills Like an Election
← Back to all articles

Comments

Loading...
>_