
JoyAI-Echo
JoyAI-Echo: Pushing the Frontier of Long Audio-Visual Generation
The Lens
JoyAI-Echo is JD.com's open release for generating long video with synced audio from a text prompt. Not the usual five-second clip. It targets multi-shot sequences up to about five minutes and keeps the same character and voice consistent across shots. The weights are actually released, not just a paper and a demo page, which is the difference between a real tool and a press release.
The catch arrives fast on hardware. You're looking at roughly 46GB of VRAM for the full setup and around 70GB of weights to download. That's an H100 or A100, data-center territory. There are reduced settings for smaller cards, with the obvious tradeoffs in quality and length.
Read the license before you build anything on this. It ships under the LTX-2 Community License, which forbids commercial use. So it substitutes for Sora, Runway, Kling, and Veo for research, prototyping, and personal projects, but not for anything you plan to sell. For commercial long-form video you're back to the paid services, or to a permissively licensed model like the Wan and Hunyuan families.
Between a non-commercial license and data-center GPU requirements, this is a research showcase for most people, not a production tool. Impressive that it exists and runs. Just know what you can and can't do with it.
Free vs Self-Hosted vs Paid
source availableFree: Download and run the model at no cost. Weights and code are public.
Self-hosted: The realistic path, if you have the hardware. Plan for 40GB+ of VRAM and roughly 70GB of weights. Reduced-quality settings exist for smaller GPUs.
The license is the real price: The LTX-2 Community License bars commercial use. You can use this for research, learning, and personal work, but the moment money is involved you need a commercial license from Lightricks or a different model entirely. For paid work, compare against hosted services like Runway and Kling, or permissive open models.
Free to download, but the LTX-2 Community License forbids commercial use. Heavy GPU needs (40GB+ VRAM).
Get tools like this every Wednesday
One featured tool, three on the radar. No fluff.
License: Other
Review license manually.
Commercial use: ✗ Restricted
About
- Owner
- JD.com (Organization)
- Stars
- 1,387
- Forks
- 121
Explore Further
More tools in the directory
openclaw
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
377.9k ★everything-claude-code
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
212.9k ★claw-code
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
193.6k ★