LTX-2 AI - Unrestricted and Cinematic Video with Native Audio
Unrestricted · 20s length · Synchronized audio · Camera-like motion · 480p/720p/1080p
We also offer other advanced models, both for video and images!
Why You Should Choose LTX-2
An open-source video model for T2V and I2V with native audio sync, 20s clips, and 1080p resolution. Most importantly, it has no filter — create the videos you want, without Restrictions.
No Filter
Native Audio Sync
Longer Coherent Clips (5-20s)
Directable Camera Motion (LoRA)
Meet LTX-2: Open-Source Video Model with Native Audio
LTX-2 is a DiT-based audio-video foundation model that generates video and sound together. It targets production-ready outputs with longer coherent clips, more stable motion, and audio that stays aligned with on-screen action.
- Native Audio + Video GenerationAudio is generated alongside the frames, helping dialogue, impacts, and ambience stay synchronized instead of being added as an afterthought.
- Longer, Coherent ClipsCreate 5-20 second sequences that maintain consistency through movement with fewer floaty artifacts and less 'jello' motion.
- Text-to-Video & Image-to-VideoStart from a prompt or a reference image, choose quality settings, and generate a clip you can download and iterate on.
How to Generate with LTX-2
Create a video in three simple steps:
Core LTX-2 Capabilities
A production-oriented video model built for coherent motion and synchronized sound.
No Filter Generation
LTX-2 is open-source and unfiltered, empowering creators to produce content without limitations—perfect for artistic and professional applications.
Camera-Like Motion
Designed to reduce wobble and 'jello' artifacts in longer shots, with more stable motion and grounded scenes.
Native Audio Sync
Generates sound with the video so impacts, ambience, and dialogue can stay aligned with the frames.
Directability with LoRA
Supports LoRA-based controls (including camera moves) for more predictable, repeatable results in local workflows.
Text-to-Video & Image-to-Video
Generate a clip from a prompt or animate a reference image to match storyboards and concepts.
Up to 1080p Output
Choose 480p, 720p, or 1080p depending on speed, cost, and quality needs.
What You Can Create with LTX-2
native audio, longer clips, and flexible formats.
Output Resolution
Up to 1080p
480p / 720p / 1080p
Clip Duration
5-20s
Longer coherent clips
Pricing
From 4 credits/s
4/6/8 credits per second
Pricing
🎊 Begin creating with our AI image and video models! Select a credits package to get started. 🎊 Prices shown exclude VAT. Applicable taxes vary by location and will be calculated at checkout.
Basic
Ideal for exploring AI video creation.
Includes
- 800 credits
- Up to 20 video generations
- Up to 100 image generations
- Credits never expire
- HD quality download
- No watermark output
- Commercial use allowed
Excellent for experimentation and compact projects!
Popular
PopularTop pick among video creators.
Includes
12% More Credits- 2200 credits
- Up to 55 video generations
- Up to 275 image generations
- Credits never expire
- HD quality download
- No watermark output
- Commercial use allowed
- Priority support
Optimal value for frequent creators!
Pro
Designed for professionals and commercial use.
Includes
20% More Credits- 5000 credits
- Up to 125 video generations
- Up to 625 image generations
- Credits never expire
- HD quality download
- No watermark output
- Commercial use allowed
- Priority support
- Bulk generation capabilities
Tailored for production-grade workflows!
Common Questions About LTX-2 AI
Need additional information? Reach out to contact@ltx-2.pro for support.
What is LTX-2?
LTX-2 is an open-source, DiT-based audio-video foundation model designed for high-fidelity video generation with synchronized audio. It supports text-to-video and image-to-video workflows for 5-20 second clips.
Does LTX-2 generate audio natively?
Yes. LTX-2 generates audio and video together, and uses a dual-stream design connected by cross-attention to help keep sound aligned with motion frame-by-frame.
What resolutions and aspect ratios are supported?
Generate 480p, 720p, or 1080p outputs. Text-to-video supports 16:9 and 9:16 aspect ratios; image-to-video follows the source image framing.
How long can LTX-2 clips be?
Choose 5 to 20 seconds. Longer durations give more narrative space while aiming to keep consistency through motion.
Can I run LTX-2 locally?
Full-precision runs are VRAM-heavy, but quantized versions can fit on high-end consumer GPUs (often around 24GB VRAM). If you'd rather skip setup, use our web generator.
How can I control camera motion and style?
In local workflows, LTX-2 supports LoRA-based controls (including camera moves like dolly-in and pan). On the web, use clear camera language in your prompt and try a few variations for the best result.
Generate Your Next Video with LTX-2 AI
Create 5-20 second clips with synchronized audio, up to 1080p, directly in your browser.
