NEWTry LTX-2 text-to-video and image-to-video with native audio sync.

LTX-2 AI - Unrestricted and Cinematic Video with Native Audio

Unrestricted · 20s length · Synchronized audio · Camera-like motion · 480p/720p/1080p

We also offer other advanced models, both for video and images!

Loading...

Meet LTX-2: Open-Source Video Model with Native Audio

LTX-2 is a DiT-based audio-video foundation model that generates video and sound together. It targets production-ready outputs with longer coherent clips, more stable motion, and audio that stays aligned with on-screen action.

  • Native Audio + Video Generation
    Audio is generated alongside the frames, helping dialogue, impacts, and ambience stay synchronized instead of being added as an afterthought.
  • Longer, Coherent Clips
    Create 5-20 second sequences that maintain consistency through movement with fewer floaty artifacts and less 'jello' motion.
  • Text-to-Video & Image-to-Video
    Start from a prompt or a reference image, choose quality settings, and generate a clip you can download and iterate on.

How to Generate with LTX-2

Create a video in three simple steps:

Core LTX-2 Capabilities

A production-oriented video model built for coherent motion and synchronized sound.

No Filter Generation

LTX-2 is open-source and unfiltered, empowering creators to produce content without limitations—perfect for artistic and professional applications.

Camera-Like Motion

Designed to reduce wobble and 'jello' artifacts in longer shots, with more stable motion and grounded scenes.

Native Audio Sync

Generates sound with the video so impacts, ambience, and dialogue can stay aligned with the frames.

Directability with LoRA

Supports LoRA-based controls (including camera moves) for more predictable, repeatable results in local workflows.

Text-to-Video & Image-to-Video

Generate a clip from a prompt or animate a reference image to match storyboards and concepts.

Up to 1080p Output

Choose 480p, 720p, or 1080p depending on speed, cost, and quality needs.

Stats

What You Can Create with LTX-2

native audio, longer clips, and flexible formats.

Output Resolution

Up to 1080p

480p / 720p / 1080p

Clip Duration

5-20s

Longer coherent clips

Pricing

From 4 credits/s

4/6/8 credits per second

Pricing

🎊 Begin creating with our AI image and video models! Select a credits package to get started. 🎊 Prices shown exclude VAT. Applicable taxes vary by location and will be calculated at checkout.

Flexible credit purchases - acquire credits as needed, with no expiration date.

Basic

$10one-time

Ideal for exploring AI video creation.

Includes

  • 800 credits
  • Up to 20 video generations
  • Up to 100 image generations
  • Credits never expire
  • HD quality download
  • No watermark output
  • Commercial use allowed

Excellent for experimentation and compact projects!

Popular

Popular
$28$25one-time

Top pick among video creators.

Includes

12% More Credits
  • 2200 credits
  • Up to 55 video generations
  • Up to 275 image generations
  • Credits never expire
  • HD quality download
  • No watermark output
  • Commercial use allowed
  • Priority support

Optimal value for frequent creators!

Pro

$62$50one-time

Designed for professionals and commercial use.

Includes

20% More Credits
  • 5000 credits
  • Up to 125 video generations
  • Up to 625 image generations
  • Credits never expire
  • HD quality download
  • No watermark output
  • Commercial use allowed
  • Priority support
  • Bulk generation capabilities

Tailored for production-grade workflows!

FAQ

Common Questions About LTX-2 AI

Need additional information? Reach out to contact@ltx-2.pro for support.

1

What is LTX-2?

LTX-2 is an open-source, DiT-based audio-video foundation model designed for high-fidelity video generation with synchronized audio. It supports text-to-video and image-to-video workflows for 5-20 second clips.

2

Does LTX-2 generate audio natively?

Yes. LTX-2 generates audio and video together, and uses a dual-stream design connected by cross-attention to help keep sound aligned with motion frame-by-frame.

3

What resolutions and aspect ratios are supported?

Generate 480p, 720p, or 1080p outputs. Text-to-video supports 16:9 and 9:16 aspect ratios; image-to-video follows the source image framing.

4

How long can LTX-2 clips be?

Choose 5 to 20 seconds. Longer durations give more narrative space while aiming to keep consistency through motion.

5

Can I run LTX-2 locally?

Full-precision runs are VRAM-heavy, but quantized versions can fit on high-end consumer GPUs (often around 24GB VRAM). If you'd rather skip setup, use our web generator.

6

How can I control camera motion and style?

In local workflows, LTX-2 supports LoRA-based controls (including camera moves like dolly-in and pan). On the web, use clear camera language in your prompt and try a few variations for the best result.

Generate Your Next Video with LTX-2 AI

Create 5-20 second clips with synchronized audio, up to 1080p, directly in your browser.