Skip to main content

Video Generation

Access 15+ AI video model families with 45+ model variants in one platform — text-to-video, image-to-video, reference-to-video, motion control, audio generation, and AI avatar modes. Credit Cost: 8 credits (5s) | 16 credits (10s) | 24 credits (15s)
Generation Time: 2–5 minutes
Durations: 5s, 10s, or 15s (model-dependent)
Access: app.wrldwide.ai/video

Kling Models

Kling 3.0 Pro

Premium 15s generation with voice control and keyframe animation
  • Modes: Text → Video, Image → Video
  • Durations: 5s, 10s, 15s
  • Aspect Ratios: 1:1, 16:9, 9:16
  • Features: Voice control, first + last frame keyframe support

Kling 3.0 (Standard)

High-quality 15s video with improved motion consistency
  • Modes: Text → Video, Image → Video
  • Durations: 5s, 10s, 15s
  • Aspect Ratios: 1:1, 16:9, 9:16

Kling O3 Pro ✨ NEW

Next-generation Kling with highest quality and voice control
  • Modes: Text → Video, Image → Video
  • Durations: 5s, 10s, 15s
  • Features: Premium quality, voice control, first + last frame (image mode)

Kling O3 ✨ NEW

Next-gen Kling with native audio and reference-to-video
  • Modes: Text → Video, Image → Video
  • Durations: 5s, 10s, 15s
  • Features: Native audio, reference-preserving cinematic style transfer

Kling 2.6

Versatile model with motion control and AI avatar modes
  • Modes: Text → Video, Image → Video, Motion Control, AI Avatar (talking avatar)
  • Durations: 5s, 10s
  • Aspect Ratios: 1:1, 16:9, 9:16

Kling 2.6 Pro (Image-to-Video)

Professional I2V with first + last frame keyframe support
  • Mode: Image → Video
  • Durations: 5s, 10s
  • Features: First + last frame keyframe support

Sora 2 (OpenAI)

OpenAI’s flagship video model
  • Modes: Text → Video, Image → Video
  • Durations: 5s, 10s (T2V) | 5s (I2V)
  • Aspect Ratios: 1:1, 16:9, 9:16
  • Features: Watermark removal option, seed control

Veo 3 (Google DeepMind)

Google’s video model with multi-image support
  • Modes: Text → Video, Image → Video
  • Aspect Ratios: 16:9, 9:16, Auto
  • Features: Multi-image reference, seed control, Fast and Quality variants

Runway Gen-3

Runway with camera control
  • Modes: Text → Video, Image → Video
  • Durations: 5s, 10s
  • Aspect Ratios: 1:1, 16:9, 9:16
  • Quality: 720p, 1080p
  • Features: Camera control keywords, 720p / 1080p quality selector

Seedance Models

Seedance 1.5 ✨ #1 RANKED

ByteDance’s top model — joint audio + video generation
  • Modes: Text → Video, Image → Video
  • Durations: 5s, 10s
  • Aspect Ratios: 1:1, 16:9, 9:16, 4:3, 3:4, 21:9
  • Resolution: 480p, 720p
  • Features: Native audio generation, end frame support (I2V)

Seedance 1.0

Budget-friendly Seedance with good quality
  • Modes: Text → Video, Image → Video
  • Durations: 5s, 10s
  • Features: Audio support

Hailuo 2.3 (Minimax)

Director-mode video with cinematic quality
  • Modes: Text → Video, Image → Video
  • Features: Director-style control, natural motion

Pixverse 4.5 ✨ NEW

Stylized video generation with creative effects
  • Modes: Text → Video, Image → Video
  • Features: Creative stylized effects

WAN 2.6

Professional video with resolution control
  • Modes: Text → Video, Image → Video
  • Durations: 5s, 10s
  • Resolution Options: 480p, 720p, 1080p

LTX-2

Fast open-source video generation
  • Modes: Text → Video, Image → Video
  • Duration: 5s, 10s

LTX-2 19B ✨ NEW

Premium 19B parameter model for higher quality
  • Mode: Image → Video
  • Duration: 5s, 10s
  • Features: 19B parameters, improved motion quality

Veo 3.1 ✨ NEW

Google’s reference-to-video model with multi-image support
  • Mode: Reference → Video
  • Duration: 5s, 10s
  • Features: Multi-image reference support, cinematic style transfer

Veo 3 Fast ✨ NEW

Speed-optimized Veo 3 with audio generation
  • Mode: Text → Video
  • Duration: 5s, 10s
  • Features: Native audio generation, faster processing

Veo 2 ✨ NEW

Google’s image-to-video model
  • Mode: Image → Video
  • Duration: 5s, 10s

PixVerse 5 ✨ NEW

Enhanced creative effects and improved motion
  • Modes: Text → Video, Image → Video
  • Duration: 5s, 10s
  • Features: Enhanced creative effects, improved motion quality over 4.5

Vidu Q3 ✨ NEW

Fast turbo video generation
  • Modes: Text → Video, Image → Video
  • Duration: 5s, 10s
  • Features: Fast turbo processing

Grok Video ✨ NEW

xAI video generation with native audio
  • Modes: Text → Video, Image → Video
  • Duration: 5s, 10s
  • Features: xAI native audio generation

Hailuo 02 (MiniMax)

Standard MiniMax video generation
  • Modes: Text → Video, Image → Video
  • Duration: 5s, 10s

Seedance 1.0 Pro ✨ NEW

Pro-tier image-to-video with improved quality
  • Mode: Image → Video
  • Duration: 5s, 10s
  • Features: Enhanced quality over standard Seedance 1.0

WAN 2.5

Preview version of WAN video generation
  • Mode: Image → Video
  • Duration: 5s, 10s

WAN 2.2

14B parameter WAN model
  • Mode: Image → Video
  • Duration: 5s, 10s
  • Features: 14B parameters

WAN Effects ✨ NEW

Stylized video effects
  • Mode: Image → Video
  • Duration: 5s, 10s
  • Features: Stylized effects and creative transformations

MMAudio V2 ✨ NEW

AI audio generation for existing videos
  • Mode: Video → Video (adds AI-generated audio)
  • Features: Generate matching audio for any existing video — music, sound effects, ambient audio

Generation Modes

Text → Video

Generate video from a text prompt alone. Most models support this mode.

Image → Video

Upload a reference image and animate it. Nearly all models support I2V. Provides much more control over subject and composition.

Reference → Video

Veo 3.1. Upload multiple reference images and the model creates a video incorporating all of them. Great for product showcase reels.

Motion Control

Kling 2.6 only. Upload a reference image + a source video. The model applies the motion pattern from the video to your image. Great for product animations.

AI Avatar

Kling 2.6 only. Upload a portrait photo + audio file. Generates a talking avatar with synchronized lip movement.

Add Audio

MMAudio V2. Upload an existing video and generate AI-matched audio — music, sound effects, or ambient audio that matches the visual content.

How to Use

1

Go to Video Generator

2

Select Model

Choose from all available models. Quick guide:
  • Best overall quality → Kling 3.0 Pro
  • Audio in video → Seedance 1.5 (ranked #1 for audio+video)
  • Camera control → Runway Gen-3
  • Longest clips (15s) → Kling 3.0 or Kling 3.0 Pro
  • Motion transfer → Kling 2.6 Motion Control
  • AI talking avatar → Kling 2.6 AI Avatar
  • Budget → Seedance 1.0, LTX-2, WAN 2.6
3

Choose Mode

  • Text → Video — Generate from prompt only
  • Image → Video — Upload reference image and animate it
  • Motion Control — Upload image + source video (Kling 2.6)
  • AI Avatar — Upload portrait + audio (Kling 2.6)
4

Configure Settings

  • Duration: 5s, 10s, or 15s (model-dependent)
  • Aspect Ratio: 1:1, 16:9, 9:16 (model-dependent)
  • Resolution: 480p, 720p, 1080p (model-dependent)
  • Audio: Enable/disable (Kling, Seedance)
  • Seed: For reproducibility (Sora 2, Veo 3)
5

Write Prompt

Luxury perfume bottle rotating slowly on velvet surface,
golden hour lighting, slow dolly zoom in, premium commercial aesthetic
6

Generate

Click Generate and wait 2–5 minutes

Model Comparison

Model FamilyMax DurationAudioBest For
Kling 3.0 Pro15sTop quality + long clips
Kling 3.015sLonger videos
Kling O3 Pro15sNext-gen premium
Kling O315sNative audio + style
Kling 2.610sMotion transfer + avatar
Kling 2.6 Pro10sI2V with keyframes
Sora 210s T2V / 5s I2VOpenAI quality
Veo 310sMulti-image reference
Veo 3.110sReference-to-video
Veo 3 Fast10sFast + audio
Veo 210sGoogle I2V
Runway Gen-310sCamera control
Grok Video10sxAI native audio
Vidu Q310sFast turbo
PixVerse 510sEnhanced effects
Pixverse 4.510sStylized/effects
Seedance 1.510s#1 audio+video
Seedance 1.0/Pro10sBudget audio
Hailuo 2.310sDirector mode
Hailuo 0210sStandard MiniMax
WAN 2.610sResolution control
WAN 2.5/2.210sLegacy WAN
WAN Effects10sStylized effects
LTX-210sFast open-source
LTX-2 19B10sPremium open-source
MMAudio V2N/AAdd audio to video

Prompt Best Practices

Structure Your Prompts

[Subject/Action] + [Camera Movement] + [Lighting] + [Style/Mood]
Example:
Luxury watch on rotating pedestal, slow orbit camera around product,
soft studio lighting, premium commercial aesthetic

Camera Movement Keywords

  • Dolly: dolly in, dolly out
  • Pan: pan left, pan right
  • Tilt: tilt up, tilt down
  • Zoom: slow zoom in, zoom out
  • Orbit: orbit around subject, circular motion
  • Static: static camera, locked shot

Model-Specific Tips

Use for your most important content. Supports 15-second clips for more narrative. Enable voice control for narrated videos. Use first + last frame for precise transformations.
Best for cinematic style transfer — upload a reference image and describe a visual style. Native audio generation means you can add music or sound effects directly.
Currently the #1 ranked model for joint audio-video generation. Ideal for product demos with background music, social media content with sound, or any video where the audio should feel native rather than dubbed.
Add camera movement keywords directly in the prompt: slow dolly in, pan left, orbit shot. Best for controlled, predictable camera movements in commercial content.
Use seed values to get reproducible results. Toggle watermark removal for clean output. Best for high-narrative, story-driven video content.
The only model with multi-image reference support. Upload multiple product shots and the model will create a video incorporating all of them. Good for product showcase reels.
Upload a product image + a reference video with the motion you want. The model transfers the motion pattern onto your product. Great for creating product rotation videos from a single photo.

Cinema Studio Integration

Take video generation further with professional cinema controls:
  • 11 Cinema Lenses — Spherical and anamorphic lenses
  • Focal Length — 8mm to 50mm
  • Aperture — f/1.4, f/4, f/11
  • 6-Axis Camera Control — Horizontal, vertical, pan, tilt, zoom, roll sliders
See Cinema Studio for full details.

Use Cases

Premium Product Commercial

[Kling 3.0 Pro, 15s, Image-to-Video]
Premium skincare serum bottle, slow zoom in, soft golden hour lighting,
luxury aesthetic, cinematic depth of field, commercial grade

Social Media Reel

[Seedance 1.5, 10s, Text-to-Video, 9:16]
Energetic fitness product unboxing, fast cuts, vibrant lighting,
modern music-video aesthetic

Product Rotation

[Kling 2.6 Motion Control]
Upload: product image + reference rotation video
→ Apply rotation motion to product

Talking Brand Avatar

[Kling 2.6 AI Avatar]
Upload: presenter portrait + voice-over audio
→ Generate talking avatar for ad

Next Steps