FAL Model Selector
Quick decision guide for choosing fal.ai models. Read references/ for full parameters.
Key Concepts
Image-to-Video vs Reference-to-Video
|
Image-to-Video |
Reference-to-Video |
| Input |
1 image (starting frame) |
Multiple images (1-5 references) |
| Purpose |
Animate that exact image |
Maintain subject consistency in new scenes |
| The image becomes... |
The first frame of the video |
A "template" for what subject looks like |
| Camera/scene |
Starts from that exact view |
Can be completely different scenes |
Image-to-Video = "animate this exact image"
- Product photo → animate it rotating
- Portrait → make the person smile/blink
Reference-to-Video = "generate video featuring this subject"
- 3 photos of mascot → video of mascot in new scene
- Product from different angles → consistent commercial
Quick Decision Matrix
Video: Text → Video
| Need |
Model |
Price/5s |
Why |
| Best quality |
fal-ai/kling-video/v2.6/pro/text-to-video |
$0.35-0.70 |
Cinematic, native audio |
| Google (Veo) |
fal-ai/veo3.1 |
$1-2 |
Google's best, 4K, natural sound |
| OpenAI (Sora) |
fal-ai/sora-2/text-to-video |
$0.50 |
Detailed narratives, lip-sync |
| Budget |
fal-ai/ltx-2/text-to-video |
$0.30 |
Good quality, includes audio |
Video: Image → Video
| Need |
Model |
Price/5s |
Why |
| Best quality |
fal-ai/kling-video/v2.1/pro/image-to-video |
$0.45 |
Cinematic, precise motion |
| Google (Veo) |
fal-ai/veo3.1/image-to-video |
$1-2 |
4K resolution, Google quality |
| OpenAI (Sora) |
fal-ai/sora-2/image-to-video |
$0.50 |
OpenAI quality, realistic |
| Multi-scene |
wan/v2.6/image-to-video |
$0.50-0.75 |
Scene segmentation |
| Budget |
fal-ai/pixverse/v5.5/image-to-video |
$0.15-0.40 |
Multiple styles |
| Start+End frame |
fal-ai/bytedance/seedance/v1.5/pro/image-to-video |
$0.26 |
Control both ends |
Video: Reference → Video (Multiple Images)
| Need |
Model |
Price/5s |
Why |
| Google (Veo) |
fal-ai/veo3.1/reference-to-video |
$0.50-1.75 |
Up to 5 refs, best consistency |
| Multi-ref |
fal-ai/vidu/q2/reference-to-video |
$0.50 |
Multiple reference images |
| First+Last frame |
fal-ai/veo3.1/first-last-frame-to-video |
$1-3 |
Controlled transitions |
Video: Extend/Remix
| Need |
Model |
Price/5s |
Why |
| Extend (Google) |
fal-ai/veo3.1/extend-video |
$1-2 |
Up to 30s total |
| Remix (OpenAI) |
fal-ai/sora-2/video-to-video/remix |
$0.50 |
Restyle existing videos |
Image: Generate (Text → Image)
| Need |
Model |
Price |
Why |
| Best quality |
fal-ai/flux-2-pro |
$0.03/MP |
Zero-config, production-ready |
| Google (Gemini) |
fal-ai/gemini-3-pro-image-preview |
$0.15-0.30 |
Up to 4K, web search |
| OpenAI (GPT) |
fal-ai/gpt-image-1.5 |
varies |
DALL-E quality, editing |
| Text/typography |
fal-ai/ideogram/v3 |
$0.03-0.09 |
Best for logos, posters |
| Budget |
fal-ai/flux-2 |
$0.012/MP |
Good quality, cheaper |
| Custom style |
fal-ai/flux-2/lora |
$0.021/MP |
Up to 3 LoRAs |
Image: Edit
| Need |
Model |
Price |
Why |
| General edit |
fal-ai/flux-2/edit |
$0.012/MP |
Multi-image, natural language |
| OpenAI (GPT) |
fal-ai/gpt-image-1.5/edit |
varies |
DALL-E editing quality |
| Google (Gemini) |
fal-ai/gemini-3-pro-image-preview/edit |
$0.15-0.30 |
4K editing capability |
| Text in image |
fal-ai/qwen-image-edit-plus |
- |
Superior text editing |
| Fast iteration |
fal-ai/reve/edit |
$0.04/img |
Quick, remix support |
| Style transfer |
fal-ai/flux/dev/image-to-image |
$0.03/MP |
Preserve composition |
Image: Specialized Apps
| Need |
Model |
Why |
| Virtual try-on |
fal-ai/image-apps-v2/virtual-try-on |
Clothing on person |
| Product photo |
fal-ai/image-apps-v2/product-photography |
Studio lighting |
| Relighting |
fal-ai/image-apps-v2/relighting |
17 lighting presets |
| Remove object |
fal-ai/image-apps-v2/object-removal |
Text-based targeting |
| Style transfer |
fal-ai/image-apps-v2/style-transfer |
26 artistic presets |
| Outpaint |
fal-ai/image-apps-v2/outpaint |
Extend boundaries |
Image: Enhancement
| Need |
Model |
Price |
Why |
| Upscale (portrait) |
clarityai/crystal-upscaler |
$0.016/MP |
Facial detail focus |
| Remove background |
fal-ai/birefnet/v2 |
- |
Multiple models, up to 2304px |
| Segmentation |
fal-ai/sam-3/image |
$0.005 |
Text/point/box prompts |
Image: Character/Identity
| Need |
Model |
Why |
| Consistent character |
fal-ai/photomaker |
Identity-preserving portraits |
| Brand style |
fal-ai/flux-2/lora |
Custom trained LoRAs |
| Need |
Model |
Price |
Why |
| Upscale |
clarityai/crystal-video-upscaler |
$0.10/MP-sec |
Up to 5K |
| Lipsync |
fal-ai/sync-lipsync/v2/pro |
~1.67x base |
Natural features |
Other
| Need |
Model |
Price |
Why |
| TTS |
fal-ai/maya |
$0.002/sec |
Emotions, accents |
| Image → 3D |
fal-ai/hunyuan3d-v3/image-to-3d |
$0.375 |
GLB/OBJ/FBX |
Detailed References
references/video-generation.md - 16+ video models (incl. all Veo/Sora endpoints)
references/image-tools.md - 20+ image models (incl. Gemini/GPT Image)
references/other-tools.md - TTS, 3D, upscaling, lipsync
Common Patterns
import fal_client
# Product demo video
result = fal_client.subscribe(
"fal-ai/kling-video/v2.1/pro/image-to-video",
arguments={
"prompt": "Product rotates smoothly, camera zooms in",
"image_url": "https://example.com/product.jpg",
"duration": "5"
}
)
# Remove background
result = fal_client.subscribe(
"fal-ai/birefnet/v2",
arguments={
"image_url": "https://example.com/photo.jpg",
"model": "General"
}
)
# Studio product photo
result = fal_client.subscribe(
"fal-ai/image-apps-v2/product-photography",
arguments={
"product_image_url": "https://example.com/product.jpg",
"aspect_ratio": "1:1"
}
)
# Virtual try-on
result = fal_client.subscribe(
"fal-ai/image-apps-v2/virtual-try-on",
arguments={
"person_image_url": "https://example.com/person.jpg",
"clothing_image_url": "https://example.com/shirt.jpg"
}
)