AI Models
By default, the platform selects an AI model for each request based on what the request needs and the user's credit balance. Every plan can use every model — more capable models cost more credits. Most apps do not need to configure models manually.
Default Selection
When an app makes an AI request without specifying a model, the platform chooses an appropriate available model. The selected model depends on the request type, required capabilities, and efficient defaults for the user's plan.
All users can access every model if they have enough credits. Plans differ mainly in monthly credit allowance, concurrency, and default model selection — not hard model locks.
Model Tiers
Models are grouped into three tiers:
| Tier | Best for | Credit cost |
|---|---|---|
| Lite | Fast, lower-cost tasks such as formatting, classification, and quick summaries | Lowest per request |
| Standard | Balanced quality and cost for most everyday tasks | Moderate per request |
| Advanced | Complex reasoning, higher-quality writing, and demanding tasks | Highest per request |
Fallback Behavior
Automatic model selection includes graceful fallback. If the first model cannot handle a requested capability, such as a specific attachment type or web search, the platform can try another available model so the request still succeeds when possible.
Fallback only applies when the platform is choosing the model. If an app pins a specific model, the platform uses that model and returns an error if it cannot satisfy the request.
Model Types
The platform supports models for text, images, video, and speech. For developer-facing details, see the AI text, AI image, AI video, and AI speech references.
To customize model behavior in an app, see Model Selection.
Available Models
Here are all the models currently available on the platform, grouped by modality. Use the Model ID value when exposing a model selector or pinning a specific model.
Text Models
Used by useAIChat, useAIText, and useAIObject for conversations, text generation, and structured data extraction.
| Model | Model ID | Provider | Tier | Description |
|---|---|---|---|---|
| Gemini 3.1 Flash Lite | gemini-3.1-flash-lite | Lite | Google's most efficient model. Ultra-fast and cost-effective. | |
| GPT-5.4 Nano | gpt-5.4-nano | OpenAI | Lite | OpenAI's fastest and most affordable GPT-5.4 variant for lightweight tasks. |
| Gemini 3 Flash | gemini-3-flash-preview | Standard | Google's most balanced model. Quick, accurate, affordable. | |
| GPT-5.4 Mini | gpt-5.4-mini | OpenAI | Standard | A faster, more cost-efficient version of GPT-5.4 for well-defined tasks. |
| Claude Haiku 4.5 | claude-haiku-4-5 | Anthropic | Standard | Anthropic's fastest model with near-frontier intelligence. |
| Gemini 3.1 Pro | gemini-3.1-pro-preview | Advanced | Google's best model with world-class multimodal understanding. | |
| GPT-5.4 | gpt-5.4 | OpenAI | Advanced | OpenAI's latest model for coding and agentic tasks across industries. |
| GPT-5.5 | gpt-5.5 | OpenAI | Advanced | OpenAI's new class of intelligence for coding and professional work. |
| Claude Sonnet 4.6 | claude-sonnet-4-6 | Anthropic | Advanced | Anthropic's mid tier. Strong accuracy, fast. |
| Claude Opus 4.6 | claude-opus-4-6 | Anthropic | Advanced | Anthropic's flagship. Highest quality. |
Image Models
Used for generating and editing images within your apps.
| Model | Model ID | Provider | Tier | Description |
|---|---|---|---|---|
| GPT Image 2 | openai/gpt-image-2 | OpenAI | Lite | OpenAI's latest image model. Extremely detailed with fine typography. |
| Grok Imagine Image | xai/grok-imagine-image | xAI | Lite | xAI's image model for generation and editing. |
| Gemini 2.5 Flash Image (Nano Banana) | fal-ai/gemini-25-flash-image | Standard | Google's lightweight image model. Fast and affordable. | |
| Gemini 3.1 Flash Image (Nano Banana 2) | fal-ai/gemini-3.1-flash-image-preview | Standard | Google's fast tier. Quick, good quality. | |
| Gemini 3 Pro Image (Nano Banana Pro) | fal-ai/gemini-3-pro-image-preview | Advanced | Google's flagship. Best quality, higher res. |
Video Models
Used for generating videos from text or images.
| Model | Model ID | Provider | Tier | Description |
|---|---|---|---|---|
| Grok Imagine Video | xai/grok-imagine-video | xAI | Lite | xAI's video model for generation, image-to-video, reference images, editing, and extension. |
| Veo 3.1 Fast | fal-ai/veo3.1/fast | Standard | Fast video generation with audio. Powered by Google Veo. | |
| Seedance 2.0 Fast | bytedance/seedance-2.0/fast | ByteDance | Standard | ByteDance's faster Seedance 2.0 video model with native audio, image animation, and reference guidance. |
| Kling V3 | fal-ai/kling-video/v3/standard | Kling | Standard | Fast cinematic video generation with native multi-shot support. Powered by Kling 3.0. |
| Kling O3 | fal-ai/kling-video/o3/standard | Kling | Standard | Kling's faster O3 model for text-to-video, image animation, video references, and video editing. |
| Veo 3.1 | fal-ai/veo3.1 | Advanced | High quality video generation with audio. Powered by Google Veo. | |
| Seedance 2.0 | bytedance/seedance-2.0 | ByteDance | Advanced | ByteDance's cinematic video model with native audio, image animation, and reference-image guidance. |
| Kling V3 Pro | fal-ai/kling-video/v3/pro | Kling | Advanced | Cinematic video generation with fluid motion, native audio, and multi-shot support. Powered by Kling 3.0. |
| Kling O3 Pro | fal-ai/kling-video/o3/pro | Kling | Advanced | Kling's O3 Pro model for text-to-video, image animation, video references, and video editing. |
Speech Models
Used for text-to-speech generation.
| Model | Model ID | Provider | Tier | Description |
|---|---|---|---|---|
| GPT-4o Mini TTS | gpt-4o-mini-tts | OpenAI | Lite | OpenAI's fast tier. Natural, cost-effective. |
| ElevenLabs Turbo v2.5 | fal-ai/elevenlabs/tts/turbo-v2.5 | ElevenLabs | Standard | ElevenLabs' low-latency model. Fast, natural speech in 32 languages. |
| TTS-1 HD | tts-1-hd | OpenAI | Advanced | OpenAI's highest-quality TTS model. |
| Eleven v3 | fal-ai/elevenlabs/tts/eleven-v3 | ElevenLabs | Advanced | ElevenLabs' most expressive model. Rich emotion, 70+ languages. |