Adaptive

Adaptive is an intelligent model router that automatically selects the best AI model for each task. Instead of manually choosing between dozens of models, Adaptive analyzes your prompt and routes it to the model that will deliver the best result.

How it works

When you select Adaptive in the model picker, Windsurf evaluates each request and dynamically chooses the right underlying model. Simple tasks get routed to fast, efficient models. Complex tasks get routed to more capable ones. This means you get the right level of intelligence for every prompt without overspending on premium models for routine work. Adaptive helps your usage allowance last longer by avoiding unnecessary use of expensive models.

Selecting Adaptive

To use Adaptive, open the model picker below the Cascade input box and select Adaptive at the top of the list. Once selected, Adaptive will be used for all subsequent messages in the conversation.

You can switch away from Adaptive to a specific model at any time.

Adaptive is the best default for most users.

Pricing

Adaptive pricing depends on your billing plan.

Self-serve
Enterprise (Cognition Platform - ACUs)
Enterprise (Legacy Credits)

Adaptive draws down your quota at a fixed per-token rate, regardless of which underlying model is selected for a given request.Currently, the Adaptive model consumes quota and overage at an introductory promotional rate (through April 27, 2026).

Token type	Cost per 1M tokens
Input tokens	$0.50
Output tokens	$2.00
Cache read tokens	$0.10

These rates also apply to extra usage beyond your included quota.Because Adaptive routes simpler tasks to lighter models, it typically consumes fewer tokens overall than manually selecting a frontier model for every request. This makes it the most cost-effective option for most users.

Tips for getting the most out of Adaptive

Be specific with your prompts. Clear, focused instructions help Adaptive route to the right model and reduce unnecessary token usage.
Leverage prompt caching. Staying on the same model across turns in a conversation enables caching, which significantly reduces input token costs. Adaptive takes this into account when routing.
Use Adaptive as your default. For most workflows, Adaptive is the best starting point. Switch to a specific model only when you have a particular reason to — for example, if you need a specific model’s reasoning capabilities for a complex task.

Editor

Agent Command Center

Cascade

Accounts

Context Awareness

Troubleshooting

Security

How it works

Selecting Adaptive

Pricing

Tips for getting the most out of Adaptive

Editor

Agent Command Center

Cascade

Accounts

Context Awareness

Troubleshooting

Security

​How it works

​Selecting Adaptive

​Pricing

​Tips for getting the most out of Adaptive

How it works

Selecting Adaptive

Pricing

Tips for getting the most out of Adaptive