Using Fast Context
When Cascade receives a query that requires code search, Fast Context will trigger automatically. You can also force it to activate by usingCmd+Enter (Mac) or Ctrl+Enter (Windows/Linux) when submitting your query.
You’ll notice Fast Context is working when:
- Cascade quickly identifies relevant files across your codebase
- Large codebase queries complete faster than before
- Cascade spends less time reading irrelevant code
How It Works
Fast Context usesSWE-grep and SWE-grep-mini, custom models trained specifically for rapid code retrieval. These models combine the speed of traditional embedding search with the intelligence of agentic exploration.
When you make a query to Cascade that requires searching through your codebase, Fast Context automatically activates to:
- Identify relevant files and code sections using parallel tool calls
- Execute multiple searches simultaneously
- Return targeted results in seconds rather than minutes
SWE-grep Models
Fast Context is powered by the SWE-grep model family:- SWE-grep: High-intelligence variant optimized for complex retrieval tasks
- SWE-grep-mini: Ultra-fast variant serving at over 2,800 tokens per second