Feb 20, 2025
Alongside our existing OpenAI GPT stack, we’ve added Anthropic Claude and Google Gemini, creating a three-engine hybrid architecture. Agents can now switch models on the fly, balancing speed, reasoning depth, and cost.
Added
Claude 3 Sonnet – superior long-form reasoning and safety filtering
Gemini 2.5 Flash / Pro – fastest generation, 128 k-token context window
Dynamic Model Router – auto-assigns GPT, Claude, or Gemini based on task type
Improved
Unified prompt templates for consistent cross-model output
Upgraded cost-management: average call cost down 28 %
Parallel multi-model inference cuts complex project completion time by 22 %
Fixed
Resolved JSON parsing errors from model-specific schema differences
Patched fallback-retry logic under rate limits to prevent double billing
Tuned streaming buffers to eliminate lost output fragments