Feb 20, 2025

Alongside our existing OpenAI GPT stack, we’ve added Anthropic Claude and Google Gemini, creating a three-engine hybrid architecture. Agents can now switch models on the fly, balancing speed, reasoning depth, and cost.

Added

  • Claude 3 Sonnet – superior long-form reasoning and safety filtering

  • Gemini 2.5 Flash / Pro – fastest generation, 128 k-token context window

  • Dynamic Model Router – auto-assigns GPT, Claude, or Gemini based on task type

Improved

  • Unified prompt templates for consistent cross-model output

  • Upgraded cost-management: average call cost down 28 %

  • Parallel multi-model inference cuts complex project completion time by 22 %

Fixed

  • Resolved JSON parsing errors from model-specific schema differences

  • Patched fallback-retry logic under rate limits to prevent double billing

  • Tuned streaming buffers to eliminate lost output fragments