Overview

SynapseX provides access to the world’s leading AI models through a unified API endpoint. All models are available via the SynapseX inference platform — no need to manage multiple API keys or provider accounts. Switch between models instantly in the VS Code Agent or configure your preferred default model in the CLI.

Claude Sonnet 4

Best for: Complex reasoning, architecture design, code reviewAnthropic’s flagship model for enterprise coding. Exceptional at multi-step reasoning and understanding large codebases.Model ID: claude-sonnet-4-5

GPT-4o

Best for: Rapid code generation, multimodal tasksOpenAI’s flagship model with vision capabilities. Excellent for UI generation from screenshots and fast iteration.Model ID: gpt-4o

Gemini 2.5 Pro

Best for: Long context, entire-codebase analysisGoogle’s frontier model with the largest context window. Ideal for analyzing large repositories or long documents.Model ID: gemini-2.5-pro

DeepSeek R1

Best for: Mathematical reasoning, algorithm designState-of-the-art open-source model with chain-of-thought reasoning. Cost-effective for compute-intensive tasks.Model ID: deepseek-r1

Full Model Reference

Anthropic — Claude Models

ModelContextBest For
claude-opus-4200K tokensHardest tasks, deep reasoning, long-horizon planning
claude-sonnet-4-5200K tokensBalanced performance — recommended default
claude-haiku-3-5200K tokensFast, cost-efficient for simple tasks
claude-3-7-sonnet200K tokensExtended thinking for complex math and logic

OpenAI — GPT Models

ModelContextBest For
gpt-4o128K tokensMultimodal, vision, fast generation
gpt-4o-mini128K tokensCost-effective for high-volume tasks
o3200K tokensAdvanced reasoning with extended thinking
o4-mini200K tokensFast reasoning for coding tasks
gpt-4.11M tokensVery long context, document analysis

Google — Gemini Models

ModelContextBest For
gemini-2.5-pro1M tokensLargest context, full repo analysis
gemini-2.5-flash1M tokensFast multimodal processing
gemini-2.0-flash1M tokensReal-time streaming, low latency

Meta — Llama Models

ModelContextBest For
llama-4-maverick1M tokensOpen source, self-hosted option
llama-4-scout10M tokensMassive context window tasks
llama-3.3-70b128K tokensCost-effective open source

DeepSeek Models

ModelContextBest For
deepseek-r164K tokensChain-of-thought reasoning, math
deepseek-v364K tokensHigh-quality general coding
deepseek-r1-lite32K tokensCost-efficient reasoning

Mistral Models

ModelContextBest For
mistral-large-2128K tokensEuropean data residency compliance
mixtral-8x22b64K tokensHigh throughput tasks
codestral256K tokensCode-specific generation

Perplexity — Search-Augmented Models

ModelContextBest For
sonar-pro128K tokensWeb-augmented queries, real-time info
sonar-reasoning128K tokensResearch with citations
sonar128K tokensFast web search

Qwen Models (Alibaba Cloud)

ModelContextBest For
qwen2.5-72b128K tokensCost-effective at scale
qwen2.5-coder-32b128K tokensSpecialized code generation
qwq-32b32K tokensReasoning tasks

Model Selection Guide

By Task Type

TaskRecommended Model
Architecture design & planningclaude-opus-4 or claude-sonnet-4-5
Rapid prototypinggpt-4o or claude-haiku-3-5
Large codebase analysisgemini-2.5-pro
Security auditclaude-sonnet-4-5
Math & algorithm designdeepseek-r1 or o3
UI generation from screenshotsgpt-4o (vision)
Web-augmented researchsonar-pro
High-volume automationdeepseek-v3 or qwen2.5-72b
EU data residencymistral-large-2

By Context Size Needed

Context RequiredUse Model
< 32K tokens (small files)claude-haiku-3-5, gpt-4o-mini
32K–128K tokens (modules)claude-sonnet-4-5, gpt-4o
128K–200K tokens (large features)claude-opus-4, gemini-2.5-flash
200K–1M tokens (full repositories)gemini-2.5-pro, gpt-4.1
> 1M tokens (massive codebases)llama-4-scout (10M context)

Switching Models

In VS Code Agent

Click the model selector dropdown in the SynapseX chat panel. Your conversation context is preserved when switching models mid-session.

In SynapseX CLI

# Create an agent with a specific model
synapsex agent create repo-analyzer --model gpt-4o

# Invoke that agent
synapsex agent invoke repo-analyzer "Analyze this repository"

Via API

import requests

response = requests.post(
    "https://api.synapsex.ai/v1/chat/completions",
    headers={"Authorization": f"Bearer {SYNAPSEX_API_KEY}"},
    json={
        "model": "claude-sonnet-4-5",
        "messages": [{"role": "user", "content": "Explain this code..."}]
    }
)

Model Availability

All models are available through:
  • SynapseX VS Code Agent — model selector in chat panel
  • SynapseX CLIsynapsex agent create --model <model>
  • SynapseX APImodel field in chat completions request
  • SynapseX Platform — model selector in the web dashboard
Model availability may vary by subscription plan. Contact support for enterprise access to additional models or higher rate limits.

API Overview

Getting started with the SynapseX API

Authentication

API keys and authentication methods

Best Practices

When to use which model

VS Code Agent

Configure your default model