How to use Claude Sonnet 4.6 inside VS Code: three setup paths compared

claudevs-codesetup-guideclinecontinue-devapipricingworkflow

Claude Sonnet 4.6 is the strongest daily-driver coding model available right now: 1 million token context window, $3 per million input tokens, and fast enough for interactive use. Getting it into VS Code is a 10-minute job — but there are three meaningfully different ways to do it, and they suit different workflows. This guide walks through each path with exact steps, then tells you which one to start with.

Pricing verified against Anthropic’s model overview page on May 9, 2026.


What you’re choosing between

PathExtensionCost modelBest for
AClaude Code (official)Claude.ai subscription or API pay-per-useMost developers; Anthropic-native experience
BClineAPI pay-per-use (BYOK)Autonomous agent workflows; full cost control
CContinue.devAPI pay-per-use (BYOK)Open-source preference; mixed-model setups

All three give you Claude Sonnet 4.6 inside the VS Code editor. The difference is in how much setup, how much control, and what happens to your token bill.


The API cost picture first

Before picking a path, understand what Claude Sonnet 4.6 actually costs:

  • Input tokens: $3.00 per million
  • Output tokens: $15.00 per million
  • Batch API (for non-interactive tasks): 50% off both rates
  • Prompt caching: cached input drops to $0.30 per million (90% reduction)
  • Context window: 1 million tokens (~750,000 words)

In practice, a typical hour of chat-assisted coding — describing a function, reviewing a diff, asking why a test fails — consumes roughly 50,000–100,000 input tokens and 5,000–20,000 output tokens. That’s $0.15–$0.60 per hour. Heavy autonomous agent sessions (Cline doing a multi-file refactor) run $0.50–$1.00 per task.

Compared to Cursor Pro at $20/month flat, direct API access is cheaper for occasional use and more expensive for heavy daily use. The crossover is roughly 15–20 hours of active coding per month.

New Anthropic accounts get approximately $5 in free credits to test the API. There is no ongoing free tier — once the starter balance is gone, you need to purchase credits or enable auto-reload at platform.claude.com.


Path A: Official Claude Code extension

The Claude Code VS Code extension is Anthropic’s own product. It’s the path with the least friction and the tightest integration.

Requirements

  • VS Code 1.98.0 or higher (check via Help → About)
  • An Anthropic account (free to create at claude.ai) OR an API key from platform.claude.com

Install

  1. Press Ctrl+Shift+X (Windows/Linux) or Cmd+Shift+X (Mac) to open Extensions
  2. Search for Claude Code, publisher: Anthropic
  3. Click Install
  4. A Spark icon (✦) appears in the Editor Toolbar once a file is open

First sign-in

Click the Spark icon and sign in with your Claude.ai account in the browser that opens. If you prefer to use an API key directly, set ANTHROPIC_API_KEY in your shell environment and launch VS Code from that terminal with code . so it inherits the variable.

What you get

The Claude Code panel opens as a sidebar or editor tab. The key features:

  • @-mentions: type @filename to pull a specific file into context, or @src/components/ with a trailing slash for a folder. Press Alt+K (Windows/Linux) or Option+K (Mac) to insert a reference to your current selection with line numbers.
  • Permission modes: click the mode indicator at the bottom of the prompt box. Default mode asks before each file edit. Plan mode has Claude describe what it will do and waits for your approval before touching anything — useful when you’re giving it a multi-step task. Auto-accept is for when you’re certain and want flow.
  • Checkpoints: hover over any message to rewind. You can fork the conversation from a point in history, revert file changes to that point, or both. This is the escape hatch when an agent session goes sideways.
  • Multiple conversations: Ctrl+Shift+Esc (Windows/Linux) opens a new conversation in a separate tab. Useful for running one refactor while asking questions about another part of the codebase.
  • MCP server support: run claude mcp add in the integrated terminal to connect external tools, then manage them with /mcp in the chat panel.

The extension includes the Claude Code CLI. Open the integrated terminal (Ctrl+`) and run claude for terminal-based workflows — these share the same conversation history as the panel.

Billing

If you use a Claude.ai Pro subscription ($20/month), Claude Code usage is included but rate-limited. For heavier use, switch to API-key billing at platform.claude.com — you pay per token at the rates above, with no monthly floor.


Path B: Cline

Cline (formerly known as Claude Dev, now published by Cline Bot Inc.) is the most-installed third-party AI coding agent for VS Code: 3.9 million installs as of May 2026, version 3.82.0. It’s free to install; you bring your own API key.

Install

  1. Open Extensions (Ctrl+Shift+X)
  2. Search Cline, publisher: Cline Bot Inc.
  3. Click Install

Configure with Claude Sonnet 4.6

  1. Click the Cline icon in the Activity Bar (left sidebar)
  2. Click the gear icon → API Provider → select Anthropic
  3. Paste your Anthropic API key from platform.claude.com/settings/keys
  4. Under Model, select claude-sonnet-4-6
  5. Click Save

What Cline does differently

Cline is explicitly an autonomous agent. You describe a task, and it plans the steps, creates and edits files, runs terminal commands, and navigates the browser — all while showing you exactly what it intends to do before each action. This is different from the Claude Code extension, which is more of a collaborative assistant you direct step by step.

Token usage is displayed in real time in the Cline sidebar. You can see your running cost per task. This transparency is useful: complex refactoring tasks where Cline has to read dozens of files first can accumulate 200,000–400,000 input tokens before writing a single line.

Cline also supports OpenRouter, Google Gemini, AWS Bedrock, local models via Ollama/LM Studio, and any OpenAI-compatible API — so if you want to route to Claude through a different provider (e.g., AWS Bedrock for enterprise data residency), Cline handles it.

When to use Cline over the official extension

Cline wins when the task is: “take this spec and build a working feature, checking in with me at each decision point.” The official Claude Code extension wins for: “I’m writing code, I want a smart pair programmer in the sidebar.” Cline is more autonomous; Claude Code is more collaborative.


Path C: Continue.dev

Continue.dev is an open-source VS Code extension licensed under Apache 2.0. Its differentiating feature: it lets you mix models. You can use a small, fast local model (like Qwen2.5-Coder 7B via Ollama) for tab completion — where latency matters most — while routing chat and agent tasks to Claude Sonnet 4.6. This is the cheapest overall configuration for a developer who writes a lot.

Install

  1. Open Extensions (Ctrl+Shift+X)
  2. Search Continue, publisher: Continue
  3. Click Install

Configure Claude Sonnet 4.6 in config.yaml

Continue uses a config.yaml file for configuration. Open it by clicking the Continue icon in the sidebar → gear icon. Add an Anthropic provider block:

models:
  - name: Claude Sonnet 4.6
    provider: anthropic
    model: claude-sonnet-4-6
    apiKey: ${{ secrets.ANTHROPIC_API_KEY }}
    roles:
      - chat
      - edit
      - apply

For tab completion, add a separate block pointing to a local model to avoid the latency and cost of calling Claude Sonnet on every keystroke:

  - name: Qwen2.5-Coder 7B (local)
    provider: ollama
    model: qwen2.5-coder:7b
    roles:
      - autocomplete

Your ANTHROPIC_API_KEY should be set as an environment variable or stored in a secrets file — Continue documentation explains both options at docs.continue.dev.

What Continue gives you

Three features worth knowing:

  • Tab completion with a local model: zero API cost for the keypress-level completions; Claude only runs when you explicitly use the chat or agent panel.
  • Agent mode: introduced in 2026, lets Continue autonomously edit multiple files. Comparable to Cline’s approach but with a different configuration model.
  • No vendor lock-in: swap models in config.yaml without reinstalling anything. If Anthropic pricing changes, you’re one line away from routing elsewhere.

Continue’s weakest area: the initial configuration has a steeper learning curve than the other two paths, and the agent mode is less mature than Cline’s.


Side-by-side comparison

Claude Code (official)ClineContinue.dev
Setup time~5 min~8 min~15 min
Cost modelSubscription or APIAPI (BYOK)API (BYOK)
Tab completionNoNoYes (with local model)
Autonomous agentYes (with Plan mode)Yes (core feature)Yes (2026 addition)
Checkpoints/rewindYesNoNo
Model flexibilityClaude family onlyAny providerAny provider
Open sourceNoYes (Apache 2.0)Yes (Apache 2.0)
MCP server supportYesLimitedLimited

Honest take: which path to start with

Start with the official Claude Code extension. It requires the least configuration, the checkpoint/rewind system is a genuine safety net when you’re learning agentic workflows, and the VS Code integration (inline diffs, @-mentions with line ranges, plan-then-execute mode) is polished in a way the third-party options haven’t matched yet.

Use Cline if you’re doing sustained autonomous agent work — multi-file feature builds, large refactors — and you want per-task cost visibility and the flexibility to swap providers. The 3.9 million install count reflects real developer trust.

Use Continue.dev if you care about open source, want tab completion without burning API budget on every keypress, or need to mix Claude with local models for a privacy-sensitive codebase. Cross-reference the Cline vs Roo Code vs Continue comparison at DevToolReviews for a deeper look at the tradeoffs.

If you’re running local models alongside Claude and want to understand the hardware requirements, the local LLM guide at runaihome.com breaks down what actually runs at what VRAM level. The cursor-local-llama-hardware-tiers-2026 article on this site covers the same question for Cursor users specifically: Cursor + Local Llama hardware tiers.

One thing all three paths share: every token you send to Claude Sonnet 4.6 via the API is charged at $3/MTok input, $15/MTok output. Large context windows are powerful, but stuffing your entire codebase into every prompt is expensive. Be selective with @-mentions and context — Claude does not need to read every file to write one function.


1V1 STARTER KIT · CLAUDE API

From zero to your first Claude API call in 30 minutes.

Complete API integration templates, 30 high-frequency prompts, error-code handling library. Python + Node.js + cURL examples included.

Get it for $19 (early bird) →

Sources

Last updated May 9, 2026. Pricing and features change frequently; verify current state before purchasing.

Was this article helpful?