Paradigm Shift
Claude uses your tools.
Your agents use Claude.
Claude Code Channels makes AI tools bidirectional for the first time. Claude can call your local tools, your agents can call Claude -- and it all runs at a fixed monthly cost you choose, not open API metering that could hit $2,000 in a bad month.
Cohort was the first third-party integration. Three hours from protocol to working agents. One week from integration to open-source launch. 3 AM to 3 AM.
What just changed
- Claude is a text box. You type, it responds.
- Your tools can't call Claude. Claude can't call your tools.
- API access means open metering: $800-2,000+/mo.
- Every API call is a new cold start. No persistent context.
- You pay per token. More agents = more cost.
- Claude calls your tools. Your agents call Claude.
- Bidirectional: one protocol, interchangeable access.
- Fixed cost: $100 or $200/mo. You know the number.
- Persistent sessions. Claude keeps context across tasks.
- Cohort manages the spend intelligently for you.
The API model charged you per token and you had no idea what the bill would be.
The Channels model gives you a fixed budget and Cohort makes every dollar count.
Bidirectional tool use
This is the part that's genuinely unprecedented. Claude Code can call any tool your system exposes. And your system can push work into Claude Code's session. Same protocol. Same session. No API keys, no token counting, no surprise bills.
Claude calls your tools
Through Cohort's MCP integration, Claude Code gains access to your entire agent ecosystem. Need a security review? Claude calls the security agent. Need content strategy? Claude calls the content strategist. Each agent brings persistent memory, domain expertise, and contribution scoring -- capabilities Claude doesn't have alone.
Tool call: cohort_roundtable("Review this auth flow for vulnerabilities",
agents: ["security", "python_dev"])
// Security agent responds with findings from its persistent memory
// Claude applies the fixes, in the same session, with full file context
Your agents call Claude
Cohort's local agents run at 104 tok/s on your GPU for free. But some tasks need frontier reasoning -- complex refactors, multi-file architectural changes, nuanced code review. Your agents escalate those tasks to Claude Code through the Channel, and Claude executes them in a persistent session with your repo fully loaded.
Escalate to Claude Code: "Refactor auth middleware to support
both JWT and session tokens. 14 files affected."
// Claude Code receives the brief with channel context
// Executes the refactor across all 14 files
// Reports back with structured results + diff
Fixed cost, not open metering
The API model was a blank check. You paid per token, per call, per agent turn. A busy week could cost $500. A runaway loop could cost thousands. Channels changes the economics entirely.
23 agents, roundtables, briefings, code queue. All on your GPU. Channels-ready.
Fixed monthly cost. You pick the plan. Cohort manages the spend intelligently.
Per-token billing. Unpredictable. Every agent turn costs money. Every runaway loop costs more.
How Cohort manages your fixed budget
Local-first triage. 95% of agent work runs on your GPU at zero cost. Only tasks that genuinely need frontier reasoning get escalated to Claude Code.
Context distillation. Before escalating to Claude, local models pre-process and condense the context. Claude gets a structured briefing instead of raw data -- so every interaction uses minimal capacity.
Session persistence. Claude Code keeps context across tasks. No cold starts. No re-reading your entire repo for each request. One session handles many tasks efficiently.
Safeguards. Unified 3-tier response gate (emergency rate limits, heuristic scoring, session-aware orchestrator gating), meeting mode stakeholder controls, and graceful degradation back to local models. Your fixed spend stays fixed.
How it works under the hood
Per-channel session isolation
Each Cohort channel gets its own independent Claude Code session. A security audit in #security-review doesn't share context with a website build in #web-project. Up to 5 simultaneous sessions, with priority eviction when you need a 6th.
Context hydration
When a new Claude Code session starts for a channel, Cohort summarizes the channel's recent discussion using your local LLM and injects it as a structured briefing. Claude joins the conversation mid-stream without anyone having to catch it up.
Auto-launch and idle reaping
Sessions start on demand when work arrives and shut down automatically after 10 minutes idle. No manual process management. If a session dies, its claimed requests are automatically failed and re-queued.
Three-tier context degradation
Tier A: Local LLM summarizes last 50 messages into ~1000 tokens. Tier B: Heuristic filter, ~4000 chars. Tier C: Last 20 messages as-is. Each tier falls back to the next if the local model is unavailable.
One IDE. Both systems. Zero switching.
Claude Code runs in the VS Code terminal. Cohort's agent team runs in the VS Code sidebar. They share the same repo, the same MCP tools, and the same channel sessions. You never leave the editor.
Cohort's agent sidebar (left) and Claude Code's terminal (right), sharing the same channel session.
Claude Code edits files, runs tests, and commits -- using Cohort's MCP tools to consult agents, check scores, and manage meeting dynamics.
Stakeholder gating controls who speaks. Phase detection tracks progress. The 3-tier response gate prevents loops. All visible in the sidebar.
No browser tabs. No terminal switching. Channels, meetings, tasks, and reviews -- all in the same VS Code window you're already coding in.
First integration, day one
Anthropic shipped the protocol at 3 AM. By end of day, Cohort had full session management running through it.
Why it took three hours, not three weeks
MCP-native architecture. Cohort's entire tool surface is already MCP. Channels are MCP servers. The plugin was a thin bridge -- no architecture changes needed.
Request/response queue already existed. Cohort's code queue has a full lifecycle state machine. The Channel plugin just polls it. No new infrastructure.
Agent orchestration was done. Unified 3-tier response gate, meeting mode with stakeholder gating, compiled roundtables, deliverables preprocessing -- all production-tested. Channels needed a transport layer, not new logic.
What Claude Code gets through Cohort
When Claude Code runs with Cohort's channel plugin, it gains access to the full ecosystem via MCP tools:
Security, QA, Python dev, web dev, content strategy, documentation, analytics, marketing -- each with persistent memory
Queue management, meeting control, health checks, model settings, rate limits, agent decisions -- all available as slash commands
Claude can delegate bulk work to your local GPU via /delegate. Your hardware does the heavy lifting; Claude does the thinking.
57 tools Claude Code gains through Cohort
Every tool available via MCP the moment a Channel session starts. 54 from the Cohort API server + 3 from the Channel plugin.
All tools are available via MCP. No API keys required for local tools. Claude Code discovers them automatically when the Channel session starts.
Learn more
Official docs. What Channels are, how to set up Telegram and Discord, security model, enterprise controls.
code.claude.com →Protocol spec for building custom channel plugins. MCP capabilities, notification format, reply tools.
code.claude.com →Full source. Channel bridge, context hydration, session management, code queue pipeline. Apache 2.0 license.
GitHub →Agent roundtables, contribution scoring, local inference, response tiers, safeguards. The system Channels plugs into.
Features →Fixed cost. Bidirectional tools. Your data stays local.
23 agents on your GPU. Claude Code when you need it. No surprise bills.