Skip to content
LIVE
OPUS 4.8$5 / $25per Mtok
GPT-5.5$5 / $30per Mtok
GEMINI 2.5 PRO$1.25 / $10per Mtok
SONNET 4.6$3 / $15per Mtok
SWE-BENCHleader GPT-5.568.7%
MMLU-PROleader GPT-5.594.2
GPQAleader GPT-5.578.3
AFTAv1.0 whitepaper live at /whitepaper
OPUS 4.8$5 / $25per Mtok
GPT-5.5$5 / $30per Mtok
GEMINI 2.5 PRO$1.25 / $10per Mtok
SONNET 4.6$3 / $15per Mtok
SWE-BENCHleader GPT-5.568.7%
MMLU-PROleader GPT-5.594.2
GPQAleader GPT-5.578.3
AFTAv1.0 whitepaper live at /whitepaper
All systems operational0 AI providers monitored, polled every 2 minutes
Live status
All harnesses

Devin

Cognition Labs

Cognition's Devin is a hosted autonomous SWE agent. Each session runs in its own persistent VM with a workspace, browser, and shell, plus Slack and IDE integrations so the agent can be assigned tasks like a human engineer. Cognition also publishes DeepWiki, a separate retrieval system over indexed repos that Devin uses to ground long-horizon work.

Type
agent-platform
License
Proprietary
Model story
Proprietary mix
Vendor
Cognition Labs

Leaderboard Placements

BenchmarkBest base modelScoreRank
SWE-bench Verified Proprietary (Sonnet 4.6 + planner)61.7#12 / 15
Terminal-Bench ---
Aider Polyglot ---
SWE-Lancer Proprietary (Sonnet 4.6 + planner)32.5#4 / 5

Distribution

Hosted SaaS. Web app, Slack, GitHub, and Linear integrations. No self-host option.

Model Story

Proprietary model mix. Cognition does not disclose which model serves which step but has stated Sonnet 4.6 is in the rotation alongside an in-house planner.

Pricing

Per-seat subscription with usage limits; team and enterprise plans available.

Who It's For

Teams that want an agent assignable through ticketing systems and willing to trade open-source transparency for managed infrastructure.

Notable Features

  • Persistent VM workspaces per task
  • DeepWiki repo retrieval system
  • Slack and Linear assignment surfaces
  • Async task queues
  • Code review and PR-author workflow
Vendor site for Devin:https://devin.ai

Other Harnesses