AutoGPT

Build, deploy, and run autonomous AI agents that automate complex multi-step workflows using a visual block-based graph editor.

182.2Kstars
46.2Kforks
Other
Python

AutoGPT is an open-source platform for building, deploying, and managing autonomous AI agents that automate complex workflows without deep engineering expertise. Users connect typed “blocks” — each performing a single discrete action — into directed acyclic graphs via a low-code visual builder, then deploy those graphs as persistent agents that can be triggered by external events and run continuously.

The platform ships as a full Docker Compose stack combining a FastAPI backend, Next.js frontend, PostgreSQL with pgvector, RabbitMQ for inter-service messaging, Redis for distributed locking, and Supabase for authentication. An integrated marketplace lets users share and deploy pre-built agents without starting from scratch.

At its core, AutoGPT unifies a large and growing library of integration blocks — covering AI providers (Claude, GPT-4, Llama, Groq), social platforms, productivity tools, databases, code execution environments, and MCP-compatible tool servers — under a consistent type-safe interface. Every block validates its inputs and outputs against JSON schemas, making graph wiring reliable and debuggable.

A built-in AI copilot (powered by the Claude Agent SDK) assists with agent design and troubleshooting. Performance-sensitive workloads can use the platform’s batch execution path, which submits requests to Anthropic’s batch API asynchronously to reduce cost at scale, with transparent fallback to synchronous execution when needed.

What You Get

  • Visual graph builder with a library of 80+ typed integration blocks spanning AI models, social media, productivity tools, databases, webhooks, and code execution
  • Persistent autonomous agent execution engine backed by RabbitMQ and Redis, supporting event-triggered and scheduled runs
  • Built-in AI copilot powered by the Claude Agent SDK to assist with agent design, debugging, and optimization
  • MCP (Model Context Protocol) Tool Block that connects to any compliant MCP server, discovers its tools at runtime, and adapts I/O schema dynamically
  • Multi-provider LLM support including Anthropic Claude, OpenAI GPT-4, Groq, and Ollama, all routed through a unified credential manager
  • Batch execution path leveraging Anthropic’s batch API for cost-efficient large-scale workloads with automatic synchronous fallback
  • Integrated agent marketplace for sharing and discovering pre-built agent graphs
  • Supabase authentication, PostgreSQL with pgvector for vector search, and FalkorDB knowledge graph for long-term agent memory via Graphiti

Common Use Cases

  • Content automation: subscribe to RSS feeds or YouTube channels, summarize new content with an LLM, and publish posts to social media automatically
  • Lead enrichment pipelines: pull prospects from Apollo or LinkedIn, enrich with web search, score with AI, and push qualified leads to a CRM
  • Internal monitoring agents: poll GitHub issues or Slack channels, triage with AI condition blocks, and route alerts to the right team via email or Discord
  • Video and media production: detect trending Reddit topics, generate scripts with an LLM, synthesize audio with ElevenLabs, and assemble short-form videos
  • Data extraction workflows: scrape or fetch structured data with Firecrawl or Exa, parse and transform it through text blocks, and write results to a spreadsheet or SQL database
  • Customer support automation: receive inbound messages via webhook, retrieve relevant knowledge graph context, generate AI responses, and escalate edge cases to a human-in-the-loop step

Under The Hood

Architecture The platform follows a service-oriented multi-process design where distinct concerns — REST API, WebSocket server, execution engine, scheduler, notification manager, copilot chat bridge, and database manager — each run as independent AppProcess subclasses coordinated by a central process launcher. The execution model is graph-based: users wire typed blocks into directed acyclic graphs, and the ExecutionManager drives block execution through a thread pool backed by RabbitMQ for inter-service messaging. A ClusterLock mechanism built on Redis prevents duplicate graph execution across distributed instances. The copilot subsystem maintains its own executor path and routes between a Claude Agent SDK path and a baseline OpenAI-compatible fallback, keeping AI assistance cleanly separated from the core automation engine.

Tech Stack The backend is a Python 3.10+ application built on FastAPI for the REST layer with a custom WebSocket server, using Prisma as the ORM against PostgreSQL extended with pgvector for semantic search. RabbitMQ handles inter-service message queuing, Redis provides distributed locking and caching, and FalkorDB serves as the knowledge graph store for the Graphiti-based long-term memory subsystem. The AI layer spans Anthropic Claude, OpenAI GPT, Groq, and Ollama through a unified LLM abstraction with per-provider credential management. E2B provides sandboxed code execution for unsafe workloads. The frontend is Next.js 15 with Tailwind, Radix UI, and Storybook. The entire platform ships as a Docker Compose stack with Supabase handling authentication and Kong as the API gateway.

Code Quality The test suite is extensive with hundreds of test files across unit tests, integration tests, E2E Playwright suites, and snapshot tests — using pytest with asyncio mode on the backend and Vitest plus Playwright on the frontend. Error handling is explicit and typed throughout, with custom exception classes like BlockError, BlockExecutionError, InsufficientBalanceError, and ModerationError propagated cleanly through the stack rather than swallowed. Ruff enforces linting with a Python 3.10 target and pyright provides static type checking on the backend. The copilot module demonstrates particularly thorough test coverage of complex behaviors including rate limiting, token tracking, and cost attribution.

What Makes It Unique AutoGPT distinguishes itself through three technically notable implementation choices. The block graph model treats every integration as a type-safe composable unit with JSON-schema-validated inputs and outputs, enabling visual wiring of arbitrarily complex multi-step automations without writing code. The MCP Tool Block allows the platform to discover and invoke tools from any Model Context Protocol-compliant server at runtime, dynamically adapting its own input/output schema — making the platform a universal MCP client at the workflow level. A dedicated batch execution path submits heavy workloads to Anthropic’s batch API asynchronously at reduced cost, with LaunchDarkly feature flags gating progressive rollout and transparent per-turn fallback to synchronous execution.

Self-Hosting

AutoGPT uses a dual licensing model. The classic agent tools, Forge framework, agbenchmark, and legacy frontend — everything outside the autogpt_platform folder — are released under the MIT License, which permits free commercial use, modification, and redistribution with no copyleft obligations. The active platform itself, housed in the autogpt_platform folder, is licensed under the Polyform Shield License 1.0.0. The Polyform Shield License is source-available but not open source in the OSI sense: you may read, run, and modify the code for your own use, but you are prohibited from using it to offer a competing commercial service to third parties. For most self-hosting scenarios — running the platform internally for your organization’s own agents — this restriction does not apply, but you should review the license carefully before building a product on top of it or reselling access.

Self-hosting the full AutoGPT platform is a substantial operational undertaking. The Docker Compose stack spans roughly a dozen services including PostgreSQL, Redis (clustered), RabbitMQ, FalkorDB, Supabase (auth, meta, storage), Kong (API gateway), and several Python backend processes. The README explicitly flags this as a technical process requiring at minimum 4 CPU cores, 8 GB RAM, and 10 GB of free storage. You are responsible for uptime, database backups, secret rotation, and keeping up with frequent releases — the project ships new beta versions roughly every one to two weeks.

The cloud-hosted beta (available via waitlist at agpt.co) removes all infrastructure burden and provides subscription tiers (Basic, Pro, Max, Business, Enterprise) that gate features like higher execution concurrency, more LLM credits, and priority support. Self-hosters gain full data residency and can use their own API keys for LLM providers, but give up the managed upgrade path, built-in billing and credit tracking, SLA guarantees, and the support channels that come with a paid tier. The project’s rapid release cadence means self-hosted instances can fall behind the cloud offering quickly if not actively maintained.

Join founders buildingwith open source

Opinionated takes, migration guides, cost-saving tips, and insights from the open source ecosystem.

Subscribe on Substack

No spam. Unsubscribe anytime.

Join 750+ subscribers
No spam. Unsubscribe anytime.

Search