AI Agents · Computer Use · Cross-Platform Automation

Forge

Multi-agent AI workflow platform with computer use, cross-platform GUI/terminal automation, and visual blueprint orchestration. 44 node types, multi-model providers, agent-on-agent coordination, and multi-machine dispatch.

Next.js 14 FastAPI LangChain OpenAI Anthropic Supabase Python 3.12 TypeScript React Flow

Overview

Forge is an AI agent orchestration platform that goes beyond chatbots. Users build visual DAG workflows (blueprints) combining deterministic code nodes with LLM-powered agent nodes, then execute them with real-time SSE streaming. The platform supports GUI automation, terminal orchestration, multi-model providers, knowledge base RAG, eval frameworks, and cross-platform computer use across macOS, Linux, and Windows.

Core Capabilities

Visual Blueprint Editor — Drag-and-drop DAG builder with 44 node types across 9 categories. Topological execution engine with concurrent layer resolution and context assembly.
Computer Use (GUI + Terminal) — 12 GUI automation nodes (screenshot, OCR, click, type, hotkey, scroll, drag, focus, find, wait, clipboard, apps) and 6 terminal nodes (session, run, send, logs, poll, fanout). Cross-platform: Steer CLI on macOS, xdotool on Linux, pyautogui on Windows.
Agent-on-Agent Orchestration — Spawn and control external coding agents (Claude Code, Codex CLI, Gemini CLI, Aider) as workers in tmux sessions. Full lifecycle: spawn, prompt, monitor, wait, capture, stop.
Multi-Machine Dispatch — Route nodes to different execution targets with capability-based routing. Target registry with health checks and aggregated capabilities.
Multi-Model Providers — OpenAI, Anthropic, Google APIs with per-node model selection, health monitoring, and model comparison.
Knowledge Base + RAG — Document collections with chunked upload, semantic search, and knowledge_retrieval blueprint node.
Eval Framework — 5 grading methods including screenshot_match and ocr_contains for computer use verification.
Human-in-the-Loop — Approval gate nodes pause execution for human review with approve/reject workflows.
Safety & Security — App blocklist, command blocklist, rate limiting (30 actions/min), approval gates, and full audit logging for all computer use actions.
Observability — Distributed traces for all executions, prompt versioning with diff/rollback, cost analytics with per-model breakdowns.

Scale

44 blueprint node types across 9 categories
13 pre-built blueprint templates
3 LLM providers (OpenAI, Anthropic, Google)
3 platform targets (macOS, Linux, Windows)
20+ CLI command groups
17 database migrations with RLS
515 backend tests + 21 frontend tests
20+ API route modules

Architecture

Next.js 14 frontend with React Flow blueprint editor, consuming a FastAPI backend. The blueprint engine runs DAGs with topological sorting, concurrent layer execution, and context assembly with token budgets. Computer use nodes dispatch to platform-specific executors (Steer/xdotool/pyautogui for GUI, Drive/tmux/PowerShell for terminals). Multi-machine dispatch routes nodes to registered execution targets. Agent-on-agent orchestration spawns external coding agents in tmux sessions via the Drive layer. Supabase provides auth and PostgreSQL with RLS. Everything streams via SSE.

Version History

v1.9 — Agent-on-agent orchestration, multi-machine dispatch, screen recording, Linux & Windows computer use, cross-platform unification
v1.8 — Computer use extension (Steer + Drive), safety/security, remote execution, capability detection
v1.7 — Workflow marketplace, team features, organization RBAC
v1.6 — Knowledge base + RAG with semantic search
v1.5 — Observability traces + prompt versioning
v1.4 — Eval framework + human-in-the-loop
v1.3 — MCP integration + event triggers
v1.2 — Multi-model provider system
v1.1 — Blueprint system with visual DAG editor
v1.0 — Production release with demo mode and landing page

Live Demo ↗ View on GitHub ↗ Back to Home