Overview

When to Use Orpheus

Orpheus is a specialized cognitive runtime engineered to provide long-term persistence, coordination, and governance for autonomous AI agents. Unlike traditional serverless functions or container orchestrators, it utilizes a three-layer architecture, coordination through workers, execution via lightweight runc containers, and persistence through dedicated workspaces. The system optimizes for time and continuity rather than just latency, using queue-depth autoscaling to manage I/O-bound workloads that typically idle while waiting for LLM responses. Key components like the ServiceManager and Registry enable seamless integration with local inference engines like Ollama and vLLM, providing a platform-agnostic environment for complex reasoning systems. Recent technical refinements have introduced an actor model to eliminate race conditions and an ExecLog system for crash recovery and auditability. Ultimately, Orpheus bridges the gap between simple script execution and heavyweight cloud orchestration, the essential substrate for industrial-grade LLM applications.

When to Use Orpheus

AI agents that run for minutes (not milliseconds)
Conversational agents that need context
Agents that create or modify files
Workflows that shouldn’t lose state on restart

Quick Start

Deploy in 2 minutes

First Guide

Step-by-step walkthrough

Quick Start

⌘I

Getting Started

Guides

Concepts

Reference

Examples

Troubleshooting

When to Use Orpheus

Quick Start

First Guide

Getting Started

Guides

Concepts

Reference

Examples

Troubleshooting

​When to Use Orpheus

Quick Start

First Guide

When to Use Orpheus