Skip to content
Agents · Jun 23, 2026

IBM releases CUGA, an open-source agent harness with two dozen example apps

CUGA provides a lightweight harness for building governed, multi-agent applications with configurable reasoning modes and built-in state management, demonstrated via two dozen single-file FastAPI examples.

Trust79
HypeLow hype

1 source · cross-referenced

ShareXLinkedInEmail
TL;DR
  • IBM Research released CUGA, an open-source agent harness designed to reduce boilerplate in agentic applications by handling orchestration, state, and guardrails out of the box.
  • The project includes two dozen working example apps, each packaged as a single FastAPI file, to demonstrate how to build and deploy governed agent workflows.
  • CUGA supports interchangeable tool bindings (OpenAPI, MCP, LangChain), declarative guardrails, multi-agent delegation, and provider switching via environment variables.
  • Reasoning modes (Fast, Balanced, Accurate) and configurable code execution sandboxes allow tuning cost, latency, and safety without rewriting agent logic.

IBM Research released CUGA, an open-source agent harness intended to reduce the boilerplate required to build agentic applications. The project emphasizes a "harness, not a framework" approach, providing pre-assembled components for planning, execution loops, tool calls, and state management so developers can focus on defining tools and prompts rather than wiring infrastructure.

CUGA includes two dozen working example applications, each packaged as a single FastAPI file wrapping a CugaAgent instance. These examples range from a movie recommender to an IBM Cloud architecture advisor, and are designed to be read, copied, and adapted. The examples are presented in a live gallery for interactive exploration.

The harness supports interchangeable tool bindings, including OpenAPI, MCP, and LangChain functions, allowing developers to mix local and hosted tools without rewriting adapter code. Declarative guardrails, multi-agent delegation over A2A, Docling-powered RAG, and provider switching (e.g., OpenAI, watsonx, Ollama) are provided out of the box.

CUGA introduces configurable reasoning modes—Fast, Balanced, and Accurate—along with configurable code execution sandboxes (local, Docker/Podman, or E2B cloud). These modes allow tuning for cost, latency, and safety without changing the agent definition, enabling the same agent logic to run efficiently across different deployment constraints.

The IBM Cloud architecture advisor example demonstrates the harness's approach: the entire agent is defined in four lines of constructor code, with tools and prompts specified separately. The agent uses a factory pattern to switch providers via environment variables, keeping application code agnostic to the underlying model.

State management is handled by the harness, with intermediate results tracked and reflected upon to catch errors and re-plan as needed. This reduces the risk of cascading failures in long-horizon tasks, a common failure mode for agentic systems.

CUGA's design is motivated by the observation that most agentic apps begin with a week of plumbing before the agent can do useful work. By inverting this workflow, IBM Research aims to shift the focus from infrastructure to application logic, while still supporting governance and scalability for production deployments.

Sources
  1. 01Hugging FaceBuild real agentic apps using CUGA: two dozen working examples on a lightweight harness
Also on Agents

Stories may contain errors. Dispatch is assembled with AI assistance and curated by human editors; despite the trust-score filter, mistakes happen. We correct publicly — every article links to its revision history. Nothing here is financial, legal, or medical advice. Verify before relying on any claim.

© 2026 Dispatch. No ads. No sponsorships. No paid placement. Reader-supported via Ko-fi.

Built by a person who cares about honest AI news.