IBM releases CUGA, an open-source agent harness with two dozen example apps
CUGA provides a lightweight harness for building governed, multi-agent applications with configurable reasoning modes and built-in state management, demonstrated via two dozen single-file FastAPI examples.
1 source · cross-referenced
- IBM Research released CUGA, an open-source agent harness designed to reduce boilerplate in agentic applications by handling orchestration, state, and guardrails out of the box.
- The project includes two dozen working example apps, each packaged as a single FastAPI file, to demonstrate how to build and deploy governed agent workflows.
- CUGA supports interchangeable tool bindings (OpenAPI, MCP, LangChain), declarative guardrails, multi-agent delegation, and provider switching via environment variables.
- Reasoning modes (Fast, Balanced, Accurate) and configurable code execution sandboxes allow tuning cost, latency, and safety without rewriting agent logic.
IBM Research released CUGA, an open-source agent harness intended to reduce the boilerplate required to build agentic applications. The project emphasizes a "harness, not a framework" approach, providing pre-assembled components for planning, execution loops, tool calls, and state management so developers can focus on defining tools and prompts rather than wiring infrastructure.
CUGA includes two dozen working example applications, each packaged as a single FastAPI file wrapping a CugaAgent instance. These examples range from a movie recommender to an IBM Cloud architecture advisor, and are designed to be read, copied, and adapted. The examples are presented in a live gallery for interactive exploration.
The harness supports interchangeable tool bindings, including OpenAPI, MCP, and LangChain functions, allowing developers to mix local and hosted tools without rewriting adapter code. Declarative guardrails, multi-agent delegation over A2A, Docling-powered RAG, and provider switching (e.g., OpenAI, watsonx, Ollama) are provided out of the box.
CUGA introduces configurable reasoning modes—Fast, Balanced, and Accurate—along with configurable code execution sandboxes (local, Docker/Podman, or E2B cloud). These modes allow tuning for cost, latency, and safety without changing the agent definition, enabling the same agent logic to run efficiently across different deployment constraints.
The IBM Cloud architecture advisor example demonstrates the harness's approach: the entire agent is defined in four lines of constructor code, with tools and prompts specified separately. The agent uses a factory pattern to switch providers via environment variables, keeping application code agnostic to the underlying model.
State management is handled by the harness, with intermediate results tracked and reflected upon to catch errors and re-plan as needed. This reduces the risk of cascading failures in long-horizon tasks, a common failure mode for agentic systems.
CUGA's design is motivated by the observation that most agentic apps begin with a week of plumbing before the agent can do useful work. By inverting this workflow, IBM Research aims to shift the focus from infrastructure to application logic, while still supporting governance and scalability for production deployments.
- Jun 23, 2026 · Latent Space — swyx
OpenAI board member and Gray Swan cofounders discuss why AI security requires a new approach as agent risks rise
Trust72 - Jun 21, 2026 · Hacker News — AI (100+ points)
Bayer and Thoughtworks describe PRINCE, an agentic RAG system for preclinical research
Trust79 - Jun 19, 2026 · Latent Space — swyx
Investor Anjney Midha on AI compute waste, AMP’s 1.2GW grid plan, and frontier systems efficiency
Trust71