OpenAI Agents SDK: Handoffs, Guardrails, and Safety

The frameworks we’ve covered so far are open-source or platform-agnostic. The OpenAI Agents SDK is different — it’s designed for OpenAI’s models and ecosystem first. But what it lacks in portability, it makes up for in three specific areas that no other framework has matched: handoffs, guardrails, and built-in safety.

OpenAI launched the Agents SDK in late 2025 as the successor to the earlier function calling API and Assistants API. It’s the unified agent framework for the entire OpenAI platform — GPT, o-series reasoning models, multimodal, and tool use.

What Is the OpenAI Agents SDK?#

The Agents SDK is a Python framework for building agents powered by OpenAI models. It wraps the Chat Completions API with agent-level abstractions: agents with instructions and tools, handoff between agents, guardrails for input/output safety, and tracing for observability.

1
from agents import Agent, Runner
2

3
agent = Agent(
4
    name="Support Agent",
5
    instructions="You are a customer support agent. Be helpful and concise.",
6
    tools=[get_order_status, process_refund]
7
)
8

9
result = Runner.run_sync(agent, "Where is my order #12345?")
10
print(result.final_output)

Simple for basic use. Where it gets powerful is the agent ecosystem.

Core Concept: Agents as Building Blocks#

Every agent is defined by the same structure:

1
from agents import Agent, Guardrail, handoff
2

3
agent = Agent(
4
    name="billing_agent",
5
    instructions="You handle billing inquiries. Transfer to escalation when needed.",
6
    model="gpt-4o",
7
    tools=[lookup_invoice, process_payment, refund_order],
8
    handoffs=[escalation_agent],  # Can transfer to other agents
9
    input_guardrails=[pii_check, budget_check],  # Check input before processing
10
    output_guardrails=[validation_check],  # Check output before returning
11
    output_type=BillingResponse,  # Structured output
12
)

Key fields:

instructions — System prompt (supports file references via file_search)
model — Any OpenAI model (gpt-4o, o3, o4-mini)
tools — Functions or MCP-configured tools
handoffs — Other agents this agent can transfer to
input_guardrails — Validate/transform input before the agent processes it
output_guardrails — Validate/filter output before returning to the user
output_type — Pydantic model for structured output

Handoffs: The Killer Feature#

Handoffs are the SDK’s standout capability. An agent can transfer a conversation to another agent seamlessly. The receiving agent gets the full conversation context.

1
from agents import Agent, handoff, Runner
2

3
# Define specialized agents
4
triage_agent = Agent(
5
    name="triage",
6
    instructions="Route to the right specialist",
7
    handoffs=["billing", "technical", "account"],
8
)
9

10
billing_agent = Agent(
11
    name="billing",
12
    instructions="Handle billing and payments",
13
    tools=[process_refund, lookup_invoice],
14
)
15

16
tech_agent = Agent(
17
    name="technical",
18
    instructions="Handle technical issues",
19
    tools=[run_diagnostics, check_logs],
20
)
21

22
# Handoff configuration with customization
23
billing_handoff = handoff(
24
    agent=billing_agent,
25
    tool_name_override="transfer_to_billing",
26
    tool_description_override="Transfer to billing for payment issues",
27
    on_handoff=lambda ctx: log_transfer(ctx, "billing"),
28
)
29

30
triage_agent.handoffs = [billing_handoff, tech_agent]

How handoffs work internally:#

The agent decides to handoff based on its instructions
The SDK pauses the current agent
The receiving agent receives the full conversation history
The receiving agent continues the conversation
If needed, the receiving agent can handoff back

This is different from function calling. A handoff transfers conversation state, not just a data result. The receiving agent knows everything that was discussed before.

Handoff Patterns#

1
# Round-robin delegation
2
analyst_agent = Agent(name="analyst", handoffs=[qa_agent])
3
qa_agent = Agent(name="qa", handoffs=[analyst_agent])
4

5
# Escalation chain
6
l1_agent = Agent(name="l1", handoffs=[l2_agent])
7
l2_agent = Agent(name="l2", handoffs=[l3_agent])
8
l3_agent = Agent(name="l3", handoffs=[human_agent])
9

10
# Parallel handoff (fan-out)
11
coordinator = Agent(name="coordinator", handoffs=[research_agent, writing_agent, review_agent])

Guardrails: Safety Built In#

Guardrails run before and after every agent invocation. They can validate, transform, or reject input/output.

1
from agents import Guardrail, Runner
2

3
class PIIGuardrail(Guardrail):
4
    """Check for personally identifiable information"""
5

6
    async def check_input(self, agent, input_data):
7
        if contains_pii(input_data):
8
            return GuardrailResult(
9
                passed=False,
10
                message="Input contains PII. Please remove personal information.",
11
                transformed=redact_pii(input_data)  # Auto-redact
12
            )
13
        return GuardrailResult(passed=True)
14

15
    async def check_output(self, agent, output_data):
16
        if contains_pii(output_data):
17
            return GuardrailResult(
18
                passed=False,
19
                message="Output must not contain PII",
20
                transformed=redact_pii(output_data)
21
            )
22
        return GuardrailResult(passed=True)
23

24
agent = Agent(
25
    name="safe_agent",
26
    input_guardrails=[PIIGuardrail()],
27
    output_guardrails=[PIIGuardrail()]
28
)

Built-in guardrails:#

Guardrail	What It Checks	Default Behavior
`PIIGuardrail`	Emails, phones, SSNs, credit cards	Auto-redact or reject
`ToxicityGuardrail`	Hate speech, harassment	Reject with explanation
`ContentGuardrail`	Topic restrictions (configurable)	Reject or redirect
`BudgetGuardrail`	Token/cost limits per session	Throttle or stop
`ValidationGuardrail`	Output schema compliance	Transform or retry

Custom guardrails#

1
class RateLimitGuardrail(Guardrail):
2
    async def check_input(self, agent, input_data):
3
        if await rate_limiter.exceeded(agent.name):
4
            return GuardrailResult(
5
                passed=False,
6
                message="Rate limit exceeded. Try again later."
7
            )
8
        return GuardrailResult(passed=True)
9

10
class CostGuardrail(Guardrail):
11
    async def check_input(self, agent, input_data):
12
        session_cost = await cost_tracker.current_session_cost()
13
        if session_cost > 5.00:  # $5 threshold
14
            return GuardrailResult(
15
                passed=False,
16
                message=f"Session cost (${session_cost:.2f}) exceeds limit. Simplify request."
17
            )
18
        return GuardrailResult(passed=True)

Structured Outputs#

The SDK has first-class support for structured outputs through Pydantic models:

1
from pydantic import BaseModel
2
from agents import Agent, Runner
3

4
class SupportResponse(BaseModel):
5
    summary: str
6
    resolution: str | None
7
    requires_escalation: bool
8
    category: str
9

10
agent = Agent(
11
    name="support",
12
    instructions="Resolve support tickets",
13
    output_type=SupportResponse,  # Agent must return this shape
14
)
15

16
result = Runner.run_sync(agent, "My payment failed")
17
print(result.final_output)  # SupportResponse instance
18
# SupportResponse(summary='Payment declined', resolution='Try different card', requires_escalation=False, category='billing')

The agent is forced to produce output matching the schema. If it can’t, guardrails can trigger a retry.

Tracing and Observability#

OpenAI Agents SDK ships with built-in tracing that integrates with OpenAI’s dashboard:

1
from agents import Runner, trace
2

3
# Automatic tracing in the dashboard
4
with trace("Support Workflow"):
5
    triage_result = Runner.run_sync(triage_agent, user_message)
6
    if triage_result.requires_handoff:
7
        specialist_result = Runner.run_sync(specialist_agent, user_message)

Every trace captures:

LLM calls with input/output tokens
Tool calls and their results
Handoff decisions and timing
Guardrail triggers and outcomes
Total latency, cost, and token usage

The dashboard is OpenAI’s — running on their infrastructure. This is both a feature (zero setup) and a drawback (no self-hosting).

Production Patterns#

Pattern 1: Support System with Escalation#

1
l1 = Agent(name="l1", handoffs=[l2], tools=[search_kb, reset_password])
2
l2 = Agent(name="l2", handoffs=[l3], tools=[query_database, process_refund])
3
l3 = Agent(name="l3", handoffs=[human], tools=[admin_access, override_policy])
4

5
result = Runner.run_sync(l1, customer_message)

The agent automatically escalates through the chain. Each level has increasingly powerful tools.

Pattern 2: Multi-Step Research#

1
researcher = Agent(
2
    name="researcher",
3
    tools=[web_search, fetch_url],
4
    handoffs=[writer]
5
)
6
writer = Agent(
7
    name="writer",
8
    tools=[format_document],
9
    handoffs=[reviewer]
10
)
11
reviewer = Agent(
12
    name="reviewer",
13
    input_guardrails=[FactCheckGuardrail()],
14
    output_guardrails=[StyleGuardrail()]
15
)

When to Choose OpenAI Agents SDK#

Best Fit#

You’re already on OpenAI — GPT-4o, o3, o4-mini users get the tightest integration
Input/output safety is critical — Guardrails are more mature than any other framework
Simple handoff patterns — Customer support, triage, escalation chains
Structured output requirements — Pydantic model enforcement is best-in-class

Avoid When#

You need multi-model support — OpenAI-only
You need human-in-the-loop — The SDK has pause/resume, but it’s less mature than LangGraph
You need complex graph workflows — AutoGen’s GroupChat and LangGraph’s state graphs handle this better
You need self-hosted traces — OpenAI dashboard only; no LangSmith equivalent

OpenAI Agents SDK vs the Rest#

Dimension	OpenAI SDK	LangGraph	CrewAI	AutoGen	Claude SDK
Handoffs	Best	Manual nodes	Hierarchical	GroupChat	No
Guardrails	Best	Custom code	Custom code	Custom code	Custom code
Structured output	Native Pydantic	Manual	Manual	Typed messages	Native
Multi-model	OpenAI only	Any LLM	Any LLM	Any LLM	Claude only
Graph control	Linear chains	Full graphs	Sequential/hierarchical	Conversations	Agent loop
Human-in-loop	Limited	Native	Via callbacks	Via callbacks	Via hooks
Tracing	Dashboard only	LangSmith	Basic	Azure Monitor	None
Setup time	30 minutes	2-3 days	Few hours	1-2 days	1 day

Next: The Finale#

Post	Framework
1	LangGraph
2	CrewAI
3	AutoGen
4	Claude Agent SDK
5	OpenAI Agents SDK (this)
6	Head-to-Head Comparison → coming next

Series: AI Agent Frameworks 2026 — Production Comparison. Post 5: OpenAI Agents SDK. Finale: Head-to-Head Comparison → coming next.