AI Agents trong Production — Day 6: Building an Internal Agent Platform

Năm ngày xây dựng agent infrastructure. Giờ gói nó thành platform.

Khác biệt giữa một đống agents và một platform là reusability, governance, và self-service. Ai trong công ty cũng có thể tạo và deploy agent mà không cần biết circuit breaker, cache invalidation, hay multi-region failover.

Bài này gói Days 1-5 vào Internal Agent Platform (IAP):

1
Agent Portal (UI)
2
  ├── Agent Registry
3
  ├── Tool Catalog
4
  └── Deployments
5

6
Platform API (REST)
7
  ├── Agent Builder (templates)
8
  ├── Governance (approvals)
9
  └── Dashboard (single pane)
10

11
Agent Runtime (Days 1-3)
12
  ├── Logger / Tracer / Cache
13
  ├── Retry / Breaker / Fallback
14
  └── Prompt Store / Experiment
15

16
Infrastructure Layer (Days 4-5)
17
  ├── Region Router
18
  ├── Global State
19
  └── Cache Warmup

Step 1: Agent Registry — Source of Truth#

Mọi agent trong công ty đều có định danh duy nhất.

`src/platform/agent-registry.ts`#

1
export interface AgentDefinition {
2
  id: string;
3
  name: string;
4
  displayName: string;
5
  version: string;
6
  owner: string;                    // Team
7
  status: "draft" | "active" | "deprecated" | "retired";
8
  tools: Array<{
9
    name: string;
10
    source: "built-in" | "mcp-server";
11
    required: boolean;
12
    permissions: string[];
13
  }>;
14
  deployment: {
15
    replicas: number;
16
    regions: string[];
17
    environment: "development" | "staging" | "production";
18
  };
19
  promptVersionId: string | null;
20
}
21

22
export class AgentRegistry {
23
  private agents = new Map<string, AgentDefinition>();
24

25
  async register(def: Omit<AgentDefinition, "id" | "createdAt" | "updatedAt">): Promise<AgentDefinition> {
26
    const id = def.name.toLowerCase().replace(/[^a-z0-9-]/g, "-");
27
    const agent = { ...def, id, createdAt: Date.now(), updatedAt: Date.now() };
28
    this.validate(agent);
29
    this.agents.set(id, agent);
30
    return agent;
31
  }
32

33
  list(filters?: { status?: string; environment?: string; owner?: string }): AgentDefinition[] {
34
    let r = Array.from(this.agents.values());
35
    if (filters?.status) r = r.filter(a => a.status === filters.status);
36
    if (filters?.owner) r = r.filter(a => a.owner === filters.owner);
37
    return r.sort((a, b) => b.updatedAt - a.updatedAt);
38
  }
39

40
  getToolUsage(): Map<string, string[]> {
41
    const usage = new Map<string, string[]>();
42
    for (const agent of this.agents.values()) {
43
      for (const tool of agent.tools) {
44
        const agents = usage.get(tool.name) || [];
45
        agents.push(agent.name);
46
        usage.set(tool.name, agents);
47
      }
48
    }
49
    return usage;
50
  }
51
}

Step 2: Agent Builder — Self-Service#

Bất kỳ team nào cũng có thể tạo agent từ template, không cần viết code.

`src/platform/agent-builder.ts`#

1
interface AgentTemplate {
2
  id: string;
3
  name: string;
4
  description: string;
5
  tools: string[];
6
  defaultPrompt: string;
7
  icon: string;
8
}
9

10
export class AgentBuilder {
11
  private templates = new Map<string, AgentTemplate>();
12

13
  registerTemplate(t: AgentTemplate): void { this.templates.set(t.id, t); }
14

15
  async createFromTemplate(params: {
16
    templateId: string; name: string; team: string; environment?: string;
17
  }) {
18
    const t = this.templates.get(params.templateId);
19
    if (!t) throw new Error(`Template "${params.templateId}" not found`);
20

21
    // Auto-generate agent definition + default prompt
22
    // Activate prompt at 100%
23
    // Return agent + promptVersionId
24
  }
25
}
26

27
// Templates built-in
28
builder.registerTemplate({
29
  id: "github-issue-manager",
30
  name: "GitHub Issue Manager",
31
  tools: ["list_issues", "get_issue", "create_issue", "search_issues", "add_comment"],
32
  defaultPrompt: "You are {{name}}, a GitHub issue management agent...",
33
  icon: "🐙",
34
});
35

36
builder.registerTemplate({
37
  id: "code-review-assistant",
38
  name: "Code Review Assistant",
39
  tools: ["list_prs", "get_pr_diff", "comment_on_pr", "check_lint", "run_tests"],
40
  defaultPrompt: "You are {{name}}, a code review assistant...",
41
  icon: "📝",
42
});

Tạo agent chỉ với 1 API call:

1
curl -X POST https://platform.company.com/v1/agents/create \
2
  -d '{
3
    "templateId": "github-issue-manager",
4
    "name": "Support Issue Tracker",
5
    "team": "support"
6
  }'

Step 3: Governance — Approval Workflows#

Không phải ai cũng được deploy production mà không review.

`src/platform/governance.ts`#

1
export class GovernanceEngine {
2
  private policies = new Map<string, ApprovalPolicy>();
3

4
  addPolicy(p: ApprovalPolicy): void { /* ... */ }
5

6
  needsApproval(agent: AgentDefinition, type: string): boolean {
7
    // Kiểm tra tất cả policies
8
    // Nếu agent deploy production → cần 2 approvals từ platform-engineering
9
  }
10

11
  async requestApproval(agentId: string, changeType: string, requestedBy: string): Promise<ApprovalRequest> {
12
    // Tạo request pending
13
  }
14

15
  async approve(id: string, by: string): Promise<ApprovalRequest> {
16
    // Nếu đủ approvals → auto-approve
17
  }
18
}
19

20
// Policies mặc định:
21
governance.addPolicy({
22
  id: "prod-deploy",
23
  conditions: [{ attribute: "deployment.environment", operator: "eq", value: "production" }],
24
  requiredApprovals: 2,
25
  approverTeams: ["platform-engineering"],
26
});

Step 4: Unified Platform API#

`src/platform/api.ts`#

1
GET    /v1/agents                  → list agents
2
GET    /v1/agents/:id              → get agent
3
POST   /v1/agents                  → register agent
4
PATCH  /v1/agents/:id              → update agent
5
POST   /v1/agents/create           → create from template
6
GET    /v1/agents/:id/prompts      → list prompt versions
7
POST   /v1/agents/:id/prompts     → new prompt version
8
POST   /v1/agents/:id/experiments → start A/B test
9
POST   /v1/agents/:id/deploy      → deploy / rollout
10
GET    /v1/tools/usage             → tool impact analysis
11
POST   /v1/approvals/:id/approve  → approve deployment
12
POST   /v1/approvals/:id/reject   → reject deployment
13
GET    /v1/dashboard               → aggregated metrics
14
GET    /v1/infra/regions           → region status

Step 5: Onboarding Flow#

Support team muốn có agent:

1
# 1. Browse templates
2
GET /v1/templates
3

4
# 2. Tạo agent từ template
5
POST /v1/agents/create → support-issue-tracker
6

7
# 3. Custom prompt
8
POST /v1/agents/support-issue-tracker/prompts
9

10
# 4. Deploy staging
11
POST /v1/agents/support-issue-tracker/deploy
12

13
# 5. Deploy production → cần approval
14
POST /v1/agents/support-issue-tracker/deploy
15
→ { status: "approval-required", approvalId: "apr-..." }
16

17
# 6. Platform engineering approve
18
POST /v1/approvals/apr-.../approve
19

20
# 7. Monitor dashboard
21
GET /v1/dashboard

Summary#

Concept	Implementation
Agent registry	AgentRegistry — source of truth
Self-service	AgentBuilder với templates
Governance	GovernanceEngine với policies
Unified API	Express router v1/
Dashboard	DashboardService — aggregate metrics

Final Checklist:#

Registry tracks tất cả agents
Self-service agent creation từ templates
Governance policies enforce approvals
Dashboard aggregates health
Tool usage tracking cho impact analysis

Day	Chủ đề
1	Observability & Telemetry ✅
2	Caching Strategies ✅
3	Error Handling & Resilience ✅
4	A/B Testing Prompts & Configs ✅
5	Multi-Region & High Availability ✅
6	Building an Internal Agent Platform ✅

Series: AI Agents trong Production. Day 6: Internal Agent Platform wrapping Days 1-5 vào self-service creation, governance, và unified API.