491 từ
2 phút đọc
Agent Security 2026: Agent Auditing & Compliance — SOC2, GDPR và PCI cho AI Agents

Agent đưa ra quyết định mà không auditable là liability.

Khi agent xử lý payment, modify database, gửi email — và có gì sai — bạn cần biết chính xác chuyện gì xảy ra. Agent nào. Tool nào. Parameters gì. Context gì dẫn đến quyết định. Ai approved (nếu có human). Agent đã nghĩ gì khi đưa ra quyết định.

Đây là compliance layer. Và nó là phần khó nhất khi deploy agents trong regulated environments.


Ba Compliance Frameworks#

FrameworkYêu cầuTại sao Agents khó
SOC 2Controls security, availability, processing integrityAgent decisions non-deterministic
GDPRRight to explanation, data minimization, deletionAgents ingest data opaque ways
PCI DSSCardholder data protection, audit trailsAgents may log payment data inadvertently

Pillar 1: Audit Trails#

Agent Audit Record#

@dataclass
class AgentAuditRecord:
audit_id: str
agent_id: str
agent_version: str # Prompt version, steering file hash
session_id: str
user_id: str
timestamp: datetime
raw_input: str # Original, encrypted
sanitized_input: str # PII-redacted
tool_calls: list[dict] # Tool name, params, result
raw_output: str # Encrypted
sanitized_output: str # PII-redacted
contains_pii: bool
contains_pci: bool
human_approvals: list[dict]
prev_hash: str # Hash chain for immutability
hash: str

What to Log#

EventLogRetention
Session startAgent version, user, session ID90 days
User messageRaw + sanitized90 days (raw PII: 30)
Tool callTool, params, result90 days
Tool call approvalApprover, timestamp7 years (regulated)
Guardrail triggerGuardrail, action1 year
Session endSummary, cost90 days

Pillar 2: Explainability#

class ReasoningCapturer:
def capture_thinking(self, thought):
self.reasoning_log.append({"type": "thought", "content": thought})
def capture_tool_decision(self, tool, args):
self.reasoning_log.append({"type": "tool_decision", "tool": tool})
def get_explainability_report(self):
# Generate human-readable explanation
# Chain of thought + why each tool was called

GDPR Right to Explanation#

async def generate_explanation(user_id, session_id):
records = await audit.get_session_trail(session_id)
return {
"automated_decision": {"made": True, "logic": "LLM with tool access"},
"decision_sequence": [
{"input": r.sanitized_input, "tools": [t["tool"] for t in r.tool_calls]}
for r in records
]
}

Pillar 3: PII/PCI Data Handling#

class PIIHandler:
# Regex patterns: email, phone, SSN, credit card
# NLP-based: Presidio Analyzer
async def process_input(self, text):
# 1. Fast regex detection
# 2. NLP-based detection
# 3. Redact with Presidio Anonymizer
# 4. PCI: never store full PAN, only last 4 digits

Data Retention#

TypeStandardPIIPCI
Conversation90 days30 days0 days (never store raw)
Tool calls90 days90 days365 days
Audit365 days365 days7 years

Pillar 4: SOC 2 Controls#

SOC 2 CriterionAgent Control
CC6.1 Logical accessMCP server auth (API key + JWT)
CC6.6 Incident detectionInjection guardrails
CC7.2 MonitoringFull audit logging
CC8.1 Change managementPrompt versioning (Git PR)
A1.2 Processing integrityTool call validation

Pillar 5: PCI Compliance#

class PCIComplianceLayer:
async def process_payment(self, params):
# PCI 3.4: Never log full card numbers
params["card_number"] = f"****-****-****-{pan[-4:]}"
# Send full PAN directly to processor, never store
# PCI 10.2: Audit trail (no PAN, no CVV, no expiry)
await self.pci_audit_log.append({
"event": "payment_processed",
"card_last_four": pan[-4:],
"amount": params["amount"],
})

Production Checklist#

  • Mọi interaction có audit record đầy đủ
  • Audit records encrypted at rest
  • Hash chains cho immutability
  • PII detected và redacted trước khi store
  • PCI card data never stored (only last 4 digits)
  • Reasoning traces captured cho explainability
  • GDPR right to explanation endpoint
  • GDPR right to erasure anonymizes
  • Data retention policies applied automatically
  • SOC 2 evidence collectable on demand
  • Audit logs append-only

Tiếp Theo#

BàiChủ đề
1Prompt Injection & Defense
2Tool Access Control
3MCP Server Security
4Agent Auditing & Compliance (bài này)
5Production Security Patterns — coming next

Series: Agent Security 2026 — Production Patterns. Bài 4: Agent Auditing & Compliance.

Advertisement

Agent Security 2026: Agent Auditing & Compliance — SOC2, GDPR và PCI cho AI Agents
https://minixium.com/vi/posts/agent-security-auditing-compliance-soc2-gdpr-pci-ai-agents-vi/
Tác giả
Minixium
Đăng vào lúc
2026-06-04
Giấy phép bản quyền
CC BY-NC-SA 4.0

Advertisement