Guide

All Guide resources on ChangeGamer — agent-first, machine-readable.

Getting Started for Agents
How autonomous agents should query, parse and cite ChangeGamer resources.

#agents #usage #llms.txt · updated 2026-06-19 · .md
Finding and Evaluating MCP Servers
How to discover, assess and safely integrate MCP servers into agent pipelines.

#mcp #tools #agents #security · updated 2026-06-11 · .md
Agentic Security Checklist
Cross-vendor, threat-surface-organized security checklist for building and operating AI agents — synthesizing OWASP, NIST, Anthropic, OpenAI, Google SAIF, and MITRE ATLAS.

#security #agents #prompt-injection #mcp #checklist #owasp · updated 2026-06-15 · .md
MCP vs A2A: Two Protocols, Two Roles
Compact comparison of the Model Context Protocol (agent↔tool) and the Agent2Agent Protocol (agent↔agent): purpose, topology, transport, discovery, auth, governance, and when to use each.

#mcp #a2a #protocols #agents #interoperability · updated 2026-06-12 · .md
Reliable Tool Calling and Structured Outputs
How providers guarantee schema-valid tool calls and structured output — mechanisms, failure modes, and mitigations — for production agent builders.

#tool-calling #structured-outputs #json-mode #constrained-decoding #agents #reliability · updated 2026-06-15 · .md
Agent Memory and Context Management
Architecture reference for agent memory: types (working, long-term, episodic, semantic, procedural), context-management techniques (summarization, RAG, sliding windows, prompt caching), storage substrates, and memory frameworks — with security notes and cross-links to related guides.

#memory #context-window #rag #vector-databases #agents #architecture · updated 2026-06-15 · .md
Agent Observability and Tracing
Why agents need observability beyond app logs, how OpenTelemetry GenAI semantic conventions model agent runs as traces, key signals to capture, and a verified tooling landscape.

#observability #tracing #opentelemetry #agents #debugging #evals · updated 2026-06-15 · .md
RAG and Retrieval for Agents
End-to-end practitioner reference for Retrieval-Augmented Generation: pipeline stages, chunking strategies, dense/sparse/hybrid retrieval, reranking, agentic retrieval patterns, quality failure modes, and evaluation — with verified sources for every named technique.

#rag #retrieval #embeddings #chunking #reranking #agents #vector-databases #evaluation · updated 2026-06-15 · .md
Computer Use and Browser Automation for Agents
Two-layer reference: vendor computer-use APIs (Anthropic, OpenAI CUA, Google Gemini) that translate screenshots to actions, and the open harnesses (Playwright MCP, browser-use, Stagehand, Skyvern) that execute those actions — with loop mechanics, reliability tradeoffs, and security gates.

#computer-use #browser-automation #playwright #anthropic #openai #agents #gui #security · updated 2026-06-15 · .md
Multi-Agent Orchestration Patterns
Vendor-neutral reference covering when multi-agent systems pay off and nine named patterns — from single-agent baseline through hierarchical and blackboard architectures — with tradeoffs, cross-cutting concerns, and a decision guide.

#agents #multi-agent #orchestration #architecture #patterns #design · updated 2026-06-15 · .md
Code Execution Sandboxing for Agents
Isolation spectrum from language sandboxes to microVMs, WebAssembly as a portable sandbox, and a verified comparison of hosted agent-sandbox APIs — for agents that need to run model-generated code safely.

#sandboxing #security #code-execution #microvm #wasm #agents #isolation #firecracker #gvisor · updated 2026-06-15 · .md
Guardrails and Safety Filters for Agents
Runtime input/output/action controls that enforce policy independently of the model — tooling landscape, techniques, and layering guidance.

#safety #guardrails #agents #security #moderation #prompt-injection · updated 2026-06-15 · .md
Agent Cost and Latency Optimization
Practitioner reference for reducing the cost and latency of production AI agents: the compounding model, token-level levers (caching, pruning), request-level levers (Batch API, parallelism), model-level levers (routing, reasoning-effort controls), and architecture-level levers (step reduction, semantic caching, code offloading).

#cost #latency #optimization #agents #prompt-caching #batch-api #model-routing #architecture · updated 2026-06-15 · .md
Voice and Realtime Agents
Architectures, vendor APIs, and open frameworks for real-time speech-to-speech AI agents — cascaded pipeline vs. native multimodal, VAD/turn detection, barge-in, latency budget, and tool calling in a voice loop.

#voice #realtime #speech #stt #tts #vad #agents #webrtc #websocket · updated 2026-06-16 · .md
Prompt and Context Engineering for Agents
From crafting a single prompt to managing everything an agent sees across a trajectory: system-prompt design, context-window management, failure modes, and a high-leverage checklist.

#prompt-engineering #context-engineering #agents #system-prompt #context-window #few-shot #chain-of-thought · updated 2026-06-16 · .md
Agent Reasoning and Design Patterns
The canonical single-agent reasoning and acting loops: ReAct, Chain-of-Thought, Plan-and-Solve, ReWOO, Reflexion, Tree-of-Thoughts, and Self-Consistency — what each is, when to use it, and tradeoffs.

#agents #reasoning #react #chain-of-thought #planning #reflection #patterns · updated 2026-06-16 · .md
Durable Execution for Long-Running Agents
Vendor-neutral reference on durable execution: event logs, replay determinism, idempotency, retries, and human-in-the-loop pause/resume — plus a cross-vendor survey and tradeoffs guide for Temporal, Restate, DBOS, Inngest, Step Functions, Azure Durable Functions, Cloudflare Workflows, GCP Workflows, LangGraph, and OpenAI Agents SDK.

#agents #durable-execution #workflows #reliability #idempotency #human-in-the-loop · updated 2026-06-17 · .md
Multimodal Agents: Vision, Documents, and Screens
How agents perceive and reason over images: VLM mechanics, image-input APIs across major providers, open-weight VLM families, grounding/pointing, failure modes, and practical guidance for agent builders.

#multimodal #vision #vlm #images #ocr #grounding #agents #open-weight · updated 2026-06-16 · .md
Agent Identity and Authentication
How autonomous agents prove who they are and get authorized to act: workload identity vs. delegated authority, SPIFFE/SPIRE, cloud workload federation, OAuth token exchange, audience binding, and emerging standards — with practical guidance and verified sources.

#identity #authentication #oauth #spiffe #workload-identity #security #agents #delegation · updated 2026-06-16 · .md
Building an MCP Server
Implementation guide for MCP servers: architecture roles, the three server primitives, stdio vs Streamable HTTP transports, official SDKs, server lifecycle, remote-server concerns, testing with MCP Inspector, and publishing to the official registry.

#mcp #tools #protocols #agents #implementation · updated 2026-06-17 · .md
Streaming Responses for Agents
Transport formats, provider event schemas, and practical concerns for consuming streamed LLM responses in production agents: SSE mechanics, OpenAI and Anthropic chunk formats, partial-JSON tool-call parsing, backpressure, cancellation, and gateway proxying.

#streaming #sse #server-sent-events #openai #anthropic #gemini #tool-calling #latency #agents · updated 2026-06-21 · .md
Text-to-SQL and Database Agents
How agents answer questions over structured data by generating and executing SQL: schema context, few-shot prompting, self-correction, safety constraints, benchmarks (Spider, BIRD-SQL), and tooling (LangChain SQLDatabaseToolkit, LlamaIndex NLSQLTableQueryEngine, Vanna, MCP Postgres server).

#text-to-sql #sql #database #schema-linking #agents #rag #security #benchmarks #langchain #llamaindex · updated 2026-06-21 · .md
Knowledge Graphs and GraphRAG for Agents
Graph-structured retrieval: when and how to use knowledge graphs over vector RAG for multi-hop, relational, and global corpus queries.

#rag #knowledge-graph #graphrag #retrieval #neo4j #agents · updated 2026-06-21 · .md
Testing AI Agents in CI
How to write deterministic, fast, CI-friendly tests for non-deterministic agents: the three-layer test pyramid, LLM mocking, cassette/VCR-style replay, snapshot testing of tool-call trajectories, pass@k thresholds, and verified tooling.

#testing #CI #agents #mocking #determinism #tool-calling #pytest · updated 2026-06-21 · .md
Generative UI and Agent-to-UI Protocols
How agents drive UI dynamically: the AG-UI protocol, framework options (Vercel AI SDK, CopilotKit, assistant-ui, LangGraph), streaming component patterns, and human-in-the-loop UI design.

#generative-ui #ag-ui #frontend #streaming #human-in-the-loop #copilotkit #agents #protocol · updated 2026-06-21 · .md
Fine-Tuning vs RAG vs Prompting
Decision guide for agent builders: when to use prompting, RAG, or fine-tuning — and how they combine. Covers SFT, LoRA/QLoRA, DPO, distillation, and a symptom-to-fix table.

#fine-tuning #rag #prompting #lora #dpo #sft #distillation #agents #decision-guide · updated 2026-06-21 · .md
Data Privacy and PII for Agents
How autonomous agents expose PII — context ingestion, tool calls, memory, logs — and the controls that contain it: detection, redaction, data minimization, provider ZDR tiers, GDPR, EU AI Act, CCPA, and a practical compliance checklist.

#privacy #pii #gdpr #compliance #data-protection #redaction #agents #security · updated 2026-06-21 · .md