XML Prompt Engineering Cheatsheet: Steering LLMs at Scale

XML (eXtensible Markup Language) tags are the most robust mechanism for structuring prompts for advanced LLMs, particularly Anthropic’s Claude 3.5 Sonnet and Google’s Gemini 1.5 Pro. These models were heavily trained on XML structures, allowing them to parse namespaces, isolate context, and follow structured output boundaries with extreme reliability.

This reference sheet covers standard XML prompt hierarchies, variable injections, few-shot patterns, and output boundary tags.

Isolation: Wrap different parts of your prompt in distinct XML tags (e.g., <instructions>, <context>, <examples>) to prevent the model from confusing instructions with data.
Variable Injection: Place user inputs or database payloads inside clear, singular tags like <user_input> or <database_record>.
Few-Shot Examples: Group each training example in an <example> tag to establish clear patterns.
Output Constraints: Force the model to wrap its response inside dedicated tags (e.g., <output> or <json>) to make programmatic parsing simple.

Before diving into this cheatsheet, check out my previous deep-dive on Claude System Prompts & Steering Cheatsheet: Architecting AI Behavior to see how we structured these patterns in practice.

1. Why XML Works (The Structural Advantage)

Traditional plain-text prompts often suffer from “instruction drift,” where the model confuses the background context or user data with the actual instructions it must execute.

XML tags act as hard structural boundaries. The model’s attention mechanism natively recognizes these tags as document segments, preventing prompt injection attacks and improving instruction adherence.

2. Core Prompt Hierarchy Template

Below is a production-ready XML prompt hierarchy. This structure ensures that Claude or Gemini parses instructions, context, rules, and variables without confusion:

<prompt_context>
  <system_instructions>
    You are an expert systems engineer. Your task is to analyze the provided log files 
    and identify performance anomalies. Follow the rules defined in the rules section.
  </system_instructions>

  <rules>
    - Only output valid JSON blocks inside the output tag.
    - Do not write any conversational text before or after the JSON block.
    - If no anomaly is found, return an empty JSON array.
  </rules>

  <context_data>
    <environment_metadata>
      AWS Account: production-spoke-1
      Region: us-east-1
      Service: eks-billing-engine
    </environment_metadata>

    <active_schema>
      {
        "timestamp": "string",
        "severity": "string",
        "message": "string"
      }
    </active_schema>
  </context_data>

  <user_input>
    [2026-06-01T15:10:00Z] [ERROR] [EKS-CNI] ENI allocation failed on node ip-10-0-1-52.ec2.internal. Out of IPs.
    [2026-06-01T15:10:05Z] [WARN] [SSM] Session Manager connection dropped by client.
  </user_input>
</prompt_context>

3. Few-Shot Pattern Template

When you need the model to follow a complex output style or formatting rule, few-shot examples are critical. Wrap each example in a grouped structure:

<instructions>
  Translate the raw developer logs into plain-english, user-friendly incident summaries.
</instructions>

<examples>
  <example>
    <raw_log>
      [2026-06-01T15:10:00Z] [ERROR] [EKS-CNI] ENI allocation failed. Out of IPs.
    </raw_log>
    <ideal_summary>
      The Kubernetes cluster tried to allocate a network interface to a new container, 
      but the subnet ran out of available IP addresses.
    </ideal_summary>
  </example>

  <example>
    <raw_log>
      [2026-06-01T15:10:05Z] [CRITICAL] [KMS] Decryption failed. Invalid Key ARN.
    </raw_log>
    <ideal_summary>
      The encryption service could not decrypt the database credentials because 
      the encryption key identifier specified in the configuration is invalid or missing.
    </ideal_summary>
  </example>
</examples>

<raw_log>
  ${USER_LOG}
</raw_log>

4. Output Boundary Tags (Programmatic Parsing)

To reliably extract data from an LLM response (especially when running automated agents), force the model to write its response inside a final tag.

The Trigger Prompt

Append this exact instruction to the end of your prompt:

Analyze the logs. You MUST output your final audit strictly within <audit_result> tags. 
Do not write any conversational intro or outro text.

The LLM Output

The model will output:

<audit_result>
{
  "anomalies_detected": true,
  "severity": "CRITICAL",
  "root_cause": "EKS subnet ran out of IP addresses during prefix delegation allocation."
}
</audit_result>

Extraction Script (Node.js)

You can easily extract this content in your backend code using a simple regular expression:

function extractXmlTag(content: string, tag: string): string | null {
  const regex = new RegExp(`<${tag}>([\\s\\S]*?)<\\/${tag}>`);
  const match = content.match(regex);
  return match ? match[1].trim() : null;
}

const rawResponse = await callClaude(prompt);
const jsonResult = extractXmlTag(rawResponse, 'audit_result');
if (jsonResult) {
  const data = JSON.parse(jsonResult);
  console.log("Extracted Data:", data);
}

5. Pro Tips for Claude & Gemini

Avoid Over-nesting: Keep XML tags to a maximum of 3 levels deep (e.g., <prompt> -> <context> -> <variables>). Over-nesting can confuse the model’s structural attention.
Match Namespaces: Always ensure your closing tags match your opening tags exactly (<instructions> closed by </instructions>). High-end models will self-correct minor typo mismatches, but exact tags guarantee deterministic behavior.
Use Singular Elements: Do not write tags with spaces or special characters (<user-input> or <user_input> is fine; <user input> is invalid).

Deepen your understanding with these curated continuations.

View All Articles

system-prompts5 min read

Claude System Prompts & Steering Cheatsheet: Architecting AI Behavior

Master the structure of advanced system prompts for Claude 3.5. Learn role-play constraints, thinking processes, XML configurations, and steering guardrails.

VishnuJun 18, 2026