Case Study: Gulf + Middle East Hybrid Intelligence Skill

TL;DR

Evidence

Project state (self-reported)

Context / Constraint

Generic LLMs produce broad, fluent commentary on the Gulf and the wider Middle East: country narration, hand-wavy "Iran tensions," vague chokepoint risk, no transmission mechanism, no actor incentives, no trigger points, no evidence boundaries, and a tendency to collapse Iran-state / IRGC-affiliated / Iran-private commercial actors into one undifferentiated actor.

That output is not decision-useful for sanctions compliance, energy trading, shipping insurance, Gulf banking, sovereign-wealth deal teams, or Iran-watcher analysts who actually have exposure to the region.

The skill needed to be small enough to attach to any capable agent and strict enough to actually change the shape of regional analysis — without becoming a screening tool, vessel-tracking product, or compliance platform.

Problem

Most AI-generated regional analysis on the Gulf and Middle East is fluent but decision-light. It rarely traces how a sanction designation, a chokepoint incident, or a sovereign-wealth deployment transmits into bank exposure, refining margins, charter rates, insurance premia, or counterparty contamination. It rarely separates verified facts from informed inference. It rarely names trigger points that would update the view. And it consistently fails the Iran-state / IRGC / Iran-private distinction that every serious sanctions compliance question depends on.

That is fine for background reading. It is weak for sanctions-exposure decisions, energy-trade structuring, shipping route posture, Gulf-bank counterparty review, sovereign-wealth co-investment screens, or any regional risk decision that has to be defensible.

Actions

What it does now

Trust-layer update — 2026-05

Additions tightening behavior under bad regional inputs. Single-author work; does not change STATUS.md — external-review bars (B2.2, B2.7) remain open.

What it is not

Portfolio context

This skill is a vertical specialist layer in a four-repo portfolio designed to compose:

This repo does not duplicate any neighbor. The broader memo workflow lives in Global Think Tank Analyst; validation tooling lives in Agenda Intelligence MD; Central-Asia regional depth lives in its own vertical.

Why this version is better

The skill is small enough to attach to any capable agent, and strict enough to change the shape of regional output. The contract does not ask the model to sound regionally smart; it asks the model to trace mechanism, label evidence, name the trigger, distinguish Iran-actor types, and say what role the implication is for.

That is the part most generic Gulf / Middle East commentary misses.

Before / after (illustrative)

Excerpt condensed for this page. Full memos with full transmission mechanism, exposure map, leverage shifts, and triggers live in examples/. Evidence modes are explicitly labeled per example — live-source-backed, user-provided sources, illustrative source packet, or reasoning-only.

User question: "We are a European bank's sanctions desk reviewing onboarding of a UAE-licensed trading counterparty that appears on a US OFAC SDN-related listing under a non-Iran programme but is in good standing with its UAE regulator. We clear USD through a US correspondent. Should we onboard?"

Before — generic regional commentary:

The UAE is a major financial hub with strong regulatory standards. OFAC sanctions present additional complexity for non-US banks. Banks should carefully balance their compliance obligations across jurisdictions and consult their sanctions desks. Maintaining strong correspondent relationships is essential.

That is fluent regional commentary. It does not say which list is operative for this bank's exposure, what the actual transmission mechanism is, or what would update the view.

After — with the Gulf + Middle East skill attached:

The skill does not screen sanctions, retrieve sources, or verify facts. It forces the agent to apply the currency trigger (mandatory live OFAC lookup), refuse the "which list wins" framing, distinguish actor regimes, and produce role-specific implications.

Tech stack

Relevance

This project demonstrates how I think about useful agent infrastructure for high-stakes regional reasoning in a domain where actor distinctions, source-tier discipline, and currency-sensitivity are the binding constraints: small reusable layers, mechanism-first contracts, honest evidence discipline, and outputs aimed at sanctions, banking, energy, shipping, and sovereign-wealth decisions — composed cleanly with a horizontal skill and a separate infrastructure layer instead of bundling everything into one repo.

Project links

Author: Vassiliy Lakhonin