🔁🔗🤖 otel-a2a-relay (o2r)

A2A coordination as OTel spans. Drop-in relay between A2A agents that turns wire traffic into traces any OTel-native observability tool can render.

otel-a2a-relay is the canonical name (repo, package, protocol doc). o2r is the dictation-friendly shortname used in CLI entrypoints (o2r, o2r-harness), span identifiers (service.name=o2r, the relay's agent.name), and prose below.

A real session, animated. The relay is the magenta hub at the center; A and B are the leaves. Each particle is one A2A hop, drawn from a real Phoenix span; arcs above and below the chord let outbound and return hops cross visibly instead of overdrawing. Generate your own with make demo && make gif CTX=demo. Detailed below in Animated session topology.

Pitch

Two A2A agents talk to each other through this relay. Every message becomes one or more OTel spans, exported via OTLP/HTTP to whatever you've pointed OTEL_EXPORTER_OTLP_ENDPOINT at. The trace IS the operations view, no derived state needed.

Agent-facing format: A2A (JSON-RPC 2.0 over HTTP, AgentCards, message/send, tasks/get, tasks/cancel).
Relay-persistence format: OTel spans, OpenInference attributes for Phoenix's Agent Graph and Sessions views.
Trace propagation: W3C traceparent end-to-end. Client → relay → peer is one trace.
Default visualizer: Phoenix. Anything OTLP-native works.

Workspace layout

This repository is a uv workspace with a backend-agnostic core and per-backend extensions. Each member is its own publishable Python package; cross-package deps are wired through the workspace.

core/ - otel-a2a-relay-core: the relay HTTP server, tracing.bootstrap(), the echo A2A peer, the in-memory task store. No backend coupling. Point OTEL_EXPORTER_OTLP_ENDPOINT at any OTLP/HTTP collector.
arize_phoenix/ - otel-a2a-relay-arize-phoenix: Phoenix-side validation harness, REST/GraphQL query helpers, animated topology GIF renderer, annotation+dataset bootstrapper, make view CLI.
tempo_grafana/ - otel-a2a-relay-tempo-grafana: Tempo-side bootstrap helper, harness probe, dockerized Tempo+Grafana stack with provisioned datasource and a LUCA-flow Grafana dashboard.
examples/luca-flow/ - the AURORA microsite multi-agent demo, backend-agnostic.

Quickstart

Pick a backend (or run both side by side - they coexist on different ports). All paths work identically through core's tracing.bootstrap().

Phoenix backend

uv sync --all-packages
make phoenix-fg                   # in another terminal (operator-owned)
make phoenix-bootstrap            # one-time annotation configs + datasets
OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:6006 make luca-demo
open http://localhost:6006        # Phoenix Sessions tab

Tempo + Grafana backend

uv sync --all-packages
make tempo-up                     # docker compose Tempo + Grafana
OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:4318 make luca-demo
open http://localhost:3000/d/luca-flow/luca-flow

Other backends

tracing.bootstrap() ships standard OTLP/HTTP - point it at Honeycomb, Datadog, or any OTel-native backend by setting OTEL_EXPORTER_OTLP_ENDPOINT. The protocol attributes (session.id, agent.role, o2r.*) work everywhere; backend-specific UX (annotation configs in Phoenix, dashboards in Grafana) is added by extension packages.

Make targets

Workspace-level (root Makefile):

sync - uv sync --all-packages
test / lint / fmt - dispatched per-package
luca-demo - run the AURORA flow against $OTEL_EXPORTER_OTLP_ENDPOINT
luca-test - byte-snapshot diff dist/ against examples/luca-flow/tests/snapshots/

Tempo+Grafana extension:

tempo-up / tempo-down / tempo-clean - docker compose lifecycle
tempo-logs / tempo-status - tail / inspect
tempo-harness - post worked-example trace, print Grafana link

Phoenix extension:

phoenix-fg - run Phoenix in foreground
phoenix-harness - post worked-example trace
phoenix-bootstrap - provision annotation configs + datasets
phoenix-bootstrap-dry-run - print plan, no writes

Topology

This is the simplest shape the relay supports: one client, one relay, one peer, one trace. Real flows are more interesting. The LUCA-flow demo below runs eight workers, an orchestrator, a planner, a validator, and a deployer through this same relay, with star-topology enforcement, retries, a deliberate worker crash, and a rogue worker that gets gated by the relay.

The relay's peer registry comes from OTEL_A2A_RELAY_PEERS=A=http://...,B=http://.... The Makefile sets this for you. If a target in metadata.agent.target has no peer registered, the relay synthesizes a completed Task and skips the forward.

Diagram source: scripts/render_topology.py. Regenerate with uv run --with matplotlib python scripts/render_topology.py.

Animated session topology

assets/topology.png (above) is the protocol-shape illustration, a fixed cartoon. assets/session-topology.gif (the hero at the top) is the temporal one: real OTel spans for one session, animated by start time, against the same star.

make phoenix-fg                # operator-owned, in another terminal
make demo                      # produces a `demo` session
OUT=mine.gif make gif CTX=demo # writes mine.gif from real Phoenix spans

The renderer pulls every span tagged with session.id == $CTX from Phoenix's GraphQL endpoint, reduces them into hops (parent -> agent), auto-detects the relay as the hub, sorts the leaves alphabetically for a stable color palette, and animates each hop in start-time order. Two hops in the same tick render with their arcs bowed in opposite directions, so a forward-and-return pair reads as crossings rather than as a single overdrawn line.

Determinism is baked in: same session.id against the same Phoenix DB produces a byte-identical GIF. Tests assert this against a synthetic-span fixture in arize_phoenix/tests/fixtures/sessions.py, so a renderer regression fails CI before the README hero drifts. The renderer is Pillow-only (no matplotlib); freetype ships with Pillow, JetBrains Mono ships in arize_phoenix/src/otel_a2a_relay_arize_phoenix/viz/assets/, the GIF palette is built once and reused across frames. To intentionally regenerate the README hero after a renderer change, run python -m tests.fixtures.regen_session_gifs from arize_phoenix/ and commit the new bytes.

The viz extra is opt-in:

uv sync --extra viz

make gif does this automatically. The base relay install stays Pillow-free.

Methods

message/send - send a message, get a Task back. The originator sets metadata.agent.id (sender) and optionally metadata.agent.target (recipient).
message/stream - same envelope as message/send, but the response is text/event-stream carrying A2A status-update and artifact-update events. The relay forwards the SSE through and emits one a2a.message.stream_chunk span event per artifact.
tasks/get - retrieve a Task by id from the relay's in-memory store. Each peer agent indexes its own tasks too.
tasks/cancel - mark a Task as canceled and emit an a2a.task.cancel span.

The peer agent serves an A2A AgentCard at /.well-known/agent.json (capabilities, skills, protocol version). The relay's GET /peers aggregates them for discovery.

Span shape

Every a2a.task carries session.id, a2a.task.id, agent.id, graph.node.id, graph.node.parent_id, openinference.span.kind=AGENT, plus input.value / output.value (OpenInference) and a2a.message.text / a2a.message.reply_text shortcuts. State changes are span events (a2a.task.state_change with from / to). Stream chunks are span events (a2a.message.stream_chunk with seq / final).

The original v0.1 protocol document at docs/protocol.md is the precedent and explains why agent identity rides on attributes (Phoenix drops Resource attributes), why the Agent Graph uses graph.node.* (Phoenix doesn't expose span links), and why state changes are events not spans (tree noise vs queryable timeline).

Layout

core/src/otel_a2a_relay_core/server.py - the relay (FastAPI, JSON-RPC, peer routing, span emission).
core/src/otel_a2a_relay_core/agent.py - tiny echo peer agent.
core/src/otel_a2a_relay_core/store.py - thread-safe in-memory task store.
core/src/otel_a2a_relay_core/tracing.py - tracing.bootstrap() consumer-agnostic OTel setup.
core/src/otel_a2a_relay_core/telemetry.py - one TracerProvider per process, OTLP/HTTP exporter.
arize_phoenix/src/otel_a2a_relay_arize_phoenix/harness.py - Phoenix-validation harness.
arize_phoenix/src/otel_a2a_relay_arize_phoenix/bootstrap.py - annotation configs + datasets bootstrapper.
arize_phoenix/src/otel_a2a_relay_arize_phoenix/client.py - dogfood CLI (send, view, get, tasks, cancel, peers).
arize_phoenix/src/otel_a2a_relay_arize_phoenix/viz/ - GIF rendering of session topologies (Pillow).
tempo_grafana/src/otel_a2a_relay_tempo_grafana/bootstrap.py - bootstrap_tempo() helper.
tempo_grafana/src/otel_a2a_relay_tempo_grafana/harness.py - Tempo harness probe.
tempo_grafana/docker/ - dockerized Tempo + Grafana stack (datasource + dashboard provisioned).
examples/luca-flow/src/luca/ - AURORA-microsite multi-agent demo, backend-agnostic.
scripts/bg.sh - pidfile-backed background process manager.
scripts/wait-healthy.sh - poll /healthz until 2xx.
Makefile - workspace-level orchestrator that dispatches to per-package targets.

LUCA-flow demo

examples/luca-flow/ is a real multi-agent choreography that dogfoods the relay end-to-end. Eight worker subprocesses + an orchestrator + a planner + a validator + a deployer build the AURORA microsite (a fictional consumer desk lamp marketed as if it physically channels solar-wind charged particles) from real public-domain NASA imagery committed to the repo. Star topology is enforced by the relay; one worker deliberately crashes, another deliberately tries to bypass the orchestrator and gets a -32010 from the relay's gate.

The demo only depends on otel-a2a-relay-core. Pick whichever backend you want to send the spans to:

# Phoenix
make phoenix-fg
OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:6006 make luca-demo

# Tempo + Grafana
make tempo-up
OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:4318 make luca-demo

The same flow runs in CI on every push (.github/workflows/luca-demo.yml), with Phoenix in CI as a background process. The built dist/ is uploaded as a workflow artifact. See examples/luca-flow/README.md for the choreography and validation rules.

License

MIT.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
.claude		.claude
.coily		.coily
.github/workflows		.github/workflows
arize_phoenix		arize_phoenix
assets		assets
core		core
docs		docs
examples/luca-flow		examples/luca-flow
scripts		scripts
tempo_grafana		tempo_grafana
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔁🔗🤖 otel-a2a-relay (o2r)

Pitch

Workspace layout

Quickstart

Phoenix backend

Tempo + Grafana backend

Other backends

Make targets

Topology

Animated session topology

Methods

Span shape

Layout

LUCA-flow demo

Related

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🔁🔗🤖 otel-a2a-relay (o2r)

Pitch

Workspace layout

Quickstart

Phoenix backend

Tempo + Grafana backend

Other backends

Make targets

Topology

Animated session topology

Methods

Span shape

Layout

LUCA-flow demo

Related

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages