969 B
969 B
Feature Plan: End-to-End Observability and Traceability
Goal
Provide trace-level visibility from ingress transport event to UI delivery/ack.
Why This Fits GIA
- Multi-hop messaging systems require correlation IDs to debug reliably.
Scope
- Global trace IDs for message lifecycle.
- Structured logs and timeline diagnostics view.
- Basic metrics and SLA dashboards.
Implementation
- Inject
trace_idat ingress/send initiation. - Propagate through router, persistence, websocket, command/task flows.
- Standardize structured log schema across services.
- Add timeline diagnostics page by trace ID and session.
- Add core metrics: ingress latency, send latency, drop rate, retry counts.
Acceptance Criteria
- One trace ID can reconstruct full message path.
- At least 95% of critical paths emit structured trace logs.
- Operators can isolate bottleneck stage in under 2 minutes.
Out of Scope
- Full distributed tracing vendor integration.