The Incidents page groups related problems — anomalies, error spikes, latency regressions — into a single timeline so you can investigate them as a unit. When LumiqTrace detects that multiple signals are happening at the same time and likely have a common cause, it opens an incident automatically. You can also open incidents manually for any issue you want to track to resolution.Documentation Index
Fetch the complete documentation index at: https://docs.lumiqtrace.com/llms.txt
Use this file to discover all available pages before exploring further.
Incident detection and management require the Team or Scale plan.
How incidents are created
LumiqTrace creates incidents automatically when:- An anomaly is detected (cost spike, error rate jump, latency surge) and a related alert rule fires within 15 minutes
- Three or more traces with the same error code occur within a 10-minute window
- An AI analysis detects a pattern across multiple traces that it classifies as a systemic issue
Incident states
Each incident moves through three states:| State | Meaning |
|---|---|
| Detecting | The platform is still gathering signals — the issue may still be developing |
| Active | Confirmed ongoing issue requiring attention |
| Resolved | The issue is no longer occurring |
The incidents list
The main Incidents page shows a table of all incidents, ordered by most recent. Each row shows:- Severity —
high,medium, orlow, based on impact to cost or error rate - Title — a one-sentence summary of the issue
- Affected metric — which measurement is out of range
- State — detecting, active, or resolved
- Duration — how long the incident has been open
- Models affected — which model(s) are involved
Incident detail
Click any incident to open its detail view. The detail view shows:Timeline
A chronological feed of all signals related to this incident:- Anomaly detections with their explanation and severity
- Alert rule triggers with threshold and actual value
- Related error traces (grouped by error code)
- Configuration changes that may have contributed (from the audit log)
Root cause summary
For incidents where an AI analysis has been run, a root cause summary appears at the top of the timeline. This is generated by running the AI root cause analyzer across the correlated traces and summarizing the findings into a coherent narrative.Affected traces
A filtered list of the specific traces associated with this incident, with error codes, latency, and cost. Click any trace to open it in the flame graph view.Resolution notes
A free-text field where you can record what you found and how you fixed it. Resolution notes are preserved after the incident closes and appear in the incident history. Use them to build a runbook for recurring issues.Resolving an incident
Click Mark resolved on any active incident. You’ll be prompted to add a brief resolution note. Once resolved:- The incident state changes to “Resolved”
- The resolution timestamp and note are saved
- If the same underlying issue recurs, a new incident opens automatically — it does not reopen the closed one
Auto-remediation
On the Scale plan, you can configure auto-remediation rules that LumiqPilot applies automatically when an incident of a specific type opens. For example:- “When a GPT-4o error spike incident opens, switch to GPT-4o-mini”
- “When a rate-limit incident opens, reduce sample rate to 50%”