Severity & confidence

How Ghost-hunter ranks hypotheses.

Ghost-hunter ranks hypotheses on two axes: severity (how much spend the hypothesis explains) and confidence (how strongly the evidence supports it).

Severity

Level	Meaning
Critical	Explains ≥ 50% of the unexpected spend.
High	Explains 20–50%.
Medium	Explains 5–20%.
Low	Explains < 5% but is anomalous.

Severity is computed from the billing CSV before any LLM call — the LLM does not pick severity.

Confidence

Confidence bars come from Opus and reflect how much corroborating evidence exists. A hypothesis starts at low confidence and rises as proposed commands return supporting output.

Hypothesis 1: BigQuery on-demand scan blew up    [████████░░] 0.82
Hypothesis 2: Egress from us-central1 to EU      [████░░░░░░] 0.41
Hypothesis 3: Idle Cloud SQL replica             [██░░░░░░░░] 0.19

Ghost-hunter stops the loop when one hypothesis crosses 0.85 confidence or when no hypothesis has improved in three consecutive rounds.

Why this matters

The dual axis stops Ghost-hunter from chasing a high-confidence-but-low-severity rabbit hole, and stops it from anchoring on a high-severity-but-unsupported guess. The final report leads with the highest-severity hypothesis above the confidence threshold.