Clinical Decision Support

Executive Summary

Clinical Decision Support (CDS) is the technology category that delivers the right information to the right person at the right time to improve clinical decisions and patient outcomes. It is also the category with the longest history in healthcare informatics, the most studied failure mode (alert fatigue), and the most complex regulatory classification picture of any healthcare AI category. Modern LLM-based CDS extends classical rule-based CDS into natural language interaction, nuanced multi-factor reasoning, and proactive identification — but it inherits all the challenges of classical CDS plus new ones. This chapter covers CDS architecture, the alert fatigue problem and its engineering mitigations, the clinical workflow integration patterns that determine whether CDS is used or ignored, and the role of LLMs in extending classical CDS.

Learning Objectives

After reading this chapter, you will be able to:

Design a CDS system architecture that distinguishes rule-based, retrieval-augmented, and LLM-based CDS and routes queries appropriately
Identify the alert fatigue failure mode and apply specificity, severity stratification, and workflow integration design principles to reduce it
Implement a CDS Hooks service that delivers CDS at the appropriate workflow trigger with an appropriate response time
Evaluate CDS quality using override rate, action rate, and patient outcome metrics

Business Problem

CDS in healthcare is simultaneously one of the highest-value and most poorly executed capabilities in clinical AI. The value case is clear: drug-drug interaction alerts prevent adverse drug events; sepsis early warning systems enable earlier intervention; care gap reminders close preventive care gaps that increase population health costs. The execution problem is equally clear: a seminal study found that emergency physicians overrode 91% of drug-drug interaction alerts in a major academic medical center. When clinicians override 9 of 10 alerts, the alert system is not a decision support tool — it is an interruption that the organization has learned to ignore.

The engineering challenge of CDS is not producing alerts. It is producing alerts that are specific enough that the 9 overridden alerts were genuinely non-actionable, while ensuring that the 1 actionable alert is seen and acted upon. This is a precision problem, not a recall problem — and it requires architectural choices about alert thresholds, severity stratification, workflow placement, and clinical context that go far beyond adding a rule engine to the EHR.

Why This Technology Exists

Rule-based CDS systems have existed since the 1970s. MYCIN (1974), developed at Stanford, was an expert system for diagnosing bacterial infections — the ancestor of modern CDS. Commercial CDS content (drug-drug interaction databases, clinical order sets, dosing calculators) became standard EHR components in the 1990s and 2000s.

The limitation of rule-based CDS is that rules cannot express clinical nuance: a drug-drug interaction alert fires based on a drug pair without knowing the patient's renal function, the indication for the drug, the dose, or whether the prescribing physician is aware of the interaction and managing it. This context-blindness is the root cause of alert fatigue — the alert system fires for interactions that an informed clinician would judge acceptable, creating alert noise that trains clinicians to dismiss alerts without reading them.

LLMs enter CDS at the nuance layer: they can reason about whether a drug-drug interaction is clinically significant given the patient's specific context, they can synthesize multi-factor risk scores with explanatory text, and they can interact with clinicians in natural language to clarify the clinical reasoning behind a recommendation.

Conceptual Explanation

CDS Intervention Types

CDS is not a single technology; it encompasses a spectrum of intervention types that vary in their timing, intrusiveness, and required clinical action:

Intervention Type	Timing	Intrusiveness	Example
Alerting / Notification	Real-time, workflow-blocking	High	Drug allergy alert at order sign
Advisory	Real-time, non-blocking	Medium	Sepsis score display on patient chart
Order facilitation	Triggered, non-blocking	Low	Pre-populated order sets
Relevant information	On-demand	None	Knowledge retrieval (clinical RAG)
Expert systems	Asynchronous, proactive	None	Care gap identification

The Alert Fatigue Problem

Alert fatigue is the state in which clinicians systematically dismiss CDS alerts without reading them because past experience has taught them that most alerts are not actionable for their specific patient and situation. It is a conditioned behavior, not a character flaw — it is the rational response to a high-noise, low-signal alert environment.

Alert fatigue has two components:

Notification fatigue: Too many alerts interrupt clinical workflow
Relevance fatigue: Too many alerts are not relevant to the specific patient's situation

The engineering solutions address both:

Threshold calibration: Raise alert thresholds to fire only for interactions/situations above a clinical significance threshold
Context enrichment: Add patient context to the alert logic (renal function, existing medications, indication) to filter out alerts that are non-actionable given the patient's situation
Severity stratification: Distinguish truly dangerous alerts (which warrant workflow interruption) from informational alerts (which can be asynchronous)
Actionability design: Every alert must offer a specific action the clinician can take; alerts without an actionable response are noise

Core Architecture

graph TD subgraph "CDS Triggers" T1["Order Sign\norder-sign hook"] T2["Patient Chart Open\npatient-view hook"] T3["Encounter Discharge\nencounter-discharge hook"] T4["Background Process\nScheduled evaluation"] end subgraph "CDS Engine" RE["Rule Engine\nDrug interactions\nAllergy checks\nDosing limits"] ML["ML Models\nSepsis scores\nReadmission risk\nDeterioraction prediction"] RAG["Clinical RAG\nGuideline recommendations\nPrior auth criteria"] LLM["LLM Reasoning\nMulti-factor context\nNuanced recommendations"] end subgraph "Severity + Specificity Layer" SC["Severity Classifier\nCritical / Warning / Info"] CF["Context Filter\nPatient-specific relevance check"] TH["Threshold Guard\nFire only above specificity threshold"] end subgraph "Delivery Layer" BK["Blocking Alert\nCritical safety — stops workflow"] NB["Non-blocking Card\nWarning — visible, dismissible"] AS["Async Notification\nInfo — delivered outside workflow"] DB["Dashboard Widget\nProactive — no trigger required"] end T1 --> RE & ML T2 --> ML & RAG T3 --> RAG & LLM T4 --> ML & RAG RE --> SC ML --> SC RAG --> CF LLM --> CF SC --> TH CF --> TH TH -->|Critical| BK TH -->|Warning| NB TH -->|Info| AS T4 --> DB

Enterprise Considerations

CDS Governance Committee: Every new CDS alert requires a clinical governance review before deployment: Who should this alert fire for? What severity is appropriate? What is the expected override rate? Who will monitor performance? Organizations that deploy CDS alerts without governance review accumulate alert debt — a large catalog of poorly calibrated alerts that collectively produce the alert fatigue state.

Alert Override Documentation: When a clinician overrides a CDS alert, capturing the override reason (structured selection, not free text) creates a dataset for alert calibration. Override reasons cluster: if 70% of overrides for a drug interaction alert select "Patient already on this regimen — clinician aware," the alert is firing for a population that is not the intended target and the threshold needs adjustment.

CDS Performance Monitoring: Measure: alert fire rate, override rate, action rate (clinician took the suggested action), and (where measurable) patient outcome correlation. Alert fire rate > 10 per provider per shift is a fatigue risk. Override rate > 70% indicates insufficient specificity. Action rate < 10% indicates that the suggested action is not relevant to the alert population.

Healthcare Example

⊕ Healthcare Example

Educational Example — Illustrative Workflow. Not intended for clinical decision making.

The Reference Healthcare Organization deploys a sepsis early warning CDS system integrated with the nursing workflow:

Architecture:

An ML model trained on historical HMS patient data calculates a modified Early Warning Score (NEWS2) every 15 minutes for all inpatients
Scores are displayed on the nursing station dashboard (non-interrupting, advisory)
When a patient's score crosses 7 (clinical threshold validated against HMS patient population), an alert is generated and delivered to the assigned nurse's workstation

Alert design:

Severity: Warning (not Critical — the score does not confirm sepsis, it identifies patients requiring assessment)
Display: Non-blocking card in the nursing EHR workflow with three action options: "Assess patient now," "Physician notified," "Score inconsistent — document reason"
No alert is delivered for scores 5–6 (elevated but sub-threshold) — these appear on the dashboard only

Override rate monitoring:

Month 1 post-deployment: override rate 41% (too high)
Root cause analysis: 38% of overrides selected "Patient recently assessed — no change"
Adjustment: suppress alert if nurse accessed patient chart within 60 minutes
Month 3: override rate 22% (acceptable)
Action rate (nurse assessed and documented): 74% of fires

Common Mistakes

Building Blocking Alerts for Non-Critical Situations. Workflow-blocking alerts that require acknowledgment before the clinician can proceed are justified only for genuinely dangerous situations. Organizations that implement blocking alerts for "warning" level interactions produce exactly the dismissal behavior that causes alert fatigue — and train clinicians to acknowledge-without-reading as fast as possible.

No Alert Review Process. CDS alert libraries that are never reviewed accumulate obsolete alerts, miscalibrated thresholds, and retired rules that continue to fire. Establish an annual alert review process that evaluates every active alert for current clinical evidence, override rate, and action rate.

Delivering All Alerts Through One Channel. An alert that requires a decision must be delivered at the point of decision (order sign, discharge); an informational observation does not need to interrupt the workflow. Mixing all alert severity levels into the same workflow interruption channel produces the signal-to-noise ratio that causes fatigue.

Best Practices

Restrict blocking alerts to genuinely dangerous situations; use non-blocking cards for warnings
Capture structured override reasons — they are the primary calibration signal for alert improvement
Monitor override rate and action rate continuously; investigate deviations from targets promptly
Review the full alert catalog annually; retire alerts that have consistently high override rates and low action rates
Use LLM reasoning for nuanced context assessment only when rule-based checks have found something worth reasoning about — not as the first-line check for every order
Govern every new CDS alert through a clinical review process before deployment

Trade-offs

CDS Design	Alert Specificity	Alert Volume	Implementation Complexity	Alert Fatigue Risk
Rule-based only	Low-Medium	High	Low	High
Rule + context filtering	Medium	Medium	Medium	Medium
Rule + ML severity scoring	High	Low-Medium	Medium-High	Low
Rule + ML + LLM nuance	Highest	Lowest	High	Lowest

Interview Questions

Category: System Design / Clinical Informatics Difficulty: Senior Role: AI Architect / FDE

Answer Framework:

An 85% override rate is a textbook alert fatigue indicator. The alerts are firing but not being read — clinicians have learned that most alerts for their specific patient population are non-actionable, so the rational response is to dismiss them quickly and continue the workflow.

The diagnosis has three possible root causes, each with different remedies:

Root cause 1 — Threshold too low: The alert fires for drug interactions that are clinically significant in theory but rarely clinically significant in practice given the hospital's patient population. Remedy: analyze the override data — what reason do clinicians select when they override? If the dominant reason is "not clinically significant for this patient," the threshold needs calibration against the actual patient population.

Root cause 2 — Missing patient context: The alert fires without considering patient context (renal function, dose, indication, current monitoring). A drug interaction that requires dose adjustment in renal impairment should not fire the same alert for a patient with normal renal function. Remedy: add patient-context filters to the rule logic before the alert fires.

Root cause 3 — Alert catalog too broad: The alert catalog contains too many interaction pairs, including low-severity interactions that trained clinicians would manage without an alert. Remedy: audit the alert catalog against override rates by alert type; retire alerts with > 80% override rate and limited evidence of clinical consequence.

The recommendation: run a 90-day alert performance analysis with structured override reason capture (if not already in place), then prioritize root cause remediation based on the override reason distribution. Do not add new alerts until the existing catalog is calibrated.

Key Points to Hit:

85% override = alert fatigue, not clinical disagreement with legitimate alerts
Three root causes: threshold, context, catalog breadth
Override reason capture is the diagnostic tool — without structured reasons, calibration is guesswork
Retire before adding — the catalog is already over-alerting

Key Takeaways

CDS spans a spectrum from deterministic rule-based alerts to nuanced LLM reasoning — use the simplest appropriate mechanism for each alert type
Alert fatigue is the primary failure mode of clinical CDS; it is caused by low specificity (alerts fire when non-actionable) not by high volume per se
Severity stratification is the most important CDS design decision: blocking alerts must be rare, reserved for genuinely dangerous situations
Override reason capture is the primary calibration signal; alert review should be a continuous process, not a one-time deployment activity
LLMs add value to CDS at the nuance layer — reasoning about whether a rule-triggered alert is clinically significant given the specific patient's context

Clinical Decision Support#

Executive Summary#

Learning Objectives#

Business Problem#

Why This Technology Exists#

Conceptual Explanation#

Core Architecture#

Enterprise Considerations#

Healthcare Example#

Common Mistakes#

Best Practices#

Trade-offs#

Interview Questions#

Q: A hospital's CMO tells you that clinicians are overriding 85% of drug-drug interaction alerts in the EHR. What is your diagnosis and what would you recommend?#

Key Takeaways#

Further Reading#

Clinical Decision Support

Executive Summary

Learning Objectives

Business Problem

Why This Technology Exists

Conceptual Explanation

Core Architecture

Enterprise Considerations

Healthcare Example

Common Mistakes

Best Practices

Trade-offs

Interview Questions

Q: A hospital's CMO tells you that clinicians are overriding 85% of drug-drug interaction alerts in the EHR. What is your diagnosis and what would you recommend?

Key Takeaways

Further Reading