The Problem: 50 Calls a Day About Lab Results

We started working with TRT (testosterone replacement therapy) clinics last fall. The problem was obvious within the first call:

"Our phones ring constantly. Patients want to know their testosterone levels, estrogen, hematocrit. They call 5, 10, 50 times asking about the same bloodwork."

Front desk staff were drowning. Clinicians were losing focus. Patients were frustrated waiting for callbacks. And every call was someone asking the same question: "When will my labs be ready? What do they mean?"

The calculus was simple: A clinic with 500 active patients might field 1,000+ lab-related calls per month. At $15–25 per patient consult per year, that's $7,500–$12,500 in unnecessary phone labor per clinic.

We thought: what if patients could ask their AI about their bloodwork, trends, and hormone questions 24/7?

The Solution: health-labs.ai

We built health-labs.ai — an AI patient portal that:

  1. Ingests lab PDFs — Clinics upload their patient lab reports (Quest, LabCorp, any format)
  2. Extracts structured data — Parses testosterone, estrogen, hematocrit, liver panels, lipids
  3. Answers patient questions — "Is my testosterone trending up?" "What's a normal hematocrit?" "Should I be worried about my liver?"
  4. Learns clinic protocol — Integrates clinic guidelines so the AI responds within their treatment philosophy
  5. Reduces call volume — Patients get instant answers. Clinics stop bleeding phone labor.
60%Call Reduction
89%Patient Satisfaction
30 daysConcept to Live
ZeroHIPAA Violations

How We Built It: Tech Stack & Decisions

Architecture

We chose a pragmatic stack over hype:

The AI Layer: Why Claude & MCP

We tested three approaches:

  1. Prompt engineering alone — Too brittle. Lab values vary by lab company and units (mg/dL vs. ng/mL). The model would hallucinate normal ranges.
  2. RAG over medical databases — Better but slow. Retrieving generic medical info for every query added 2–3 seconds latency.
  3. Structured context + few-shotWin. We built a "context injection" pattern:
System: You are a patient health assistant for [Clinic Name].
Clinic Protocol:
- Normal testosterone: 600-900 ng/dL (clinic-specific target)
- Hematocrit management: If >50%, reduce dose
Patient Labs (from PDF):
- Testosterone: 725 ng/dL (within range)
- Hematocrit: 52% (slightly elevated)
Question: Should I worry about my hematocrit?

Claude's reasoning mode (thinking + response) is perfect here. The model reasons through the data, applies the clinic's protocol, then gives a safe, anchored answer.

Why Model Context Protocol mattered

We initially built custom integrations for each PDF format. After 20 PDFs, we were debugging vendor-specific quirks. We switched to a unified tool interface (MCP pattern) where:

This cut integration time in half and made onboarding new clinics trivial.

Decision 1: No FHIR, No HL7

Medical data integration is typically FHIR/HL7. We ignored that. Why?

Decision 2: Claude Over Open-Source Models

We tested Llama 2 + self-hosted. It worked. It was 60% cheaper. It failed on edge cases:

Claude's reasoning mode gave us guardrails for free. Cost difference? ~$0.02 per patient query. Worth it for 60% fewer support tickets.

Decision 3: Minimal Patient Auth

We used magic links (no passwords). Why?

Zero signup friction. Works.

Outcome: 60% Fewer Calls, Real Business Impact

One clinic we worked with had 480 active patients, averaging 2.1 lab-related calls per patient per year (1,000+ calls/year). 30 days after launch:

For a clinic, that's ~$8,000/year in reclaimed front-desk labor. Enough to hire one part-time staff or redirect that capacity to patient experience.

What We Learned

1. MVP doesn't mean broken.

We shipped one integrated feature (lab intake + QA) instead of half-built everything. Better to be perfect at one thing.

2. Context is king in medical AI.

Generic Claude is good. Claude with clinic protocol + structured patient data is great. We spend 30% of compute on context prep, not inference.

3. Compliance isn't a blocker, it's a feature.

HIPAA-compliant architectures actually simplify your code (audit logs, encryption at rest, patient consent). We built it in Day 1.

4. Doctors trust what they understand.

We don't hide the AI response logic. We show doctors exactly what data the AI saw and why it answered the way it did. That transparency built trust faster than 10 testimonials.

What's Next

We're onboarding 3 more clinics in Q2. The unit economics work. The technical foundation is solid. The market wants it.

If you're running a healthcare vertical (any specialty, any condition), the playbook is the same: Find where patients ask the same question 100 times. Build an AI to answer it. Watch the efficiency gains compound.

Want the technical details?

Building an AI Lab Portal — The Full Stack Breakdown →

Want to explore AI solutions for your business?

NexGenOS builds production AI systems. We specialize in integrating AI where it matters — healthcare, operations, customer support.