All posts
·9 min read

FDA Real-Time Trial Data Pilot: What Sponsors Should Do

FDA's real-time clinical trial data pilot puts reviewers inside the data stream mid-trial. What it means for Phase 2-3 sponsors and AI monitoring.

Rebecca Lauren
Rebecca Lauren
Head of Regulatory Intelligence
FDA Real-Time Trial Data Pilot: What Sponsors Should Do

FDA quietly opened a real-time clinical trial data pilot that puts agency reviewers inside the data stream while a trial is still running. Most of the reaction I've seen treats it as a process tweak. It isn't.

The pilot is structural. It changes when sponsors get FDA signal, what trial infrastructure has to be in place to participate, and whether being early on the list is an advantage or an exposure. Nobody fully knows what the program will look like at scale — it's a pilot, FDA hasn't named participants — but the direction is clear enough that Phase 2-3 sponsors should make decisions about it now.

What actually changed

Until now, FDA sat at the end of a long line. A sponsor ran a trial, locked the database, cleaned the SAS datasets, wrote the CSR, packaged the M5 deliverables, and shipped the whole thing to the agency. Reviewers opened a finished box.

The pilot moves the reviewers inside the trial while it runs. Scientific reviewers see data as it accumulates. Not the cleaned, locked, narrative-wrapped version. The live stream.

That's a structural shift, not a workflow tweak. The boundary between "active trial" and "regulator" used to be a wall. Now it's a window.

What we don't know (and what FDA hasn't said)

The Endpoints reporting is clear about the gaps. FDA hasn't published the participating sponsor list. It hasn't specified which data types are in scope — adverse events only, efficacy endpoints, biomarker data, all of the above. It hasn't said whether reviewers can act on what they see mid-trial, or only observe and store.

Feasibility at scale is unproven. A pilot with a handful of trials behaves differently than a program touching 400 active Phase 2-3 studies. Don't read this as established FDA policy. Read it as a directional bet.

Why FDA wants this

I'll give you the case for it from FDA's perspective, because it's not crazy.

Half the post-submission cycle is FDA discovering things in a CSR that could have been flagged six months earlier. Adverse event patterns. Protocol deviations clustered at one site. An endpoint that drifted because the assay drifted. By the time it surfaces in review, the trial's done. The sponsor's spent the money. The patients are already exposed.

Real-time visibility compresses that. If a reviewer can see a signal at week 24 instead of week 104 (after database lock, after CSR drafting, after a submission package is filed, after the PDUFA review goal date is set), the conversation about what to do is cheaper for everyone. Including the patients still being randomized.

It also fits the direction FDA has been moving for years. 21 CFR 312 already gives FDA authority over IND conduct during a trial. ICH E6(R3) — the revised GCP guideline reaching Step 4 in early 2025 — explicitly assumes a world where trial data flows electronically and quality is built in, not inspected in afterward. The Real-World Evidence framework and the Complex Innovative Trial Design program both pull in the same direction. None of those say "stream us your eCRF data live." But none of them rule it out, either.

What you'd actually need to participate

This is where I think most sponsors haven't done the math. Real-time data access isn't a button you flip. It's an infrastructure question.

You need:

  • An EDC that can produce a continuous, queryable export — not a once-a-month SAS dump
  • A clean data model. If your investigator at site 14 enters AEs as free text and your central reviewer manually MedDRA-codes them three weeks later, your "real-time" stream is real-time garbage
  • A standing data review process at the sponsor side that runs at least as fast as FDA's view. If FDA flags something on Tuesday and you find out on Friday from an email, that's bad for you
  • Defined site SOPs for data entry latency. "Within 7 days of visit" is industry-standard. For a real-time pilot it might need to be 24-48 hours
  • Pre-specified rules about what triggers an alert. Otherwise reviewers see noise and ping you about transient artifacts

CDISC SDTM (the standardized data tabulation format FDA expects in submissions) helps — if your data already lands in SDTM, the streaming question is mostly plumbing. And a clean clinical safety summary discipline before you start is half the work, because if you can't summarize your own safety data without drama, you can't stream it without drama either.

The hard question: advantage or exposure?

The question I keep getting from regulatory leads on Phase 2-3 programs is whether real-time FDA visibility is good or bad for sponsors.

Honest answer: depends who you are.

If you're confident in your data quality, this is an advantage. You get FDA aligned on signal interpretation before lock. You catch a controlled efficacy study issue at week 30 instead of in a Complete Response Letter eighteen months later. You shorten the post-submission cycle. You may even get earlier signal to stop a trial that's not going to make it, which saves money and patients both.

If your data quality is shaky, this is exposure. Every imputation, every missing CRF page, every protocol deviation lives under FDA's eyes in something close to real-time. You don't get to clean it up in a CSR narrative. The agency sees the mess.

The pilot will select for the first kind of sponsor. I expect the early participants will be groups with strong CROs, mature data ops, and a competitive reason to want fast FDA alignment — accelerated approval candidates, breakthrough therapy holders, designs that already have FDA's attention.

What this means for AI trial-monitoring vendors

This pilot is good news for AI monitoring vendors in a specific, narrow way. Once FDA expects continuous data, sponsors need their own continuous monitoring layer that runs at least one step ahead of the agency. Nobody wants FDA to see a signal first.

That's a real product opportunity. Centralized statistical monitoring (CSM, the risk-based monitoring approach E6(R3) explicitly endorses), anomaly detection at the site level, drift detection on biomarker assays, predicted-vs-observed analyses on enrollment and dropout — those tools become required infrastructure, not optional.

What it isn't: a green light to throw an LLM at trial data and call it monitoring. FDA scrutiny on AI-assisted clinical decision tools is rising in parallel with this pilot. If your vendor can't tell you what model version made what call on what data on what date, you don't have a monitoring system. You have a liability.

What I'd do this quarter

If I were sitting at a Phase 2 or Phase 3 sponsor right now, with no public information about who's in the pilot, I'd do three things.

First, I'd map my current data flow. Honestly. From investigator entry to sponsor-visible record, what's the latency? In hours, not "we try to keep it tight." If the answer is "two to four weeks for some sites," that's where I'd start.

Second, I'd schedule a Type C meeting if I had a major Phase 2 readout coming. The Type C meeting request is the right vehicle to ask FDA directly: what's your current expectation on data sharing cadence for our program? Don't volunteer for the pilot blind. Get FDA's posture first.

Third, I'd audit my efficacy and safety reporting infrastructure against the assumption that someone outside my company might read it before I do. If that audit makes me uncomfortable, that's the problem to fix — pilot or no pilot.

The longer view

Whether this specific pilot scales or quietly fades, the direction is set. FDA has been telling the industry for a decade that the post-hoc submission model is too slow, too expensive, and too late. E6(R3) is built on that premise. The RWE framework is built on that premise. Decentralized trials are built on that premise.

When the wall between trial and regulator becomes a window, the sponsors who win are the ones who'd already been working as if the window existed. Clean data, fast review cycles, defensible AI tools where AI is used, and a regulatory team that can answer FDA questions in days instead of months.

Regfo is a preclinical-to-IND compliance engine — that's the work I lead at the company. The pilot is downstream of where Regfo runs today. But the underlying ask is the same as ours: read your own document the way the regulator will read it, before they get to it. If you want to see what that kind of structured check looks like, the fastest path is to start a workspace and paste a study report or protocol synopsis you have on file. The mechanics translate.

I'd rather find out on Tuesday than in a CRL.

FAQ

What is FDA's real-time clinical trial data pilot?

It's a program that gives FDA scientific reviewers direct access to clinical trial data as it accumulates during the trial, instead of waiting for the sponsor to lock the database, write the CSR, and file a finished submission package. The pilot puts the reviewer inside the data stream while the trial is still running.

Is the pilot mandatory for Phase 2-3 sponsors?

No. It is a voluntary pilot at this stage. FDA has not published the participating sponsor list, has not specified mandatory data types, and has not signaled program scale. Treat it as a directional bet by the agency, not established policy. Sponsors who want to participate should expect their data infrastructure and review cadence to be the deciding factors.

How does this interact with ICH E6(R3) and GCP requirements?

ICH E6(R3) — the revised GCP guideline reaching Step 4 in early 2025 — assumes a world where trial data flows electronically and quality is built in rather than inspected in afterward. The pilot is consistent with that direction. Sponsors already implementing centralized statistical monitoring under E6(R3) will be better positioned to stream data to FDA without adding new infrastructure.

Is real-time FDA visibility an advantage or a risk for sponsors?

It depends on data quality. Sponsors with mature CROs, clean SDTM data, fast site-to-EDC latency, and disciplined safety monitoring should see an advantage — earlier alignment with FDA, shorter post-submission cycles, fewer Complete Response Letters tied to data issues. Sponsors with shaky data quality face exposure: every imputation, missing CRF page, and protocol deviation lives under FDA's eyes in close to real time, with no chance to clean it up in a CSR narrative first.

Check your studies against 1,054 rules

Upload your preclinical reports. Get a compliance score in 3 minutes.

Try Regfo free