Real-World Evidence Sources for Drug Safety: Registries and Claims Data

Real-World Evidence Sources for Drug Safety: Registries and Claims Data
Axton Ledgerwood 14 January 2026 0 Comments

Drug Safety Data Source Calculator

Compare Real-World Evidence Sources

Calculate which data source (patient registries or claims data) is more effective for detecting specific drug safety signals based on population size and event rarity.

Number of patients in your study population
How common is the side effect? (e.g., 10000 = 1 in 10,000 patients)
%
For registry data (87% is typical)
How this works: Uses data from the article: Registries require 500k patients for rare events (vs. 1M+ for claims data) with 87% data completeness vs. 52% for claims.

When a new drug hits the market, the clinical trial data tells you only part of the story. Trials involve thousands of patients-sometimes just a few hundred-and run for months, not years. But what happens when millions of people start taking that drug over decades? That’s where real-world evidence comes in. For drug safety, two of the most powerful sources are patient registries and claims data. Together, they help regulators, doctors, and pharmaceutical companies spot dangers that clinical trials missed.

What Real-World Evidence Actually Means

Real-world evidence (RWE) isn’t theoretical. It’s real data pulled from everyday healthcare systems. Think of it as the digital footprint of patient care: doctor visits, prescriptions filled, hospital stays, lab results, and even patient-reported symptoms. Unlike clinical trials, which control every variable, RWE captures how drugs behave in messy, real life-with all the comorbidities, different dosages, and lifestyle factors that trials can’t replicate.

The U.S. Food and Drug Administration (FDA) started formally recognizing RWE in 2018. Since then, it’s been used in over a dozen drug approvals. In 2022 alone, the FDA reviewed 107 RWE submissions-up from just 29 in 2018. That’s not a trend. It’s a transformation.

How Patient Registries Work

Patient registries are like detailed medical diaries for specific groups. They’re built around diseases (like cystic fibrosis or Parkinson’s) or specific drugs (like a new cancer therapy). Every entry includes more than just a diagnosis code. Registries track lab values, imaging results, side effects reported by patients, and even how well someone can walk or breathe over time.

Take the Cystic Fibrosis Foundation Patient Registry. It’s tracked over 30,000 patients since the 1960s. When the drug ivacaftor was approved, clinical trials showed promise-but the registry spotted something else: patients with rare CFTR mutations had dramatic improvements. That detail never showed up in trials because those mutations were too uncommon. Registries found it.

Registries aren’t perfect. They’re expensive to run. Setting one up costs between $1.2 million and $2.5 million and takes 18 to 24 months. Only 60-80% of eligible patients enroll, which can skew results. And about 35% of academic registries shut down within five years due to funding gaps.

But their data quality? Unmatched. Registries capture 87% of lab values, compared to just 52% in claims data. For rare side effects-like a sudden heart rhythm change in 1 in 10,000 patients-registries need half the population size to detect the signal. That’s why they’re the gold standard for specialized populations.

Claims Data: The Power of Scale

Claims data is the opposite of registries: less detail, but way more people. It’s what insurers and government programs like Medicare collect every time a doctor bills for a service. It includes diagnosis codes (ICD-10), procedures (CPT), and drug prescriptions (NDC codes). It doesn’t tell you someone’s blood pressure or whether they felt dizzy-but it does tell you they went to the ER three times in six months after starting a new medication.

Medicare claims data alone covers 65 million Americans. Commercial databases like IBM MarketScan track over 200 million lives. That scale lets researchers spot rare events. In 2015, the FDA analyzed 1.2 million Medicare records to check if a Parkinson’s drug, entacapone, increased heart risks. No link found. In 2014, they used 850,000 records to assess olmesartan’s cardiovascular risks in diabetics. Again, no red flags.

Claims data’s biggest strength is time. Medicare records go back 15+ years. Clinical trials rarely last longer than five. That’s why claims data is critical for spotting long-term risks-like bone fractures from osteoporosis drugs or liver damage from cholesterol meds that only show up after years of use.

But it has blind spots. Only 45-60% of lab results are captured. Patient-reported symptoms? Almost never. Diagnosis codes are wrong 15-20% of the time, according to the Agency for Healthcare Research and Quality. And without clinical context, a spike in ER visits could mean anything: a side effect, a bad fall, or just a patient who hates their doctor.

Doctor using a magnifying glass over a U.S. map with claims data dots and registry vitals above.

Registries vs. Claims Data: The Trade-Offs

Here’s the reality: neither source is enough on its own.

| Feature | Registries | Claims Data | |---------|------------|-------------| | Population size | 1,000-50,000 patients | 100 million+ lives | | Clinical detail | High (labs, imaging, outcomes) | Low (mostly codes and billing) | | Data completeness (labs) | 87% | 52% | | Longitudinal coverage | 5-10 years (on average) | 15+ years (Medicare) | | Cost to maintain | $300K-$600K/year | Lower (uses existing billing systems) | | Detection power for rare events | 500,000 patients needed | 1 million+ patients needed | | Bias risk | Selection bias (voluntary enrollment) | Coding errors, missing symptoms | Registries are like a high-resolution MRI. You see every detail-but only of a small slice of the brain. Claims data is like a satellite image. You see the whole country, but you can’t tell if someone’s coughing or sweating.

The FDA’s 2023 guidance says the best approach is combining both. A 2021 FDA methodology paper found that using both sources cuts false safety signals by 40%. For example, when the drug palbociclib got a new approval in 2019, the FDA didn’t rely on one source. They used claims data to find who was taking it, then checked registries to confirm whether those patients had the right cancer type and were getting proper monitoring.

Why This Matters for Drug Safety

Drug safety isn’t just about preventing bad outcomes. It’s about avoiding overreactions. In 2022, a Yale study found that 22% of safety signals flagged by claims data alone turned out to be false positives. One case: a spike in kidney reports after a new blood pressure drug launched. Claims data showed a 30% increase. But when researchers checked registries, they found the patients were older, sicker, and already on multiple kidney-affecting drugs. The real culprit? Polypharmacy-not the new drug.

Without registries, regulators might have pulled the drug. With both sources, they kept it on the market-and added clearer warnings for elderly patients.

That’s the power of triangulation. Registries explain the “why.” Claims data shows the “how many.” Together, they turn noise into actionable insight.

Balanced scale showing registries and claims data with AI tools connecting them in flat design.

What’s Changing Right Now

The landscape is evolving fast. In January 2024, the FDA released draft guidance requiring registries to hit 80% data completeness for key variables to be accepted for safety reviews. That’s a big step. It means half-baked registries won’t cut it anymore.

The European Medicines Agency’s Darwin EU network now connects 32 databases across 15 countries, covering 120 million people. That’s the largest real-world safety network in the world.

New tech is helping too. Novartis is testing wearable sensors that track heart rate and activity levels in heart failure patients on Entresto. That data is being merged with claims records. AI tools are now scanning millions of records to flag patterns humans might miss-cutting false alarms by 28% in a 2024 JAMA study.

The FDA’s REAL program is launching standardized registries for 20 priority diseases by 2026. The first focus? Rare diseases. Why? Because traditional trials can’t recruit enough patients. Registries are the only way to monitor safety in these groups.

What You Need to Know

If you’re a patient: know that your prescriptions and doctor visits are being used to make drugs safer. That’s not a flaw-it’s a feature.

If you’re a provider: accurate coding matters. A wrong ICD-10 code can send a false signal across a national database.

If you’re in pharma or policy: invest in both data types. Don’t pick one over the other. Use registries for depth, claims for breadth. And always validate signals with clinical review.

The era of relying only on clinical trials for drug safety is over. The future belongs to data that reflects real life-and the smartest systems are the ones that blend the precision of registries with the scale of claims data.