About the Data Core

The DOMStat Data Core provides a centralized resource for investigators across the research community to store, analyze and share large secondary databases to facilitate research. The Data Core server currently hosts 6 datasets from various investigators.

 

Illustration of DOMStat Data Core with server icons connected by circuit lines on yellow background.

Team Leaders

Dr. Yusuke Tsugawa

Yusuke Tsugawa

Director, Data Core; Associate Professor, Division of General Internal Medicine and Health Services Research; Associate Professor, Department of Health Policy and Management
Maya Fujimura

Maya Fujimura

Senior Public Administration Analyst, Division of General Internal Medicine and Health Services Research
David Elashoff, Director of DoMSTaT

David Elashoff

Director; Professor, Division of General Internal Medicine and Health Services Research; Professor, Department of Biostatistics; Professor, Department of Computational Medicine
Yash Motwani

Yash Motwani

Statistician, Division of General Internal Medicine and Health Services Research
Paige-Ashley Smith

Paige-Ashley Smith

Research Data Analyst, Division of General Internal Medicine and Health Services Research

Special thanks to: Dr. Carol Mangione, Dr. Dale Abel, Dr. Alan Fogelman.

Services

  • Secure virtual servers
    • DGIT-maintained
    • Located at UCSD Supercomputer Center
  • VPN Access for internal & external collaborators
  • Statistical software
  • VPN access setup
  • DUA development & renewal support
  • Data management plan draft support
  • Data reuse support
  • IRB submission support
  • Faculty-level collaborations
  • Analytic and database plan development
  • Support fro grant applications
  • Preliminary data and power circulation
  • Database construction
  • Variable creation
  • Analytic file preparation
  • Development of common codebase
  • Access to experienced programmers

By hosting widely used resources—such as Medicare claims, MarketScan, SEER-Medicare, HCUP, HCAI, and the AHA Governance Survey—the Data Core fosters collaboration across research teams. It also streamlines data sharing and supports impactful, evidence-driven discoveries.

Data

Databases Years PI
National Medicare data (100% data) 2016-2021 (FFS) 2018-2020 (MA) Yusuke Tsugawa
CA Medicare claims data (100% data) 2015-2019 (FFS) Anne Coleman
Merative MarketScan data (85% sample) 2013-2022 Tina Shih
HCUP SID&SEDD (12 states) 2000-2017 Rie Sakai-Bizmark
SEER-Medicare data 2000-2016 Tina Shih and Nick McAndrew
Health Care Access and Information (HCAI) 2000-2017 David Eisenman

Medicare FFS RIF Data (2016-2021)

  • 2 approaches to use data

  1. CMS data reuse: $2,000 fee
  2. No approval for projects that fall under existing DUA ("Patient, physician, health system, and regional factors associated with the quality and cost of care")
100% 20%
• MBSF (denominator) • Carrier file
• Inpatient file • Part D file
• Outpatient file
• Home Health file
• Hospice file
• SNF file
• DME file
• Long Term Care MDS
• ACO
• MD-PPAS (physician data)

Medicare Advantage RIF Data (2018-2020)

100% 20%
• MBSF (denominator) • Carrier file
• Inpatient file
• Outpatient file
• Home Health file
• SNF file

CA Medicare FFS RIF Data (2015-2019)

  • Only FFS data are available
  • Part A (facility) and Part B (carrier) claims
100% 20%
• MBSF (denominator)
• Carrier file
• Inpatient file
• Outpatient file

SEER-Medicare File

SEER-Medicare File Years
Cancer File 2000-2015
5% Cancer File 2000-2015
Master Beneficiary Summary File (MBSF) Base A/B/C/D* 1999-2020
Chronic Conditions Flags 2002-2016
MedPAR 2002-2016
Carrier Claims (NCH) 2002-2016
Outpatient 2002-2016
Home Health Agency (HHA) 2002-2016
Hospice 2002-2016
Durable medical equipment (DME) 2002-2016
Part D Event (PDE) - with Drug Characteristics File appended 2007-2016
Hospital Characteristics File 2002-2016
Geographic - zip code/census tract files (automatically provided) 1999-2018

Merative MarketScan Data

  • Two sets of databases available:

    • Set B: Covers 85% of all MarketScan data enrollees (2013-2019)
    • Substance Use Disorder: Restricted to individuals with substance use disorder, and family members who share the health plan (2015-2019)