Data Core
About the Data Core
The DOMStat Data Core provides a centralized resource for investigators across the research community to store, analyze and share large secondary databases to facilitate research. The Data Core server currently hosts 6 datasets from various investigators.

Team Leaders





Special thanks to: Dr. Carol Mangione, Dr. Dale Abel, Dr. Alan Fogelman.
Services
- Secure virtual servers
- DGIT-maintained
- Located at UCSD Supercomputer Center
- VPN Access for internal & external collaborators
- Statistical software
- VPN access setup
- DUA development & renewal support
- Data management plan draft support
- Data reuse support
- IRB submission support
- Faculty-level collaborations
- Analytic and database plan development
- Support fro grant applications
- Preliminary data and power circulation
- Database construction
- Variable creation
- Analytic file preparation
- Development of common codebase
- Access to experienced programmers
By hosting widely used resources—such as Medicare claims, MarketScan, SEER-Medicare, HCUP, HCAI, and the AHA Governance Survey—the Data Core fosters collaboration across research teams. It also streamlines data sharing and supports impactful, evidence-driven discoveries.
Data
Databases | Years | PI |
---|---|---|
National Medicare data (100% data) | 2016-2021 (FFS) 2018-2020 (MA) | Yusuke Tsugawa |
CA Medicare claims data (100% data) | 2015-2019 (FFS) | Anne Coleman |
Merative MarketScan data (85% sample) | 2013-2022 | Tina Shih |
HCUP SID&SEDD (12 states) | 2000-2017 | Rie Sakai-Bizmark |
SEER-Medicare data | 2000-2016 | Tina Shih and Nick McAndrew |
Health Care Access and Information (HCAI) | 2000-2017 | David Eisenman |
Medicare FFS RIF Data (2016-2021)
2 approaches to use data
- CMS data reuse: $2,000 fee
- No approval for projects that fall under existing DUA ("Patient, physician, health system, and regional factors associated with the quality and cost of care")
100% | 20% |
---|---|
• MBSF (denominator) | • Carrier file |
• Inpatient file | • Part D file |
• Outpatient file | |
• Home Health file | |
• Hospice file | |
• SNF file | |
• DME file | |
• Long Term Care MDS | |
• ACO | |
• MD-PPAS (physician data) |
Medicare Advantage RIF Data (2018-2020)
100% | 20% |
---|---|
• MBSF (denominator) | • Carrier file |
• Inpatient file | |
• Outpatient file | |
• Home Health file | |
• SNF file |
CA Medicare FFS RIF Data (2015-2019)
- Only FFS data are available
- Part A (facility) and Part B (carrier) claims
100% | 20% |
---|---|
• MBSF (denominator) | |
• Carrier file | |
• Inpatient file | |
• Outpatient file |
SEER-Medicare File
SEER-Medicare File | Years |
---|---|
Cancer File | 2000-2015 |
5% Cancer File | 2000-2015 |
Master Beneficiary Summary File (MBSF) Base A/B/C/D* | 1999-2020 |
Chronic Conditions Flags | 2002-2016 |
MedPAR | 2002-2016 |
Carrier Claims (NCH) | 2002-2016 |
Outpatient | 2002-2016 |
Home Health Agency (HHA) | 2002-2016 |
Hospice | 2002-2016 |
Durable medical equipment (DME) | 2002-2016 |
Part D Event (PDE) - with Drug Characteristics File appended | 2007-2016 |
Hospital Characteristics File | 2002-2016 |
Geographic - zip code/census tract files (automatically provided) | 1999-2018 |
Merative MarketScan Data
Two sets of databases available:
- Set B: Covers 85% of all MarketScan data enrollees (2013-2019)
- Substance Use Disorder: Restricted to individuals with substance use disorder, and family members who share the health plan (2015-2019)