Data Core
About the Data Core
The DOMStat Data Core provides a centralized resource for investigators across the research community to store, analyze and share large secondary databases to facilitate research. The Data Core server currently hosts 6 datasets from various investigators.
Team Leaders
Special thanks to: Dr. Carol Mangione, Dr. Dale Abel, Dr. Alan Fogelman.
Services
- Secure virtual servers
- DGIT-maintained
- Located at UCSD Supercomputer Center
- VPN Access for internal & external collaborators
- Statistical software
- VPN access setup
- DUA development & renewal support
- Data management plan draft support
- Data reuse support
- IRB submission support
- Faculty-level collaborations
- Analytic and database plan development
- Support fro grant applications
- Preliminary data and power circulation
- Database construction
- Variable creation
- Analytic file preparation
- Development of common codebase
- Access to experienced programmers
By hosting widely used resources—such as Medicare claims, MarketScan, SEER-Medicare, HCUP, HCAI, and the AHA Governance Survey—the Data Core fosters collaboration across research teams. It also streamlines data sharing and supports impactful, evidence-driven discoveries.
Data
| Databases | Years | PI |
|---|---|---|
| National Medicare data (100% data) | 2016-2021 (FFS) 2018-2020 (MA) | Yusuke Tsugawa |
| CA Medicare claims data (100% data) | 2015-2019 (FFS) | Anne Coleman |
| Merative MarketScan data (85% sample) | 2013-2022 | Tina Shih |
| HCUP SID&SEDD (12 states) | 2000-2017 | Rie Sakai-Bizmark |
| SEER-Medicare data | 2000-2016 | Tina Shih and Nick McAndrew |
| Health Care Access and Information (HCAI) | 2000-2017 | David Eisenman |
Medicare FFS RIF Data (2016-2021)
2 approaches to use data
- CMS data reuse: $2,000 fee
- No approval for projects that fall under existing DUA ("Patient, physician, health system, and regional factors associated with the quality and cost of care")
| 100% | 20% |
|---|---|
| • MBSF (denominator) | • Carrier file |
| • Inpatient file | • Part D file |
| • Outpatient file | |
| • Home Health file | |
| • Hospice file | |
| • SNF file | |
| • DME file | |
| • Long Term Care MDS | |
| • ACO | |
| • MD-PPAS (physician data) |
Medicare Advantage RIF Data (2018-2020)
| 100% | 20% |
|---|---|
| • MBSF (denominator) | • Carrier file |
| • Inpatient file | |
| • Outpatient file | |
| • Home Health file | |
| • SNF file |
CA Medicare FFS RIF Data (2015-2019)
- Only FFS data are available
- Part A (facility) and Part B (carrier) claims
| 100% | 20% |
|---|---|
| • MBSF (denominator) | |
| • Carrier file | |
| • Inpatient file | |
| • Outpatient file |
SEER-Medicare File
| SEER-Medicare File | Years |
|---|---|
| Cancer File | 2000-2015 |
| 5% Cancer File | 2000-2015 |
| Master Beneficiary Summary File (MBSF) Base A/B/C/D* | 1999-2020 |
| Chronic Conditions Flags | 2002-2016 |
| MedPAR | 2002-2016 |
| Carrier Claims (NCH) | 2002-2016 |
| Outpatient | 2002-2016 |
| Home Health Agency (HHA) | 2002-2016 |
| Hospice | 2002-2016 |
| Durable medical equipment (DME) | 2002-2016 |
| Part D Event (PDE) - with Drug Characteristics File appended | 2007-2016 |
| Hospital Characteristics File | 2002-2016 |
| Geographic - zip code/census tract files (automatically provided) | 1999-2018 |
Merative MarketScan Data
Two sets of databases available:
- Set B: Covers 85% of all MarketScan data enrollees (2013-2019)
- Substance Use Disorder: Restricted to individuals with substance use disorder, and family members who share the health plan (2015-2019)