Data Sources

Data Sources

All of Us participants contribute to the program in many ways, such as by responding to surveys, sharing electronic health records, and providing biosamples. The All of Us Data and Research Center curates and validates data derived from these sources. All direct identifiers are removed before the data are made available for research.

Summaries of data contributed by participants can be found in the Data Browser. Registered researchers can access individual-level data through the Researcher Workbench. As the program grows, additional data types and sources will become available. View the data roadmap to learn more.

Electronic Health Records

All participants are invited to share their electronic health records (EHR) with the program. We use the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) to standardize all EHR data. Learn more about the OMOP CDM on the Data Methods page.

Biosamples And Bioassays (Genomics)

Most participants contribute blood, saliva, and/or urine samples that can be used for a variety of bioassays. DNA extracted from samples is sent to our genome centers for genomic analysis, including short-read whole genome sequencing, long-read whole genome sequencing, and genome-wide genotyping to call SNP and Indel variants and structural variants. In addition, a select subset of participant blood serum samples was analyzed for COVID-19 antibody testing in 2020.

Surveys

All participants are invited to complete surveys periodically. Current surveys include questions about participants’ identities and backgrounds, overall health, lifestyles, medical histories, health care access, experiences with COVID-19, and more. Survey response data can be found in the Data Browser and question source information can be found in the Survey Explorer. In addition, some external survey data are available in the Researcher Workbench and can be linked with All of Us data.

Physical Measurements

The program collects physical measurements from three possible sources: EHRs, participant-provided (self-reported) height and weight measurements, and/or an in-person visit for the collection of baseline physical measurements (“program physical measurements”) by trained staff at All of Us Research Program partner organizations. Data from these visits include measurements of height, weight, waist circumference, hip circumference, blood pressure, and heart rate. Program staff also note participants’ pregnancy status and wheelchair use at any visits.

Wearable Devices (Digital Health)

All of Us participants can contribute information from their wearable health tracking devices, such as Fitbit devices, which track biometric data including heart rate, physical activity, and sleep. Fitbit data also include device data, such as the type of Fitbit device, approximate battery level, and more.

Data Roadmap

In the coming years, All of Us will enroll more participants and make more types of data available, as funding allows.
Data availability and access timelines are estimates and subject to change.

*Previously two separate surveys, Personal Medical History and Family Health History
**Additional information on available genomic data can be found in the Genomic Quality Report and the Short-Read Structural Variant Quality Report