The Precision Health Analytics Platform is a suite of tools, services, and datasets available to researchers across campus. View the complete Analytics Platform User Guide [pdf] and other resources on the PH Analytics Platform Documentation site (UMICH [Level-1] password required).
Schedule a virtual consultation with a Research Scientific Facilitator to learn more about how the Analytics Platform can enhance your research.
|Resource||Resource type||Description||More information
(umich password required)
|De-identified electronic health record data through Precision Health DataDirect||tool||
||Analytics Platform User Guide|
|Increased download capability for patient results||functionality||
||Accessing Analytics Environments|
|High-performance compute platform||infrastructure||
||Accessing Analytics Environments|
|Yottabyte Research Cloud||infrastructure||
||Accessing Analytics Environments|
|Social determinants of health||data||
||Accessing Neighborhood-based Socioeconomic Status Data for Patient Cohorts using DataDirect|
|Michigan Medicine/ Precision Health COVID surveys||data||
||About the COVID-19 survey|
|COVID starting population
(predefined and validated cohort)
||COVID-19 Data via DataDirect|
|Diabetes starting population||data||
|Chest X-ray data||data||
|Accessing the COVID-19 chest X-ray Dataset on Turbo|
|Michigan Genomics Initiative (MGI)||data||
|Preoperative MGI participant surveys||data||
|Star allele calls||service||
||Michigan Genomics Initiative: Accessing Star Allele Calls|
DataDirect is a self-serve software tool enabling researchers to access and explore clinical data from the Michigan Genomics Initiative cohort and the electronic health records (EHR) of more than 4 million unique patients . Researchers may use DataDirect to generate aggregate counts for cohort study (“Cohort Discovery Mode”) or to analyze de-identified patient health data (“De-Identified Mode”). See the Analytics Platform User Guide for information on DataDirect modes. U-M VPN login is required to access Precision Health DataDirect.
DataDirect is managed by Michigan Medicine’s Data Office for Clinical and Translational Research (DOCTR), which oversees access to several institutionally supported tools and also provides customized datasets in consultation with researchers. The Data Office administers a secure and compliant process for researchers requiring Michigan Medicine data. All users of Precision Health DataDirect are required to complete robust human subjects research training and appropriate data use agreements.
The Precision Health Analytics Platform, using Michigan Medicine Data Office tools and resources, provides access to genetic and clinical data on approximately 80K patients. This includes the ability to link clinical phenotype data to genotype data and facilitation of GWAS analysis.
Researchers can access their data in a secure, virtual, high-compute Linux- or Windows-based environment.
The Armis2 high-performance computing (HPC) environment is composed of task-managing administrative nodes and standard Linux-based two- and four-socket server class hardware in a secure data center, connected by both a high-speed ethernet (1 Gbps) and InfiniBand network (40/100Gbps), and a secure parallel file system for temporary data, provided by HIPAA-aligned Turbo Research Storage. The two-socket nodes have up to 24 cores and 156 GB of memory. There are also 12 V100 GPUs currently on the cluster, but others can be moved on request.
If you are a new user of Armis2, you will need to create an account by submitting an application form [umich password required]; this form is also accessible via the Armis2 User Guide homepage. On the form, please specify a) the PH-based need for an Armis2 account, and b) the HUM#(s) associated with your data request(s) on DataDirect (without this information, ARC-TS won’t be able to create an Armis2 account). Please allow one business day for your application to be processed. If you already have an Armis2 account, you will need to send an email to email@example.com specifying a) the PH-based need to use your Armis2 account, and b) the HUM#(s) associated with your data request(s) on DataDirect.
Precision Health also has a private set of six nodes on Armis2. Each node has eight (48 total) RTX2080Ti GPUs and large volumes of fast local storage, and can see all data and software provided on Armis2. These nodes are optimized for machine learning/AI, computer vision, molecular dynamics, and any other GPU-accelerated workload. Precision Health–affiliated researchers who have interest in using the condo nodes should contact PHDataHelp@umich.edu.
The Yottabyte Research Cloud (YBRC) is a private cloud environment that provides high-performance, secure, and flexible computing environments enabling the analysis of sensitive datasets restricted by federal privacy laws, proprietary access agreements, or confidentiality requirements. The system is built on Yottabyte’s composable, software-defined infrastructure platform and represents U-M’s first use of software-defined infrastructure for research, allowing on-the-fly personalized configuration of any-scale computing resources. This platform allows the creation of any combination of network, CPU, RAM, and storage components into resource groups that can be used to build multi-tenant, multi-site infrastructure as a service.
Research Scientific Facilitators
Precision Health Research Scientific Facilitators are on hand to guide investigators across campus through processes that allow them to assemble datasets in a virtual, HIPAA-compliant server environment. Facilitators help researchers navigate self-serve tools such as DataDirect and EMERSE, find other ways of pulling clinical data (through DOCTR), submit biospecimen inquiries, assemble subject survey data, and more. Facilitators also strive to identify and integrate additional data lakes for centralized use.
Contact the Facilitators at PHDataHelp@umich.edu.
All IRB applications should go through IRBMED and not HSBS.
Type of IRB approvals needed by investigators for clinical and/or genetic data:
- Cohort mode provides aggregate counts; no individual-level information is accessible. No IRB application is required.
- Deidentified mode, which includes includes data from more than 4M Michigan Medicine patients, allows users to access individual-level structured patient health data without any HIPAA identifiers. An IRBMED “not regulated” determination is required by the system.
- PHI mode allows users to access and download structured patient health data including PHI. Only PHI types relevant to the specific research are accessible, consistent with an IRBMED approval or “exempt” determination; this only sometimes includes personal identifiers.
For IRB applications, please reference MGI HUM00071298.
De-Identified data and genomic data requests on their own are pre-approved by MGI committee, and do not need a specific letter or commitment to submit to IRB. Biospecimen requests and re-contact of MGI patients will need Precision Health MGI Access Committee approvals.
Contact the Data Office for Clinical & Translational Research (DOCTR) with any IRB-related questions: DataOffice@umich.edu.