Data Science Core

DATA SCIENCE CORE SPECIFIC AIMS

The overall goal of the AK INBRE 5 Data Science Core (DSC) is to provide resources, services, and training in data science for investigators and students. The DSC is a fundamental component of AK INBRE 5 investment in biomedical research infrastructure and human capacity in Alaska where the program can directly support biomedical research activity. One Health, the continuing, broad-based research theme from AK INBRE 3 and 4, recognizes the developing expertise and interconnections among network investigators in the biomedical, animal, and environmental sciences. The AK INBRE 5 DSC uses the One Health theme and the study of health disparities in Alaska Native people as opportunities to support interdisciplinary and clinical and translational research activities that address Alaska’s needs for improving the well-being of individuals, families, and communities. The DSC supports the AK INBRE 5 program (AK INBRE 5 Overall Specific Aim 3) by providing a) Data Infrastructure; b) Modernized Data Ecosystem; c) Data Management, Analytics, and Tools; d) Workforce Development; and e) Data Stewardship and Sustainability in the AK INBRE network, improving capacity for an inclusive research program that addresses the needs of the broader community. AK INBRE 5 is committed to the coordination and sharing of resources among the NIH infrastructure programs in Alaska and continuing with experienced leadership, project administrators, representation throughout the network, and significant institutional support. Devin Drown, (UAF Bioinformatics Core Contact of AK INBRE 4) will lead the DSC. Drown has significant experience working in computational biology, genomics, and bioinformatics and has been working closely with AK INBRE 5 Program Coordinator, Jason Burkhead. To achieve its goal, the AK INBRE 5 DSC has 2 Specific Aims:

Specific Aim 1. Provide access to programming, data analysis, data management and sharing, and data security to facilitate a comprehensive data science project for biomedical researchers. The DSC will provide data science support through local infrastructure and expertise across the AK INBRE 5 network, as well as using the DSC distributed core model to include expertise missing within our own network through external resources in RAIN and across the IDeA network. The DSC will enhance analysis of a wide variety of data types including but not limited to genomic, small molecule and elemental, and isotopic data. This will include supporting data scientists with expertise in the types of data generated by the facilities in the RAC. The DSC will support enhanced access to HPC resources. THE DSC will emphasize FAIR data practices while also incorporating the complementary CARE principles important for indigenous data.

Specific Aim 2. Support career development and data science training for students and biomedical researchers in the AK INBRE network. The DSC will host outside speakers as well as provide funding for users to gain data science expertise (programming, data analysis, data management, data security, and access to big data and cloud computing). The DSC will support the development of undergraduate curricula (e.g., modules) for existing courses to enhance training in data science. The DSC will build on the expertise in emerging bioinformatics methods from AK INBRE 4 with short format workshops (e.g., codeathons) targeting the next generation of data scientists. These innovative Alaska-based workshops will target data science expertise as well as emerging data science disciplines (e.g., machine learning, artificial intelligence). These training events will focus on specific student learning and utilize the train-the-trainer model.

Execution of these aims is in alignment with the NIH Strategic Plan for Data Science. The DSC will enhance Alaska-based resources, promote new expertise, and support access to resources outside of Alaska where gaps in infrastructure or expertise exist. These activities will maximize NIH and AK institutional investments and provide training essential for data science in biomedical research. Execution will promote best practices in data management, the utilization of biomedical data repositories, and enhanced opportunities for cloud computing. The DSC will collaborate with other INBRE Data Science Cores to provide richer opportunities for training and research.

Dovin Drow

dmdrown@alaska.edu Data Science Core Director University of Alaska Fairbanks