Presentation Title

Kroak: A Metadata Collection System for Longterm Microbial Community Monitoring

Format of Presentation

15-minute lecture to be presented April 1, 2017

Location

IB 1010

Start Date

1-4-2017 2:00 PM

End Date

4-2017 2:15 PM

Abstract

In many studies of microbial communities various datasets, both detailing the microbial communities themselves and their environments, are gathered in an ad hoc fashion without consideration for how this data will be eventually integrated. Many datasets are in fact a collection of disjointed datasets made up of a collection of flat files such comma-separated values (CSV) files or Microsoft Excel files. Prior to analysis of these files, they must be joined into one or more cohesive tables. This process often involves time, consuming manual labor, and provides opportunities for mistakes which could potentially affect the researcher’s scientific conclusions. As a result, few studies go beyond the analysis of more than two dimensions of data. Having all the data in a database allows for integrated analyses. Once the database is created, data analysis is faster as researchers are able to access the data in an organized fashion. Additionally, fewer data handling mistakes are made which reduces the number of analyses which need to be redone. Kroak is a database system for the collection and tagging per-sample microbial community information with industrial environmental metadata to assist in the analysis of how microbial communities influence industrial processes.

Department

Biological Sciences

Faculty Advisor

Jonathan Van Hamme

This document is currently not available here.

Share

COinS
 
Apr 1st, 2:00 PM Apr 1st, 2:15 PM

Kroak: A Metadata Collection System for Longterm Microbial Community Monitoring

IB 1010

In many studies of microbial communities various datasets, both detailing the microbial communities themselves and their environments, are gathered in an ad hoc fashion without consideration for how this data will be eventually integrated. Many datasets are in fact a collection of disjointed datasets made up of a collection of flat files such comma-separated values (CSV) files or Microsoft Excel files. Prior to analysis of these files, they must be joined into one or more cohesive tables. This process often involves time, consuming manual labor, and provides opportunities for mistakes which could potentially affect the researcher’s scientific conclusions. As a result, few studies go beyond the analysis of more than two dimensions of data. Having all the data in a database allows for integrated analyses. Once the database is created, data analysis is faster as researchers are able to access the data in an organized fashion. Additionally, fewer data handling mistakes are made which reduces the number of analyses which need to be redone. Kroak is a database system for the collection and tagging per-sample microbial community information with industrial environmental metadata to assist in the analysis of how microbial communities influence industrial processes.