Show simple item record

dc.contributor.advisorKoestler, Devin C
dc.contributor.authorMeier, Richard
dc.date.accessioned2021-06-07T21:40:26Z
dc.date.available2021-06-07T21:40:26Z
dc.date.issued2020-05-31
dc.date.submitted2020
dc.identifier.otherhttp://dissertations.umi.com/ku:16995
dc.identifier.urihttp://hdl.handle.net/1808/31677
dc.description.abstractCompositional data, in which a vector of observed variables are constrained by a sum total, imposes a unique correlation structure among its components. Considering the abundance of components in a biological sample is inherently compositional, that is to say it is constrained by the amount of collected biomass, it is not surprising that compositional data frequently arises in the field of biomedical research. Failing to account for compositional effects compromises statistical inference, and may lead to spurious results that are not reproducible. Development and application of statistical techniques that honor compositionality in the context of biomedical research is therefore of great importance. In this dissertation, we first investigate microbial composition in the pancreatic microbiome (not well characterized prior to this research) and surrounding tissue using a variety of different statistical methods. We identify similarities between tissue types and differences between tissues from subjects with different types of pancreatic cancer and tissues from non-cancer subjects. Identification of microbes commonly found in oral cavities then motivates the question whether consistent patterns of the microbial landscape with respect to disease can be found between the mouth and the gut. Since there is no established method to test for these patterns in microbiome data, we continue by presenting a suitable Bayesian testing framework that is able to address the unique challenges posed by microbial abundance data. We elaborate how the method simultaneously applies to a variety of different data models and different types of estimates of microbial abundance, and demonstrate its ability to detect desired associations via simulation studies. Further, analysis of microbiome profiles derived from gut and oral cavity samples collected from pancreatic cancer cases are used to successfully identify microbes that exhibit consistent patterns of interest. This dissertation closes with methodological developments for the analysis of DNA methylation levels of bulk samples with heterogeneous cell composition. Building on novel modelling approaches that are able to detect cell type specific methylation based on bulk samples, we introduce a Bayesian hierarchical modelling strategy that leverages spatial correlation of proximal CpG dinucleotides. We elaborate how our method was empirically motivated by whole blood methylation data of isolated cell types and demonstrate its performance improvement in terms of prediction accuracy and statistical power compared to non-spatial models.
dc.format.extent161 pages
dc.language.isoen
dc.publisherUniversity of Kansas
dc.rightsCopyright held by the author.
dc.subjectBiostatistics
dc.subjectBayesian
dc.subjectcompositional data
dc.subjectconsistent pattern
dc.subjectDNA methylation
dc.subjectmicrobiome
dc.subjectspatial correlation
dc.titleMethodological Developments for the Analysis of Biological Samples in the Presence of Compositional Effects
dc.typeDissertation
dc.contributor.cmtememberGajewski, Byron J
dc.contributor.cmtememberThompson, Jeffrey
dc.contributor.cmtememberChalise, Prabhakar
dc.contributor.cmtememberMarkiewicz, Mary A
dc.thesis.degreeDisciplineBiostatistics
dc.thesis.degreeLevelPh.D.
dc.identifier.orcidhttps://orcid.org/0000-0003-1062-1444
dc.rights.accessrightsopenAccess


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record