• Media type: Doctoral Thesis; Electronic Thesis; E-Book
  • Title: Split Analysis Methods and Parametric Bootstrapping in Molecular Phylogenetics : Taking a closer look at model adequacy
  • Contributor: Meid, Sandra A. [Author]
  • Published: Universitäts- und Landesbibliothek Bonn, 2015-06-08
  • Language: English
  • DOI: https://doi.org/20.500.11811/6474
  • Keywords: model adequacy ; Modellmissspezifikation ; phylogenetische Sequenzanalysen ; model-based tree reconstruction ; Modelladäquatheit ; bioinformatics ; split analysis ; Maximum Likelihood ; Methoden der Bioinformatik ; Split-Analysen ; molekulare Phylogenetik ; modellbasierte Stammbaumrekonstruktion ; phylogenetic sequence analysis ; Goldman-Cox-Test ; molecular phylogenetics ; model misspecification
  • Origination:
  • Footnote: Diese Datenquelle enthält auch Bestandsnachweise, die nicht zu einem Volltext führen.
  • Description: Even though the size of datasets in molecular analyses increased rapidly during the last years, undetected systematic errors as well as unsolved problems concerning the evaluation of data quality and adequate substitution model selection still persist. This not only hampers the correct analysis of these datasets but leads to undetectable effects in phylogenetic tree reconstruction. Model-based tree reconstruction methods like maximum likelihood estimation and Bayesian inference have become the methods of choice for reconstruction of phylogenetic trees. Although maximum likelihood methods are known to be consistent if all necessary conditions are met, it depends strongly on the quality of the multiple sequence alignment and the ability of the chosen evolutionary model to reflect the underlying historical processes. This thesis addresses the assessment of model adequacy of estimated evolutionary models to multiple sequence alignments in the light of parametric bootstrapping and aims to find new methods for detection of model misspecifications with the help of split analyses. The second chapter focuses on the influence of the number of gamma rate categories used in modelling among-site rate variation when trying to assess model adequacy using an absolute goodness-of-fit test. The analyses of simulated alignments show that the Goldmann-Cox test rejects models which were only approximated by four discrete gamma rate categories for various tree shapes and branch length setups, if they were simulated with a continuous gamma distribution. Increasing the number of discrete rate categories leads to an acceptance of model adequacy for stationary datasets and a correct detection of non-stationarity and inhomogenetity in simulated data. The results illustrate that the application of the proposed Goldmann-Cox test to evaluate model adequacy might be too strict and rigorous with empirical data, in particular for large phylogenomic datasets. Approaches such as the Goldman-Cox test evaluate the absolute fit of data and model ...
  • Access State: Open Access
  • Rights information: In Copyright