Abstract
While Bayesian methods have become very popular in phylogenetic systematics, the foundations of this approach remain controversial. The star-tree paradox in Bayesian phylogenetics refers to the phenomenon that a particular binary phylogenetic tree sometimes has a very high posterior probability even though a star tree generates the data. I argue that this phenomenon reveals an unattractive feature of the Bayesian approach to scientific inference and discuss two proposals for how to address the star-tree paradox. In particular, I defend the polytomy prior as a solution (or rather dissolution) of the paradox and argue that it is preferable to a data-size dependent branch lengths prior from a methodological perspective. However, while this reply dissolves the star-tree paradox, the general challenge to Bayesian confirmation theory remains unmet.
Similar content being viewed by others
Notes
The external nodes of the tree represent present-day species (e.g., human, chimpanzee and gorilla), whereas internal nodes represent extinct ancestors (i.e., ancestors for which no sequence data are available). The degree of a node is the number of branches connected to the node. The root of tree T 1 has degree 2; the tree is therefore called binary. If the root of a tree has a degree larger than 2 or a non-root node has a degree larger than 3, the node represents a polytomy. Also note that the terms ‘tree’ and ‘tree topology’ will be used interchangeably.
While here ‘branch length’ is understood as measuring physical time, this term also has a different meaning in the biological literature. In graphical representations of a phylogenetic tree the length of a branch is seen as a measure of the amount of evolutionary change occurring along the branch. In that case ‘branch length’ means the expected number of substitutions, which is a function of time and the rate of substitutions.
A resolved tree does not contain any multifurcations (or, to use a different term for the same phenomenon, polytomies).
Yang also points out that the polytomy prior is difficult to code in current computer algorithms for Bayesian phylogenetic inference, such as ‘MrBayes’ (Huelsenbeck and Ronquist 2001; Ronquist and Huelsenbeck 2003). The focus of this paper, however, is on conceptual issues in Bayesian methodology rather than on practical issues associated with the coding of Bayesian phylogenetic methods. So, when assessing proposals of how to assign prior probabilities in Bayesian phylogenetics, only conceptual aspects will be taken into account and questions of how to practically implement these priors in current computer software are set aside.
References
Berger J, Wolpert R (1984) The likelihood principle. IMS, Hayward
Carnap R (1962) Logical foundations of probability. University of Chicago Press, Chicago
Dawid AP (1999) The trouble with Bayes factors. Research Report 202. Department of Statistical Science. University College London, London
Durand DJ, Bianco PG, Laroche J, Gilles A (2003) Insight into the origin of endemic mediterranean ichtyofauna: phyleography of Chondrostoma genus (Teleostei, Cyprinidae). J Hered 94:315–328
Falconnet M (2010) Priors for the Bayesian star paradox. Math Biosci 228:90–99
Felsenstein J (2004) Inferring phylogenies. Sinauer, Sunderland
Hajek A, Hartmann S (2010) Bayesian epistemology. In: Dancy J et al (eds) A companion to epistemology. Blackwell, Oxford, pp 93–106
Hawthorne J (2011) Confirmation theory. In: Bandyopadyay PS, Forster MR (eds) Handbook of the philosophy of science volume 7: philosophy of statistics. Elsevier, Amsterdam
Hoelzer GA, Melnick DJ (1994) Patterns of speciation and limits to phylogenetic resolution. Trends Ecol Evol 9:104–107
Huelsenbeck JP, Ronquist F (2001) MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics 17:754–755
Lewis PO, Holder MT, Holsinger KE (2005) Polytomies and Bayesian phylogenetic inference. Syst Biol 54:241–253
Ronquist F, Huelsenbeck JP (2003) MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19:1572–1574
Shimony A (1970) Scientific inference. Reprinted in Shimony A (1993) Search for a naturalistic world view. Cambridge University Press, Cambridge, pp 183–273
Stanford K (2006) Exceeding our grasp: science, history, and the problem of unconceived alternatives. Oxford University Press, Oxford
Steel M, Matsen FA (2007) The Bayesian “star paradox” persists for long finite sequences. Mol Biol Evol 24:1075–1079
Susko E (2008) On the distributions of bootstrap support and posterior distributions for a star tree. Syst Biol 57:602–612
Suzuki Y, Glazko GV, Nei M (2002) Overcredibility of molecular phylogenies obtained by Bayesian phylogenetics. Proc Natl Acad Sci USA 99:16138–16143
Yang Z (2007) Fair-balance paradox, star-tree paradox and Bayesian phylogenetics. Mol Biol Evol 24:1639–1655
Yang Z, Rannala B (2005) Branch-length prior influences Bayesian posterior probability of phylogeny. Syst Biol 54:455–470
Acknowledgments
I would like to thank Samir Okasha and two anonymous reviewers for their helpful comments on earlier versions of the article. Financial support from the British Academy Postdoctoral Fellowship scheme is gratefully acknowledged.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Autzen, B. Dissolving the star-tree paradox. Biol Philos 31, 409–419 (2016). https://doi.org/10.1007/s10539-015-9502-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10539-015-9502-2