7.2 What Is a Phylogenetic Tree? Structure and Interpretation
7.2 What Is a Phylogenetic Tree? Structure and Interpretation
Section titled “7.2 What Is a Phylogenetic Tree? Structure and Interpretation”Learning Objectives
Section titled “Learning Objectives”After reading this section, you should be able to:
- describe the structural components of a phylogenetic tree
- distinguish between rooted and unrooted trees
- interpret internal nodes as hypothetical ancestors
- differentiate between tree topology and branch lengths
- correctly read and interpret phylogenetic trees without common misconceptions
From Idea to Representation
Section titled “From Idea to Representation”In the previous section, we introduced phylogenetic trees as models that describe evolutionary relationships. We now make this representation more precise.
At an abstract level, a phylogenetic tree expresses a simple idea: biological entities diverge over time from common ancestors. This process creates a branching structure, which we represent as a tree. While this idea is intuitive, its correct interpretation requires some care. The meaning of a phylogenetic tree does not lie in how it is drawn, but in the relationships it encodes.
Understanding this distinction is essential before we begin constructing trees algorithmically.
Nodes and Branches
Section titled “Nodes and Branches”A phylogenetic tree consists of nodes connected by branches. Each of these elements has a specific interpretation.
The terminal nodes, often called leaves, correspond to the observed entities. These may be species, genes, or sequences. They represent the data we start from.
The internal nodes represent inferred entities. They correspond to common ancestors from which descendant lineages have diverged. These ancestors are not observed directly. Their existence is inferred because they provide a coherent explanation for the observed similarities among sequences .
Branches connect nodes and represent evolutionary lineages. Moving along a branch corresponds to the passage of evolutionary time or the accumulation of changes, depending on the interpretation of the model.
Topology as the Core of the Tree
Section titled “Topology as the Core of the Tree”The most important aspect of a phylogenetic tree is its topology, that is, the pattern of branching relationships.
Topology determines which taxa share a more recent common ancestor. This defines evolutionary relatedness. Two sequences are considered closely related if they join at a recent internal node.
A crucial point is that the topology is independent of how the tree is drawn. Branches can be rotated around internal nodes without changing the relationships. As a result, the left-to-right order of leaves has no meaning.
What matters is not visual proximity, but shared ancestry.
Rooted and Unrooted Trees
Section titled “Rooted and Unrooted Trees”Phylogenetic trees can be either rooted or unrooted, and this distinction fundamentally affects their interpretation.
A rooted tree contains a designated root node that represents the most recent common ancestor of all taxa in the tree. The root introduces a direction, from past to present. It allows us to interpret the order of divergence events.
An unrooted tree, in contrast, represents relationships without specifying a direction of evolution. It tells us which taxa are more closely related, but not which lineage diverged first .
Many reconstruction methods initially produce unrooted trees. Additional information or assumptions are required to determine the root.
Branch Lengths and Their Meaning
Section titled “Branch Lengths and Their Meaning”In some phylogenetic trees, branch lengths carry quantitative meaning. In others, they do not.
When branch lengths are included, they often represent evolutionary distance, such as the expected number of substitutions along a lineage. The distance between two taxa is then obtained by summing the lengths of the branches that connect them.
In other cases, branch lengths are omitted or arbitrary, and only the topology is considered. It is therefore essential to distinguish between the structure of the tree and the interpretation of its branch lengths.
A common mistake is to assume that visually longer branches always indicate greater evolutionary distance. This is only true if the tree has been constructed with meaningful branch lengths.
Resolution and Uncertainty
Section titled “Resolution and Uncertainty”Not all phylogenetic trees are fully resolved.
In a fully resolved tree, each internal node splits into exactly two descendants. This corresponds to a precise hypothesis about the order of divergence events.
In contrast, a node may give rise to more than two descendants. Such a structure is called a polytomy. It indicates that the data do not provide sufficient information to resolve the relationships among these taxa.
Polytomies are therefore not merely graphical features. They reflect uncertainty in the reconstruction.
Groups and Evolutionary Interpretation
Section titled “Groups and Evolutionary Interpretation”Phylogenetic trees allow us to define biologically meaningful groups.
A monophyletic group consists of a common ancestor and all of its descendants. Such groups correspond to natural evolutionary units.
A paraphyletic group includes a common ancestor but excludes some of its descendants. These groups often arise from historical classifications that do not fully reflect evolutionary relationships .
Understanding these distinctions is essential when interpreting trees in a biological context.
Reading Trees Correctly
Section titled “Reading Trees Correctly”Because phylogenetic trees are visual representations, they are often misinterpreted.
One common mistake is to assume that taxa that are close together in the diagram are closely related. As discussed earlier, this is not necessarily true. Only the topology determines relatedness.
Another misconception is to interpret internal nodes as directly observed entities. In reality, they represent inferred ancestors. Their purpose is explanatory, not observational.
It is also important to remember that a tree is only one possible model. Alternative trees may explain the data equally well, especially when the available information is limited.
Conceptual Summary
Section titled “Conceptual Summary”A phylogenetic tree is a structured representation of evolutionary relationships. Its meaning lies in its topology and, when present, in its branch lengths. It encodes hypotheses about common ancestry and divergence, not direct observations.
Understanding how to read and interpret such trees is a prerequisite for constructing them and for evaluating the models they represent.
Self-Check Questions
Section titled “Self-Check Questions”- What is the role of internal nodes in a phylogenetic tree?
- Why does the left-to-right order of leaves not matter?
- What is the difference between rooted and unrooted trees?
- How should branch lengths be interpreted?
- What does a polytomy indicate?
- What distinguishes a monophyletic group from a paraphyletic group?