In ovo omnia: diversification by duplication in fish and other vertebrates
© BioMed Central Ltd 2009
Published: 5 March 2009
Skip to main content
© BioMed Central Ltd 2009
Published: 5 March 2009
Gene and genome duplications are considered to be the main evolutionary mechanisms contributing to the unrivalled biodiversity of bony fish. New studies of vitellogenin yolk proteins, including a report in BMC Evolutionary Biology, reveal that the genes underlying key evolutionary innovations and adaptations have undergone complex patterns of duplication and functional evolution.
Since the publication of Charles Darwin's The Origin of Species a century and a half ago, evolutionary biologists have been concerned with the identification of the processes that govern the emergence of new species and, thus, of organismal diversity. Because of variation in the rate of speciation and extinction, evolution inevitably leads to an unequal distribution of morphological diversity and species-richness across taxonomic lineages. Some lineages have remained morphologically uniform and are species-poor, whereas others have diversified rapidly. It is these more 'successful' and species-rich lineages in particular that enable insights into the process of diversification.
In vertebrates, the most species-rich group is that of the fishes: at least one in two vertebrate species is a fish, or – more precisely – a teleost fish. There are at least 26,000 living teleost species , which show a remarkable variety of ecological, morphological and behavioral adaptations. Among the characteristics that distinguish the teleost cohort from the only 50 or so species of basal ray-finned fishes and the rest of the vertebrates are genomic features such as gene and genome duplications and higher rates of chromosomal rearrangements and molecular evolution .
A fish-specific genome duplication (also known as the 3R duplication) occurred in an ancestor of the teleost lineage around 300–350 million years ago . This event, which endowed teleosts with additional new genes, has been hypothesized to be at least partly responsible for their biodiversity and species richness [2, 4, 5]. Not all genes that emerged from the duplication are still present, however. In fact, the majority of duplicated genes (about 70–90%) have since been degraded and/or lost (a process termed nonfunctionalization). But because this massive post-duplication gene loss followed different routes, different teleost lineages now have different complements of paralogous genes derived from the original genome duplication. This process is called divergent resolution [4, 5]. Empirical support for divergent resolution between teleost lineages that diverged very early comes from a recent comparative genome-wide analysis of paralog loss in zebrafish and the green spotted pufferfish .
In many cases where both copies have been maintained in a genome, the functions of the ancestral gene are now distributed among the duplicates – a process called subfunctionalization. Given that retention of duplication-derived gene copies also followed different routes and that subfunctionalization can be neutral and stochastic, the partitioning of gene functions can also occur lineage-specifically. Finally, it is possible that one of the duplicates continues to fulfill the ancestral functions while the other acquires a completely new function (neofunctionalization). Differential functional evolution between teleost lineages has so far been shown for zebrafish, stickleback and medaka .
Together, the fish-specific genome duplication and the divergent resolution, subfunctionalization and neofunctionalization that followed it created a large evolutionary playground within teleost genomes. The duplication-diversification hypothesis predicts that gene and genome duplication and subsequent reciprocal gene loss and/or differential paralog evolution in divergent populations leads to genomic incompatibilities between isolated populations and, consequently, to postzygotic isolation and speciation. That is how the fish-specific genome duplication might have facilitated the radiation of teleosts [4, 5].
Besides the overall impact of gene and genome duplication on reproductive isolation and thus on speciation, neo-functionalization of a duplicated gene copy can lead to the origination of a key evolutionary innovation that enables a group to radiate, for example in a new environment. In two new articles, one in BMC Evolutionary Biology  and the other in Molecular Biology and Evolution , Finn and colleagues examine an example of a cluster of genes that emerged by duplication and that apparently has enabled a whole group of fishes to diversify.
Finn and Kristoffersen had already in earlier studies  reconstructed the evolution of the vitellogenin (vtg) gene family in teleost fishes. Vitellogenins are yolk proteins synthesized in the liver and deposited in the maturing oocyte. Finn and Kristoffersen  suggested that neo-functionalization of the vtgAa gene in acanthomorphs, the most species-rich group of teleosts (comprising about 16,000 species, 78% of which are marine), was an important step towards adapting to a new spawning strategy in the marine realm. Proteolysis of the VtgAa yolk protein leads to an increase in the levels of free amino acids in the maturing oocyte and causes water influx. In this way, the hydrated eggs are protected against leakage of water into the hyperosmolar marine environment, so that the eggs float on the water surface. This is an important adaptation that makes pelagic ('floating') spawning strategies possible.
The initial phylogenetic analysis of teleost vitellogenins  suggested that the three vtg genes in acanthomorphs, vtgAa, vtgAb and vtgC, evolved through a progressive series of gene duplications and subsequent gene losses, involving the fish-specific genome duplication and the two earlier rounds of whole genome duplication in vertebrates (called 1R and 2R), and also an acanthomorph-specific duplication of the vtgA gene that generated the vtgAa and vtgAb duplicates. According to this scenario, lineage-specific neofunctionalization of the newly arising vtgAa paralog in acanthomorphs facilitated their conquest of the marine ecosystem from their original habitats in freshwaters.
In theory, phylogenetic reconstruction of the vitellogenin gene or protein family should reveal these three ancestral gene duplications. However, published vitellogenin phylogenies [1, 7, 8, 10] consistently suggest that the different vertebrate Vtg2 and Vtg3 genes have been generated in parallel but independently through lineage-specific tandem duplications (Figure 1b). One explanation for the failure of phylogenies to reconstruct the common duplication of the Vtg2/3 precursor could be that gene conversion has occurred between Vtg2 and Vtg3, keeping them alike. The new results by Finn et al.  and Babin  therefore illustrate how important it is to include synteny data for the correct inference of gene family evolution.
The evolutionary significance of vitellogenins is further substantiated by the high frequency of true lineage-specific duplication events in teleost fishes. In acanthomorphs, Vtg2/vtgAa has been duplicated in medaka, whereas Vtg3/vtgAb has multiple copies in marine labrids (wrasses). In the zebrafish, an ostariophysian, both Vtg3/vtgAb and Vtg2/vtgAa have been duplicated, the latter being present in as many as five copies [7–9]. Nevertheless, acanthomorphs are special in their processing of the Vtg2/vtgAa protein and the exceptionally high expression of Vtg2/vtgAa in marine, pelagically spawning species . Although yolk proteolysis evolved before the divergence of Acanthomorpha and Otocephala (such as zebrafish and herring), it was not until the neofunctionalization of Vtg2/vtgAa in the acanthomorph lineage that highly hydrated marine pelagic eggs were made possible, thereby triggering the teleost radiation in the oceans. This happened at least 400 million years after the evolution of the Vtg2/vtgAa gene itself [7, 8].