A Caenorhabditis elegans genetic-interaction map wiggles into view

Systematic mapping of genetic-interaction networks will provide an essential foundation for understanding complex genetic disorders, mechanisms of genetic buffering and principles of robustness and evolvability. A recent study of signaling pathways in Caenorhabditis elegans lays the next row of bricks in this foundation.

One of the enduring challenges in biology is to learn how the amazing complexity and diversity of life forms arise from a limited repertoire of heritable factors. To understand the emergent properties of biological systems, it is necessary to first map the functional organization of the complex biological networks that underlie them. Many levels of function will need to be analyzed systematically to arrive at this goal. Mapping molecular interactions such as proteinprotein, protein-DNA, and RNA-RNA interactions will help define structural and regulatory relationships. However, understanding organizational principles that determine how different parts of these networks are coordinated will require uncovering functional dependencies that may not be reflected in direct physical interactions, for example between actin-and tubulin-dependent cellular processes [1]. Large-scale mapping of genetic interactions in model organisms offers a powerful approach to tackle this challenge. A recent genetic-interaction study published in Journal of Biology by Byrne et al. [2], focusing on signaling pathways of the nematode worm Caenorhabditis elegans, pushes the envelope of genetic-interaction mapping in a multicellular organism by developing a novel approach to defining networks of genetic interactions based on interaction strength, and integrating these networks with other dimensions of genome-scale data in order to reveal global patterns of functional relationships. U Un nr ra av ve el li in ng g t th he e f fu un nc ct ti io on na al l o or rg ga an ni iz za at ti io on n o of f b bi io ol lo og gi ic ca al l n ne et tw wo or rk ks s Why is it important to gain a global view of genetic interactions? One simple reason is to help assign functions to the many nonessential genes whose in vivo requirements remain obscure. Genetic and reverse genetic studies in Saccharomyces cerevisiae [3], C. elegans [4][5][6][7], and Drosophila melanogaster [8] indicate that the majority of genes (around 75-85%) in both single-celled eukaryotes and metazoans appear to be dispensable for survival; moreover, only about half of protein-coding genes in yeast [3,9] and about 25% in the worm [10] give rise on their own to any discernable phenotype in vivo. However, genetic modifier screens for enhancement or suppression of specific phenotypes have been used with great success in model organisms to identify genes with related functions and to order genes within pathways involved in numerous biological processes (for a review see [11]). Many genetic elements identified in this way give rise to detectable phenotypes only when their function is compromised in combination with other genetic loci. In medicine, there is an increasing recognition that the etiology of many diseases involves multiple genetic factors that confound simple genotype-phenotype relationships [12].
Characterizing patterns of genetic interactions can also help us understand how organisms resist or adapt to environ-mental or genetic variation. Biological networks are increasingly seen as modular systems [13], in which coordinated assemblies of components with specialized functions mediate distinct processes that are, to some extent, insulated from other parts of the network. Thus perturbing the activity of a single component is often not catastrophic; instead, systems find ways to compensate. This impressive resilience is thought to reflect fundamental architectural properties of molecular networks that underlie both the robustness and the adaptability of biological systems. Robustness refers to the ability of organisms to maintain phenotypic stability through homeostatic mechanisms that allow them to tolerate fluctuations in environmental conditions or genetic variation [14]. Phenotypic buffering allows the accumulation of mutations in a particular genetic background; when buffering mechanisms break down, this hidden genetic variation may become expressed. This is famously illustrated by the example of HSP90 [15] -which when impaired can release striking morphological diversity in almost any adult structure in the fly -but may be a more general property of genetic networks [16]. The release of phenotypic variation has important implications for evolutionary change [17,18]. Thus, buffering can both promote homeostasis and foster phenotypic plasticity under the right conditions. Identifying functional connections between particular molecules and modules on a global scale will help us both to learn about explicit mechanisms and to develop a theoretical framework for how organisms adjust to variability in external conditions and internal network states.
I In ns si ig gh ht ts s f fr ro om m g ge en ne et ti ic c n ne et tw wo or rk ks s i in n y ye ea as st t The most comprehensive analyses of genetic interactions so far have been performed in S. cerevisiae. High-throughput approaches have been developed in yeast to create qualitative and quantitative maps of genetic interactions, including synthetic sick or lethal (SSL) interactions for essential and nonessential sets of genes, synthetic dosage suppression or lethality, and complex haploinsufficient interactions [19]. These techniques are enabled by the generation of strain libraries with mutations in every gene, allowing large-scale screening of deletions, conditional or hypomorphic alleles and inducible overexpression constructs [19]. These approaches have also been extended to map the sensitivity of yeast to various chemicals, revealing interactions between specific genes and environmental perturbations (see, for example, [20][21][22]).
The growing body of genetic-interaction studies has greatly extended our understanding of the functional organization of biological processes in yeast, in terms of both specific functional relationships and global properties [19]. For example, although the SSL and protein-protein interaction (PPI) maps overlap more than expected by chance (approximately 13% of within-complex PPIs are SSLs, compared with 0.5% expected by chance), the number of overlapping interactions is very small overall (around 1-4% of SSL pairs are also PPIs), pointing to essential differences in the type of information that these networks provide about functional organization within cells [1]. PPIs correspond mainly to physical complexes and pathways, whereas patterns of SSL interactions predominantly reveal between-pathway relationships that expose functional links between related cellular processes; thus genes in the same pathway or complex tend to share many of the same genetic-interaction partners [1].
This body of data has also stimulated significant interest in exploring the types of interactions that can be observed genetically [23] and in defining mathematical models that should be applied to interpret the results of geneticinteraction studies [24]. For example, using a 'min' definition, in which any phenotype worse than either of the single mutants is called a genetic interaction, will yield a different (and much larger) set of interactions than using a 'product' rule, in which the phenotype of a double mutant must be worse than the product of either single mutant alone [24]. Considering synergistic genetic interactions in yeast, alternative definitions differ with respect to identifying functional relationships and can lead to different conclusions regarding the underlying biology [24]. This issue also has significant implications for the interpretation of genetic interactions in other organisms.
M Ma ap pp pi in ng g g ge en ne et ti ic c n ne et tw wo or rk ks s i in n C C. . e el le eg ga an ns s Similar approaches now need to be extended to study complex interactions in multicellular organisms. As described in Byrne et al. [2], a collaborative study between the groups of Peter Roy and Josh Stuart takes a significant new step in this direction. Although the analysis of genetic interactions for individual genes of interest has long been a mainstay of genetics in metazoan model organisms such as the worm and the fly, large-scale systematic efforts have lagged far behind those in yeast, mainly because of technical limitations: comprehensive libraries of deletion strains do not yet exist, and selecting and analyzing progeny from the 200 million or so possible mutant crosses using forward genetic methods is a logistical nightmare. With few reported exceptions [25], a purely reverse genetic approach using combinatorial RNA interference (RNAi) to target two genes simultaneously in the same animal has not met with great success in most worm labs. However, a hybrid strategy, in which individual genetic alleles are screened against a library of genes depleted one at a time by RNAi, has proved an effective alternative in studies of increasing scale [26][27][28][29]. Using a hybrid genetic-RNAi approach, Byrne et al. [2] report a network of 1,246 genetic interactions between genetic alleles of 11 'query' genes (primarily involved in conserved signaling pathways specific to metazoans) and genes from a library of 858 'target' genes depleted individually by RNAi. The target gene set was split between 372 genes likely to be involved in signal transduction (based on functional annotations) and 486 genes on linkage group III (which may contain new, previously unidentified signaling targets).
Although the total number of interactions tested was not significantly larger than several recent studies [27,[29][30][31], the work by Byrne et al. [2] stands out in its attempt to provide a more quantitative assessment of the strength of genetic interactions and in its novel use of a global dataanalysis approach designed to identify interacting pairs in an unbiased fashion. The experimental design involved estimating numbers of progeny on solid agar over several days using a graded scoring scheme in blind triplicate assays. From these data the authors constructed a large compendium matrix of 56,347 scores and inferred 51 unique sets of genetic interactions by varying six parameters (for example, deviation between experimental and control samples, number of days with an observed deviation and reproducibility). They then chose two network variants that corresponded best to shared Gene Ontology (GO) terms [32]: a 'high confidence' variant containing 656 unique interactions among 253 genes, and a larger variant with slightly higher recall containing 1,246 interactions among 461 genes.
What lessons did Byrne et al. [2] learn from this study? To evaluate their results, the authors analyzed their geneticinteraction networks in a variety of ways, both independently and in combination with other datasets. First, they identified many potential new functional links and confirmed a number of previously noted links within and between specific signaling pathways (for example, transforming growth factor β ↔ Wnt/β-catenin; fibroblast growth factor ↔ epidermal growth factor). These links provide many hypotheses for follow-up studies to determine their potential significance in development. Second, based on comparisons with a variety of other datasets, Byrne et al. concluded that their approach resulted in much higher detection sensitivity than most previous screens, which they attributed to their ability to detect both strong and weak interactions and their novel method of identifying interacting pairs. Third, by overlaying their geneticinteraction network with protein-protein interactions, coexpression and co-phenotype data, the authors found that there is little overlap between datasets, suggesting that the genetic interactions they identified are revealing novel functional relationships. Even though the PPI and phenotype data are still relatively sparse with respect to the entire genome, and the level of specificity provided by the phenotype and expression links is limited, this result is consistent with studies in yeast.
Within the superimposed network, the authors identified highly connected subnetworks, which in at least one example revealed a significant enrichment for similar RNAi phenotypes and previously undocumented genetic interactions upon retesting. Many of these subnetworks were enriched for shared functional annotations, and a significant number were bridged by genetic interactions (Figure 1), supporting the idea that genetic interactions connect different functional modules. This observation is curious in light of the fact that the final genetic-interaction F Fi ig gu ur re e 1 1 Adapted from Byrne et al. [2], a superimposed network composed of different types of functional linkages contains subnetworks of genes that are highly interconnected based on one type of data: coexpression (blue), co-phenotype (green), or eukaryotic protein-protein interactions ('interolog'; purple). Byrne et al. found that these subnetworks were bridged by genetic interactions (pink) more often than expected by chance. Many such subnetworks were enriched for genes with shared functional annotations, supporting the idea that enhancing genetic interactions (identified by reduced function of a pair of genes) tend to bridge distinct functional modules.

Co-phenotype SGI Coexpression
Interolog network was selected to maximize shared GO terms, and possibly suggests that this standard may not be the optimal measure to evaluate the fine structure of functional relationships within a cell or organism. Alternatively, refining the functional neighborhoods used for this analysis ('broad subnetworks' based on a single mode of interaction, such as coexpression, and containing dozens or hundreds of genes) may provide a higher level of resolution that would bring these relationships into better focus. Finally, when the authors compared the connectivity of yeast [1,33] and worm [2,27] geneticinteraction networks, they found no significant evidence for conservation of synthetic genetic interactions between species. Thus, as in yeast [1], genetic interactions identified in the worm appear to reveal higher-level intermodule functional relationships (see Figure 1); however, the specific patterns of connectivity between modules may not be evolutionarily conserved.
L Lo oo ok ki in ng g t to o t th he e f fu ut tu ur re e These are very early days for systematic genetic interaction studies in metazoans, and many questions -both theoretical and technical -remain unresolved. A notably unglamorous but important set of technical considerations is that differences in methodology between different studies in the same organism will heavily influence both the composition of reported datasets and conclusions drawn from them. Chief among these considerations, as illustrated by the 51 network variants identified by Byrne et al. [2] and comparisons with results from a similar study by Lehner et al. [27], is that differences in experimental design, scoring methods and models used to define genetic interactions [24] will necessarily result in different sets of reported interactions. It is not yet clear how to evaluate these differences. Notably, both Byrne et al. and Lehner et al. achieved high technical reproducibility (83% and more than 90%, respectively); in contrast, when genetic alleles and RNAi for query-target pairs were reversed, only 40% (6/15) of reciprocal tests by Byrne et al. interacted. This indicates that these screens may be far from saturation, as RNAi does not always phenocopy genetic alleles and can carry considerable false-negative rates [34]. Unlike Lehner et al. [27], who placed a lower estimate of 32% on their detection rate for previously reported genetic interactions (some of which, for example suppressors, would not be expected to be detected as synthetic lethals), Byrne et al. [2] did not compare their results with a 'gold standard' of genetic interactions from the literature. Instead, they evaluated functional cohesion by precision and recall of shared GO terms, achieving somewhat lower precision but much higher recall (as well as a higher total number of interactions) among pairs tested in both studies. This and other comparisons suggest that the detection methods used by Lehner et al. [27] were more stringent, resulting in a bias toward stronger genetic interactions, and that Byrne et al. [2] cast a much wider net for recovery of genetic interactions.
A further improvement over the semi-quantitative scoring approach used by Byrne et al. [2], which was based on binned ranges of estimated survival rates, would be to precisely measure lethality in these assays. Currently, one of the biggest technical limitations for large-scale RNAi-based screens in C. elegans is the lack of efficient high-throughput methods to quantitate lethality, growth rates, and other morphological phenotypes, which limits the extent to which issues surrounding the quantitative definition of genetic interactions [23,24] can be explored. Over time, as technical approaches evolve and further large-scale screens and in-depth studies accumulate, it will be interesting to revisit these comparisons.
A more profound question is, to what extent will patterns of genetic interactions be conserved across species? Answers to this question will inform how we use crossspecies inferences to guide studies in less experimentally tractable systems. A preliminary comparison between worm and yeast [2] suggested that, in contrast to PPIs, there is little conservation of genetic interactions between these two organisms. This conclusion is clouded, however, by caveats on several levels. For example, it is not clear if this comparison considered whether all of the positive genetic interaction pairs in C. elegans were actually tested in yeast. Since the set of gene pairs that has been tested differs substantially between yeast and worm, tests of conservation should be made only for subsets of gene pairs that have been systematically tested in both organisms. More obvious is the dichotomy between unicellular and multicellular organisms: yeast are directly exposed to the environment, and must modulate their internal states accordingly, whereas metazoans comprise many different cell types with distinct internal states and external contacts. Measuring survival and growth rates thus provides a relatively direct readout of cell status in yeast, whereas the types of phenotypic assay performed in metazoans will heavily influence our ability to detect different patterns of genetic interactions. The interpretation of negative results in whole-animal assays is further complicated by the possibility -given a particular experimental setup or phenotypic assay -that two potentially interacting components may not become limiting in the same cell types, or that interactions in a subset of cells will not give rise to obvious organismal phenotypes. Studies of mammalian and Drosophila cells in culture have begun to report genome-wide genetic requirements for specific cellular functions [35,36], but these cannot reveal how biological systems as a whole adapt to the loss of specific genetic determinants. Thus, the answer to whether geneticinteraction studies in model systems will provide practical insights into human biology and disease mechanisms awaits further studies. Good reason for optimism stems from the deep conservation of many developmental signaling pathways and the fact that many human disease processes can be effectively studied in these models (the fly and worm, for example, even provide model systems to study mechanisms underlying Alzheimer's disease [37]).
What's next? Extending systematic genetic-interaction maps to other metazoan systems, including alleviating (suppressing) as well as synthetic (enhancing) interactions, using more specific high-throughput assays (for example, those that allow tissue-specific readouts [38]), and developing quantitative assays, will greatly expand our understanding of molecular network organization in complex multicellular organisms. These approaches could also be combined with chemical genetic profiling, as pioneered in yeast [21,22], to develop therapeutic strategies based on multiple molecular targets within the cell. Experimental approaches for mapping genetic interactions will both inform and be guided by efforts to generate predictive models for both gene function and functional associations between genes (for example [39,40]): the continued accumulation of large unbiased training sets will help develop better predictive methods, which in turn will help fill out neighborhoods of interactions and reduce the combinatorial search space for studies directed at specific pathways. Finally, it will be interesting to compare the spectrum of phenotypes and genetic interactions identified in systematic studies of genetic alleles and RNAi with those arising from variation in natural populations (for example, see [41]). Building on knowledge gained from decades of studying specific genes and pathways, global analysis of genetic-interaction networks promises to reveal new insights that will broadly influence our thinking about both applications to medicine and the relationship between network architecture and biological function.
A Ac ck kn no ow wl le ed dg ge em me en nt ts s I wish to thank F. Roth, M. Siegal, F. Piano and A. Fernandez for constructive comments on the manuscript and NIH (HD046236 and HG004276), US Department of the Army (W23RYX-3275-N605), and NYSTAR (C040066) for research support.