Insulators as mediators of intra- and inter-chromosomal interactions: a common evolutionary theme
© BioMed Central Ltd 2009
Published: 27 August 2009
Skip to main content
© BioMed Central Ltd 2009
Published: 27 August 2009
Insulator elements mediate intra- and inter-chromosomal interactions. The insulator protein CCCTC-binding factor (CTCF) is important for insulator function in several animals but a report in BMC Molecular Biology shows that Caenorhabditis elegans, yeast and plants lack CTCF. Alternative proteins may have a similar function in these organisms.
Eukaryotic genomes have developed a variety of strategies for efficiently orchestrating the complex patterns of gene expression required for proper cellular differentiation. Comparative genome analyses suggest that developmental evolution is largely driven by the increase in the complexity of these expression patterns . Consistent with this hypothesis, recent studies indicate that transcription factor-coding genes tend to be under greater positive evolutionary selection compared with other genes . To establish and maintain cell-specific patterns of gene expression, regions of the genome are kept in a silenced state while immediately adjacent regions are transcriptionally active because of the presence of promiscuous enhancer elements that can act over large distances. Insulators were originally described as DNA regulatory elements that ensure the progress of an accurate transcriptional program by keeping in check communication between enhancers and promoters and creating boundaries that prevent inappropriate interactions between adjacent chromatin domains. Accu mu lating evidence suggests that these properties of insulators arise from their ability to mediate intra- and inter-chromosomal interactions, which result in the formation of chromatin loops through clustering of multiple insulator sites . Depending on the complexity of the genome, the capability to mediate long-range interactions with other protein complexes may allow insulator proteins to carry out a variety of functions in the nucleus .
CCCTC-binding factor (CTCF) is the only known insulator protein necessary for establishing patterns of nuclear architecture and transcriptional control in vertebrates . This protein is also found in invertebrates such as Anopheles gambiae, Aedes aegypti and Drosophila melanogaster . A recent study by Heger et al. in BMC Molecular Biology  has shown that the gene encoding CTCF is not present in the genomes of several model organisms, including Saccharomyces cerevisiae, Schizosaccharomyces pombe, Arabidopsis thaliana and Caenorhabditis elegans. Because of the widespread presence of insulators and the essential role of CTCF in a wide variety of eukaryotic organisms, this absence of the gene in other organisms raises the possibility that other regulatory mechanisms might have evolved to replace the function of this protein. Here, we provide a brief overview of how insulator proteins work in Drosophila and vertebrates, as well as how plants and fungi may have adapted different proteins to accomplish insulator function. We also discuss how insulator proteins such as CTCF may have evolved new functions to handle more complex genomes in animals.
The concept of intra- and inter-chromosomal interaction mediated by insulator proteins in Drosophila seems to be applicable to the CTCF insulator in vertebrates, despite the involvement of a different set of protein complexes. The mechanism of CTCF function in vertebrates is best illustrated by the mouse imprinted Igf2-H19 locus , where four CTCF-binding sites are located at the imprinted control region (ICR) that lies between the Igf2 gene and its downstream enhancers (Figure 1b). CTCF binds to these sites on the maternally inherited allele but not on the methylated paternal copy. Chromatin conformation capture (3C) experiments revealed distinct long-range chromosomal interactions that are specific to the parent of origin (Figure 1b). On the maternal allele, a CTCF-dependent loop formed by contacts between DNA methylated region 1 (DMR1) and the ICR allows downstream enhancers to turn on the H19 gene. However, on the paternal allele, contacts between DMR2 and ICR allow downstream enhancers to activate the Igf2 gene. Given that CP190 protein has been shown to interact with CTCF in Drosophila, what proteins could then mediate CTCF-depen dent looping of chromatin in vertebrates? Recent data indicate that cohesin might be required for CTCF insulator function . Cohesin complexes mediate cohesion between sister chromatids by connecting two distinct DNA molecules physically. It is therefore plausible that cohesin can create or stabilize DNA loops during interphase by physically connecting different CTCF-binding sites on the same or different DNA molecules, in a manner similar to CP190 and Mod(mdg4) proteins in Drosophila.
If CTCF or functionally similar proteins have a role in establishing patterns of nuclear organization by mediating intra- and inter-chromosomal interactions, how do organisms that lack CTCF homologs accomplish the same goal? In S. pombe and S. cerevisiae, the transcription factor TFIIIC seems to have this role. In fission yeast, binding of TFIIIC to B-box sequences in the inverted repeat boundary elements can prevent the spreading of heterochromatin from the silenced mating-type loci to neighboring euchromatic regions . Detailed genome-wide analyses reveal that TFIIIC associates with RNA polymerase (Pol) III on all tRNA genes, which are mostly found at pericentromeric heterochromatin domain boundaries. In addition, TFIIIC binds to many sites between divergent promoters in the absence of Pol III and acts as a chromosome-organizing clamp (COC) by tethering distant loci to the nuclear periphery  (Figure 1c). Similarly, TFIIIC recruited to tRNA genes in budding yeast can act as both an enhancer-blocking insulator and a heterochromatin barrier by preventing ectopic spreading of Sir protein-mediated silencing . These results uncover a general mechanism of genome organization involving the conserved TFIIIC complex in yeast.
Studies of the process by which KNOTTED1-like homeobox (KNOX) genes are silenced during organogenesis suggest that A. thaliana may also use chromatin looping as a way of regulating gene expression . Stable KNOX gene silencing requires the DNA-binding proteins ASYMMETRIC LEAVES1 (AS1) and AS2 and the chromatin-remodeling factor HIRA. AS1 and AS2 form a repressor complex that binds directly to two DNA motif sites that flank the enhancer element of the KNOX genes BREVIPEDICELLUS (BP) and KNOTTED-like Arabidopsis (KNAT2). Interaction between AS1-AS2 complexes at these two sites is required to repress BP expression. These results suggest that AS1-AS2 complexes interact to create a loop in the KNOX promoter and, through recruitment of HIRA, to form a repressive chromatin state that blocks enhancer activity during organogenesis (Figure 1d). This regulatory mechanism, which may be conserved among plants with compound leaves, is conceptually similar to the action of an insulator in Drosophila and vertebrates.
Recent phylogenetic studies using the zinc-finger protein sets from 35 completely sequenced nematodes  has discovered the presence of CTCF-like genes in only three basal nematodes and not in other derived nematodes such as C. elegans. This suggests that CTCF might have been lost during nematode evolution, probably as a result of a switch from gene regulatory mechanisms involving distantly acting elements and chromatin insulation to polycistronic transcriptional units . However, the presence of higher-order genome organization in yeast suggests the possibility that other protein complexes may have evolved to replace CTCF functions in C. elegans.
The underlying theme governing insulator function seems to be the establishment of intra- and inter-chromosomal interactions that bring different sequences in close proximity within the nucleus to accomplish a variety of outcomes . Different eukaryotes may have evolved unique machineries to achieve this. It is also clear that insulator proteins such as CTCF may have acquired additional functions with increased complexity of the genome (reviewed in ). In yeast (S. cerevisiae), which has a haploid genome size of 13 megabases, the primary insulator function of TFIIIC seems to be the demarcation of chromatin into distinct domains for blockage of heterochromatin silencing. In A. thaliana, in which genes are only infrequently interrupted by repetitive elements outside the centromeric regions, AS1-AS2 complexes may mainly act to regulate enhancer-promoter interactions. Long-range interactions mediated by insulator proteins have wider functional implications for Drosophila and mammals. In Drosophila, different insulators have diverse DNA occupancy patterns with respect to gene features, suggesting that the various insulator functions have diversified by using different insulator DNA-binding proteins with a common interacting partner . Interestingly, vertebrate cells, which contain a larger genome that requires more complex forms of regulation, seem to require CTCF to have a wider set of regulatory roles. These include transcriptional regulation of gene expression at the major histocompatibility complex class II, β-globin and interferon-γ loci, V(D)J recombination at the immunoglobulin-encoding Igh and Igk loci, mono-allelic expression of imprinted genes and X-chromosome inactivation . The ability to have such varied roles must rely on context-dependent interactions with a variety of partners. Their identification remains one of the future challenges for the field.
Work in the authors' laboratory is supported by Public Health Service Award GM35463 from the National Institutes of Health.