The Replication Fork: Understanding the Eukaryotic Replication Machinery and the Challenges to Genome Duplication

* Authors to whom correspondence should be addressed; E-Mails: ude.ekud@881la (A.R.L.); ude.demlexerd@ihcugon.ihsie (E.N.); Tel.: +1-215-762-4825 (E.N.); Fax: +1-215-762-4452 (E.N.).

This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Abstract

Eukaryotic cells must accurately and efficiently duplicate their genomes during each round of the cell cycle. Multiple linear chromosomes, an abundance of regulatory elements, and chromosome packaging are all challenges that the eukaryotic DNA replication machinery must successfully overcome. The replication machinery, the “replisome” complex, is composed of many specialized proteins with functions in supporting replication by DNA polymerases. Efficient replisome progression relies on tight coordination between the various factors of the replisome. Further, replisome progression must occur on less than ideal templates at various genomic loci. Here, we describe the functions of the major replisome components, as well as some of the obstacles to efficient DNA replication that the replisome confronts. Together, this review summarizes current understanding of the vastly complicated task of replicating eukaryotic DNA.

Keywords: DNA replication, replisome, replication fork, genome stability, checkpoint, fork barriers, difficult-to-replicate sites

1. Introduction: Basics of DNA Replication

DNA replication, at its most fundamental, is the action of DNA polymerases synthesizing a DNA strand complementary to the original template strand. To synthesize DNA, the double-stranded DNA is unwound by DNA helicases ahead of polymerases, forming a replication fork containing two single-stranded templates. Replication processes permit the copying of a single DNA double helix into two DNA helices, which are divided into the daughter cells at mitosis. The major enzymatic functions carried out at the replication fork are well conserved from prokaryotes to eukaryotes. In practice, the replication machinery is a massive complex coordinating many proteins all working at the site of replication, forming the “replisome”.

The replisome is responsible for copying the entirety of genomic DNA in each proliferative cell. This process allows for the high-fidelity passage of hereditary/genetic information from parental cell to daughter cell and is thus essential to all organisms. Much of the cell cycle is built around ensuring that DNA replication occurs without errors. DNA replication is an energetically costly process. In G₁ phase of the cell cycle, many of the DNA replication regulatory processes are initiated. In eukaryotes, the vast majority of DNA synthesis occurs during S phase of the cell cycle, and the entire genome must be unwound and duplicated to form two daughter copies. During G₂, any damaged DNA or replication errors are corrected. Finally, one copy of the genomes is segregated to each daughter cell at Mitosis or M phase. These daughter copies each contain one strand from the parental duplex DNA and one nascent antiparallel strand. This mechanism is conserved from prokaryotes to eukaryotes and is known as semiconservative DNA replication [1]. The process of semiconservative replication suggested a geometry for the site of DNA replication, a fork-like DNA structure, where the DNA helix is open, or unwound, exposing unpaired DNA nucleotides for recognition and base pairing for the incorporation of free nucleotides into double-stranded DNA ( Figure 1 ).

An external file that holds a picture, illustration, etc. Object name is genes-04-00001-g001.jpg

The replication fork. Leading-strand synthesis proceeds continuously in the 5' to 3' direction. Lagging-strand synthesis also occurs in the 5' to 3' direction, but in a discontinuous manner. An RNA/DNA primer (labeled in green) initiates leading-strand synthesis and every Okazaki fragment on the lagging strand.

The base pairing and chain formation reactions, which form the daughter helix, are catalyzed by DNA polymerases [2,3]. These enzymes move along single-stranded DNA (ssDNA) and allow for the extension of the nascent DNA strand by “reading” the template strand and allowing for incorporation of the proper purine (adenine and guanine) and pyrimidine (thymidine and cytosine) nucleobases. Activated free deoxyribonucleotides exist in the cell as deoxyribonucleotide triphosphates (dNTPs). These free nucleotides are added to an exposed 3'-hydroxyl group on the last incorporated nucleotide. In this reaction, a pyrophosphate is released from the free dNTP, generating energy for the polymerization reaction and exposing the 5' monophosphate, which is then covalently bonded to the 3' oxygen. Additionally, incorrectly inserted nucleotides can be removed and replaced by the correct nucleotides in an energetically favorable reaction. This property is vital to proper proofreading and repair of errors that occur during DNA replication.

Owing to the antiparallel nature of duplex DNA, DNA replication occurs in opposite directions between the two new strands at the replication fork. However, all DNA polymerases synthesize DNA in the 5' to 3' direction. Consequently, further coordination is required during DNA replication. Two replicative polymerases synthesize DNA in opposing orientations ( Figure 1 ). Polymerase ε (epsilon) synthesizes DNA in a continuous fashion, as it is “pointed” in the same direction as DNA unwinding. This strand is known as the “leading strand.” In contrast, polymerase δ (delta) synthesizes DNA on the opposite template strand in a fragmented, or discontinuous, manner and this strand is termed the “lagging strand” [4]. The discontinuous stretches of DNA replication products on the lagging strand are known as Okazaki fragments and are about 100 to 200 bases in length at eukaryotic replication forks. Owing to the “lagging” nature, the lagging strand generally contains a longer stretch of ssDNA that is coated by single-stranded binding proteins, which stabilizes ssDNA templates by preventing secondary structure formation or other transactions at the exposed ssDNA. In eukaryotes, ssDNA stabilization is maintained by the heterotrimeric complex known as replication protein A (RPA) ( Figure 2 ) [5,6]. Each Okazaki fragment is preceded by an RNA primer, which is displaced by the procession of the next Okazaki fragment during synthesis. In eukaryotic cells, a small amount of the DNA segment immediately upstream of the RNA primer is also displaced, creating a flap structure. This flap is then cleaved by endonucleases (such as Fen1, discussed later). At the replication fork, the gap in DNA after removal of the flap is sealed by DNA ligase I [7,8,9]. Owing to the relatively short nature of the eukaryotic Okazaki fragment, DNA replication synthesis occurring discontinuously on the lagging strand is less efficient and more time consuming than leading-strand synthesis.

An external file that holds a picture, illustration, etc. Object name is genes-04-00001-g002.jpg

The eukaryotic replisome complex coordinates DNA replication. Replication on the leading and lagging strands is performed by Pol ε and Pol δ, respectively. Many replisome factors (including the FPC [fork protection complex], Claspin, And1, and RFC [the replication factor C clamp loader]) are charged with regulating polymerase functions and coordinating DNA synthesis with unwinding of the template strand by Cdc45-MCM [mini-chromosome maintenance]-GINS [go-ichi-ni-san]. The replisome also associates with checkpoint proteins as DNA replication and genome integrity surveillance mechanisms.

2. DNA Polymerases and Helicases

The two basic processes of DNA replication are unwinding of the template strand and polymerization of the daughter strands. Therefore, there are two main “workhorse” enzymes in the replisome, the replicative helicase and polymerases. The replicative helicase is responsible for unwinding the parental duplex DNA, exposing two ssDNA templates. The template is then utilized by the replicative polymerases to generate two copies of the parental genome. Recent results have yielded new information on the specific role of each polymerase at the replication fork and while building on our knowledge of the replicative helicase. Understanding the main players in DNA replication allows us to further appreciate their regulation by other replisome factors.

2.1. Replicative DNA Polymerases: Further Subdivision of Labor

DNA polymerase function is highly specialized. From viruses through mammals, defined DNA polymerases accomplish replication on specific templates and in narrow localizations. At the eukaryotic replication fork, three distinct replicative polymerase complexes contribute to canonical DNA replication: α, δ, and ε. These three polymerases are essential for viability of the cell [10,11,12,13]. Because DNA polymerases require a primer on which to begin DNA synthesis, first, polymerase α (Pol α) acts as a replicative primase. Pol α is associated with an RNA primase and this complex accomplishes the priming task by synthesizing a primer that contains a short ~10-nucleotide RNA stretch followed by 10 to 20 DNA bases [14,15]. Importantly, this priming action occurs at replication initiation at origins to begin leading-strand synthesis and also at the 5' end of each Okazaki fragment on the lagging strand ( Figure 1 ).

However, Pol α is not able to continue DNA replication. From in vitro studies using SV40 T-antigen-dependent replication and reconstituted components, it was observed that DNA replication must be “handed off” to another polymerase to continue synthesis [9,16]. The polymerase switching requires clamp loaders (which will be discussed in detail later in this article) [17]. Initially, it was thought that Pol δ performed leading-strand replication and that Pol α completed each Okazaki fragment on the lagging strand [17,18]. Using mutator polymerase variants and mapping nucleotide misincorporation events, Kunkel and colleagues found that Pol ε and Pol δ mutations lead to mismatched nucleotide incorporation only on the leading and lagging strands, respectively [19,20,21]. Thus, normal DNA replication requires the coordinated actions of three DNA polymerases: Pol α for prime synthesis, Pol ε for leading-strand replication, and the constantly loaded Pol δ for generating Okazaki fragments during lagging-strand synthesis ( Figure 2 ).

2.2. Helicases Unwind DNA for Replication

For DNA polymerases to function, the double-stranded helix must be unwound to expose a single-stranded template. This activity is performed by the replicative helicase. In eukaryotes, the replicative helicase is a hexameric complex comprised of the mini-chromosome maintenance proteins (Mcm2-7: Mcm2, Mcm3, Mcm4, Mcm5, Mcm6 and Mcm7). The MCM helicase is an AAA + ATPase, a superfamily of protein complexes that process substrates through a central pore using energy release from ATP hydrolysis [22].

MCM activity is required throughout S phase for DNA replication [23,24]. The MCM proteins are recruited to replication origins (during G₁ phase and before DNA replication) then redistributed throughout genomic DNA during S phase, indicative of their localization to the replication fork [25]. Although it was known that MCM proteins are required for DNA replication initiation and progression, it was not originally clear what the enzymatic function of the MCM complex could be [26]. In a study using purified MCM homologue from archaea, ATP-driven helicase activity was detected in fractions corresponding to double hexamer forms of the complex [27]. Further, purified complexes of Mcm4/6/7 have ATP-dependent helicase activity in vitro, unwinding DNA in the 3' to 5' direction [28,29]. These results, coupled with localization and knockout studies strongly favor the model suggesting that the Mcm2-7 hexamer is the core of the replicative helicase. However, the full replicative helicase responsible for replication in eukaryotic cells requires additional factors, including the go-ichi-ni-san (GINS) complex and Cdc45, which together form the Cdc45-MCM-GINS (CMG) complex ( Figure 2 ). The function of the CMG complex will be discussed in detail later in this article.

3. Control of DNA Replication at the Replication Fork

Although the enzymatic processes of DNA replication such as unwinding, template generation/stabilization, and daughter strand synthesis are largely determined, the control of DNA replication in vivo is not. DNA replication requires multiple processes to coordinate and regulate highly accurate and timely duplication of genomic DNA during S phase. In addition to primase, replicative polymerases, and helicases, the DNA replication fork requires the use of accessory proteins to facilitate efficient initiation and replication fork progression. The cooperative protein complexes that participate in DNA replication are known as the replisome ( Figure 2 ). New findings continue to suggest that the size and complexity of the replisome is greater than once thought.

3.1. Replication Initiation at Origins

To completely duplicate the genome in a reasonable time during the cell cycle, eukaryotic cells initiate DNA replication at multiple sites during DNA replication, whereas prokaryotic replication initiates at a single locus. Replication initiator sites are known as origins of replication (Oris) and are recognized by the origin recognition complex (ORC) of proteins in eukaryotic cells. ORCs are found associated with DNA throughout the genome and form the markers to which replication forks are recruited in a highly regulated manner (reviewed in [30] and [31]). In some eukaryotes, such as budding yeast, origins are defined by conserved nucleotide sequences, known as autonomous replication sequences (ARSs) that mark Oris. However, in most other model eukaryotes and in all metazoans, replication origins are less well defined (reviewed in [32]). Origin usage in metazoans can be dynamic, with origin firing at different sites depending on cell type and developmental stage. Nevertheless, the mechanism of replisome assembly and origin firing is highly conserved.

During late mitosis and G₁ phase, Cdt1 and Cdc6 (Cdc18 in fission yeast) proteins associate with Ori sites throughout the genome and recruit Mcm2-7 ( Figure 3 A) [25,33,34,35,36,37]. At this time, double hexamers of the Mcm2-7 complex are loaded at replication origins [38,39]. This generates a prereplication complex (pre-RC). Origins with an associated pre-RC are considered licensed for replication. Licensed replication origins can then be “fired,” when replication actually initiates at Oris. Origin firing is brought about by multiple phosphorylation events carried out by cyclin E-CDK2 at the onset of S phase and by Cdc7-Dbf4 (DDK kinase or Dbf4-dependent kinase) prior to individual origin firing ( Figure 3 B) [40,41,42,43,44]. At this point, origin melting occurs and DNA unwinding by the helicase generates ssDNA, exposing a template for replication ( Figure 3 C). The replisome then begins to form with the localization of replisome factors such as Cdc45 [45,46]. Interestingly, the assembly of the CMG complex at origins requires the non-catalytic domain of Pol ε, suggesting a codependence between the replicative helicase and polymerase during replisome assembly [47,48]. Further recruitment assembles the remainder of replisome proteins. DNA synthesis begins on the melted template, and the replication machinery translocates away from the origin in a bidirectional manner.

An external file that holds a picture, illustration, etc. Object name is genes-04-00001-g003.jpg

Mcm2-7 loads onto DNA at replication origins during G1 and unwinds DNA ahead of replicative polymerases. (A) The combined activities of Cdc6 and Cdt1 bring MCM complexes to replication origins. (B) CDK/DDK-dependent phosphorylation of pre-RC components leads to replisome assembly and origin firing. Cdc6 and Cdt1 are no longer required and are removed from the nucleus or degraded (C) MCMs and associated proteins (GINS and Cdc45 are shown) unwind DNA to expose template DNA. At this point replisome assembly can be completed and replication initiated. “P” indicates phosphorylation.

To prevent multiple rounds of DNA replication during a single cell cycle, mechanisms have evolved to regulate limiting initiation factors: ORC, Cdt1, and Cdc6 (reviewed in [49]). After origin firing, Orc1, the largest ORC subunit, is ubiquitinated and degraded [50]. Phosphorylation of human Orc2 leads to the dissociation of Orc2-6 from chromatin to further prohibit re-replication, while the yeast complex appears to remain on DNA throughout the cell cycle [51,52,53]. Once phosphorylated at the onset of S phase, Cdc6 is rapidly exported from the nucleus [54]. Geminin is a specific inhibitor of Cdt1. Geminin protein levels increase throughout S phase, blocking Cdt1 from localizing to replication origins [55]. It is also well understood that Cdt1 and Cdc6 undergo rapid proteasome-dependent degradation (reviewed in [56,57]). Thus, duplicating the genome once and only once per cell cycle is controlled at the level of licensing. These multiple and redundant mechanisms prevent origin assembly to restrict replication licensing once per cell cycle. These controls emphasize the importance of preventing overreplication of chromosomal DNA, in order to preserve genomic integrity.

3.2. The DNA Sliding Clamp: PCNA

DNA polymerases require additional factors to support DNA replication in vivo. DNA polymerases have a semiclosed hand structure, which allows them to load onto DNA and translocate. This structure permits DNA polymerase to hold the single-stranded template, incorporate dNTPs at the active site, and release the newly formed double strand. However, the conformation of DNA polymerases does not allow for their stable interaction with the template DNA. To strengthen the interaction between template and polymerase, DNA sliding clamps have evolved, promoting the processivity of replicative polymerases. In eukaryotes, this sliding clamp is a homotrimer known as proliferating cell nuclear antigen (PCNA), which form a ring structure. The PCNA ring has polarity with a surface that interacts with DNA polymerases and tethers them securely to DNA. PCNA-dependent stabilization of DNA polymerases has a significant effect on DNA replication because it enhances polymerase processivity up to 1,000-fold [58,59]. Such stimulation of replication activity makes PCNA an essential cofactor in vivo. PCNA also has the distinction of being one of the most common interaction platforms in the replisome to accommodate multiple processes at the replication fork [60]. Therefore, PCNA is often viewed as a regulatory cofactor for DNA polymerases ( Figure 4 ).

An external file that holds a picture, illustration, etc. Object name is genes-04-00001-g004.jpg

PCNA is a node of protein-protein interaction in the replisome. The PCNA homotrimer interacts with proteins in diverse processes. Many of the protein-protein interactions occur through a PCNA binding motif known as the PIP box, which binds a central domain on PCNA (orange).

Various PCNA modifications regulate the replisome through specific circumstances during DNA replication. The modifications of PCNA have dramatic effects on its function. Although there are some species-specific modifications of PCNA throughout eukaryota, the principles remain conserved. Upon DNA damage, PCNA is monoubiquitinated, which changes PCNA’s affinity from replicative polymerases to the damage-tolerant translesion synthesis (TLS) polymerases [61,62]. PCNA ubiquitination is dependent on the DNA damage checkpoint pathway and regulates dynamic changes in the replication fork. This process allows for bypass of bulky DNA damage that would otherwise prevent replication fork progression, although this method of damage bypass is error prone [62,63]. In contrast, polyubiquitination of the same site directs the cell towards DNA damage bypass by poorly characterized, but essentially error-free mechanisms [64,65,66]. PCNA can also be SUMOylated ( s mall u biquitin-like mo difier) at the same site in yeast, and SUMOylated PCNA exists in vertebrates [67,68]. This modification is thought to suppress ubiquitination of PCNA, therefore inhibiting TLS and other DNA repair pathways, which are potentially harmful to the cell because they can introduce mutations and genome rearrangements [69,70,71]. In addition, acetylation of PCNA may play a role in enhancing the processivity of associated polymerases [72]. These findings indicate that PCNA modifications play critical roles in controlling pathway selection for DNA damage management during DNA replication.

3.3. Clamp Loaders

PCNA fully encircles DNA and must be loaded onto DNA at the replication fork. At the leading strand, this is a relatively infrequent process, because DNA replication is essentially continuous until replication is terminated. However, at the lagging strand, Pol δ is continually loaded at the start of each Okazaki fragment. This constant initiation of Okazaki fragment synthesis requires repeated PCNA loading for efficient DNA replication. PCNA loading is accomplished by the replication factor C (RFC) complex, which is comprised of five AAA + ATPases [73]. RFC recognizes primer-template junctions and loads PCNA at these sites [74,75]. The PCNA homotrimer is opened by RFC using energy from ATP hydrolysis and is then loaded onto DNA in the proper orientation to facilitate its association with the polymerase [76,77]. Clamp loaders can also unload PCNA from DNA, a mechanism that becomes important when replication must be terminated [77].

Eukaryotes have evolved multiple clamp loading complexes, each of which appears to function in a separate pathway. The canonical clamp loader essential for DNA replication is RFC and includes Rfc1, Rfc2, Rfc3, Rfc4 and Rfc5. At least three RFC-like complexes exist in eukaryotic cells. RFC Ctf18 , which contains Ctf18 in place of Rfc1, promotes sister chromatid cohesion and regulates replication speed [78,79,80,81]. RFC Elg1 , which contains Elg1, is thought to unload SUMOylated PCNA in the presence of DNA damage to allow for replication progression through damaged DNA templates [71]. The RFC Rad17/Rad24 clamp does not load PCNA, but loads the 9-1-1 complex at DNA damage sites during the replication checkpoint response [82]. Thus, DNA replication can be regulated at the level of PCNA clamp loading, in order to accommodate multiple processes that take place during DNA replication ( Figure 4 ).

3.4. Okazaki Fragment Maturation

DNA replication on the lagging strand is discontinuous. At the end of Okazaki fragment synthesis, Pol δ runs into the previous Okazaki fragment and displaces its 5' end containing the RNA primer and a small segment of DNA. This generates an RNA-DNA single strand flap, which must be cleaved, and the nick between the two Okazaki fragments must be sealed by DNA ligase I. This process is known as Okazaki fragment maturation and can be handled in two ways: one mechanism is designed to process short flaps, while the other deals with long flaps [9]. Pol δ is able to displace up to 2 to 3 nucleotides of DNA or RNA ahead of its polymerization, generating a short “flap” substrate for Fen1, which can remove nucleotides from the flap, about one nucleotide at a time. By repeating cycles of this process, Pol δ and Fen1 can coordinate the removal of RNA primers and leave a DNA nick at the lagging strand [83]. It has been proposed that this iterative process is preferable to the cell because it is tightly regulated and does not generate large flaps that need to be excised [84]. In the event of deregulated Fen1/Pol δ activity, the cell uses an alternative mechanism to generate and process long flaps by using Dna2, which has both helicase and nuclease activities [85]. Dna2 is localized to Okazaki fragments [86,87], and importantly, Dna2 is thought to act on Okazaki fragment ends that are not optimal for Fen1 activity [88]. For example, DNA flaps containing secondary structure and long flaps are both poor Fen1 substrates. The nuclease activity of Dna2 is required for removing these long flaps, leaving a shorter flap to be processed by Fen1. Electron microscopy studies indicate that nucleosome loading on the lagging strand occurs very close to the site of synthesis [89]. Thus, Okazaki fragment maturation is an efficient process that occurs immediately after the nascent DNA is synthesized. These redundant and proficient mechanisms showcase the importance of Okazaki fragment maturation in protecting genomic integrity.

3.5. Linking Helicase and Polymerase Functions

The DNA helicases and polymerases must remain in close contact at the replication fork ( Figure 2 ). If unwinding occurs too far in advance of synthesis, large tracts of ssDNA are exposed. This can activate DNA damage signaling or induce aberrant DNA repair processes. To thwart these problems, the eukaryotic replisome contains specialized proteins that are designed to regulate the helicase activity ahead of the replication fork. These proteins also provide docking sites for physical interaction between helicases and polymerases, thereby ensuring that duplex unwinding is coupled with DNA synthesis.

There is evidence that Cdc45 has such a role. It has been shown that Cdc45 coordinates MCM progression with the replisome [90]. Before DNA replication initiates, S-phase kinase DDK phosphorylates Mcm4, which allows stable Cdc45-MCM complex formation. This is an important step in replisome assembly at origins and is essential for origin firing [91]. Furthermore, during S phase, Cdc45 depletion prevents DNA replication stress signaling through downregulation of active DNA synthesis, reinforcing the notion that Cdc45 is an important regulatory protein, required for DNA replication [92].

During the past decade, a protein complex termed GINS has drawn much attention. The GINS complex is comprised of the Sld5 (go), Psf1 (ichi), Psf2 (ni), and Psf3 (san) proteins and (named for 5, 1, 2, 3 in Japanese) was originally identified in budding yeast and in Xenopus laevis extracts [93,94]. GINS has been shown to regulate the interaction of the MCM subunits with Cdc45 [95]. The high-molecular-weight complex consisting of the MCM, GINS, and Cdc45 is now known as the CMG complex and can be observed in human cells [96,97]. The GINS complex may provide additional regulation of the MCM helicase during replication progression and in response to DNA replication stress. Importantly, the full CMG complex is required for DNA unwinding, demonstrating that the complex of Cdc45-MCM-GINS is the functional DNA helicase machinery in eukaryotic cells ( Figure 2 ) [98,99]. It is also worth noting that the Psf3 subunit is phosphorylated by DNA damage signaling kinases ATM and ATR, suggesting a possible mechanism of MCM regulation upon DNA damage during DNA replication [100].

While the identification of the CMG complex adds to our understanding of the regulation of the MCM helicase activity, how the CMG complex interacts with the bulk of the replisome is still emerging. Recent studies have shown that the Ctf4/And1 protein interacts with both the CMG complex and Pol α [101,102]. Ctf4 is a Pol α accessory factor, which is required for the recruitment of Pol α to replication origins [103]. Indeed, human Ctf4/And1 stimulates Pol α function in vitro, whereas depletion of the protein greatly reduces replication fork progression rates in human cells [101]. Therefore, the interaction of CMG and Ctf4 suggests the critical role of CMG in tethering Pol α to the replication fork. Considering that Pol α is continually required on the lagging strand and that CMG has helicase activity, CMG may couple DNA unwinding with primer synthesis on the lagging strand. Such a coupling mechanism may help to minimize the length of the exposed ssDNA tract at the replication fork, thereby promoting efficient DNA replication and ensuring genomic integrity.

How, then, is leading-strand synthesis coupled with unwinding activity? Mrc1/Claspin appears to be the best candidate for this task. In budding yeast, Mrc1 interacts with Pol ε as well as MCM proteins in unperturbed S phase and under genotoxic stress [104]. The importance of this direct link between the helicase and the leading-strand polymerase is underscored by results in cultured human cells, where Mrc1/Claspin is required for efficient replication fork progression [105]. These results suggest that efficient DNA replication also requires the coupling of helicases and leading-strand synthesis.

Taken together, it appears that the CMG complex has a central role in coupling template unwinding with DNA polymerases. Importantly, CMG interacts with both leading- and lagging-strand polymerases via Mrc1/Claspin and Ctf4/And-1, respectively ( Figure 2 ). This setup also helps coordinate leading- and lagging-strand synthesis, integrating required processes at the replication fork into a centralized replisome. In this model, protein-protein interactions allow for crosstalk among multiple enzymatic processes essential for DNA replication. For instance, leading-strand synthesis can be restrained to closely match the rate of the lagging-strand synthesis. Considering that replication efficiency is reduced when Ctf4/And-1 or Mrc1/Claspin is lost, the coordination of multiple activities plays a critical role in replisome progression.

Additional factors are required for maintaining proper replisome structure and function. One such factor is the fork protection complex (FPC), which is comprised of Timeless and Tipin (Tof1-Csm3 in budding yeast and Swi1-Swi3 in fission yeast). FPC is a replisome component and appears to play a key role in efficient DNA replication (reviewed in [106]). As neither FPC protein appears to have any enzymatic function, it is likely that their role is to maintain or stabilize the replisome during DNA replication. Indeed, in the absence of functional FPC proteins, DNA replication takes longer to complete and results in shorter replication tracts, indicating that the FPC is required for efficient replisome progression [107,108]. In addition, the FPC has been shown to associate with nonreplisome proteins involved in multiple genome maintenance mechanisms, suggesting that the FPC mediates interactions between the replisome and other activities that take place on DNA during replication (reviewed in [106]). Given that loss of Claspin, Ctf4/And-1, and the FPC leads to inefficient replication, maintaining proper protein-protein interactions in the replisome is vital to coordinating the variety of processes occurring at the replication fork during DNA replication.

3.6. Replication Checkpoint Proteins

In order to preserve genetic information every time the cell divides, DNA replication must be completed with high fidelity. To achieve this task, eukaryotic cells are equipped with genome surveillance systems that detect errors or problems during DNA replication. Proteins required for this mechanism, termed the DNA replication checkpoint, have several mechanisms to preserve genomic integrity. To ensure that DNA is faithfully repaired and replicated, checkpoint proteins function to arrest the cell cycle to allow time for DNA repair and prevent cells from entering mitosis. Checkpoint proteins also directly facilitate some DNA repair pathways, while they stabilize replication fork structures to prevent further damage. These mechanisms are essential to avoid passing down mutations or other chromosome aberrations to offspring.

The checkpoint proteins are well conserved throughout the domain Eukaryota. At the central node of the checkpoint networks are two phosphatidyl inositotide-3' kinase (PI3K)-like kinases, ATR and ATM. ATM and ATR share a consensus target phosphorylation sequence, the SQ/TQ motif. However, their targets and their roles in the cells differ substantially. ATR is an essential mammalian gene, whereas ATM is not [109,110]. This is thought to be due to the specificity of each kinase for different types of DNA damage. ATM arrests the cell cycle in response to DNA double strand breaks (DSBs), which are a relatively rare event in the cell. Conversely, ATR and its obligate checkpoint partner, ATR-interacting-protein (ATRIP), are responsive to stretches of ssDNA that are coated by RPA ( Figure 2 ) [111,112]. Importantly, ssDNA accumulation occurs frequently during DNA replication. Long stretches of ssDNA are also present during replication stress. Therefore, ATR-ATRIP is heavily relied upon to arrest the cell cycle and to preserve genome integrity during DNA replication. Indeed, ATR is found on chromatin during S phase in a manner similar to RPA and the checkpoint mediator Claspin [113]. Moreover, studies in budding yeast demonstrated that Mec1 (the ATR homologue) is required for normal DNA synthesis during S phase and that mec1 deficiency causes dramatically longer S phase [114]. These findings reinforce the importance of checkpoint proteins during S phase.

The generation of ssDNA tracts is important in initiating the checkpoint pathways downstream of replication-associated damage. Once sufficiently long, RPA coated ssDNA stretches are able to recruit ATR-ATRIP [112,115]. However, to become fully active, the ATR kinase relies on sensor proteins that “sense” whether the checkpoint proteins are localized to a valid site of DNA replication stress. The 9-1-1 (Rad9-Hus1-Rad1) heterotrimeric clamp and its clamp loader RFC Rad17 recognize the damaged DNA. It has been demonstrated in vitro that RFC Rad17 recognizes gapped or nicked DNA and loads the 9-1-1 clamp onto DNA [82]. The presence of 9-1-1 on DNA is sufficient to facilitate the interaction between ATR-ATRIP and a group of proteins termed checkpoint mediators, such as TopBP1 and Mrc1/Claspin. TopBP1 interacts with (and recruits) the phosphorylated Rad9 component of 9-1-1 and binds ATR-ATRIP, which phosphorylates Chk1 [116,117,118,119,120]. Furthermore, Mrc1/Claspin is required for the complete activation of ATR-ATRIP that phosphorylates Chk1, the major downstream checkpoint effector kinase [121,122,123]. Claspin is a component of the replisome and contains a domain for docking with Chk1, revealing a specific function of Claspin during DNA replication: the promotion of checkpoint signaling at the replisome ( Figure 2 ) [124].

Chk1 signaling is vital for arresting the cell cycle and preventing cells from entering mitosis with incomplete DNA replication or DNA damage. In metazoans, active Chk1 induces a robust G₂/M checkpoint by phosphorylating Cdc25A protein, a phosphatase that removes inhibitory phosphorylations on Cdk1 and Cdk2. Once phosphorylated, Cdc25A is targeted for proteasome-dependent degradation by the ubiquitin ligase complex containing the F-box protein β-TrCP [125]. Thus, Chk1 maintains inhibitory phosphorylation of metazoan Cdk1 and Cdk2 to block mitotic progression and origin firing, respectively [126,127,128]. The Chk1-dependent Cdk inhibition may constitute the single most important function of the ATR-Chk1 checkpoint, to arrest the cell cycle and allow sufficient time for completion of DNA repair mechanisms, which in turn prevents the inheritance of damaged DNA. In addition, Chk1-dependent Cdk inhibition plays a critical role in inhibiting origin firing during S phase. This mechanism prevents continued DNA synthesis and is required for the protection of the genome in the presence of replication stress and potential genotoxic conditions [129,130,131]. Thus, ATR-Chk1 activity further prevents potential replication problems at the level of single replication origins by inhibiting initiation of replication throughout the genome, until the signaling cascade maintaining cell-cycle arrest is turned off.

3.7. Replication through Nucleosomes

Eukaryotic genomes are substantially more complicated than the smaller and unadorned prokaryotic genomes. Eukaryotic cells have multiple noncontiguous DNA components, chromosomes, each of which must be compacted to allow packaging within the confined space of a nucleus. Chromosomes are packaged by wrapping ~147 nucleotides (at intervals averaging 200 nucleotides) around an octamer of histone proteins, forming the nucleosome [132,133,134,135]. The histone octamer includes two copies each of histone H2A, H2B, H3, and H4. Histone proteins are subject to a variety of modifications, including phosphorylation, acetylation, methylation, and ubiquitination that represent vital epigenetic marks. The tight association of histone proteins with DNA in nucleosomes suggests that eukaryotic cells possess proteins that are designed to remodel histones ahead of the replication fork, in order to allow smooth progression of the replisome. It is also essential to reassemble histones behind the fork to reestablish the nucleosome confirmation. Furthermore, it is important to transmit the epigenetic information found on the parental nucleosomes to the daughter nucleosomes, in order to preserve the same chromatin state. In other words, the same histone modifications should be present on the daughter nucleosomes as were on the parental nucleosomes. This must all be done while doubling the amount of chromatin, which requires incorporation of newly synthesized histone proteins. This process is accomplished by “histone chaperones” and “histone remodelers,” which are discussed below ( Figure 5 ).

An external file that holds a picture, illustration, etc. Object name is genes-04-00001-g005.jpg

Nucleosome displacement and deposition during DNA replication. Histones are removed from chromatin ahead of the replication fork. FACT may facilitate this process. Asf1 recruits histone H3-H4 dimers to the replication fork. CAF-1 and Rtt106 load newly synthesized (light purple) histones to establish chromatin behind the fork. Previously loaded histones (dark purple) are also deposited on both daughter DNA strands. The histone chaperones involved in these processes are associated with replisome proteins: CAF-1/Rtt106 with PCNA and FACT/Asf1 with MCMs.

Several histone chaperones are known to be involved in replication-coupled nucleosome assembly. The FACT complex components were originally identified as proteins that greatly stimulate transcription by RNA polymerase II (reviewed in [136]). In budding yeast, FACT was found to interact with DNA Pol α-primase complex, and the FACT subunits were found to interact genetically with replication factors [137,138,139]. More recently, studies showed that FACT facilitates DNA replication in vivo and is associated with the replisome in budding yeast and human cells [95,140,141,142]. The FACT complex is a heterodimer that does not hydrolyze ATP, but facilitates the “loosening” of histones in nucleosomes [143]. How the FACT complex relieves the tight association of histones for DNA in nucleosomes is unanswered. Further, the mechanisms and factors facilitating nucleosome removal during DNA replication remain unknown.

Another histone chaperone associating with the replisome is Asf1, which interacts with MCM in a manner dependent on histone H3-H4 dimers [144]. It is hypothesized that Asf1 can pass newly synthesized H3-H4 dimers to deposition factors behind the replication fork [145,146,147]. This activity makes histone H3-H4 dimers available at the site of histone deposition just after replication ( Figure 5 ). Another highly related problem for the replication fork in the context of chromatin is the deposition of histones on nascent DNA. Chromatin formation proteins deposit histones onto both newly replicated DNA strands to form chromatin [148]. The heterotrimeric chaperone chromatin assembly factor 1 (CAF-1) is involved in this mechanism [149,150]. CAF-1 contains a PCNA-binding motif called PIP-box, allowing CAF-1 to associate with the replisome through PCNA and to deposit histone H3-H4 dimers onto newly synthesized DNA [151,152] ( Figure 5 ). The Rtt106 chaperone also participates in this process, associating with CAF-1 and H3/H4 dimers during chromatin formation [153,154]. These processes load newly synthesized histones onto DNA. How the existing chromatin marks are transferred from parental to daughter strands is not characterized. Daughter strands contain H3-H4 from parental DNA, but the mechanism of histone transfer is a continuing field of study [155].

At this point, nucleosomes form by the association of histone H2A/H2B with the deposited histone H3-H4. This process may occur through the FACT complex, since it is associated with the replisome and binds free H2A-H2B; or possibly another histone H2A-H2B chaperone, Nap1 [156]. Electron microscopy studies show that this occurs very quickly, as nucleosomes can be observed forming just a few hundred base pairs after the replication fork [157]. Therefore, the entire process of forming new nucleosomes takes place just after replication due to the coupling of histone chaperones to the replisome.

4. Challenges to Replication

The varied chromatin landscape within the eukaryotic genome requires a “toolkit” of proteins that facilitate replisome progression through a variety of genomic regions that are difficult to replicate. Defects in this mechanism result in replication fork collapse and lead to genomic instability. In this section, analysis will focus on a subset of challenging genomic regions for DNA replication ( Figure 6 ). It has become apparent that specialized mechanisms have evolved for the replisome to replicate DNA through these regions.

An external file that holds a picture, illustration, etc. Object name is genes-04-00001-g006.jpg

Replication Fork Barriers (RFBs) control DNA progression to protect genomic integrity. RFBs allow for the coordination of DNA replication with important processes on chromatin. (A) In budding yeast, Fob1 blocks replication fork progression in the opposing direction to 35S ribosome transcription. (B) In fission yeast, collisions between the transcription machinery and the replication fork are prevented by Reb1 and Sap1, which bind to the RFBs and block replisome progression near the 3' end of the 35S ribosome genes. (C) At the mating-type (mat1) locus in fission yeast, Rtf1 bound to the RTS1 site prevents replication fork progression at the 5' side of mat1. Only the fork in the 3' direction can progress through the region. This fork pauses at MPS1 and generates an imprint, probably a DNA strand discontinuity, which is required for a specific recombination event that allows for mating-type switching.

4.1. Replication Fork Barriers

In prokaryotes, such as the Escherichia coli bacterium, bidirectional replication initiates at a single replication origin on the circular chromosome and terminates at a site approximately opposed from the origin [158]. This replication terminator region contains DNA sequences known as Ter sites, polar replication terminators that are bound by the Tus protein. The Ter-Tus complex counteracts helicase activity, resulting in replication termination [159]. In this way, prokaryotic replication forks pause and terminate in a predictable manner during each round of DNA replication.

In eukaryotes, the situation differs. Replication termination typically occurs by the collision of two replication forks anywhere between two active replication origins. The location of the collision can vary based on the replication rate of each of the forks and the timing of origin firing. Often, if a replication fork is stalled or collapsed at a specific site, replication of the site can be rescued when a replisome traveling in the opposite direction completes copying the region. However, there are numerous programmed replication fork barriers (RFBs) and replication “challenges” throughout the genome. To efficiently terminate or pause replication forks, some fork barriers are bound by RFB proteins in a manner analogous to E. coli Tus [158]. In these circumstances, the replisome and the RFB proteins must specifically interact to stop replication fork progression.

4.2. Fork Barriers at rDNA Arrays in Eukaryotic Cells

Certain RFB proteins function to prevent collisions between replication forks and the RNA transcription machinery. One of the best characterized RFBs is situated near the 3' end of the 35S ribosomal RNA gene (rDNA) in budding yeast [160]. Budding yeast chromosome XII contains an array of approximately 150 rDNA repeats, which are highly transcribed in order to produce the ribosomes necessary for translation. Therefore, transcription forks are exceedingly common in rDNA regions, which can lead to head-on collisions between replication and transcription forks, possibly leading to collapse of one or both forks. To prevent these collisions, replisome progression opposing the direction of ribosome transcription is inhibited at rDNA repeats, thus allowing for replication and transcription to proceed in the same direction through the rDNA loci [161]. How is this polar replication block achieved?

In budding yeast, the Fob1 protein is responsible for the rDNA barrier ( Figure 6 A). Fob1 binds RFBs located at the 3' non-transcribed region of each rDNA gene [162]. Fob1 operates by looping or wrapping the DNA around itself to dramatically alter the local chromatin state and block replication from the 3' direction [163]. This Fob1-dependent chromatin structural change not only causes polar replication fork pausing but also leads to increased recombination at the RFB region [164]. Additional factors such as Tof1 and Csm3, components of the FPC are required to ensure stable fork pausing at rDNA loci. Tof1 and Csm3 appear to work together with Fob1 to counteract Rrm3, a DNA helicase required for replication through the rDNA RFB regions. It is proposed that Rrm3 displaces Fob1 to facilitate fork progression [165]. In addition, Rrm3 is thought to be required for replication restart in other genomic loci requiring displacement of nonhistone proteins from DNA [166,167,168,169,170]. Interestingly, Rrm3 translocates with the replication fork and physically interacts with PCNA [171,172]. Therefore, similar protein-protein interactions between RFB proteins and Rrm3 may allow for polar passage of replication forks through multiple fork block sites throughout the genome.

Although Fob1 is a budding yeast-specific protein, similar 3' specific replication termination occurs in the rDNA array in fission yeast and mammalian cells. The fission yeast rDNA unit has four RFBs (Ter1-4), three of which are known to use RFB proteins and the FPC to achieve polar fork arrest ( Figure 6 B). The DNA binding protein Sap1, which also recognizes the mating-type locus, causes polar fork arrest at Ter1, while fork arrest at Ter2 and Ter3 is regulated by another RFB protein, Reb1, which is homologous to mammalian transcription terminator factor 1 (TTF-1) [173,174,175,176]. Human TTF-1 also localizes to the rDNA RFB. TTF-1 performs “double duty” at the RFB: TTF-1 promotes polar replication fork arrest and also mediates transcription termination at the rDNA, preventing rDNA transcription machinery from running past the RFB [177,178]. Interestingly, fission yeast Reb1 and mammalian TTF-1 share the Myb-like DNA-binding domain, which is critical for DNA binding and protein function [179]. These proteins act to specifically inhibit polar replication progression, coordinating DNA replication with transcription to minimize fork collisions and genome instability.

4.3. Replication Termination at the Fission Yeast Mating-Type Locus

In addition to site-specific fork pausing required to prevent collision between replication and transcription machinery, fork pausing also allows for programmed cellular events that are coordinated with replication of specific genomic loci. For example, the fission yeast genome contains two genetically programmed fork pausing sites near the mating-type (mat1) locus: the mat1 pausing sites 1 (MPS1) and replication termination site 1 (RTS1). A strong polar fork arrest at RTS1 blocks one replication fork moving into the mat1 locus, allowing only the opposing fork to migrate into the mat1 locus. This fork pauses at MPS1, generating an imprint that initiates a replication-coupled recombination event, leading to mating-type switching ( Figure 6 C) [180,181,182,183,184,185].

The RTS1-dependent fork arrest is DNA sequence-specific, because inserting the RTS1 sequence into other genomic loci in Schizosaccharomyces pombe also results in polar fork arrest and increased recombination at those sites [186,187]. The RTS1 DNA sequence was used to identify a site-specific replication terminator factor, Rtf1. Rtf1 contains an Myb-like DNA-binding domain and directly binds DNA to block replisome progression. Rtf1 mutations disrupt polar fork arrest, indicating that Rtf1 is required for unidirectional replication termination at the RTS1 loci [188,189]. Interestingly, efficient pausing at RTS1 also requires the presence of specific replisome proteins. The Pol α gene (swi7) in S. pombe is required to initiate recombination at the mat1 locus. Additionally, the swi1 and swi3 genes, encoding the fission yeast FPC subunits, are required for both MPS1-pausing and RTS1-fork arrest [190,191]. The requirement for both replication termination factors on the DNA and replisome components suggests that these proteins may interact to regulate replication pausing or termination at the mat1 locus. Together, these RFB sites provide insights into the role of chromatin-bound proteins in regulating replisome progression at specific loci.

4.4. Replication Slow Zones

Examining DNA replication on a global level, it appears that some loci are more sensitive to DNA replication stress than others. Analysis of chromosomal DNA using two-dimensional agarose gel electrophoresis, showed that checkpoint mutants accumulate replication forks at specific genomic regions [114]. This led to the suggestion that certain genomic regions take longer to replicate because they are more difficult to replicate when compared with the rest of the genome. The replication fork-enriched loci were coined replication slow zones (RSZs). DNA replication through RSZs requires the checkpoint protein Mec1 (the budding yeast ATR homologue), implying that checkpoint pathways play an important role during replication of RSZs [114]. This is consistent with the observation that functional checkpoint proteins are generally required to replicate DNA during stress [192]. Early hypotheses proposed that RSZs were advantageous for general replication, allowing for the replisome to coordinate with other processes (e.g., slowing replication to enhance chromatid pairing or nucleosome loading) [114]. Recent work has suggested that mec1 mutant cells fail to replicate through RSZs due to a partial inability to upregulate dNTP levels (relieving inhibition of ribonucleotide reductase II by Sml1). This paucity of dNTPs leads to replication fork collapse in RSZs, because replication of RSZs requires higher local levels of dNTPs [193,194]. Therefore, the primary function of Mec1 may be to act as a sensor at the replication fork, upregulating nucleotide synthesis when there is a demand for increased dNTP levels during DNA replication. Considering that the checkpoint is required even in the absence of exogenous replication stressors, something inherent to RSZs causes replication stress.

As described above, DNA replication through these challenging regions requires checkpoint proteins in budding yeast. This is consistent with replication in the presence of exogenous genotoxic agents such as methyl methanesulphonate (MMS: an alkylating agent, which arrests forks) or hydroxyurea (HU: a ribonucleotide reductase II inhibitor, which depletes cellular dNTP pools and arrests forks). In the absence of Rad53 checkpoint kinase, budding yeast cells accumulate unusual replication fork structures in response to MMS or HU; thus cells are unable to complete replication [195,196]. In contrast, wild-type cells are eventually able to complete replication in the presence of genotoxic agents, albeit slowly and with continued activation of repair and checkpoint pathways.

In MMS-treated cells, DNA replication stress is induced globally, and replication is generally slowed by the checkpoints. In the RSZ model, the stress is induced by replication of the specific genomic region. The replisome encounters difficulties at these regions, resulting in fork arrest and generation of DNA structures that are recognized by checkpoint proteins. Whether the source of replication stress is exogenous or endogenous, checkpoint proteins recognize stalled forks, upregulate nucleotide synthesis, activate repair pathways, and arrest the cell cycle. By associating with the replisome, checkpoint kinases are well positioned to send signals to cope with fork arrest in response to a multitude of replication stressors, including endogenous obstacles on DNA sequences and exogenous genotoxic agents ( Figure 7 ).