Phyton-International Journal of Experimental Botany |
DOI: 10.32604/phyton.2021.014967
REVIEW
Non-Canonical Functions of the E2F/DP Pathway with Emphasis in Plants
1Facultad de Química, Departamento de Bioquímica, Universidad Nacional Autónoma de México, Avenida Universidad y Copilco, Ciudad de México, 04510, México
2Plant Development and (Epi) Genetics Group, Swammerdam Institute for Life Sciences, Universiteit van Amsterdam, Amsterdam, 1098XH, Netherlands
*Corresponding Author: Jorge M. Vázquez-Ramos. Email: jorman@unam.mx
Received: 12 November 2020; Accepted: 19 January 2021
Abstract: The E2F/DP pathway is a widely conserved regulatory mechanism in pluricellular organisms. The family of E2F and DP transcription factors was originally described having a role in the transition from the G1 to the S phase of the cell cycle. However, the discovery of hundreds of possible gene targets and their involvement in many other biochemical processes, soon showed that they participated in cell development and differentiation, chromatin remodeling, DNA repair and others. The E2F/DP transcription factors can act as either activators or repressors of transcription depending on their association to other regulatory proteins, particularly the retinoblastoma protein, or even depending to their protein structure that can define their role. In plants the E2F/DP pathway also regulates endoreduplication, a process present along the life cycle, from organ elongation to root differentiation and reproduction. These transcription factors also help plant cells to respond to environmental disturbances such as those caused by different types of radiation, or by pathogens. This review focuses on the “so called” non-canonical functions of the E2F/DP family proteins in animal and plant cells, that are in fact essential activities that connect regulatory circuits among multiple metabolic pathways by means of their atypical functions.
Keywords: E2F; DP; transcription factors; atypical functions; plant cells
Cell theory establishes that all biological organisms are composed of cells, the fundamental units of life, and postulates that all form of life originates from pre-existent life, i.e., every cell comes from another cell. Pluricellular organisms are composed of a great variety of cell types arising from a unique cell, or zygote, and thus have the same genetic information. Therefore, the diversity of cells in the human body is the result of a differential expression of the genome in the different cell lineages, expression that must be finely coordinated with cell proliferation [1]. The cell cycle is the process that will provide the new cells that will enter the distinct developmental programs for a cell to perform a determined role within an organism.
The E2F protein family is a group of transcription factors that have been widely studied, particularly for their role in the G1/S transition of the cell cycle. This review focuses on the non-canonical functions of the E2F family members, both in mammals and plants, highlighting the nodes in which these proteins connect regulatory circuits among multiple metabolic pathways by means of their atypical functions.
2 A Brief History of the E2F Family
Three decades have passed since the first reports on the E2F family of proteins. The presence of a nuclear factor in adenovirus infected HeLa cells was able to stimulate the early expression of the E2 viral gene, and thus was named E2 PROMOTER BINDING FACTOR (E2F) [2]. In parallel, Lee and Green (1987) identified a cell factor involved in the expression of diverse adenovirus genes, like E1A, E2A, E3 and E4 and was named E4F1 (E4 FACTOR 1) [3]. Later studies concluded that both groups characterized the same factor and that this was also present in non-infected cells, although in low proportion [4,5]. Two duplicated elements in cis were found in the promoter of the E2 gene that were necessary for its expression, and this depended on E2F [2,6]. Years later, it was found that the E2F factor was formed by two proteins. One of them, E2F1, was responsible for both gene transactivation and interaction with the hypophosphorylated form of the retinoblastoma (pRb) protein, whose function will be mentioned below; the second protein, DP (DRTF1/E2F-POLYPEPTIDE or DIMERIZATION PARTNER), was able to interact with E2F sites in the promoter of the E2 viral gene [7]. The DP human gene was cloned and the interaction of DP-E2F proteins was shown to potentiate both the transcriptional activity and the binding to pRb, demonstrating that the activity of this transcription factor depends on the cooperation of the E2F and DP subunits [8].
Initially, studies were directed to understand the influence of E2F-DP in the expression of genes required for entry and progression of the S phase of the cell cycle. However, most recent studies have shown that cell cycle regulation is not the only function as E2F-DP participates in multiple biological processes as mitosis, response to and repair of damaged DNA, differentiation, development and apoptosis [9–13].
3 The E2F/DP Family in Mammals
During the G1/S phase transition, the E2F-DP activity is regulated by proteins of the pocket family, integrated in mammals by the pRb, p107 and p130 proteins, that are efficient repressors of the E2F transcription factor (TF). In response to mitogenic stimuli, there is an increase in the kinase activity of complexes formed by the proteins named Cyclins and Cyclin Dependent Kinases (Cyc-CDK complexes) [12]. Pocket proteins are among the targets of these kinases that, when hyperphosphorylated, suffer structural changes, loosing affinity and releasing the E2F TF [14,15], thereby allowing activation of essential genes for the establishment and advance of the S phase of the cell cycle. Within the E2F family there are members related to either activation or repression of gene expression. In humans, E2F1, E2F2 and E2F3a have been characterized as activators, while E2F3b, E2F4, E2F5 and E2F6 are related to transcriptional repression and all of them form complexes with DP, a feature that identifies all of them as Typical E2Fs. A special type of E2Fs with transcriptional repression properties was described in Arabidopsis thaliana [16,17] and later homologues were found in animals. These E2Fs were named E2F7 and E2F8, and differently from the other members of the family, did not associate with DP, instead binding DNA as monomers, and were considered as Atypical E2Fs [18,19].
The typical E2F1-6 proteins share a similar domain organization. In the amino end they all possess a nuclear localization signal, in the central region a DNA binding domain and a DP heterodimerization domain in which a leucine zipper and a region called mark box are localized. The transactivation domain is found in the carboxyl end, where the capacity of gene expression regulation is found; the motif for interaction with the pocket proteins is embedded within this domain (DYXXXLXXXEGIXDLFD, where X represents any amino acid) [15,20]. The DP subunit of the complex consists of a nuclear localization signal in the amino end, in the central region a DNA binding domain and an E2F heterodimerization domain in the carboxyl end (Fig. 1). Atypical members E2F7 and E2F8 contain two DNA binding domains in their structure that confer a structural folding that allow them to contact the E2F motif in the DNA with high affinity, without the need of a DP subunit [21].
3.1 Opposing Interests within the E2F Family
As stated above, there are activators and repressors within the E2F family. E2F1, E2F2 and E2F3a induce proliferation in quiescent cells when overexpressed, while E2F3b, E2F4, E2F5, E2F6, E2F7 and E2F8 promote cell cycle arrest in serum-stimulated cell cultures [12] (Fig. 2A). However, when every member is analyzed individually, highly specialized and contrasting functions can be observed. As stated, E2F1-3a activates genes necessary for the S phase. However, high levels of E2F1 also promote apoptosis through two mechanisms. On one side, E2F1 induces the expression of pro-apoptotic genes as p14Arf (Arf) and caspases [22–25]; on the other side, it represses the expression of the anti-apoptotic gene Mcl-1 by binding to its promoter. In contrast, over-expression of the E2F4 repressor does not affect Mcl-1 [26], indicating target selectivity within the E2F family and opposing functions for the same protein (Fig. 2B). For E2F3 there are two isoforms coming from the same locus, E2F3a and E2F3b. The main difference between these isoforms is in the amino end since they are the product of the alternative use of promoters making the first exon different [27,28]. The E2f3 gene null mice dies soon after birth and cells with an interrupted E2f3 gene show proliferation defects due to a decrease in the activation of G1/S genes. The loss of E2F3b has no phenotype, whereas specifically silencing E2F3a causes a slight negative effect in cell proliferation, indicating that the two isoforms are partially redundant. However, silencing E2f3a in combination with E2f1 mutation causes severe proliferation defects, neonatal lethality, and defects in cartilage development, but not silencing of E2F3b, indicating a differential role for both isoforms [29–31].
The molecular mechanism by which E2F proteins repress gene expression is not known. However, they may function as scaffolding to recruit pocket proteins or associated co-repressors. Some of these co-repressors modify chromatin structure to silence transcription, as is the case of histone deacetylases (HDAC1 and HDAC2), histone demethylases (RBP2), DNA methyltransferases (DNMT1), nucleosome remodelers of the SWI/SNF type (Brg1, Brm), histone methyltransferases (Suv39, Suv4) and chromatin binding proteins like HP1 [32].
The predominant form of E2F3 is isoform b, and it has been considered a repressor since most of the time during the cell cycle is associated to pRb. Specific targets of the E2F3b-pRb complex are genes related to exit to cell differentiation and pro-apoptotic genes like Ink4/Arf, specifically the isoform p19Arf (murine homologue of Arf), that activates the p53-p21Cip1 pathway [33]. In this way, the repressive function of E2F3b allows cell cycle advance under normal conditions [34,35].
Similarly, E2F4 in complex with pocket proteins (mainly p130) represses the expression of G1/S genes during G0 (i.e., quiescence, senescence and differentiated states) and early G1, avoiding premature entrance into the cycle [36]. This TF possesses two nuclear export signals (NES) that prevent it from a stable permanence within the nucleus, and its residence in the nucleus, and repressive activity, depend on its association with pocket proteins [37]. E2F4, as other typical E2Fs, contains a transactivation domain; in vitro assays have shown that this protein can activate the expression of reporter genes, although weakly, and its overexpression does not allow entry into the S phase in quiescent cells. However, when co-expressed with DP1, this TF shows a higher capacity to stimulate cell cycle progression [38], but differently to E2F1, E2F4-DP1 overexpression (E2F4-DP1OE) does not induce neither the Arf expression nor the apoptosis process [39,40]. In contrast, in highly proliferating tissues as stem cells or tissue generating cells, E2F4 seems to substitute E2F1 as a primary activator, since it is the form predominantly expressed and it is localized in the cell nucleus, while E2F1 shows a more diffused expression pattern [41,42]. Loss of E2F4 reduces the rate of cell division in these tissues; nonetheless, it is not known if in this context, the E2F targets are cell cycle genes. In cancer models, E2F4 acts as an oncogene, probably through its non-canonical role as an activator and not as a repressor. Consistently, E2F4-DP1OE induces skin tumors in mice [43]. On the other hand, its function as repressor could also be related to cancer progression, since E2F4 increases the survival of transformed cells, repressing the expression of pro-apoptotic genes as E2f1 and Apaf-1 [44]. In addition, E2F4 in complex with p130 could promote genomic instability repressing genes essential for DNA damage repair by homologous recombination (HR) like BRCA1 and RAD51, increasing tumor cell survival [45,46]. These data indicate that E2F4 function, besides its interaction with pocket proteins, responds to other factors that depend on the cell context/ tissue in which it is expressed. Consistently, E2F4 overexpression, besides reducing the expression of certain genes, induces the expression of many others, indicating a dual function in gene expression regulation [47] (Fig. 2B).
E2F5, similarly to E2F4, associates to p107 and p130 to repress cell cycle genes transcription. Overexpression of E2F5 (E2F5OE) arrests cell cycle and can promote differentiation; however, the mechanism is still unknown [48–50]. The E2F pathway regulates not only adenovirus infection but also infection by other viruses as the human papillomavirus (HPV) [51]. The variety 18 (HPV18) is one of the more aggressive with a high probability of progression into carcinogenesis [52], and the high transformation of cells after infection resides in two viral proteins, E6 and E7, encoded in the same viral gene, and are a product of alternative splicing [53]. E6 protein stimulates p53 degradation via proteasome, while E7 dissociates pRb from E2F, thereby stimulating the cell cycle progression [54]. E6/E7 gene promoter is regulated by E2F through E2F sites bound by E2F2 and E2F5 proteins. Silencing of E2f1, E2f2 and E2f3 genes (individual or multiple) does not alter significantly the E6/E7 expression, but silencing E2f5 highly reduces expression [55]. Interestingly, this atypical E2F5 function as gene activator has only been described for cells infected by HPV18 and specifically on the E6/E7 promoter. The high rate of carcinogenesis after HPV18 infection could be because the high expression of E6/E7 (through E2F5) that produces a positive feedback loop, in which on one side stimulates the expression of the S phase genes by sequestering pRb and on the other side, halts the activation of apoptosis via p53 [55] (Fig. 2B).
Structure and function of the E2F6 protein differs from the other typical members of the E2F family due to the lack of the transactivation domain and of the pocket protein interaction motif, besides being an active repressor of transcription [56–58]. E2F6 overexpression (E2F6OE) differs from the activator E2Fs, as it induces no cell proliferation; on the contrary, the cell cycle is arrested independently of the association to pocket proteins. E2F6 is able to repress E2F target genes even under conditions in which E2F1-DP1 is overexpressed, reverting the high proliferation phenotype associated to this TF. Interestingly, E2F6OE cells are not arrested in the G1/S but in the S phase, indicating a possible role in the S phase exit, the opposite function to that of other members of the family [56] (Fig. 2A).
The main activity of this complex is the displacement of nucleosomes in the DNA to regulate transcription. Also, E2F6 is part of the POLYCOMB REPRESSIVE COMPLEX 1 (PRC1), and it associates to the core subunits of the MBI-1/Pcgf4 and RING1B E3 ubiquitin ligase. Another type of PcG complex related to E2F6 is E2F6PRC2, containing the catalytic subunit of PRC2, ENHANCER OF ZESTE 2 (EZH2), and the EPC1 protein, the anchoring protein. PRC2 trimethylates lysine 27 of histone H3 (H3K27me3), a mark related to transcription repression. The transcription repression activity of E2F6 has been linked to the function of the polycomb repressor complex (PcG), that is associated to chromatin structure. A possible mechanism to explain the repression of E2F1 target genes is through the interaction with the BRG1 subunit of the SWI/SNF remodeling complex [59]. During G1/S both E2F6 and BRG1 associate to promoters of E2F1 target genes, as E2f1 and PCNA [60]. PRC1 monoubiquitinates lysine 119 of histone H2A (H2AK119ub1), a chromatin mark related to gene expression silencing [61,62]. During G0, E2F6PRC2 associates to promoters of cell cycle genes and when cells are serum-stimulated to enter the cycle, the E2F6PRC2 complex becomes undetectable [63]. E2f6 null mice are viable and apparently healthy; however, they show homeotic transformations in the axial skeleton, similar to the phenotype of mice lacking the PcG components, indicating that this gene is part of the same genetic pathway [64].
The polycomb group proteins cannot bind DNA directly and so other TF could perform this function. This view is compatible with evidence that E2F6 can be localized in the same loci than other PcG proteins, both in G0 and in advanced stages of development and pattern formation, perhaps acting as a protein scaffolding to direct the PcG activity to specific loci [65,66].
The other two members of the E2F family, E2F7 and E2F8, known as atypical, are transcriptional repressors independent of pRb since they do not possess the interaction motif to pocket proteins. Overexpression of these TF represses the expression of cell cycle genes promoting its arrest [11,18,67]. Under normal conditions, expression of the E2f7 and E2f8 genes increases during G1/S via E2F1-3 and after the transition E2F7 and E2F8 proteins accumulate and repress the E2f1 expression, creating a transcriptional wave in G1/S by a negative feedback loop. Mutations in E2f7 or E2f8 do not show evident defects, indicating that they perform redundant functions; deletion of both genes causes an increase in apoptosis. This increase in cell death is in part due to a deregulated activity of E2F1 that, as mentioned before, stimulates the expression of proapoptotic genes. Consistently, E2F1 activity extended to the S phase exit and G2 promotes the activation of the p53 pathway and cell death [68]. Some of the factors that naturally limit E2F1 activity are association to pRb, phosphorylation by CycA/CDK2 in the S phase, ubiquitination and binding of E2F sites by repressor complexes associated to E2F6 [11,69,70]. Anyhow, none of these mechanisms is enough to counter the E2F activity in the absence of E2F7 and E2F8, pointing that the antagonistic role against E2F1 is not the only action they perform [68,71]. Another characteristic inherent to these atypical E2Fs is their capacity to form either homo or heterodimers among them. The predominant dimeric form is E2F7/E2F7, E2F7/E2F8 is the second most representative, and E2F8/E2F8 is the least common. Mice with genotype E2f7+/- E2f8-/- develop normally, while E2f7-/- E2f8+/- mice present multiple defects in postnatal development, indicating that their function is not at all that redundant [68]; the way this happens is not known although it is hypothesized that, since they all recognize the same response element on DNA, could compete for the sites with other E2Fs. Since E2F7 can form homodimers and both E2F7 and E2F8 can heterodimerize with E2F1, all these complexes are able to bind adjacent E2F sites on DNA in a cooperative manner [72]. As with DP [73], E2F1 interacts with E2F7/8 by the DNA binding domain. In vivo, E2F1 binds selectively to one of the two E2F sites in its own promoter, suggesting that during G1/S E2F1 activates its own expression, and when E2F7/8 accumulate, they can compete for the E2F sites forming a stable dimer with E2F1 on DNA displacing it. On the other hand, ChIP-reChIP experiments showed that E2F7/E2F7 can form complexes in the E2F sites with the transcriptional regulator CtBP2, whose silencing reverts the E2F7-dependent repression of the E2f1 gene [72]. This evidence shows that part of the repressive function of atypical E2Fs is related to competition for DNA sites and with chromatin remodeling of target genes via CtBP2 [74].
In human cells, the E2F dimerization partner (DP) subgroup is formed by three members, DP1, DP2/DP3 and TFDP3/DP4. Nomenclature can be confusing because there are multiple alternative splicing forms of DP2, and also, the murine homologue of DP2 in humans is known as DP3 [75]. When the third DP (TFDP3) was reported [76], another report appeared on a protein named DP4 [77], both referring to the same gene. So, from now on, we will refer to DP2/DP3 as DP2, a protein structurally and functionally related to DP1, while TFDP3/DP4 will be DP3, whose function is antagonistic to DP1 and DP2 [76]. DP1 shows the highest levels of expression of the family, followed by DP2 and then DP3; DP1 is considered as the main E2F interactor and its function is redundant with DP2; overexpression of DP1 stimulates entry to the S phase whereas DP3 promotes arrest in G1 [77]. All these proteins heterodimerize with E2F, but only DP1 and DP2 are related to activation of E2F target genes; meanwhile, DP3 inhibits both association to DNA and capacity of E2F activation [76,77]. Despite the low levels of DP3, abundance can be greatly increased after DNA damage by chemical agents that provoke double strand breaks (DSB), and its localization changes to the nucleus; this expression pattern is the opposite to that of DP1, whose levels decrease in response to DNA damage [78]. Overexpression of DP3 causes the displacement of DP1 from the complex with E2F1 substituting it, and so inhibiting the E2F function [76,78]. This unusual role can be seen as an inhibitory mechanism of p53-mediated E2F1 induced apoptosis [79]. However, Komori et al. [39] have reported that expression of the proapoptotic Arf gene is increased when E2F1 activity is deregulated by pRb inhibition as a consequence of the expression of the viral E1A protein. Interestingly, Arf expression or entry into apoptosis are not affected by silencing of the DP members, despite detention of the cell cycle in G1, indicating that E2F1 is acting independently of DP [39]. Evidence points to the interaction of E2F1 to CG repeats similar to the core E2F DNA motif (TTTCGCGCG), without the need of DP, defying the classical model of interaction as a heterodimer [73,80]. Besides Arf, expression of BIM, another proapoptotic gene, is increased when the E2F1 activity is deregulated [81]. Both genes lack canonical E2F sites in their promoter, but they contain CG repetitions, that are bound by this TF [39]. The classical model of E2F-induced apoptosis implies the existence of an E2F activity threshold that, when exceeded, more than inducing proliferation, stimulates entry into cell death. Experimental evidence challenges this model since in the absence of DP, cell cycle gene expression stops, while expression of proapoptotic genes continue. It is important to recall that the proapoptotic activity of E2F1 takes place mainly when the E2F pathway is altered, i.e., under conditions in which the pRb protein cannot repress its activity.
These functions of the atypical E2F proteins could work as a reserve mechanism to contend with unfavorable situations in which genome integrity is at risk, first, arresting the cell cycle and then, in case of major damage, activating processes as programmed cell death to maintain organism homeostasis.
3.2 Small Non-Coding RNAs as New Regulatory Nodes in the Cell Cycle Network
In recent years, the discovery of small non-coding RNAs (or miRNA) involved in cell cycle regulation opened an exciting new field in cell cycle research [82–84]. A prototypical miRNA, after several processing steps, consists of 21 to 24 nucleotide-long RNA; the sequence of a mature miRNA is partially complementary to the 3’ UTR of target mRNA(s) [85]. In cooperation with the RNA-induced silencing complex (RISC), miRNAs can direct either the cleavage of the target mRNA or interfere with the translation process, thus decreasing protein levels of their targets in a post-transcriptional manner [85]. miRNAs are transcribed by RNA pol II/III and they can exist both hosted by other genes (e.g., within introns), and as independent genetic units regulated by their own promoters [85]. miRNA function is crucial for cell cycle control, since they have the potential to decrease the levels of important proteins as oncogenes or tumor suppressors. For example, the CDK repressors p27kip1 and p57kip2 are targeted by miR221/222 and miR-181 [86–88]. Overexpression of the mir-221-222 cluster provokes the ectopic activation of CDK2 and enhanced tumor growth [89]. After mitogenic stimuli for cell cycle reentry, the activation of E2F-dependent transcription takes place; among the hundreds of E2F target genes there are several miRNA such as the mir-17-92 cluster which, paradoxically, target E2F1-3 mRNAs. This leads to a decrease in the activator-E2F levels, forming a negative feedback loop that can fine tune the abundance of E2F1-3 after the S phase establishment [90–92]. Other clusters like mir-15b-16-2, mir-106b-25, let-7a-d, and let-7i are also transcriptionally stimulated by E2F1-3. This miRNA clusters target central regulators of the G1/S transition like D-type cyclins, CDK4, CDK6, CDC25A, among others [93]. In the absence of these miRNAs the augmented E2F activity induces a strong DNA replication activity that results in replicative stress and DNA damage [93]. Altogether, these observations provide an exquisite model of regulation of E2F activity through several positive and negative feedback loops promoted by itself, supported in part by the activity of miRNAs, which in recent years have started to be regarded as an important tool for therapy in diseases such as cancer.
4 The E2F Family and the Green Ancestry
The first reports on the plant E2F/DP family were published 20 years ago. Cloning of maize cDNAs encoding pRb type proteins suggested conservation of the E2F/Rb pathway in plants [94,95]. The first plant E2F was reported in wheat [96,97] and then DP was also reported [98,99]. The discovery of the E2F pathway in plants was followed by studies in multiple species, including A. thaliana [17,100], tobacco [101], carrot [102], rice [103] and algae like Chlamydomonas reinhardtii [104] and Ostreococcus tauri [105], among others.
The E2F family of Arabidopsis has been the most widely studied; six E2F-type proteins and two DP-type proteins have been described. There are three typical E2Fs (i.e., E2Fa, E2Fb and E2Fc) and they have a domain organization similar to the human E2F1-5; it means, a nuclear localization signal, a DNA binding domain, a DP dimerization domain, a leucine zipper and a transactivation domain within which the RBR (Retinoblastoma-Related) motif is embedded, towards the carboxyl end [17,100]. Also, there are three atypical members that in plants are known as DEL (DP-E2F-LIKE), that include DEL1/E2Fe, DEL2/E2Fd and DEL3/E2Ff. Atypical E2Fs, similarly to animal E2F7/8, have two in tandem DNA binding domains, that allow them to bind DNA as monomers and do not contain DP interaction or transactivation domains [16]. DEL proteins are considered as transcriptional repressors [106,107]. Arabidopsis also has two DP-type proteins (DPa and DPb), that interact with E2Fa-c to form functional heterodimers with capacity to bind DNA [17,98]. These DP proteins have a nuclear localization signal, a DNA binding domain and an E2F dimerization domain. Finally, E2Fa and E2Fb present properties as transcriptional activators, while E2Fc works as a repressor [17].
In mammals, the consensus DNA binding sequence both for typical and atypical E2F is TTTSSCGS (S = nucleotides C or CG), regardless the direction with respect to the transcription initiation site [73,108,109]. Ramírez-Parra et al. [110] determined a slightly larger motif in Arabidopsis, TYTCCCGCC (where Y = T or C) [110], with a dinucleotide CG in positions 6 and 7 considered as indispensable for recognition by E2F/DP. Vandepoele et al. [111] reported a similar consensus, with some variations, WTTSSCSS (W = A or T) [111]. Consistent with the similarities observed for plant and animal DNA sequences, DNA binding domains of plant E2F, DEL and DP proteins in several plant species conserve the signature RRXYD (where X represents any amino acid) that allows them to contact the CG-rich core of the E2F motif [73,110,112–114]. The high conservation of the E2F DNA binding domain among eukaryotes is illustrated by its occurrence spanning major clades such as metazoan (616 sp.), viridiplantae (169 sp.) and fungi (17 sp.) when searched within the non-redundant DNA collection of NCBI (unpublished data).
5 Function of the Plant E2F/DP Family
In plants, as in animals, the E2F pathway regulates expression of cell cycle genes, mainly during the G1/S transition [115]. In Arabidopsis, E2Fa and E2Fb stimulate expression of genes related to start and advance of the S phase, while repressors E2Fc and DEL induce cell cycle arrest in G2 or G2/M, probably negatively regulating some of the E2F target genes for G1/S [107,116].
Besides the mitotic cell cycle, there is an alternative cell cycle that has been conserved from fungi to humans. This is known as endoreduplication and its main characteristic is that cells duplicate their genome in the absence of cell division, resulting in polyploid cells [117]. In humans, endoreduplication has been related to terminal differentiation and commonly occurs in highly specialized tissues such as hepatocytes or in the placenta [118]. Unlike animals, endoreduplication in plants is a process present along the life cycle. For instance, it is a strategy for organ elongation in leaves and hypocotyl and as part of the genetic program for root differentiation [119,120]. Polyploidy is also present during the reproductive stage.
5.1 Function of E2F in Endocycles
Flowering plants go through a process of double fertilization by which the embryo (2n) and a triploid endosperm are originated, which are the result of the fusion of the maternal (2n) and the paternal (n) nuclei [121]. Additionally, in some species as maize, the endosperm goes through several rounds of acytokinetic mitoses that form a multinucleated syncytium, followed by cellularization by mitosis [122–124]. Once this is finished, endosperm cells start synthesizing storage compounds and go through multiple rounds of endoreduplication, reaching a ploidy up to 192c, to finally enter the programmed cell death as part of the seed maturation process [122–125]. During this endoreduplicative stage, target genes of the E2F pathway as the subunits of the MCM2-7 replicative helicase, or the DNA polymerase processivity factor PCNA, greatly increase expression compared with the mitotic stage, pointing towards the participation of E2F in this process. On the other hand, silencing of one of the four genes coding for the maize pRb-type repressors, RBR1, the most expressed form, results in an increase in ploidy of endosperm cells, connecting the E2F/RBR pathway with control of ploidy in this developmental stage [125].
During mitosis (M), chromosomes condense and align with the mitotic spindle and then migrate (segregate) towards the poles to finally give way to cytokinesis creating two new daughter cells. Accumulation of the mitotic B-type cyclins (hereafter named as CycB) in the G2/M transition, and the increase in the associated kinase activity, leads to the establishment of the M phase; later, degradation of CycB is necessary for chromosome segregation and exit from M to happen. The anaphase promoting complex (APC) ubiquitinates mitotic cyclins, labelling them for degradation via 26S proteasome to allow M phase advance. The APC complex is composed of 11 subunits in vertebrates and 13 in yeast [126]; Arabidopsis APC has at least 13 subunits in the complex [127], of which the homologs of human CDH1 in Arabidopsis, CELL CYCLE SWITCH52 (CCS52) A and B, activate APC and show a peak of expression during late M and in G2/M, respectively [128].
The E2Fa/RBR1 complex binds to E2F sites in the CCS52A promoter to repress its expression during early G2, allowing CycB accumulation and the subsequent advance to the M phase [129]. Plants overexpressing E2Fa-DPa (E2Fa-DPaOE) show high levels of endoreduplication [130], probably the consequence of arrest in G2 due to the ectopic activation of CCS52A expression and the concomitant degradation of CycB. Additionally, arrest in G2/M could be enhanced by the increased expression of the E2Fc repressor protein in E2Fa-DPaOE plants [111,116]. RBR1 also plays an important role in the establishment of endoreduplication, since e2fa mutant plants, or plants expressing an E2Fa version lacking the RBR interacting motif (E2Fa∆RBR) are unable to repress CCS52A expression, increasing the arrest in G2 and endoreduplication [129].
The atypical E2F proteins also control the G2/M transition through repression of CCS52A expression, since DEL1OE plants show lower levels of ploidy, possibly due to arrest in the G1/S transition, promoting mitosis advance by regulating CCS52A. On the contrary, del1 mutant plants accumulate copies of the genome and ectopically express replication genes [106,107], that could direct entry into the endocycle. DEL proteins have no transactivation domain and therefore, it is a hypothesis that they antagonize the function of E2F-DP competing for the E2F site in target promoters, preventing its occupation by activator E2Fs [107]. However, most recent studies have shown that a subunit of an important expression regulating complex, MEDIATOR (MED), subunit 16 (MED16), genetically and physically interacts with DEL1. The joint activity of DEL1-MED16 stimulates transcriptional repression of CCS52A, and as a result, a decrease in ploidy, although the molecular mechanism has not been elucidated (Fig. 3) [131].
5.2 Function of E2F in Response to Environment
All living things are constantly exposed to agents that can compromise genome integrity. One of these agents, ultraviolet radiation (UV) is part of the solar light that, paradoxically, is the main energy source for photosynthetic organisms. The UV spectrum is divided in UV-C (λ < 280 nm), UV-B (λ 280 nm–315 nm) and UV-A (λ 315 nm–400 nm), of which, fraction UV-C is retained by the ozone layer in the stratosphere, while UV-B (5%–10%) and UV-A (>90%) reach the earth surface [132]. UV-C and UV-B are absorbed by DNA causing genotoxic damage in the form of pyrimidine-cyclobutanes (CPDs) [133], whereas UV-A can produce elevated levels of reactive oxygen species (ROS) when they impinge on biomolecules in cells [134]. The presence of CPDs on DNA can alter both replication and transcription and thus, cell cycle advance [135]. Arabidopsis plant leaves exposed to UV-B stress show a considerable increase in CPDs, besides an increase in ploidy. This damage can be repaired by photoreactivation for which the enzyme photolyase PHR1 is required, and its expression is stimulated by UV-B treatment. Under normal conditions, PHR1 expression is restricted by the association of DEL1 to its promoter, but UV-B decreases the abundance of DEL1 messenger RNA, releasing such repression [136]; since DEL1 is a negative regulator of endoreduplication, when its levels decrease, accumulation of DNA is favored due to the increase in CCS52A expression (Fig. 3).
Other genes in plants that could be influenced via the E2F pathway during DNA damage are those codifying for the three subunits of the chromatin assembly factor (CAF-1). The largest subunit, FASCIATA1 (FAS1), catalyzes the transference of histones H3/H4 to the recently synthesized DNA during replication and also participates in chromatin compaction and in HR [137]. The FAS1 promoter possesses two E2F sites that permit to increase or repress its expression, respectively. Under normal conditions, during G1/S, FAS1 forms part of the genes activated by E2Fa and E2Fb because of the transition to the S phase [130,138] and its expression diminishes in late S phase, maybe in response to the activity of E2Fc or DEL1 [139], allowing the advance of the cycle. Nonetheless, a replicative stress activates the expression of the G2 checkpoint genes and the subsequent cell cycle arrest. In a similar way, loss of the FAS1 gene provokes a delay of the S phase and the constitutive expression of DNA integrity checkpoint genes like RAD51 and BRCA1, possibly due to the increase in acetylation of histone H3 (H3ac) in their promoters, and also produces a systemic increase in ploidy [140]. DSB induction with chemical agents like zeocine, or others that cause replicative stress, partially phenocopy loss of FAS1 [140]. Altogether, these data connect the function of several members of the E2F family with the process of endoreduplication, either as part of the normal development of plants or in response to stress in the DNA [141].
In plants, the functional analog of the tumor suppressor p53 is known as SUPPRESSOR OF GAMMA RESPONSE 1 (SOG1). SOG1 is the TF responsible of integrating the signals arising from the DNA damage checkpoint kinases ATM or ATR when there is DNA damage to generate a transcriptional response [142]. Some of the genes whose expression is increased via SOG1 are related to cell cycle arrest like the CDK inhibitors of the SIAMESSE RELATED (SMR) group and of the KIP RELATED PROTEIN (KRP) group, besides inducing the expression of the CDK repressing kinase WEE1. SOG1 function also promotes an increase in ploidy [141], perhaps by stimulating the expression of WRKY25, a TF that associates to the promoter of DEL1 to repress its expression [143,144]. In parallel, SOG1 function promotes the expression of numerous genes concerned with DNA repair like CYCB1;1, RAD51 or BRCA1 [143].
E2F not only participates in endoreduplication regulation, but also directly acts in response to DNA damage, as evidenced by the transcriptional regulation of BRCA1, and the promotion of localization of RBR-BRCA1 in genomic sites where lesions accumulate [145], signaled by the presence of the phosphorylated form of histone H2A.X (γH2A.X), a marker of damage in DNA. On the contrary, while RAD51 expression can be regulated by E2Fa and RBR under normal conditions, its expression is independent of the E2F pathway when DNA damage is produced. However, the RBR protein, as with BRCA1, forms pools in genomic sites with lesions and co-localizes with RAD51, but in an E2F independent manner, as e2fa mutations illustrate, with an enhanced sensitivity to DNA damage, but unaffected RBR-RAD51 pools [146].
Other proteins not related to RBR can also repress the E2F activity, such is the case of SNI1, one of the eight subunits of the STRUCTURAL MAINTENANCE OF CHROMOSOMES SMC5/6 complex [147,148]. One of the best-known functions of this complex is the resolution of complex structures in DNA during DNA damage repair by HR, and during the blockage of replication forks during the S phase. It has recently been discovered that Arabidopsis SNI1 works similarly to RBR, masking the E2F transactivation domain and also recruiting histone deacetylases to repress expression of its replication target genes, including SNI1 itself [149]. This new function creates a link between the DNA integrity checkpoint and the cell cycle. On the other hand, it is known that pathogen-infected plants trigger the Salicylic Acid (SA) pathway to mediate the immune response, which apart from inducing expression of immunity genes, also promotes an increase in plant DNA damage, in the absence of genotoxic agents; accumulation of DNA damage, in turn, stimulates genes of the HR pathway as ATR and RAD17 [150]. SNI1 gene was discovered when sni1 mutants showed an increased immune response in the absence of stimuli, indicating that it was a repressor of the pathway. It is thought that SA-produced DNA damage contributes to signal and stimulate the systemic resistant pathway, since the resistance phenotype of sni1 plants is reverted by mutation of RAD17 and ATR sensors [150]. Additionally, e2fabc mutant plants are more susceptible to infections and are unable to express some of the immune response genes [151]. DEL1 also participates in the immune response directly regulating the expression of the EDS5 gene, a SA transporter (Fig. 3). Consistently, del1 plants show increased immunity in the absence of a pathogenic challenge [152], recalling the phenotype of SNI1 repressor absence. All this evidence shows a tight connection between the E2F pathway, the response to stress by pathogens and the DNA repair pathway.
In addition to SNI1, another protein of the SMC5/6 complex, SUMO E3 ligase MMS21, negatively regulates E2F activity. There are two mechanisms by which MMS21 regulates the cell cycle. The first consists in promoting dissociation of the E2F-DP complex by competing for DPa; the second is consequence of DP SUMOylation, potentiating heterodimer separation. DPa SUMOylation also decreases TF nuclear localization, provoking alterations in the regulation of its target genes. Therefore, mutant plants lacking MMS21 gene, a mutant showing an increased expression of G1/S E2F target genes, have higher expression of PCNA1, MCM3 and ORC2, implying that this E3 ligase could participate in the regulation of the S phase [153]. MMS21 mutation also affects expression of G2/M genes as CDKB1 and CYCB1 but, differently to G1/S genes, expression of G2/M genes decreases, arresting the cell cycle and causing an increase in ploidy. Overexpression of MMS21 attenuates the polyploid phenotype in E2Fa-DPaOE plants, suggesting an additional participation in the regulation of the cell cycle, independently of mediating the response to DNA damage (Fig. 3) [153,154].
During the cell cycle, the γ-tubulin cytoskeleton has a vital role in the reorganization of cell structures. In animal cells centrosomes are the organelles responsible of nucleating and polarizing microtubules starting from two centrioles that during interphase (probably in G1/S) are duplicated and help to organize cell organelles. During mitosis, these microtubules will form the mitotic spindle, the scaffolding in which chromosomes will anchor to be segregated later towards the opposite poles of the spindle; microtubules will also participate in the separation of the two new cells during cytokinesis. A structural role of γ-tubulin is not the only contribution to the cell cycle; apparently, it can also physically regulate the position of the genome within the nucleus in collaboration with the nuclear envelope and can also associate to transcriptional complexes, although these functions are not well understood [155].
There are no centrosomes in plants, however, microtubules form a structure known as microtubule organizing center (MTOC), whose function is like that of centrioles. In Arabidopsis, besides being part of the cytoskeleton, γ-tubulin associates to E2Fa-c in the nucleus and blocks their activity, but differently to MMS21, tubulin neither dissociates the E2F-DP complex nor blocks the DNA association. The ternary E2F-DP-tubulin complex, associated to chromatin, promotes cell cycle arrest, becoming another type of repressor complex for both G1/S and G2/M, preventing the increase in ploidy. This atypical function of γ-tubulin could coexist with the RBR pathway, since it interacts with the amino end and the DNA binding domain of E2Fa-c [156]. Finally, although in mammals γ-tubulin can interact with activating E2Fs (E2F1-3), ternary complex formation is circumscribed to G1 and G1/S, while in plants it also occurs in S and G2, having a unique role in endocycle regulation [157,158].
5.3 Seeds, Cell Cycle and the E2F Pathway
Seed development is a key innovation in the history of plant evolution that allowed them to colonize predominantly dry continental environments. Seeds can originate by ovule fertilization (sexual reproduction), or by apomixis (asexual reproduction), and are composed of three main structures: i) The zygote (2n), resulting from the fusion of a spermatic nucleus (n) with the arquegonium egg cell (n), ii) The endosperm (3n) coming from the fusion of another spermatic nucleus from pollen with two polar nuclei of the embryo sac central cell; and iii) Testa, a protecting layer of maternal origin [159].
In cereals, the development and growth of a seed is composed of three phases: Phase 1, starting with fertilization and includes cellularization of both the embryo and the endosperm. Phase 2, in which there is a notable increase in mass, attributed to cell elongation and massive accumulation of storage compounds; increase in ploidy of endosperm cells by endoreduplication also occurs in this phase. In Phase 3, or maturation phase, seeds enter a process of drying, loosing up to 80% water and in some species the dormancy process begins [160].
During embryogenesis, disturbing the G1/S CDKA;1 or CycA3 axis, or loss of APC/C components like HOBBIT (CDC27 homolog) slow down the cell cycle affecting the formation of embryo patterns, morphogenesis and tissue specification, like the root meristem [161–163]. Null cdka;1 mutations in Arabidopsis show a reduced number of cells, together with a drastic delay in embryogenesis, while rbr mutants reverse the post-embryonic defects of cdka;1 mutants, indicating the participation of both genes in the control of cell proliferation in the embryo [164]. Maize seeds with low levels of RBR1 show an increase in the expression of E2F/DP target genes, and an increase in the rate of cell division and DNA content in mature endosperm cells [125]. However, reserve material content and seed size are not affected, suggesting a functional redundancy with other members of the RBR pathway or other compensatory mechanisms [165].
In Arabidopsis, the seed maturation stage causes a reduction in nuclear size and a higher chromatin condensation. Nonetheless, there is expression of certain genes like those required for dormancy [166,167]. e2fa and e2fb mutant plants do not show alterations in cellularization during the embryo formation stage, but exhibit premature expression of seed maturation genes, suggesting that these E2Fs could participate in the balance between proliferation and differentiation during seed development performing the atypical function of maturation gene repression [168].
After maize seed maturation, during water imbibition, reactivation of the cell cycle would be necessary for proper germination to occur [169]. In maize, most of the cells in the embryo axis are blocked in the G1 phase of the cycle, and interestingly, they seem to have in store most of the components of the cell cycle machinery, like cyclins, CDKs, CDK inhibitors (ICK/KRP), active cyclin-CDK complexes, PCNA, among many others [170–178]. Recently, Sánchez-Camargo et al. [114] reported the existence of 12 genes of the E2F/DP pathway and similarly to other cell cycle players, mRNAs of 9 out of the 12 genes are present in embryo axes of dry seeds, and their levels fluctuate during germination. Detailed studies show that maize E2Fa/b1;1 can act as a transcriptional activator, while E2Fc does not have this capacity and evidence suggests that it is a transcriptional repressor. In many plant species E2Fc proteins have a truncated RBR interaction domain, whereas activating proteins have an additional motif towards the carboxyl end. Maize E2Fc could have lost this motif during evolution and therefore its transactivating function (Romero-Rodríguez and Sánchez-Camargo, data not published), thus becoming a repressor competing for E2F response elements or recruiting RBR to E2F target sites.
In dry maize seeds and during germination, replication genes like MCM3;1, RPA2 and PCNA1 show open chromatin marks as H3K9/K14ac and H3K4me3. However, their expression levels vary during germination. Immunoprecipitation experiments of chromatin associated E2F in dry seeds, indicate that the repressor E2Fc and RBR1 are bound to S phase promoters and after imbibition, these are replaced by activator E2Fa/b1;1, together with an increase in the expression of target genes [114]. At later germination times, when the first cell cycle round has finished [173], RBR1, E2Fa/b1;1 and E2Fc can be detected bound to chromatin, possibly forming repressor complexes in DNA. In this way, E2Fc-RBR1 could prevent the start of a premature cell cycle, and then RBR could regulate E2Fa/b1;1 function both for reentry to the next cell cycle and for exit to differentiation (Fig. 4) [114]. Evidence accumulated for years tends to indicate that seeds could possess some kind of biochemical memory formed, among other factors, by the Cyc/CDK/KRP signaling pathway, the E2F/DP/RBR factors and the chromatin state in the embryos. The proper communication among these pathways would allow the coordination of seed germination and the first round of cell cycles in the embryo axis, providing the physiological and environmental conditions are propitious.
Understanding cell cycle control has been of utmost interest for a long time since in multicellular organisms the precise balance between cell proliferation and differentiation is determinant for an adequate life cycle. Proof of that is the high incidence of oncogenesis in mammals when the cell cycle homeostasis is disturbed. For similar reasons, knowing the dynamics of the cycle in plants, with a genetic plasticity that causes amazement, can offer new perspectives to generate strategies, both with therapeutic purposes and with the purpose of increasing the productivity of agricultural products. Therefore, understanding the role of the E2F family as the core of transcriptional regulation of the cell cycle is imperative. Both, regulation of the E2F/RB pathway and its function illustrates the enormous complexity inherent to the coordination of proliferation, growth, and development. Likewise, the concept of E2F as a control module of multiple pathways apparently biochemically distant, like endoreduplication, DNA repair, immunity, cell death and even the dynamics of cell structure, is crucial to have an integral vision, that goes from understanding cell proliferation control to ascertain into the evolutionary processes that eukaryotes have gone through to travel from an unicellular form to the constitution of complex multicellular organisms, and how the environment has had an influence.
Funding Statement: This work was supported by Consejo Nacional de Ciencia y Tecnología [Grant No. CB220661, Postgraduate and Mobility grants to V.A.S.C. and SNI III grant to S.R.R.] and Universidad Nacional Autónoma de México [DGAPA-PAPIIT IN215316, PAIP 5000-9124, PAIP 5000-9130 and PAEP-UNAM grant to V.A.S.C.].
Conflicts of Interest: The authors declare that they have no conflicts of interest regarding this study.
This work is licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |