The complexity of a living organism is not driven by gene number but gene regulation. Controlling which genes to express and to what extent dictates the subsequent cell identity. Transcription, the critical initial stage in gene expression, is regulated delicately to maintain the cell status. Recent developments in the genomic approaches provided unparalleled coverage of the study of transcription. Still, basic molecular biology and biochemistry are providing mechanistic insights into how the regulation is achieved. In this feature “Regulation of transcription: mechanisms and biological functions”, the latest advances in epigenetics, mRNA processing, RNA quality control, and human immunodeficiency virus (HIV) transactivation are discussed.
. Control transcription, control gene expression
A gene has been traditionally viewed as the basic molecular unit of heredity (Crick,
1958;
1970). In the form of DNA or RNA, it carries the raw genetic information that can be turned into functional products, usually proteins. However, the number of genes does not reflect the complexity of the organism. For example, a human has about 20 000 protein-coding genes, which is ∼6000 more than a fruit fly, ∼2000 more than
Caenorhabditis elegans, and ∼14 000 more than budding yeast, but ∼10 000 less than a lab mouse, ∼5000 less than the model plant
Arabidopsis, and ∼17 000 less than rice. Clearly, the level of complexity of the organism is achieved by regulating available genes, not simply by introducing more genes.
The central dogma of gene expression includes two sequential steps: transcription (DNA to RNA) and translation (RNA to protein). Transcription is the key step that controls the “on and off” of genes and subsequently underlines the identity and the status of the cell (Young,
2011; Lee and Young,
2013). For example, when I compare a string of my hair and my finger tip, they appear so different, as if they are made from different genetic materials. However, the fact is that all the different tissues and cells in my body contain the same DNA and it is the differential expression profiles that created the functional diversity. Understanding the mechanism of gene expression will help us to understand the formation and evolution of life and to find possible cures for diseases.
. In this issue
We have invited scientists on the leading edge in their respective fields to share their expertise and perspectives. Ma et al. (
2014) focused on the mechanism that opens and closes specific regions in the chromatin. The issue starts with the packaging of human DNA. If stretched out, the total length of DNA from a human cell is about 3 m. In a living cell, DNA is packaged into a highly compact form called chromatin. Genes are not active when they are in the compacted form. Factors involved in opening and closing specific regions control the accessibility of genes (Guertin and Lis,
2012). A set of factors called polycomb group (PcG) proteins is involved in modifying the chromatin structure and subsequently regulating a large number of genes. The dynamic nature of the multisubunit protein complex and its complex functional impact are described in detail.
Gene regulation does not stop at finding the gene. The transcription machinery finds an accessible promoter and assembles a multi-megadalton protein complex to initiate transcription (Kornberg,
2007). As soon as the nascent transcript emerges from the body of RNA polymerase II (Pol II), the enzyme that transcribes protein coding genes, the RNA is protected by the addition of a 5′ cap (Shatkin,
1976). A text-book would describe the capping to be a unanimous phenomenon that takes place on every molecule of the RNA transcripts. However, Zhai and Xiang (
2014) summarize the recent discovery that improperly capped RNA is subjected to quality control and degradation. Previously, removal of the cap and subsequent RNA degradation are thought to only take place after the RNA is exported to the cytoplasm and translation has been initiated. This discovery changed the way that mRNA capping and de-capping are viewed in time and space.
RNA degradation or turnover has a significant place in gene expression (Houseley and Tollervey,
2009). Compared to DNA, RNA is a very short-lived species in cells. Its relatively rapid turnover provides the flexibility for organisms to react to the environment and adapt quickly to a change in gene expression profile. Historically, mRNA degradation is only thought to occur once at least a round of translation has taken place. Liu H. et al. (
2014) provide an updated review of the many RNA degradation pathways taking place in the nucleus. Degrading nascent RNA transcripts in the nucleus has been viewed as a quality control mechanism to quickly get rid of undesired RNAs before they are exported to cytoplasm and waste the effort of ribosomes in translating wrong proteins.
The start of transcription relies on promoters. The termination signal typically relies on sequence information at the end of the genes (Richard and Manley,
2009; Guo et al.,
2011). Davis and Shi (
2014) review the current understanding of the sequence requirements and the protein factors involved in this process called mRNA 3′ end processing. It cuts the RNA off the moving Pol II and adds a stretch of adenines to the RNA. The sequence driving this process is called polyadenylation signal or polyA signal (Proudfoot,
1991). It has been known that the promiscuous nature of the polyA signal sequences leads to the selective use of multiple polyA signals (Tian and Manley,
2013). This feature, described as alternative polyadenylation, has been linked with many aspects of gene expression: pre-mature termination, RNA splicing, RNA stability and expression level (Mayr and Bartel,
2009). However, the exact mechanism that dictates alternative polyA signal usage is unclear. This article also provides a simple yet broadly applicable model to characterize how the choice among many polyA signals is made.
The process of making RNA nucleotide by nucleotide, transcription elongation, is tightly controlled and interlaced with all above events. Indeed, elongation control is a critical step in determining transcriptional output. Instead of looking for an open promoter to start transcription, an initiated and paused Pol II can be found on all expressed human genes (Core et al.,
2012; Kwak et al.,
2013). This paused Pol II population serves as a reserve for transcription to rapidly respond to activation signals (Guo and Price,
2013). Upon activation, positive transcription elongation factor P-TEFb triggers the fast-paced productive elongation that leads to mRNA production. P-TEFb is therefore seen as the key for rapid amplification of gene expression. Indeed, this is the exact factor virus hijack to promote the activation of themselves (Zhou et al.,
2012). Liu R.D. et al. (
2014) provide the latest understanding of factors involved in the transition into productive elongation and how HIV finds its niche to achieve replication effectively and in a deadly way.
. On the journal
The
Journal of Zhejiang University-SCIENCE B (Biomedicine & Biotechnology) has been at the frontier of peer-reviewed scientific journals based in China. Since its establishment in 2005, the editorial office has been dedicated to publishing high quality and high impact research and review articles.
In this special issue, we invited primary authors of recent high impact research articles to provide the most up-to-date view of their area of expertise. The studies they led were groundbreaking and provided significant milestones for their field. Justly, the invitees have all recently started their own labs with generous funding. Demonstrating their dedication to nurturing future scientists, they have all invited their trainees and colleagues to join in the adventure of composing the articles. We believe these cutting edge articles will promote discussions among established scientists and benefit the training of junior researchers.
References
[1] Core, L.J., Waterfall, J.J., Gilchrist, D.A., 2012. Defining the status of RNA polymerase at promoters.
Cell Rep, 2(4):1025-1035.
[2] Crick, F.H., 1958. On protein synthesis.
Symp Soc Exp Biol, 12:138-163.
[3] Crick, F., 1970. Central dogma of molecular biology.
Nature, 227(5258):561-563.
[4] Davis, R., Shi, Y., 2014. The polyadenylation code: a unified model for the regulation of mRNA alternative polyadenylation.
J Zhejiang Univ-Sci B (Biomed & Biotechnol), 15(5):429-437.
[5] Guertin, M.J., Lis, J.T., 2013. Mechanisms by which transcription factors gain access to target sequence elements in chromatin.
Curr Opin Genet Dev, 23(2):116-123.
[6] Guo, J., Garrett, M., Micklem, G., 2011. Poly(A) signals located near the 5′ end of genes are silenced by a general mechanism that prevents premature 3′-end processing.
Mol Cell Biol, 31(4):639-651.
[7] Guo, J., Price, D.H., 2013. RNA polymerase II transcription elongation control.
Chem Rev, 113(11):8583-8603.
[8] Houseley, J., Tollervey, D., 2009. The many pathways of RNA degradation.
Cell, 136(4):763-776.
[9] Kornberg, R.D., 2007. The molecular basis of eukaryotic transcription.
PNAS, 104(32):12955-12961.
[10] Kwak, H., Fuda, N.J., Core, L.J., 2013. Precise maps of RNA polymerase reveal how promoters direct initiation and pausing.
Science, 339(6122):950-953.
[11] Lee, T.I., Young, R.A., 2013. Transcriptional regulation and its misregulation in disease.
Cell, 152(6):1237-1251.
[12] Liu, H., Luo, M., Wen, J.K., 2014. mRNA stability in the nucleus.
J Zhejiang Univ-Sci B (Biomed & Biotechnol), 15(5):444-454.
[13] Liu, R.D., Wu, J., Shao, R., 2014. Mechanism and factors that control HIV-1 transcription and latency activation.
J Zhejiang Univ-Sci B (Biomed & Biotechnol), 15(5):455-465.
[14] Ma, R.G., Zhang, Y., Sun, T.T., 2014. Epigenetic regulation by polycomb group complexes: focus on roles of CBX proteins.
J Zhejiang Univ-Sci B (Biomed & Biotechnol), 15(5):412-428.
[15] Mayr, C., Bartel, D.P., 2009. Widespread shortening of 3′UTRs by alternative cleavage and polyadenylation activates oncogenes in cancer cells.
Cell, 138(4):673-684.
[16] Proudfoot, N., 1991. Poly(A) signals.
Cell, 64(4):671-674.
[17] Richard, P., Manley, J.L., 2009. Transcription termination by nuclear RNA polymerases.
Genes Dev, 23(11):1247-1269.
[18] Shatkin, A.J., 1976. Capping of eucaryotic mRNAs.
Cell, 9(4):645-653.
[19] Tian, B., Manley, J.L., 2013. Alternative cleavage and polyadenylation: the long and short of it.
Trends Biochem Sci, 38(6):312-320.
[20] Young, R.A., 2011. Control of the embryonic stem cell state.
Cell, 144(6):940-954.
[21] Zhai, L.T., Xiang, S., 2014. mRNA quality control at the 5′ end.
J Zhejiang Univ-Sci B (Biomed & Biotechnol), 15(5):438-443.
[22] Zhou, Q., Li, T., Price, D.H., 2012. RNA polymerase II elongation control.
Annu Rev Biochem, 81(1):119-143.
Open peer comments: Debate/Discuss/Question/Opinion
<1>