USF1
This article may require copy editing for grammar, style, cohesion, tone, or spelling. (December 2023) |
Ensembl | |||||||||
---|---|---|---|---|---|---|---|---|---|
UniProt | |||||||||
RefSeq (mRNA) | |||||||||
RefSeq (protein) | |||||||||
Location (UCSC) | Chr 1: 161.04 – 161.05 Mb | Chr 1: 171.24 – 171.25 Mb | |||||||
PubMed search | [3] | [4] |
View/Edit Human | View/Edit Mouse |
Upstream stimulatory factor 1 is a protein that in humans is encoded by the USF1 gene.[5][6]
Gene
The upstream stimulatory factor gene encodes a
Isoforms
USF comprises two major isoforms: USF1 and USF2. USF1 gene locates on the chromosome region 1q22-q23 in both human and mice; USF2 gene locates on the chromosome 19q13 in human and chromosome 19q7 in mice, respectively.[9] Both USF1 and USF2 transcripts comprise 10 exons and can undergo exon 4-excision during alternative splicing.[7][9] From an auto-regulation perspective, these exon 4-excision products act as dominant negative regulators and are found to suppress USF-dependent gene expression.[7][9]
Protein
Although USF1 and USF2 share 70% of the amino acid sequence in their bHLH-LZ region, only 40% of similarity is found in their full-length proteins. In addition, USF1 and USF2 exhibit different protein abundances in a cell type-specific manner.[7] It has been found that USF1 and USF2 expression increases during the differentiation of erythroid cells.[10] Despite the ubiquitous expression of both isoforms, USF1 and USF2 mediate different biological processes and functions in cells. While USF1 modulates metabolism, immune response, and tissue protection, USF2 primarily controls embryonic development, brain function, iron metabolism, and fertility.[7] Structurally, the highly conserved bHLH-LZ structure on the C-terminus of USF yields high binding specificity and promotes the formation of USF1 homodimers or USF1-USF2 heterodimers for DNA binding.[7][9][11] The USF-specific region (USR) on the N-terminal region, on the other hand, facilitates the nuclear translocation and activation of USF1.
Function
This gene encodes a member of the
A study of mice suggested reduced USF1 levels increase metabolism in
Regulation
Modulation of DNA binding affinity
The symmetrical E-box motif is the main target of bHLH-LZ transcription factors, and USF1 has a high binding affinity for the core sequence CACGTG in the motif.[9] USF1-DNA binding activity can be modulated by cell type-specific DNA methylation and acetylation on the E-box motif or by post-transcriptional modifications of the USF1 protein. For example, CpG methylation on the central E-box motif inhibits the complex formation of USF1 with its co-transcription factors and therefore decreases the corresponding gene expression in mouse lymphosarcoma cells.[9] In contrast, phosphorylation of USF1 by p38 mitogen-activated protein kinases, protein kinase A or protein kinase C increases its binding to the E-box motif and activate gene transcription.[9]
Phosphorylation
Mitogen-activated protein kinase (MAPKs) phosphorylates serine and threonine residues of substrate proteins and convert extracellular signals induced by growth factors, mitogens or cytokines into intracellular phosphorylation cascades, which regulate cell proliferation, differentiation, stress responses and apoptosis (programmed cell death).[7]
Phosphorylation by MAPKs induces a conformational change of the USF protein and exposes its DNA-binding domain for interaction. This increased structural exposure enhances DNA binding and therefore the transcriptional activity of USF.[13]
- ERK1 (also known as MAPK3) and ERK2 (also known as MAPK1) phosphorylate USF1 in response to TFG-β signaling in vascular smooth muscle cells.[13] SMAD2 and SMAD3 signaling following the TFG-β receptor activation can also cooperate with EGFR / ERK pathways to activate USF1, which in turn regulates the gene expression of plasminogen activator inhibitor-1 (PAI-1), a significant biomarker and predictor of cardiovascular disease-related death[13] and a marker of poor prognosis in breast cancer.[7]
- Casein kinase 2 or CK-II (CK2) is a tetrameric enzyme composed of two catalytic and two regulatory subunits. In pancreatic cells, CK2 phosphorylates USF1, PDX1 and MST1 to suppress insulin expression.[14]
Proteins mediating USF1 modification | |
---|---|
Phosphorylation | p38, pKA and pKC,[9] ERK1/2,[13] DNA-PK [11] |
Acetylation | PCAF [11] |
Methylation' | SET7/92[9] |
USF1-interacting proteins | |
Transcription co-factors | USF2,[7] SP1, PEA3, MTF1,[9] SREBP1-c,[11] MED17,[15] BAF60[15] |
Gene transcription
- Transforming growth factor β 1 (TGF beta 1) is encoded by the TFGB1 gene that contains an E-box within the promoter region and has been implicated in excessive extracellular matrix accumulation under a high-glucose condition.[8] Overexpression of either USF1 or USF2 is found to elevate the TFGB1 promoter activity in human embryonic kidney cells. However, only USF1 overexpression leads to increased TGF-β1 secretion.
- Thrombospondin 1 (TSP1) is involved in the development of diabetic nephropathy. USF1/2 binds to the E-box motif (CAGATG) on the human THBS1 promoter and regulates high-glucose-induced TSP1 expression in mesangial cells.[8] USF2 overexpression has been found to augment THBS1 promoter activity and TSP1 expression. The resulting increase in TSP1 expression further promotes the formation of active TGF-β.[8]
- AP-1 transcription factor (AP-1) refers to a complex of dimeric transcription factors composed of c-Jun, c-Fos or activating transcriptionfactor (ATF) that bind to the AP-1 binding site on DNA.[16] cJun-cJun / cJun-cFos dimers preferentially bind to the phorbol 12-O-Tetradecanoylphorbol-13-acetate (TPA)-responsive element (TRE region, TGACTCA), whereas cJun-ATF dimers and ATF homodimers preferentially bind to the cAMP-responsive element (CRE, TGACGTCA).[16] The AP-1 complex becomes activated in response to high glucose, oxidative stress, low-density lipoprotein(LDL) and oxidised LDL. It has been reported that a high glucose level upregulates USF and AP-1 binding activities, as well as the protein level of cFos.[8]
Interaction between USF1 and other transcription factors, including SP1, PEA3 (also known as ETV4) and MTF1, also leads to cooperative transcriptional regulation. For instance, the leucine zipper motif of USF1 recruits PEA3 to form a ternary complex and co-regulates the transcription of BAX, an apoptosis regulator.[9] Another USF1-regulated target is topoisomerase III (hTOP3⍺), which catalyzes the topological changes of DNA, modifies DNA supercoil structures, and increases the chromatin accessibility for gene expression.[9] Similar interactions exist between USF1 and JMJD1C or H3K9 demethylase, in which the molecular interactions change chromatin accessibility and elevate the transcription of a series of lipogenic genes, including FASN, ACC, ACLY, and SREBP1.[15]
Chromosome boundary by USF
Chromosomes are generally classified into euchromatin and heterochromatin with distinct histone modifications, compaction levels, and the resulting gene expression patterns. Heterochromatin is a tightly condensed and transcriptionally repressed chromatin domain that is characterized by distinct combinations of histone post-translational modifications.[17] Heterochromatin is required for genome stability and gene expression regulation. However, it can spread into neighboring DNA regions and inactivate gene expression.[17][18] Chromosome boundary elements are thus necessary to block such stochastic spreads of heterochromatin and maintain stable gene expression.[19] USF1 and USF2 have been found to recruit various histone-modifying complexes, including the histone H3 methyltransferase Set1 complex and the H4 arginine 3 methyltransferase PRMT1, with the latter known to establish active chromatin domains.[19] USF1/USF2 binding deposits a high level of activating histone modifications on adjacent nucleosomes and thus prevents the propagation of chromatin silencing modifications from the heterochromatin, such as H3K9 and K27 methylation.[19]
Other USF1/USF2-related chromatin modifications include the recruitment of the E3 ubiquitin ligase, RNF20, to moniubiquitinate histone H2B.[19] The loss of RNF20 is found to cause an extension of the silencing modifications from the 16 kb heterochromatic domain into the β-globin locus.[19] Moreover, USF1 and USF2 can bind to the 5' DNase I hypersensitive site HS4 and recruit an H3 acetyltransferase, PCAF, which blocks the heterochromatin spread into the β-globin locus.[18]
FASN transactivates for lipogenesis
USF is known to bind the L-type pyruvate kinase promoter on DNA at high glucose and insulin levels. Excessive insulin activates kinases and phosphatases that post-translationally modify USF, sterol regulatory element-binding protein 1C (SREBP1C), Carbohydrate-responsive element-binding protein (ChREBP), and Liver X receptor (LXRs).[11] With insulin stimulation, USF1 and USF2 bind to the E-boxes at -332 and -65 in the promoter region of FASN that encodes Fatty acid synthase (FAS) for lipogenesis.[11]
Various post-translational modifications of USF1 determine its activity and signaling pathways and can affect the lipogenesis process. An abnormal increase in the USF-mediated de novo fatty acid synthesis is found to cause intracellular fatty acid accumulation and deregulate gene expression and cellular processes like tumor cell survival.[20]
Lipogenic pathways
- In response to insulin elevation, DNA-protein kinase (DNA-PK) involved in DNA damage repair becomes dephosphorylated and activated.[11] The active form of DNA-PK indirectly phosphorylates USF1 at S262 through AMP-activated protein kinase (AMPK). The S262 phosphorylation increases USF1 interaction with SREBP1C near the sterol regulatory element (SRE) and facilitates the synergistic activation of SREBP1C and transcription of the downstream lipogenic genes.[11]
- USF1 S262 phosphorylation also recruits PCAF to acetylate USF1 at the site K237. Both S262 phosphorylation and K237 acetylation enhance USF1 activities and the subsequent transcriptional activation of the fatty acid synthase gene (FASN).[11] Fatty acid synthase (FAS), together with Acetyl-CoA carboxylase (ACC), produces malonyl-CoA, converts it to long-chain fatty acids, and promotes the de novo fatty-acid synthesis for energy provision and protein lipidation.[21][20]
- USF1 modified with S262 phosphorylation an K237 acetylation also recruits BGR1 (also known as SMARCA4)-associated factor 60c (BAF60c).[15] BAF60c is then phosphorylated by atypical protein kinase C (aPKC) at S257, allowing it to form a LipoBAF complex at promoters of lipogenic genes to regulate chromatin structure and gene transcription.[15]
- In contrast, HDAC9 deacetylates USF1 during cell fasting, prevents the recruitment of USF1-interacting factors, and suppresses the transcriptional activation of lipogenic genes.[11]
In early embryonic development
USF1 transcription undergoes active dynamics during cell meiosis, in which the USF1 mRNA first increases significantly during 2-8 cells and then decreases to an undetectable level at the blastocyst stage, indicating its role in the embryo genome activation.[22] USF1 siRNA knockout has been shown to compromise the blastocyst rate and deregulate the transcripts of twist-related protein 2 (increased), growth differentiation factor-9 and follistatin (decreased) by affecting their promoter-binding element E-box region during oocyte maturation.[22]
Clinical significance
Diabetic kidney disease
Diabetic kidney disease (DKD) (or Diabetic nephropathy) is a progressive microalbuminuria disease with a slight loss of albumin in the urine (30–300 mg per day); DKD has been viewed as a diabetic complication-related microvascular disorder in a renal manifestation.[23] In kidney biopsy, DKD is characterized by glomerular and tubular basement thickening, mesangial expansion, glomerulosclerosis, podocyte effacement (histology) and nephron loss.[24] DKD occurs in 30%-50% of the diabetic patient population and leads to kidney failures in up to 20% of the type 1 diabetic patients.[24] However, a substantial portion of DKD patients do not manifest albuminuria.[24] DKD pathogenesis is attributed to the dysregulated glucose transport at a higher glucose level and the excessive influx of intracellular glucose into endothelial cells.[23] The elevated glucose level is sustained along with multiple metabolic phenotypes such as excess fatty acids and oxidative stress, as well as shear stresses induced by hypertension and hyperfusion, and can lead to microvascular rarefaction, hypoxia and maladaptation in glomerular neoangiogenesis.[23]
USF1 as an insulin-sensitive transcription factor that becomes active in response to a high glucose level promotes the transactivation of genes involved in lipid metabolism, including
Cancer
Increased FASN-mediated de novo lipid synthesis
Cancer cells exhibit a set of phenotypes, including a highlighted increase in aerobic glycolysis, lactic acid production (known as the Warburg effect), elevated protein and DNA synthesis, and increased de novo or endogenous fatty acid synthesis by fatty acid synthase (FAS).[20] FAS synthesizes primarily palmitate from malonyl-CoA, which is further esterified to triglycerides for energy storage. Normally, FASN is active during embryogenesis and in fetal lungs for lubricant production; however, it is physiologically low-expressed in non-cancerous adult cells. In contrast, abnormal FASN overexpression is detected in multiple cancer types, spanning breast cancer, colorectal cancer, prostate cancer, pancreatic cancer and ovarian cancer.[26] FASN-mediated de novo lipid synthesis accounts for more than 93% of triglycerides in tumor cells.[20] Specifically, tumor cells prefer glycolysis over oxidation for energy consumption and re-direct the glycolytic products towards de novo fatty acid synthesis to supply lipids for membrane production and protein lipidation for fast cell proliferation.[20] For example, PI3K-AKT pathway is found to increase in LNCaP prostate cancer cells to stimulate FASN overexpression. Concurrently, fatty acid synthase overexpression is also post-translationally sustained by USP2a-mediated ubiquitination reduction, stabilizing FAS for constitutive signal transduction.[20] In addition to de novo lipogenesis, FAS promotes the localization of VEGFR-2 to the lipid raft of the endothelial cell membrane and thus enhances angiogenesis in tumor development.[26] Meanwhile, mutual activation between FAS and ERBB2 (HER2) signaling also potentiates tumorigenesis, in which ERBB2 amplification is associated with elevated survival and proliferation of cancer cells and poor prognosis in breast and gastric cancers; an ERBB2 increase, especially, contributes to 18-25% of breast cancers.[27] In prostate cancer cells and promyelocytic leukemia cells, USF1 activation also attains a high-level of PAI-1 expression and inhibits spontaneous or camptothecin-induced apoptosis.[13]
Decreased USF1-p53 interaction and increased p53 instability
The poor prognosis of gastric cancers is associated with low expression of USF1 and p53.[28] Among gastric cancer patients, 88% of the patients are diagnosed with H. pylori infection, and half of the patients show lower USF1 expression in tumor tissues. Mechanistically, H. pylori induces DNA hypermethylation in the promoter regions of USF1 and USF2 and inhibits expression. Decreased expression reduces the interaction between USF1 and p53 when DNA damage occurs, rendering p53 to associate more frequently with the E3-ubiquitin ligase HDM2 (also known as MDM2) and increasing p53 instability in cancer cells.[28]
Familial combined hyperlipidemia
Familial combined hyperlipidemia (FCHL) was first used to describe lipid abnormalities in 47 Seattle pedigree-containing members with hypercholesterolemia and hypertriglyceridemia.[29] The core FCHL lipid profiles feature high serum cholesterol/triglyceride, apolipoprotein B (APOB) and LDL levels. Genetic evidence has suggested a FCHL-related locus on the human chromosome 1q21-q23, which is linked to metabolic syndromes.[30] Fine-mapping of those linked regions identifies USF1 as the first positionally cloned gene for FCHL and a target for FCHL treatment. In addition, hepatocyte nuclear factor 4 alpha (HNF4A) is also implicated in high lipid levels and metabolic syndromes. Cooperative effects of USF1 and HNF4A have been shown to regulate the expression of apolipoprotein A-II (APOA2) and apolipoprotein C-III (APOC3).[30] Mutations in USF1, HNF4A and apolipoproteins also increase patients' susceptibility to FCHL.[30] Additional genes subjected to USF1 regulation and involved in glucose/lipid metabolism include apolipoprotein A5 (APOA5), apolipoprotein E (APOE), hormone-sensitive lipase (LIPE), hepatic lipase (LIPC), glucokinase (GCK), islet-specific glucose-6-phosphatase catalytic-subunit-related protein (IGRP), insulin, glucagon receptor (GCGR) and ATP-binding cassette transporter A1 (ABCA1).[30]
Interactions
USF1 (human gene) has been shown to
References
- ^ a b c GRCh38: Ensembl release 89: ENSG00000158773 – Ensembl, May 2017
- ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000026641 – Ensembl, May 2017
- ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- PMID 8486371.
- ^ a b "Entrez Gene: USF1 upstream transcription factor 1".
- ^ PMID 25741280.
- ^ S2CID 206329507.
- ^ PMID 16162174.
- PMID 21282467.
- ^ PMID 26490400.
- S2CID 24737539.
- ^ PMID 19132220.
- PMID 33994545.
- ^ PMID 32198196.
- ^ PMID 9069263.
- ^ PMID 29235574.
- ^ PMID 25192661.
- ^ PMID 22326678.
- ^ S2CID 205468233.
- PMID 19352381.
- ^ PMID 38003209.
- ^ PMID 27188921.
- ^ PMID 37324271.
- ^ PMID 34584272.
- ^ PMID 36934087.
- S2CID 23150684.
- ^ PMID 36209270.
- PMID 14764618.
- ^ PMID 16938803.
- PMID 17353931.
- PMID 8576131.
- PMID 9160889.
- PMID 9384587.
- S2CID 4260885.
Further reading
- Corre S, Galibert MD (January 2006). "[USF as a key regulatory element of gene expression]". Médecine/Sciences. 22 (1): 62–67. PMID 16386222.
- Lee JC, Lusis AJ, Pajukanta P (April 2006). "Familial combined hyperlipidemia: upstream transcription factor 1 and beyond". Current Opinion in Lipidology. 17 (2): 101–109. S2CID 20122462.
- Roy AL, Meisterernst M, Pognonec P, Roeder RG (November 1991). "Cooperative interaction of an initiator-binding transcription initiation factor and the helix-loop-helix activator USF". Nature. 354 (6350): 245–248. S2CID 4260885.
- Gregor PD, Sawadogo M, Roeder RG (October 1990). "The adenovirus major late transcription factor USF is a member of the helix-loop-helix group of regulatory proteins and binds to DNA as a dimer". Genes & Development. 4 (10): 1730–1740. PMID 2249772.
- Henrion AA, Martinez A, Mattei MG, Kahn A, Raymondjean M (January 1995). "Structure, sequence, and chromosomal location of the gene for USF2 transcription factors in mouse". Genomics. 25 (1): 36–43. PMID 7774954.
- Ferré-D'Amaré AR, Pognonec P, Roeder RG, Burley SK (January 1994). "Structure and function of the b/HLH/Z domain of USF". The EMBO Journal. 13 (1): 180–189. PMID 8306960.
- Viollet B, Lefrançois-Martinez AM, Henrion A, Kahn A, Raymondjean M, Martinez A (January 1996). "Immunochemical characterization and transacting properties of upstream stimulatory factor isoforms". The Journal of Biological Chemistry. 271 (3): 1405–1415. PMID 8576131.
- Ghosh AK, Datta PK, Jacob ST (February 1997). "The dual role of helix-loop--helix-zipper protein USF in ribosomal RNA gene transcription in vivo". Oncogene. 14 (5): 589–594. S2CID 23764497.
- Pognonec P, Boulukos KE, Aperlo C, Fujimoto M, Ariga H, Nomoto A, Kato H (May 1997). "Cross-family interaction between the bHLHZip USF and bZip Fra1 proteins results in down-regulation of AP1 activity". Oncogene. 14 (17): 2091–2098. PMID 9160889.
- Roy AL, Du H, Gregor PD, Novina CD, Martinez E, Roeder RG (December 1997). "Cloning of an inr- and E-box-binding protein, TFII-I, that interacts physically and functionally with USF1". The EMBO Journal. 16 (23): 7091–7104. PMID 9384587.
- Pajukanta P, Nuotio I, Terwilliger JD, Porkka KV, Ylitalo K, Pihlajamäki J, et al. (April 1998). "Linkage of familial combined hyperlipidaemia to chromosome 1q21-q23". Nature Genetics. 18 (4): 369–373. S2CID 5905269.
- Hartley JL, Temple GF, Brasch MA (November 2000). "DNA cloning using in vitro site-specific recombination". Genome Research. 10 (11): 1788–1795. PMID 11076863.
- Wiemann S, Weil B, Wellenreuther R, Gassenhuber J, Glassl S, Ansorge W, et al. (March 2001). "Toward a catalog of human genes and proteins: sequencing and analysis of 500 novel complete protein coding human cDNAs". Genome Research. 11 (3): 422–435. PMID 11230166.
- Bengtsson SH, Madeyski-Bengtson K, Nilsson J, Bjursell G (July 2002). "Transcriptional regulation of the human carboxyl ester lipase gene in THP-1 monocytes: an E-box required for activation binds upstream stimulatory factors 1 and 2". The Biochemical Journal. 365 (Pt 2): 481–488. PMID 11945176.
- Villavicencio EH, Yoon JW, Frank DJ, Füchtbauer EM, Walterhouse DO, Iannaccone PM (April 2002). "Cooperative E-box regulation of human GLI1 by TWIST and USF". Genesis. 32 (4): 247–258. S2CID 12132097.
- Coulson JM, Edgson JL, Marshall-Jones ZV, Mulgrew R, Quinn JP, Woll PJ (February 2003). "Upstream stimulatory factor activates the vasopressin promoter via multiple motifs, including a non-canonical E-box". The Biochemical Journal. 369 (Pt 3): 549–561. PMID 12403649.
- Salero E, Giménez C, Zafra F (March 2003). "Identification of a non-canonical E-box motif as a regulatory element in the proximal promoter region of the apolipoprotein E gene". The Biochemical Journal. 370 (Pt 3): 979–986. PMID 12444925.
- Pickwell GV, Shih H, Quattrochi LC (April 2003). "Interaction of upstream stimulatory factor proteins with an E-box located within the human CYP1A2 5'-flanking gene contributes to basal transcriptional gene activation". Biochemical Pharmacology. 65 (7): 1087–1096. PMID 12663044.