Protein:NUP98 |
Protein Summary |
Gene summary |
| Gene name: NUP98 | ASpdb.0 ID: 4928 | Gene | Gene symbol | NUP98 | Gene ID | 4928 |
| Gene name | nucleoporin 98 and 96 precursor |
| Synonyms | ADIR2|NUP196|NUP96|Nup98-96 |
| Cytomap | 11p15.4 |
| Type of gene | protein-coding |
| Description | nuclear pore complex protein Nup98-Nup96nuclear pore complex protein Nup98GLFG-repeat containing nucleoporinNUP98/PHF23 fusion 2 proteinNup98-Nup96nucleoporin 96nucleoporin 98kDnucleoporin 98kDa |
| Modification date | 20240413 |
| UniProtAcc | P52948 |
Gene ontology of this gene with evidence of Inferred from Direct Assay (IDA) from Entrez |
| Partner | Gene | GO ID | GO term | PubMed ID |
| Gene | NUP98 | GO:0000776 | kinetochore | 15146057 |
| Gene | NUP98 | GO:0005635 | nuclear envelope | 15146057|28221134 |
| Gene | NUP98 | GO:0005643 | nuclear pore | 9348540|12802065|15229283 |
| Gene | NUP98 | GO:0005654 | nucleoplasm | 28221134 |
| Gene | NUP98 | GO:0016604 | nuclear body | 28221134 |
| Gene | NUP98 | GO:0031080 | nuclear pore outer ring | 17360435 |
| Gene | NUP98 | GO:0031965 | nuclear membrane | 11839768|12802065|15229283|20407419 |
| Gene | NUP98 | GO:0034399 | nuclear periphery | 11839768|15229283 |
| Gene | NUP98 | GO:0042405 | nuclear inclusion body | 11839768 |
| Gene | NUP98 | GO:0044615 | nuclear pore nuclear basket | 11839768 |
| Gene | NUP98 | GO:0140693 | molecular condensate scaffold activity | 25562883 |
AS Summary |
Information of the canonical protein with experimentally identified structure from PDB (2023). |
| UniProt Acc | File name | PDB ID | Method | Resolution | Chain | Start | End |
| P52948-1 | P52948-1_5a9q_5.pdb | 5A9Q | EM | 23.0 | 5 | 1157 | 1631 |
ASpdb's canonical and alternatively spliced isoform information. |
| accession_id | gene_name | canonical_id | alternative_id | canonical_length | alternative_length | canonical_start | canonical_end | type | originalSEQ | variationSEQ | alternative_start | alternative_end |
| P52948 | NUP98 | P52948-1 | P52948-3 | 1817 | 937 | 932 | 937 | Substitution | SQSPEV | VEKKGQ | 932 | 937 |
| P52948 | NUP98 | P52948-1 | P52948-3 | 1817 | 937 | 938 | 1817 | Deletion | none | none | 937 | 937 |
| P52948 | NUP98 | P52948-1 | P52948-4 | 1817 | 920 | 393 | 409 | Deletion | none | none | 392 | 392 |
| P52948 | NUP98 | P52948-1 | P52948-4 | 1817 | 920 | 932 | 937 | Substitution | SQSPEV | VEKKGQ | 915 | 920 |
| P52948 | NUP98 | P52948-1 | P52948-4 | 1817 | 920 | 938 | 1817 | Deletion | none | none | 920 | 920 |
Multiple sequence alignment of our canonical and alternatively spliced NUP98 |
Matched gene isoform IDs with Ensembl and RefSeq of our canonical and alternative spliced genes of NUP98 |
| UniProt-id | ENSG | ENST | ENSP |
| P52948-1 | ENSG00000110713.18 | ENST00000359171.8 | ENSP00000352091.5 |
| P52948-3 | ENSG00000110713.18 | ENST00000397007.10 | ENSP00000380202.4 |
| P52948-4 | ENSG00000110713.18 | ENST00000397004.9 | ENSP00000380199.4 |
| P52948-4 | ENSG00000110713.18 | ENST00000700606.1 | ENSP00000515094.1 |
| UniProt-id | NM ID | NP ID |
| P52948-3 | NM_005387.6 | NP_005378.4 |
| P52948-4 | NM_139131.4 | NP_624357.1 |
Amino acid sequences of our canonical and alternatively spliced NUP98 |
| accession_id | Protein sequence |
| P52948-1 | MFNKSFGTPFGGGTGGFGTTSTFGQNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANT LFGTASTGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVST NISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTGFGTNP GGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQTNTGFGAV GSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGNKPTLTLGTNTNTSNFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLFG NNQPKIGGPLGTGAFGAPGFNTTTATLGFGAPQAPVALTDPNASAAQQAVLQQHINSLTYSPFGDSPLFRNPMSDPKKKEERLKPTNPAA QKALTTPTHYKLTPRPATRVRPKALQTTGTAKSHLFDGLDDDEPSLANGAFMPKKSIKKLVLKNLNNSNLFSPVNRDSENLASPSEYPEN GERFSFLSKPVDENHQQDGDEDSLVSHFYTNPIAKPIPQTPESAGNKHSNSNSVDDTIVALNMRAALRNGLEGSSEETSFHDESLQDDRE EIENNSYHMHPAGIILTKVGYYTIPSMDDLAKITNEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDIVHIRRKEVVVYLDDNQKPP VGEGLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEAVSRKQGAQFKEYRPETGSWVFKVSHFSKYGLQDSDEEEEEHPSKTS TKKLKTAPLPPASQTTPLQMALNGKPAPPPQSQSPEVEQLGRVVELDSDMVDITQEPVLDTMLEESMPEDQEPVSASTHIASSLGINPHV LQIMKASLLTDEEDVDMALDQRFSRLPSKADTSQEICSPRLPISASHSSKTRSLVGGLLQSKFTSGAFLSPSVSVQECRTPRAASLMNIP STSSWSVPPPLTSVFTMPSPAPEVPLKTVGTRRQLGLVPREKSVTYGKGKLLMDMALFMGRSFRVGWGPNWTLANSGEQLNGSHELENHQ IADSMEFGFLPNPVAVKPLTESPFKVHLEKLSLRQRKPDEDMKLYQTPLELKLKHSTVHVDELCPLIVPNLGVAVIHDYADWVKEASGDL PEAQIVKHWSLTWTLCEALWGHLKELDSQLNEPREYIQILERRRAFSRWLSCTATPQIEEEVSLTQKNSPVEAVFSYLTGKRISEACSLA QQSGDHRLALLLSQFVGSQSVRELLTMQLVDWHQLQADSFIQDERLRIFALLAGKPVWQLSEKKQINVCSQLDWKRSLAIHLWYLLPPTA SISRALSMYEEAFQNTSDSDRYACSPLPSYLEGSGCVIAEEQNSQTPLRDVCFHLLKLYSDRHYDLNQLLEPRSITADPLDYRLSWHLWE VLRALNYTHLSAQCEGVLQASYAGQLESEGLWEWAIFVLLHIDNSGIREKAVRELLTRHCQLLETPESWAKETFLTQKLRVPAKWIHEAK AVRAHMESDKHLEALCLFKAEHWNRCHKLIIRHLASDAIINENYDYLKGFLEDLAPPERSSLIQDWETSGLVYLDYIRVIEMLRHIQQVD CSGNDLEQLHIKVTSLCSRIEQIQCYSAKDRLAQSDMAKRVANLLRVVLSLHHPPDRTSDSTPDPQRVPLRLLAPHIGRLPMPEDYAMDE |
| P52948-3 | MFNKSFGTPFGGGTGGFGTTSTFGQNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANT LFGTASTGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVST NISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTGFGTNP GGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQTNTGFGAV GSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGNKPTLTLGTNTNTSNFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLFG NNQPKIGGPLGTGAFGAPGFNTTTATLGFGAPQAPVALTDPNASAAQQAVLQQHINSLTYSPFGDSPLFRNPMSDPKKKEERLKPTNPAA QKALTTPTHYKLTPRPATRVRPKALQTTGTAKSHLFDGLDDDEPSLANGAFMPKKSIKKLVLKNLNNSNLFSPVNRDSENLASPSEYPEN GERFSFLSKPVDENHQQDGDEDSLVSHFYTNPIAKPIPQTPESAGNKHSNSNSVDDTIVALNMRAALRNGLEGSSEETSFHDESLQDDRE EIENNSYHMHPAGIILTKVGYYTIPSMDDLAKITNEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDIVHIRRKEVVVYLDDNQKPP VGEGLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEAVSRKQGAQFKEYRPETGSWVFKVSHFSKYGLQDSDEEEEEHPSKTS |
| P52948-4 | MFNKSFGTPFGGGTGGFGTTSTFGQNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANT LFGTASTGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVST NISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTGFGTNP GGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQTNTGFGAV GSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLFGNNQPKIGGPLGTGAFGA PGFNTTTATLGFGAPQAPVALTDPNASAAQQAVLQQHINSLTYSPFGDSPLFRNPMSDPKKKEERLKPTNPAAQKALTTPTHYKLTPRPA TRVRPKALQTTGTAKSHLFDGLDDDEPSLANGAFMPKKSIKKLVLKNLNNSNLFSPVNRDSENLASPSEYPENGERFSFLSKPVDENHQQ DGDEDSLVSHFYTNPIAKPIPQTPESAGNKHSNSNSVDDTIVALNMRAALRNGLEGSSEETSFHDESLQDDREEIENNSYHMHPAGIILT KVGYYTIPSMDDLAKITNEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDIVHIRRKEVVVYLDDNQKPPVGEGLNRKAEVTLDGVW PTDKTSRCLIKSPDRLADINYEGRLEAVSRKQGAQFKEYRPETGSWVFKVSHFSKYGLQDSDEEEEEHPSKTSTKKLKTAPLPPASQTTP |
Protein Functional Features |
Main function of this protein. (from UniProt) |
| NUP98 (go to UniProt):P52948 |
Retention analysis result of protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at * Minus value of BPloci means that the break pointn is located before the CDS. |
| - Retained protein feature among the 13 regional features. |
| Accession_id | Subsection | Start | End | Funcitonal feature | Splicing information |
| P52948 | Region | 214 | 480 | Note=FG repeats 2 | Type=Deletion;Start=393;End=409 |
| P52948 | Region | 886 | 937 | Note=Disordered;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-lite | Type=Substitution;Start=932;End=937 |
| P52948 | Region | 886 | 937 | Note=Disordered;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-lite | Type=Substitution;Start=932;End=937 |
Gene Isoform Structures and Expression Levels for NUP98 |
Gene structures of our canonical and alternative spliced genes of NUP98* Click on the image to open the UCSC genome browser with custom track showing this image in a new window. |
Expression levels of gene isoforms across GTEx. |
Expression levels of gene isoforms across TCGA. |
Protein Structures |
PDB and CIF files of the predicted protein structures * Here we show the 3D structure of the proteins using Mol*. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Model confidence is shown from the pLDDT values per residue. pLDDT corresponds to the model’s prediction of its score on the local Distance Difference Test. It is a measure of local accuracy (from AlphfaFold website). To color code individual residues, we transformed individual PDB files into CIF format. |
| 3D view using mol* of P52948-1 |
| 3D view using mol* of P52948-3 |
| 3D view using mol* of P52948-4 |
pLDDT Score Distribution |
pLDDT score distribution of the predicted protein structures from AlphaFold2* AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. |
| pLDDT distribution across the protein length of P52948-1 |
![]() |
| pLDDT distribution across the protein length of P52948-3 |
![]() |
| pLDDT distribution across the protein length of P52948-4 |
![]() |
Ramachandran Plot of Protein Structures |
Ramachandran plot of the torsional angles - phi (φ)and psi (ψ) - of the residues (amino acids) contained in this protein peptide. |
| Ramachandran plot of P52948-1 |
![]() |
Potential Active Site Information |
The potential binding sites of these proteins were identified using SiteMap, a module of the Schrodinger suite. |
| UniProt-id | Site score | Size | D score | Volume | Exposure | Enclosure | Contact | Phobic | Philic | Balance | Don/Acc | Residues |
| P52948-1 | 1.045 | 150 | 1.117 | 490.147 | 0.67 | 0.658 | 0.805 | 0.819 | 0.67 | 1.222 | 1.28 | 1206,1247,1264,1265,1267,1268,1271,1274,1277,1278, 1282,1283,1284,1285,1301,1304,1305,1308,1309,1310, 1312,1313,1314,1316,1317,1423,1425,1426,1468,1469, 1470,1472,1515,1516,1517 |
| P52948-3 | 0.891 | 86 | 0.914 | 222.264 | 0.68 | 0.592 | 0.749 | 0.273 | 0.959 | 0.284 | 0.514 | 796,797,798,828,829,830,831,832,834,837,842,846,85 1,882,885,886,887,888,890,891,893,894 |
| P52948-4 | 0.873 | 76 | 0.876 | 208.201 | 0.659 | 0.634 | 0.818 | 0.527 | 0.992 | 0.531 | 0.592 | 780,811,812,813,814,815,817,820,825,828,829,831,83 4,868,869,870,873,874,876,877 |
Protein Structure and Feature Comparision |
Protein Structure Comparision Using Template Modeling Scores (TM-score). |
![]() |
Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Canonical validated structure (PDB)(green) |
| 3D view using mol* of P52948-1_P52948-1_5a9q_5.pdb |
Protein Structure Comparision Visualization with mol*. between Canonical validated structure (PDB)(orange) vs Alternative predicted structure (AF2)(green) |
| 3D view using mol* of P52948-1_5a9q_5_P52948-3.pdb |
| 3D view using mol* of P52948-1_5a9q_5_P52948-4.pdb |
Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Alternative predicted structure (AF2)(green) |
| 3D view using mol* of P52948-1_P52948-3.pdb |
| 3D view using mol* of P52948-1_P52948-4.pdb |
Protein Feature Comparison of the protein sequendary structures among the protiens. |
| ./stats/secondary_structure/figure/P52948-1_vs_P52948-3.png |
< |
| ./stats/secondary_structure/figure/P52948-1_vs_P52948-4.png |
< |
Protein Feature Comparison of the relative accessible surface area (ASA) among the protiens. |
| ./stats/relative_asa/P52948-1_vs_P52948-3.png |
< |
| ./stats/relative_asa/P52948-1_vs_P52948-4.png |
< |
Protein-Protein Interaction |
Interactors from UniProt. |
| Accession_id | Subsection | Start | End | Funcitonal feature | Splicing information |
Interactors from STRING. |
| Gene name | Interactors |
Related Drugs to NUP98 |
Drugs targeting this gene/protein. (DrugBank) |
| UniProt accession | Gene name | DrugBank ID | Drug name | Drug group | Actions |
Related Diseases to NUP98 |
Previous studies relating to the alternative splicing of NUP98 and disease information from the MeSH term (PubMed) |
| Gene | PMID | Title | Abstract | MeSH ID | MeSH term |
| NUP98 | 22103895 | Expression of the novel NUP98/PSIP1 fusion transcripts in myelodysplastic syndrome with t(9;11)(p22;p15). | The t(9;11)(p22;p15) is a very rare but recurrent translocation in acute myeloid leukemia (AML) and chronic myeloid leukemia (CML) blast crisis. The translocation results in a fusion gene between NUP98 at 11p15 and PSIP1 encoding two transcriptional coactivators, p52 and p75, at 9p22. Here, we describe the first case of myelodysplastic syndrome (MDS) with t(9;11)(p22;p15). | D009190 | Myelodysplastic Syndromes |
| NUP98 | 22103895 | Expression of the novel NUP98/PSIP1 fusion transcripts in myelodysplastic syndrome with t(9;11)(p22;p15). | The t(9;11)(p22;p15) is a very rare but recurrent translocation in acute myeloid leukemia (AML) and chronic myeloid leukemia (CML) blast crisis. The translocation results in a fusion gene between NUP98 at 11p15 and PSIP1 encoding two transcriptional coactivators, p52 and p75, at 9p22. Here, we describe the first case of myelodysplastic syndrome (MDS) with t(9;11)(p22;p15). | D014178 | Translocation, Genetic |
| NUP98 | 24711643 | Identifying biological pathways that underlie primordial short stature using network analysis. | Mutations in CUL7, OBSL1 and CCDC8, leading to disordered ubiquitination, cause one of the commonest primordial growth disorders, 3-M syndrome. This condition is associated with i) abnormal p53 function, ii) GH and/or IGF1 resistance, which may relate to failure to recycle signalling molecules, and iii) cellular IGF2 deficiency. However the exact molecular mechanisms that may link these abnormalities generating growth restriction remain undefined. In this study, we have used immunoprecipitation/mass spectrometry and transcriptomic studies to generate a 3-M 'interactome', to define key cellular pathways and biological functions associated with growth failure seen in 3-M. We identified 189 proteins which interacted with CUL7, OBSL1 and CCDC8, from which a network including 176 of these proteins was generated. To strengthen the association to 3-M syndrome, these proteins were compared with an inferred network generated from the genes that were differentially expressed in 3-M fibroblasts compared with controls. This resulted in a final 3-M network of 131 proteins, with the most significant biological pathway within the network being mRNA splicing/processing. We have shown using an exogenous insulin receptor (INSR) minigene system that alternative splicing of exon 11 is significantly changed in HEK293 cells with altered expression of CUL7, OBSL1 and CCDC8 and in 3-M fibroblasts. The net result is a reduction in the expression of the mitogenic INSR isoform in 3-M syndrome. From these preliminary data, we hypothesise that disordered ubiquitination could result in aberrant mRNA splicing in 3-M; however, further investigation is required to determine whether this contributes to growth failure. | D004392 | Dwarfism |
| NUP98 | 24711643 | Identifying biological pathways that underlie primordial short stature using network analysis. | Mutations in CUL7, OBSL1 and CCDC8, leading to disordered ubiquitination, cause one of the commonest primordial growth disorders, 3-M syndrome. This condition is associated with i) abnormal p53 function, ii) GH and/or IGF1 resistance, which may relate to failure to recycle signalling molecules, and iii) cellular IGF2 deficiency. However the exact molecular mechanisms that may link these abnormalities generating growth restriction remain undefined. In this study, we have used immunoprecipitation/mass spectrometry and transcriptomic studies to generate a 3-M 'interactome', to define key cellular pathways and biological functions associated with growth failure seen in 3-M. We identified 189 proteins which interacted with CUL7, OBSL1 and CCDC8, from which a network including 176 of these proteins was generated. To strengthen the association to 3-M syndrome, these proteins were compared with an inferred network generated from the genes that were differentially expressed in 3-M fibroblasts compared with controls. This resulted in a final 3-M network of 131 proteins, with the most significant biological pathway within the network being mRNA splicing/processing. We have shown using an exogenous insulin receptor (INSR) minigene system that alternative splicing of exon 11 is significantly changed in HEK293 cells with altered expression of CUL7, OBSL1 and CCDC8 and in 3-M fibroblasts. The net result is a reduction in the expression of the mitogenic INSR isoform in 3-M syndrome. From these preliminary data, we hypothesise that disordered ubiquitination could result in aberrant mRNA splicing in 3-M; however, further investigation is required to determine whether this contributes to growth failure. | D006130 | Growth Disorders |
| NUP98 | 24711643 | Identifying biological pathways that underlie primordial short stature using network analysis. | Mutations in CUL7, OBSL1 and CCDC8, leading to disordered ubiquitination, cause one of the commonest primordial growth disorders, 3-M syndrome. This condition is associated with i) abnormal p53 function, ii) GH and/or IGF1 resistance, which may relate to failure to recycle signalling molecules, and iii) cellular IGF2 deficiency. However the exact molecular mechanisms that may link these abnormalities generating growth restriction remain undefined. In this study, we have used immunoprecipitation/mass spectrometry and transcriptomic studies to generate a 3-M 'interactome', to define key cellular pathways and biological functions associated with growth failure seen in 3-M. We identified 189 proteins which interacted with CUL7, OBSL1 and CCDC8, from which a network including 176 of these proteins was generated. To strengthen the association to 3-M syndrome, these proteins were compared with an inferred network generated from the genes that were differentially expressed in 3-M fibroblasts compared with controls. This resulted in a final 3-M network of 131 proteins, with the most significant biological pathway within the network being mRNA splicing/processing. We have shown using an exogenous insulin receptor (INSR) minigene system that alternative splicing of exon 11 is significantly changed in HEK293 cells with altered expression of CUL7, OBSL1 and CCDC8 and in 3-M fibroblasts. The net result is a reduction in the expression of the mitogenic INSR isoform in 3-M syndrome. From these preliminary data, we hypothesise that disordered ubiquitination could result in aberrant mRNA splicing in 3-M; however, further investigation is required to determine whether this contributes to growth failure. | D009123 | Muscle Hypotonia |
Clinically important variants in NUP98 |
(ClinVar, 04/20/2024) |
| accession_id | uniprot_id | gene_name | Type | Variant | Clinical_significance |
|
|