ASpdb: an integrative knowledgebase of human protein isoforms from experimental and AI-predicted structures
ASpdb Logo

Home

Download

Statistics

Examples

Help

Contact

Terms of Use

Center for Computational Systems Medicine
leaf

Protein Summary

leaf

AS Summary

leaf

Protein Functional Features

leaf

Gene Isoform Structures and Expression Levels

leaf

Protein Structures

leaf

pLDDT Score Distribution

leaf

Ramachandran Plot of Protein Structures

leaf

Potential Active Site Information

leaf

Protein Structure and Feature Comparision

leaf

Protein-Protein Interaction

leaf

Related Drugs

leaf

Related Diseases

leaf

Clinically Important Variants

Protein:NPM1

Protein Summary

check button Gene summary
Gene name: NPM1
ASpdb.0 ID: 4869
Gene
Gene symbol

NPM1

Gene ID

4869

Gene namenucleophosmin 1
SynonymsB23|NPM
Cytomap

5q35.1

Type of geneprotein-coding
Descriptionnucleophosminnucleolar protein NO38nucleophosmin (nucleolar phosphoprotein B23, numatrin)nucleophosmin/nucleoplasmin family, member 1testicular tissue protein Li 128
Modification date20240411
UniProtAcc

P06748


check button Gene ontology of this gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
GeneNPM1

GO:0001046

core promoter sequence-specific DNA binding

19160485

GeneNPM1

GO:0003713

transcription coactivator activity

15087454|19160485

GeneNPM1

GO:0003723

RNA binding

12058066

GeneNPM1

GO:0004860

protein kinase inhibitor activity

12882984

GeneNPM1

GO:0005634

nucleus

12080348|22720776

GeneNPM1

GO:0005654

nucleoplasm

12058066|19160485

GeneNPM1

GO:0005730

nucleolus

11420665|12080348|16041368|17475909|18809582|19160485|19188445|19208757|22528486|25956029

GeneNPM1

GO:0005737

cytoplasm

9121481

GeneNPM1

GO:0005813

centrosome

11051553

GeneNPM1

GO:0006281

DNA repair

19188445

GeneNPM1

GO:0006334

nucleosome assembly

11602260

GeneNPM1

GO:0006913

nucleocytoplasmic transport

16041368

GeneNPM1

GO:0008104

protein localization

18420587

GeneNPM1

GO:0008284

positive regulation of cell population proliferation

22528486

GeneNPM1

GO:0030957

Tat protein binding

9094689

GeneNPM1

GO:0031616

spindle pole centrosome

16041368

GeneNPM1

GO:0032071

regulation of endodeoxyribonuclease activity

19188445

GeneNPM1

GO:0032991

protein-containing complex

19160485

GeneNPM1

GO:0032993

protein-DNA complex

19160485

GeneNPM1

GO:0034644

cellular response to UV

19160485

GeneNPM1

GO:0042393

histone binding

11602260

GeneNPM1

GO:0042803

protein homodimerization activity

9121481

GeneNPM1

GO:0043023

ribosomal large subunit binding

18809582

GeneNPM1

GO:0043024

ribosomal small subunit binding

18809582

GeneNPM1

GO:0043066

negative regulation of apoptotic process

12882984

GeneNPM1

GO:0044387

negative regulation of protein kinase activity by regulation of protein phosphorylation

12882984

GeneNPM1

GO:0045727

positive regulation of translation

12882984

GeneNPM1

GO:0045893

positive regulation of DNA-templated transcription

22528486

GeneNPM1

GO:0045944

positive regulation of transcription by RNA polymerase II

19160485

GeneNPM1

GO:0051059

NF-kappaB binding

15087454

GeneNPM1

GO:0051082

unfolded protein binding

10211837

GeneNPM1

GO:0060699

regulation of endoribonuclease activity

19188445

GeneNPM1

GO:0060735

regulation of eIF2 alpha phosphorylation by dsRNA

12882984

GeneNPM1

GO:1902751

positive regulation of cell cycle G2/M phase transition

22528486

GeneNPM1

GO:1990904

ribonucleoprotein complex

18809582



AS Summary

check button Information of the canonical protein with experimentally identified structure from PDB (2023).
UniProt AccFile namePDB IDMethodResolutionChainStartEnd
P06748-1P06748-1_5ehd_C.pdb5EHDX-ray2.55C13119

check button ASpdb's canonical and alternatively spliced isoform information.
accession_idgene_namecanonical_idalternative_idcanonical_lengthalternative_lengthcanonical_startcanonical_endtypeoriginalSEQvariationSEQalternative_startalternative_end
P06748NPM1P06748-1P06748-2294265195223Deletionnonenone194194
P06748NPM1P06748-1P06748-3294259258294SubstitutionGGSLPKVEAKFINYVKNCFRMTDQEAIQDLWQWRKSLAH258259

check buttonMultiple sequence alignment of our canonical and alternatively spliced NPM1

check button Matched gene isoform IDs with Ensembl and RefSeq of our canonical and alternative spliced genes of NPM1
UniProt-idENSGENSTENSP
P06748-1ENSG00000181163.14ENST00000296930.10ENSP00000296930.5
P06748-1ENSG00000181163.14ENST00000517671.5ENSP00000428755.1
P06748-2ENSG00000181163.14ENST00000351986.10ENSP00000341168.6
P06748-3ENSG00000181163.14ENST00000393820.2ENSP00000377408.2
P06748-3ENSG00000181163.14ENST00000678280.1ENSP00000503235.1

UniProt-idNM IDNP ID
P06748-1NM_002520.6NP_002511.1
P06748-2NM_199185.3NP_954654.1
P06748-3NM_001037738.2NP_001032827.1

check buttonAmino acid sequences of our canonical and alternatively spliced NPM1
accession_idProtein sequence
P06748-1MEDSMDMDMSPLRPQNYLFGCELKADKDYHFKVDNDENEHQLSLRTVSLGAGAKDELHIVEAEAMNYEGSPIKVTLATLKMSVQPTVSLG
GFEITPPVVLRLKCGSGPVHISGQHLVAVEEDAESEDEEEEDVKLLSISGKRSAPGGGSKVPQKKVKLAADEDDDDDDEEDDDEDDDDDD
FDDEEAEEKAPVKKSIRDTPAKNAQKSNQNGKDSKPSSTPRSKGQESFKKQEKTPKTPKGPSSVEDIKAKMQASIEKGGSLPKVEAKFIN
P06748-2MEDSMDMDMSPLRPQNYLFGCELKADKDYHFKVDNDENEHQLSLRTVSLGAGAKDELHIVEAEAMNYEGSPIKVTLATLKMSVQPTVSLG
GFEITPPVVLRLKCGSGPVHISGQHLVAVEEDAESEDEEEEDVKLLSISGKRSAPGGGSKVPQKKVKLAADEDDDDDDEEDDDEDDDDDD
P06748-3MEDSMDMDMSPLRPQNYLFGCELKADKDYHFKVDNDENEHQLSLRTVSLGAGAKDELHIVEAEAMNYEGSPIKVTLATLKMSVQPTVSLG
GFEITPPVVLRLKCGSGPVHISGQHLVAVEEDAESEDEEEEDVKLLSISGKRSAPGGGSKVPQKKVKLAADEDDDDDDEEDDDEDDDDDD

Protein Functional Features

check buttonMain function of this protein. (from UniProt)
NPM1 (go to UniProt):P06748

check buttonRetention analysis result of protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page

* Minus value of BPloci means that the break pointn is located before the CDS.
- Retained protein feature among the 13 regional features.
Accession_idSubsectionStartEndFuncitonal featureSplicing information
P06748Region120247Note=Disordered;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-liteType=Deletion;Start=195;End=223
P06748Region243294Note=Required for nucleolar localizationType=Substitution;Start=258;End=294
P06748Motif191197Note=Nuclear localization signal;Ontology_term=ECO:0000255;evidence=ECO:0000255Type=Deletion;Start=195;End=223
P06748Compositional bias201224Note=Polar residues;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-liteType=Deletion;Start=195;End=223


Gene Isoform Structures and Expression Levels for NPM1

check buttonGene structures of our canonical and alternative spliced genes of NPM1
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
gene isoform structure of NPM1

check button Expression levels of gene isoforms across GTEx.
gtex expression

check button Expression levels of gene isoforms across TCGA.
tcga expression


Protein Structures

check button PDB and CIF files of the predicted protein structures
* Here we show the 3D structure of the proteins using Mol*. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Model confidence is shown from the pLDDT values per residue. pLDDT corresponds to the model’s prediction of its score on the local Distance Difference Test. It is a measure of local accuracy (from AlphfaFold website). To color code individual residues, we transformed individual PDB files into CIF format.
3D view using mol* of P06748-1
3D view using mol* of P06748-2
3D view using mol* of P06748-3


pLDDT Score Distribution

check button pLDDT score distribution of the predicted protein structures from AlphaFold2
* AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100.
pLDDT distribution across the protein length of P06748-1
all structure
pLDDT distribution across the protein length of P06748-2
all structure
pLDDT distribution across the protein length of P06748-3
all structure


Ramachandran Plot of Protein Structures


check button Ramachandran plot of the torsional angles - phi (φ)and psi (ψ) - of the residues (amino acids) contained in this protein peptide.
Ramachandran plot of P06748-1
all structure
Ramachandran plot of P06748-3
all structure

Potential Active Site Information


check button The potential binding sites of these proteins were identified using SiteMap, a module of the Schrodinger suite.
UniProt-idSite scoreSizeD scoreVolumeExposureEnclosureContactPhobicPhilicBalanceDon/AccResidues
P06748-10.9811380.968533.0220.6420.670.8160.3141.1480.2741.11232,33,34,35,36,37,38,62,63,64,65,66,69,72,74,92,94
,95,96,97,239,240,241,242,250,254,257,259,260,261,
271,274,275,276,277
P06748-21.071701.087525.8190.4310.7951.0660.771.0120.7610.82430,32,33,34,35,36,38,39,40,63,64,65,69,70,71,95,96
,97,98,99,221,222,224,225,226,228,230,231,232,233,
238,241,242,245,246,247
P06748-30.713430.696140.630.7050.5890.8130.3490.8410.4161.4129,10,11,12,118,119,120,121,122,123,125,126

Protein Structure and Feature Comparision


check button Protein Structure Comparision Using Template Modeling Scores (TM-score).
all structure

check button Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Canonical validated structure (PDB)(green)
3D view using mol* of P06748-1_P06748-1_5ehd_C.pdb

check button Protein Structure Comparision Visualization with mol*. between Canonical validated structure (PDB)(orange) vs Alternative predicted structure (AF2)(green)
3D view using mol* of P06748-1_5ehd_C_P06748-2.pdb
3D view using mol* of P06748-1_5ehd_C_P06748-3.pdb

check button Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Alternative predicted structure (AF2)(green)
3D view using mol* of P06748-1_P06748-2.pdb
3D view using mol* of P06748-1_P06748-3.pdb

check button Protein Feature Comparison of the protein sequendary structures among the protiens.
./stats/secondary_structure/figure/P06748-1_vs_P06748-2.png
all structure<
./stats/secondary_structure/figure/P06748-1_vs_P06748-3.png
all structure<

check button Protein Feature Comparison of the relative accessible surface area (ASA) among the protiens.
./stats/relative_asa/P06748-1_vs_P06748-2.png
all structure<
./stats/relative_asa/P06748-1_vs_P06748-3.png
all structure<


Protein-Protein Interaction


check button Interactors from UniProt.
Accession_idSubsectionStartEndFuncitonal featureSplicing information


check button Interactors from STRING.
Gene nameInteractors


Related Drugs to NPM1


check button Drugs targeting this gene/protein.
(DrugBank)
UniProt accessionGene nameDrugBank IDDrug nameDrug groupActions
P06748NPM1DB11638Artenimolapproved, experimental, investigationalligand

Related Diseases to NPM1


check button Previous studies relating to the alternative splicing of NPM1 and disease information from the MeSH term (PubMed)
GenePMIDTitleAbstractMeSH IDMeSH term
NPM124711643Identifying biological pathways that underlie primordial short stature using network analysis.Mutations in CUL7, OBSL1 and CCDC8, leading to disordered ubiquitination, cause one of the commonest primordial growth disorders, 3-M syndrome. This condition is associated with i) abnormal p53 function, ii) GH and/or IGF1 resistance, which may relate to failure to recycle signalling molecules, and iii) cellular IGF2 deficiency. However the exact molecular mechanisms that may link these abnormalities generating growth restriction remain undefined. In this study, we have used immunoprecipitation/mass spectrometry and transcriptomic studies to generate a 3-M 'interactome', to define key cellular pathways and biological functions associated with growth failure seen in 3-M. We identified 189 proteins which interacted with CUL7, OBSL1 and CCDC8, from which a network including 176 of these proteins was generated. To strengthen the association to 3-M syndrome, these proteins were compared with an inferred network generated from the genes that were differentially expressed in 3-M fibroblasts compared with controls. This resulted in a final 3-M network of 131 proteins, with the most significant biological pathway within the network being mRNA splicing/processing. We have shown using an exogenous insulin receptor (INSR) minigene system that alternative splicing of exon 11 is significantly changed in HEK293 cells with altered expression of CUL7, OBSL1 and CCDC8 and in 3-M fibroblasts. The net result is a reduction in the expression of the mitogenic INSR isoform in 3-M syndrome. From these preliminary data, we hypothesise that disordered ubiquitination could result in aberrant mRNA splicing in 3-M; however, further investigation is required to determine whether this contributes to growth failure.D004392Dwarfism
NPM124711643Identifying biological pathways that underlie primordial short stature using network analysis.Mutations in CUL7, OBSL1 and CCDC8, leading to disordered ubiquitination, cause one of the commonest primordial growth disorders, 3-M syndrome. This condition is associated with i) abnormal p53 function, ii) GH and/or IGF1 resistance, which may relate to failure to recycle signalling molecules, and iii) cellular IGF2 deficiency. However the exact molecular mechanisms that may link these abnormalities generating growth restriction remain undefined. In this study, we have used immunoprecipitation/mass spectrometry and transcriptomic studies to generate a 3-M 'interactome', to define key cellular pathways and biological functions associated with growth failure seen in 3-M. We identified 189 proteins which interacted with CUL7, OBSL1 and CCDC8, from which a network including 176 of these proteins was generated. To strengthen the association to 3-M syndrome, these proteins were compared with an inferred network generated from the genes that were differentially expressed in 3-M fibroblasts compared with controls. This resulted in a final 3-M network of 131 proteins, with the most significant biological pathway within the network being mRNA splicing/processing. We have shown using an exogenous insulin receptor (INSR) minigene system that alternative splicing of exon 11 is significantly changed in HEK293 cells with altered expression of CUL7, OBSL1 and CCDC8 and in 3-M fibroblasts. The net result is a reduction in the expression of the mitogenic INSR isoform in 3-M syndrome. From these preliminary data, we hypothesise that disordered ubiquitination could result in aberrant mRNA splicing in 3-M; however, further investigation is required to determine whether this contributes to growth failure.D006130Growth Disorders
NPM124711643Identifying biological pathways that underlie primordial short stature using network analysis.Mutations in CUL7, OBSL1 and CCDC8, leading to disordered ubiquitination, cause one of the commonest primordial growth disorders, 3-M syndrome. This condition is associated with i) abnormal p53 function, ii) GH and/or IGF1 resistance, which may relate to failure to recycle signalling molecules, and iii) cellular IGF2 deficiency. However the exact molecular mechanisms that may link these abnormalities generating growth restriction remain undefined. In this study, we have used immunoprecipitation/mass spectrometry and transcriptomic studies to generate a 3-M 'interactome', to define key cellular pathways and biological functions associated with growth failure seen in 3-M. We identified 189 proteins which interacted with CUL7, OBSL1 and CCDC8, from which a network including 176 of these proteins was generated. To strengthen the association to 3-M syndrome, these proteins were compared with an inferred network generated from the genes that were differentially expressed in 3-M fibroblasts compared with controls. This resulted in a final 3-M network of 131 proteins, with the most significant biological pathway within the network being mRNA splicing/processing. We have shown using an exogenous insulin receptor (INSR) minigene system that alternative splicing of exon 11 is significantly changed in HEK293 cells with altered expression of CUL7, OBSL1 and CCDC8 and in 3-M fibroblasts. The net result is a reduction in the expression of the mitogenic INSR isoform in 3-M syndrome. From these preliminary data, we hypothesise that disordered ubiquitination could result in aberrant mRNA splicing in 3-M; however, further investigation is required to determine whether this contributes to growth failure.D009123Muscle Hypotonia


Clinically important variants in NPM1


check button (ClinVar, 04/20/2024)
accession_iduniprot_idgene_nameTypeVariantClinical_significance