ASpdb: an integrative knowledgebase of human protein isoforms from experimental and AI-predicted structures
ASpdb Logo

Home

Download

Statistics

Examples

Help

Contact

Terms of Use

Center for Computational Systems Medicine
leaf

Protein Summary

leaf

AS Summary

leaf

Protein Functional Features

leaf

Gene Isoform Structures and Expression Levels

leaf

Protein Structures

leaf

pLDDT Score Distribution

leaf

Ramachandran Plot of Protein Structures

leaf

Potential Active Site Information

leaf

Protein Structure and Feature Comparision

leaf

Protein-Protein Interaction

leaf

Related Drugs

leaf

Related Diseases

leaf

Clinically Important Variants

Protein:HNRNPM

Protein Summary

check button Gene summary
Gene name: HNRNPM
ASpdb.0 ID: 4670
Gene
Gene symbol

HNRNPM

Gene ID

4670

Gene nameheterogeneous nuclear ribonucleoprotein M
SynonymsCEAR|HNRNPM4|HNRPM|HNRPM4|HTGR1|NAGR1|hnRNP M
Cytomap

19p13.2

Type of geneprotein-coding
Descriptionheterogeneous nuclear ribonucleoprotein MCEA receptorN-acetylglucosamine receptor 1heterogenous nuclear ribonucleoprotein M4hnRNA-binding protein M4
Modification date20240407
UniProtAcc

P52272


check button Gene ontology of this gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
GeneHNRNPM

GO:0005654

nucleoplasm

19874820

GeneHNRNPM

GO:0005681

spliceosomal complex

9731529

GeneHNRNPM

GO:0016363

nuclear matrix

19874820

GeneHNRNPM

GO:0042382

paraspeckles

19874820

GeneHNRNPM

GO:0071013

catalytic step 2 spliceosome

11991638



AS Summary

check button Information of the canonical protein with experimentally identified structure from PDB (2023).
UniProt AccFile namePDB IDMethodResolutionChainStartEnd
P52272-1P52272-1_2ot8_D.pdb2OT8X-ray3.1D5069

check button ASpdb's canonical and alternatively spliced isoform information.
accession_idgene_namecanonical_idalternative_idcanonical_lengthalternative_lengthcanonical_startcanonical_endtypeoriginalSEQvariationSEQalternative_startalternative_end
P52272HNRNPMP52272-1P52272-2730691160198Deletionnonenone159159

check buttonMultiple sequence alignment of our canonical and alternatively spliced HNRNPM

check button Matched gene isoform IDs with Ensembl and RefSeq of our canonical and alternative spliced genes of HNRNPM
UniProt-idENSGENSTENSP
P52272-1ENSG00000099783.13ENST00000325495.9ENSP00000325376.2
P52272-2ENSG00000099783.13ENST00000348943.7ENSP00000325732.2

UniProt-idNM IDNP ID
P52272-1NM_005968.4NP_005959.2
P52272-2NM_031203.3NP_112480.2

check buttonAmino acid sequences of our canonical and alternatively spliced HNRNPM
accession_idProtein sequence
P52272-1MAAGVEAAAEVAATEIKMEEESGAPGVPSGNGAPGPKGEGERPAQNEKRKEKNIKRGGNRFEPYANPTKRYRAFITNIPFDVKWQSLKDL
VKEKVGEVTYVELLMDAEGKSRGCAVVEFKMEESMKKAAEVLNKHSLSGRPLKVKEDPDGEHARRAMQKVMATTGGMGMGPGGPGMITIP
PSILNNPNIPNEIIHALQAGRLGSTVFVANLDYKVGWKKLKEVFSMAGVVVRADILEDKDGKSRGIGTVTFEQSIEAVQAISMFNGQLLF
DRPMHVKMDERALPKGDFFPPERPQQLPHGLGGIGMGLGPGGQPIDANHLNKGIGMGNIGPAGMGMEGIGFGINKMGGMEGPFGGGMENM
GRFGSGMNMGRINEILSNALKRGEIIAKQGGGGGGGSVPGIERMGPGIDRLGGAGMERMGAGLGHGMDRVGSEIERMGLVMDRMGSVERM
GSGIERMGPLGLDHMASSIERMGQTMERIGSGVERMGAGMGFGLERMAAPIDRVGQTIERMGSGVERMGPAIERMGLSMERMVPAGMGAG
LERMGPVMDRMATGLERMGANNLERMGLERMGANSLERMGLERMGANSLERMGPAMGPALGAGIERMGLAMGGGGGASFDRAIEMERGNF
GGSFAGSFGGAGGHAPGVARKACQIFVRNLPFDFTWKMLKDKFNECGHVLYADIKMENGKSKGCGVVKFESPEVAERACRMMNGMKLSGR
P52272-2MAAGVEAAAEVAATEIKMEEESGAPGVPSGNGAPGPKGEGERPAQNEKRKEKNIKRGGNRFEPYANPTKRYRAFITNIPFDVKWQSLKDL
VKEKVGEVTYVELLMDAEGKSRGCAVVEFKMEESMKKAAEVLNKHSLSGRPLKVKEDPDGEHARRAMQKAGRLGSTVFVANLDYKVGWKK
LKEVFSMAGVVVRADILEDKDGKSRGIGTVTFEQSIEAVQAISMFNGQLLFDRPMHVKMDERALPKGDFFPPERPQQLPHGLGGIGMGLG
PGGQPIDANHLNKGIGMGNIGPAGMGMEGIGFGINKMGGMEGPFGGGMENMGRFGSGMNMGRINEILSNALKRGEIIAKQGGGGGGGSVP
GIERMGPGIDRLGGAGMERMGAGLGHGMDRVGSEIERMGLVMDRMGSVERMGSGIERMGPLGLDHMASSIERMGQTMERIGSGVERMGAG
MGFGLERMAAPIDRVGQTIERMGSGVERMGPAIERMGLSMERMVPAGMGAGLERMGPVMDRMATGLERMGANNLERMGLERMGANSLERM
GLERMGANSLERMGPAMGPALGAGIERMGLAMGGGGGASFDRAIEMERGNFGGSFAGSFGGAGGHAPGVARKACQIFVRNLPFDFTWKML

Protein Functional Features

check buttonMain function of this protein. (from UniProt)
HNRNPM (go to UniProt):P52272

check buttonRetention analysis result of protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page

* Minus value of BPloci means that the break pointn is located before the CDS.
- Retained protein feature among the 13 regional features.
Accession_idSubsectionStartEndFuncitonal featureSplicing information


Gene Isoform Structures and Expression Levels for HNRNPM

check buttonGene structures of our canonical and alternative spliced genes of HNRNPM
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
gene isoform structure of HNRNPM

check button Expression levels of gene isoforms across GTEx.
gtex expression

check button Expression levels of gene isoforms across TCGA.
tcga expression


Protein Structures

check button PDB and CIF files of the predicted protein structures
* Here we show the 3D structure of the proteins using Mol*. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Model confidence is shown from the pLDDT values per residue. pLDDT corresponds to the model’s prediction of its score on the local Distance Difference Test. It is a measure of local accuracy (from AlphfaFold website). To color code individual residues, we transformed individual PDB files into CIF format.
3D view using mol* of P52272-1
3D view using mol* of P52272-2


pLDDT Score Distribution

check button pLDDT score distribution of the predicted protein structures from AlphaFold2
* AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100.
pLDDT distribution across the protein length of P52272-1
all structure
pLDDT distribution across the protein length of P52272-2
all structure


Ramachandran Plot of Protein Structures


check button Ramachandran plot of the torsional angles - phi (φ)and psi (ψ) - of the residues (amino acids) contained in this protein peptide.
Ramachandran plot of P52272-1
all structure

Potential Active Site Information


check button The potential binding sites of these proteins were identified using SiteMap, a module of the Schrodinger suite.
UniProt-idSite scoreSizeD scoreVolumeExposureEnclosureContactPhobicPhilicBalanceDon/AccResidues
P52272-10.837590.822143.0310.5460.6810.9020.430.9470.4540.96761,62,63,64,65,66,67,69,70,71,99,118,119,120
P52272-20.722450.688179.7320.7660.6230.7290.3090.9720.3181.287155,158,159,162,163,164,165,166,211,215,242,243,24
4,245,246,247,248

Protein Structure and Feature Comparision


check button Protein Structure Comparision Using Template Modeling Scores (TM-score).
all structure

check button Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Canonical validated structure (PDB)(green)
3D view using mol* of P52272-1_P52272-1_2ot8_D.pdb

check button Protein Structure Comparision Visualization with mol*. between Canonical validated structure (PDB)(orange) vs Alternative predicted structure (AF2)(green)
3D view using mol* of P52272-1_2ot8_D_P52272-2.pdb

check button Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Alternative predicted structure (AF2)(green)
3D view using mol* of P52272-1_P52272-2.pdb

check button Protein Feature Comparison of the protein sequendary structures among the protiens.
./stats/secondary_structure/figure/P52272-1_vs_P52272-2.png
all structure<

check button Protein Feature Comparison of the relative accessible surface area (ASA) among the protiens.
./stats/relative_asa/P52272-1_vs_P52272-2.png
all structure<


Protein-Protein Interaction


check button Interactors from UniProt.
Accession_idSubsectionStartEndFuncitonal featureSplicing information


check button Interactors from STRING.
Gene nameInteractors


Related Drugs to HNRNPM


check button Drugs targeting this gene/protein.
(DrugBank)
UniProt accessionGene nameDrugBank IDDrug nameDrug groupActions

Related Diseases to HNRNPM


check button Previous studies relating to the alternative splicing of HNRNPM and disease information from the MeSH term (PubMed)
GenePMIDTitleAbstractMeSH IDMeSH term
HNRNPM14597422Surface expression of heterogeneous nuclear RNA binding protein M4 on Kupffer cell relates to its function as a carcinoembryonic antigen receptor.Elevated concentrations of carcinoembryonic antigen (CEA) in the blood are associated with the development of hepatic metastases from colorectal cancers. Clearance of circulating CEA occurs through endocytosis by liver macrophages, Kupffer cells. Previously we identified heterogeneous nuclear ribonucleoproteins M4 (hnRNP M4) as a receptor (CEAR) for CEA. HnRNP M4 has two isoform proteins (p80, p76), the full-length hnRNP M4 (CEARL) and a truncated form (CEARS) with a deletion of 39 amino acids between RNA binding domains 1 and 2, generated by alternative splicing. The present study was undertaken to clarify any isoform-specific differences in terms of their function as CEA receptor and localization. We develop anti-CEAR isoform-specific antibodies and show that both CEAR splicing isoforms are expressed on the surface of Kupffer cells and can function as CEA receptor. Alternatively, in P388D1 macrophages CEARS protein has nuclear and CEARL has cytoplasmic localization. In MIP101 colon cancer and HeLa cells the CEARS protein is localized to the nucleus and CEARL to the cytoplasm. These findings imply that different functions are assigned to CEAR isoforms depending on the cell type. The search of 39 amino acids deleted region against the Prosite data base revealed the presence of N-myristylation signal PGGPGMITIP that may be involved in protein targeting to the plasma membrane. Overall, this report demonstrates that the cellular distribution, level of expression, and relative amount of CEARL and CEARS isoforms determine specificity for CEA binding and the expression of alternative spliced forms of CEAR is regulated in a tissue-specific manner.D002277Carcinoma
HNRNPM14597422Surface expression of heterogeneous nuclear RNA binding protein M4 on Kupffer cell relates to its function as a carcinoembryonic antigen receptor.Elevated concentrations of carcinoembryonic antigen (CEA) in the blood are associated with the development of hepatic metastases from colorectal cancers. Clearance of circulating CEA occurs through endocytosis by liver macrophages, Kupffer cells. Previously we identified heterogeneous nuclear ribonucleoproteins M4 (hnRNP M4) as a receptor (CEAR) for CEA. HnRNP M4 has two isoform proteins (p80, p76), the full-length hnRNP M4 (CEARL) and a truncated form (CEARS) with a deletion of 39 amino acids between RNA binding domains 1 and 2, generated by alternative splicing. The present study was undertaken to clarify any isoform-specific differences in terms of their function as CEA receptor and localization. We develop anti-CEAR isoform-specific antibodies and show that both CEAR splicing isoforms are expressed on the surface of Kupffer cells and can function as CEA receptor. Alternatively, in P388D1 macrophages CEARS protein has nuclear and CEARL has cytoplasmic localization. In MIP101 colon cancer and HeLa cells the CEARS protein is localized to the nucleus and CEARL to the cytoplasm. These findings imply that different functions are assigned to CEAR isoforms depending on the cell type. The search of 39 amino acids deleted region against the Prosite data base revealed the presence of N-myristylation signal PGGPGMITIP that may be involved in protein targeting to the plasma membrane. Overall, this report demonstrates that the cellular distribution, level of expression, and relative amount of CEARL and CEARS isoforms determine specificity for CEA binding and the expression of alternative spliced forms of CEAR is regulated in a tissue-specific manner.D015179Colorectal Neoplasms
HNRNPM14597422Surface expression of heterogeneous nuclear RNA binding protein M4 on Kupffer cell relates to its function as a carcinoembryonic antigen receptor.Elevated concentrations of carcinoembryonic antigen (CEA) in the blood are associated with the development of hepatic metastases from colorectal cancers. Clearance of circulating CEA occurs through endocytosis by liver macrophages, Kupffer cells. Previously we identified heterogeneous nuclear ribonucleoproteins M4 (hnRNP M4) as a receptor (CEAR) for CEA. HnRNP M4 has two isoform proteins (p80, p76), the full-length hnRNP M4 (CEARL) and a truncated form (CEARS) with a deletion of 39 amino acids between RNA binding domains 1 and 2, generated by alternative splicing. The present study was undertaken to clarify any isoform-specific differences in terms of their function as CEA receptor and localization. We develop anti-CEAR isoform-specific antibodies and show that both CEAR splicing isoforms are expressed on the surface of Kupffer cells and can function as CEA receptor. Alternatively, in P388D1 macrophages CEARS protein has nuclear and CEARL has cytoplasmic localization. In MIP101 colon cancer and HeLa cells the CEARS protein is localized to the nucleus and CEARL to the cytoplasm. These findings imply that different functions are assigned to CEAR isoforms depending on the cell type. The search of 39 amino acids deleted region against the Prosite data base revealed the presence of N-myristylation signal PGGPGMITIP that may be involved in protein targeting to the plasma membrane. Overall, this report demonstrates that the cellular distribution, level of expression, and relative amount of CEARL and CEARS isoforms determine specificity for CEA binding and the expression of alternative spliced forms of CEAR is regulated in a tissue-specific manner.D008113Liver Neoplasms
HNRNPM16230076Beta-catenin interacts with the FUS proto-oncogene product and regulates pre-mRNA splicing.beta-Catenin is a downstream effector of the Wnt signaling pathway and is believed to exert its oncogenic function by activating T-cell factor (TCF)/lymphoid enhancer factor (LEF) family transcriptional factors. However, it is still uncertain whether the diverse effects of beta-catenin are caused solely by aberrant gene transactivation. In this study, we used a proteomics approach to obtain further insight into the functional properties of nuclear beta-catenin.D015179Colorectal Neoplasms
HNRNPM24840202Cell type-restricted activity of hnRNPM promotes breast cancer metastasis via regulating alternative splicing.Tumor metastasis remains the major cause of cancer-related death, but its molecular basis is still not well understood. Here we uncovered a splicing-mediated pathway that is essential for breast cancer metastasis. We show that the RNA-binding protein heterogeneous nuclear ribonucleoprotein M (hnRNPM) promotes breast cancer metastasis by activating the switch of alternative splicing that occurs during epithelial-mesenchymal transition (EMT). Genome-wide deep sequencing analysis suggests that hnRNPM potentiates TGFβ signaling and identifies CD44 as a key downstream target of hnRNPM. hnRNPM ablation prevents TGFβ-induced EMT and inhibits breast cancer metastasis in mice, whereas enforced expression of the specific CD44 standard (CD44s) splice isoform overrides the loss of hnRNPM and permits EMT and metastasis. Mechanistically, we demonstrate that the ubiquitously expressed hnRNPM acts in a mesenchymal-specific manner to precisely control CD44 splice isoform switching during EMT. This restricted cell-type activity of hnRNPM is achieved by competition with ESRP1, an epithelial splicing regulator that binds to the same cis-regulatory RNA elements as hnRNPM and is repressed during EMT. Importantly, hnRNPM is associated with aggressive breast cancer and correlates with increased CD44s in patient specimens. These findings demonstrate a novel molecular mechanism through which tumor metastasis is endowed by the hnRNPM-mediated splicing program.D001943Breast Neoplasms
HNRNPM24840202Cell type-restricted activity of hnRNPM promotes breast cancer metastasis via regulating alternative splicing.Tumor metastasis remains the major cause of cancer-related death, but its molecular basis is still not well understood. Here we uncovered a splicing-mediated pathway that is essential for breast cancer metastasis. We show that the RNA-binding protein heterogeneous nuclear ribonucleoprotein M (hnRNPM) promotes breast cancer metastasis by activating the switch of alternative splicing that occurs during epithelial-mesenchymal transition (EMT). Genome-wide deep sequencing analysis suggests that hnRNPM potentiates TGFβ signaling and identifies CD44 as a key downstream target of hnRNPM. hnRNPM ablation prevents TGFβ-induced EMT and inhibits breast cancer metastasis in mice, whereas enforced expression of the specific CD44 standard (CD44s) splice isoform overrides the loss of hnRNPM and permits EMT and metastasis. Mechanistically, we demonstrate that the ubiquitously expressed hnRNPM acts in a mesenchymal-specific manner to precisely control CD44 splice isoform switching during EMT. This restricted cell-type activity of hnRNPM is achieved by competition with ESRP1, an epithelial splicing regulator that binds to the same cis-regulatory RNA elements as hnRNPM and is repressed during EMT. Importantly, hnRNPM is associated with aggressive breast cancer and correlates with increased CD44s in patient specimens. These findings demonstrate a novel molecular mechanism through which tumor metastasis is endowed by the hnRNPM-mediated splicing program.D009362Neoplasm Metastasis


Clinically important variants in HNRNPM


check button (ClinVar, 04/20/2024)
accession_iduniprot_idgene_nameTypeVariantClinical_significance