ASpdb: an integrative knowledgebase of human protein isoforms from experimental and AI-predicted structures
ASpdb Logo

Home

Download

Statistics

Examples

Help

Contact

Terms of Use

Center for Computational Systems Medicine
leaf

Protein Summary

leaf

AS Summary

leaf

Protein Functional Features

leaf

Gene Isoform Structures and Expression Levels

leaf

Protein Structures

leaf

pLDDT Score Distribution

leaf

Ramachandran Plot of Protein Structures

leaf

Potential Active Site Information

leaf

Protein Structure and Feature Comparision

leaf

Protein-Protein Interaction

leaf

Related Drugs

leaf

Related Diseases

leaf

Clinically Important Variants

Protein:EZH2

Protein Summary

check button Gene summary
Gene name: EZH2
ASpdb.0 ID: 2146
Gene
Gene symbol

EZH2

Gene ID

2146

Gene nameenhancer of zeste 2 polycomb repressive complex 2 subunit
SynonymsENX-1|ENX1|EZH2b|KMT6|KMT6A|WVS|WVS2
Cytomap

7q36.1

Type of geneprotein-coding
Descriptionhistone-lysine N-methyltransferase EZH2enhancer of zeste homolog 2lysine N-methyltransferase 6
Modification date20240411
UniProtAcc

Q15910


check button Gene ontology of this gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
GeneEZH2

GO:0000122

negative regulation of transcription by RNA polymerase II

20154697

GeneEZH2

GO:0000785

chromatin

19734898

GeneEZH2

GO:0003682

chromatin binding

20154697

GeneEZH2

GO:0005634

nucleus

19144645

GeneEZH2

GO:0005654

nucleoplasm

-

GeneEZH2

GO:0010718

positive regulation of epithelial to mesenchymal transition

20154697

GeneEZH2

GO:0016279

protein-lysine N-methyltransferase activity

19144645

GeneEZH2

GO:0031490

chromatin DNA binding

19734898

GeneEZH2

GO:0035098

ESC/E(Z) complex

15385962|20075857|23104054|23273982|24474760

GeneEZH2

GO:0042054

histone methyltransferase activity

15385962

GeneEZH2

GO:0043406

positive regulation of MAP kinase activity

20154697

GeneEZH2

GO:0043547

positive regulation of GTPase activity

20154697

GeneEZH2

GO:0045814

negative regulation of gene expression, epigenetic

20154697

GeneEZH2

GO:0046976

histone H3K27 methyltransferase activity

22323599|30923826

GeneEZH2

GO:0071902

positive regulation of protein serine/threonine kinase activity

20154697

GeneEZH2

GO:1990841

promoter-specific chromatin binding

24623306



AS Summary

check button Information of the canonical protein with experimentally identified structure from PDB (2023).
UniProt AccFile namePDB IDMethodResolutionChainStartEnd
Q15910-1Q15910-1_5hyn_A.pdb5HYNX-ray2.95A10740

check button ASpdb's canonical and alternatively spliced isoform information.
accession_idgene_namecanonical_idalternative_idcanonical_lengthalternative_lengthcanonical_startcanonical_endtypeoriginalSEQvariationSEQalternative_startalternative_end
Q15910EZH2Q15910-1Q15910-2746751297298SubstitutionHPHRKCNYS297303
Q15910EZH2Q15910-1Q15910-374670783121Deletionnonenone8282
Q15910EZH2Q15910-1Q15910-47467377482Deletionnonenone7373
Q15910EZH2Q15910-1Q15910-57466957482Deletionnonenone7373
Q15910EZH2Q15910-1Q15910-5746695511553SubstitutionDGSSNHVYNYQPCDHPRQPCDSSCPCVIAQNFCEKFCQCSSECG502502

check buttonMultiple sequence alignment of our canonical and alternatively spliced EZH2

check button Matched gene isoform IDs with Ensembl and RefSeq of our canonical and alternative spliced genes of EZH2
UniProt-idENSGENSTENSP
Q15910-1ENSG00000106462.12ENST00000460911.5ENSP00000419711.1
Q15910-2ENSG00000106462.12ENST00000320356.7ENSP00000320147.2
Q15910-3ENSG00000106462.12ENST00000350995.6ENSP00000223193.2
Q15910-4ENSG00000106462.12ENST00000483967.5ENSP00000419856.1
Q15910-5ENSG00000106462.12ENST00000476773.5ENSP00000419050.1
Q15910-5ENSG00000106462.12ENST00000478654.5ENSP00000417062.1

UniProt-idNM IDNP ID
Q15910-1NM_001203247.1NP_001190176.1
Q15910-2NM_004456.4NP_004447.2
Q15910-3NM_152998.2NP_694543.1
Q15910-4NM_001203248.1NP_001190177.1
Q15910-5NM_001203249.1NP_001190178.1

check buttonAmino acid sequences of our canonical and alternatively spliced EZH2
accession_idProtein sequence
Q15910-1MGQTGKKSEKGPVCWRKRVKSEYMRLRQLKRFRRADEVKSMFSSNRQKILERTEILNQEWKQRRIQPVHILTSVSSLRGTRECSVTSDLD
FPTQVIPLKTLNAVASVPIMYSWSPLQQNFMVEDETVLHNIPYMGDEVLDQDGTFIEELIKNYDGKVHGDRECGFINDEIFVELVNALGQ
YNDDDDDDDGDDPEEREEKQKDLEDHRDDKESRPPRKFPSDKIFEAISSMFPDKGTAEELKEKYKELTEQQLPGALPPECTPNIDGPNAK
SVQREQSLHSFHTLFCRRCFKYDCFLHPFHATPNTYKRKNTETALDNKPCGPQCYQHLEGAKEFAAALTAERIKTPPKRPGGRRRGRLPN
NSSRPSTPTINVLESKDTDSDREAGTETGGENNDKEEEEKKDETSSSSEANSRCQTPIKMKPNIEPPENVEWSGAEASMFRVLIGTYYDN
FCAIARLIGTKTCRQVYEFRVKESSIIAPAPAEDVDTPPRKKKRKHRLWAAHCRKIQLKKDGSSNHVYNYQPCDHPRQPCDSSCPCVIAQ
NFCEKFCQCSSECQNRFPGCRCKAQCNTKQCPCYLAVRECDPDLCLTCGAADHWDSKNVSCKNCSIQRGSKKHLLLAPSDVAGWGIFIKD
PVQKNEFISEYCGEIISQDEADRRGKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVMMVNGDHRIGIFAKRAIQTGE
Q15910-2MGQTGKKSEKGPVCWRKRVKSEYMRLRQLKRFRRADEVKSMFSSNRQKILERTEILNQEWKQRRIQPVHILTSVSSLRGTRECSVTSDLD
FPTQVIPLKTLNAVASVPIMYSWSPLQQNFMVEDETVLHNIPYMGDEVLDQDGTFIEELIKNYDGKVHGDRECGFINDEIFVELVNALGQ
YNDDDDDDDGDDPEEREEKQKDLEDHRDDKESRPPRKFPSDKIFEAISSMFPDKGTAEELKEKYKELTEQQLPGALPPECTPNIDGPNAK
SVQREQSLHSFHTLFCRRCFKYDCFLHRKCNYSFHATPNTYKRKNTETALDNKPCGPQCYQHLEGAKEFAAALTAERIKTPPKRPGGRRR
GRLPNNSSRPSTPTINVLESKDTDSDREAGTETGGENNDKEEEEKKDETSSSSEANSRCQTPIKMKPNIEPPENVEWSGAEASMFRVLIG
TYYDNFCAIARLIGTKTCRQVYEFRVKESSIIAPAPAEDVDTPPRKKKRKHRLWAAHCRKIQLKKDGSSNHVYNYQPCDHPRQPCDSSCP
CVIAQNFCEKFCQCSSECQNRFPGCRCKAQCNTKQCPCYLAVRECDPDLCLTCGAADHWDSKNVSCKNCSIQRGSKKHLLLAPSDVAGWG
IFIKDPVQKNEFISEYCGEIISQDEADRRGKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVMMVNGDHRIGIFAKRA
Q15910-3MGQTGKKSEKGPVCWRKRVKSEYMRLRQLKRFRRADEVKSMFSSNRQKILERTEILNQEWKQRRIQPVHILTSVSSLRGTREVEDETVLH
NIPYMGDEVLDQDGTFIEELIKNYDGKVHGDRECGFINDEIFVELVNALGQYNDDDDDDDGDDPEEREEKQKDLEDHRDDKESRPPRKFP
SDKIFEAISSMFPDKGTAEELKEKYKELTEQQLPGALPPECTPNIDGPNAKSVQREQSLHSFHTLFCRRCFKYDCFLHPFHATPNTYKRK
NTETALDNKPCGPQCYQHLEGAKEFAAALTAERIKTPPKRPGGRRRGRLPNNSSRPSTPTINVLESKDTDSDREAGTETGGENNDKEEEE
KKDETSSSSEANSRCQTPIKMKPNIEPPENVEWSGAEASMFRVLIGTYYDNFCAIARLIGTKTCRQVYEFRVKESSIIAPAPAEDVDTPP
RKKKRKHRLWAAHCRKIQLKKDGSSNHVYNYQPCDHPRQPCDSSCPCVIAQNFCEKFCQCSSECQNRFPGCRCKAQCNTKQCPCYLAVRE
CDPDLCLTCGAADHWDSKNVSCKNCSIQRGSKKHLLLAPSDVAGWGIFIKDPVQKNEFISEYCGEIISQDEADRRGKVYDKYMCSFLFNL
Q15910-4MGQTGKKSEKGPVCWRKRVKSEYMRLRQLKRFRRADEVKSMFSSNRQKILERTEILNQEWKQRRIQPVHILTSCSVTSDLDFPTQVIPLK
TLNAVASVPIMYSWSPLQQNFMVEDETVLHNIPYMGDEVLDQDGTFIEELIKNYDGKVHGDRECGFINDEIFVELVNALGQYNDDDDDDD
GDDPEEREEKQKDLEDHRDDKESRPPRKFPSDKIFEAISSMFPDKGTAEELKEKYKELTEQQLPGALPPECTPNIDGPNAKSVQREQSLH
SFHTLFCRRCFKYDCFLHPFHATPNTYKRKNTETALDNKPCGPQCYQHLEGAKEFAAALTAERIKTPPKRPGGRRRGRLPNNSSRPSTPT
INVLESKDTDSDREAGTETGGENNDKEEEEKKDETSSSSEANSRCQTPIKMKPNIEPPENVEWSGAEASMFRVLIGTYYDNFCAIARLIG
TKTCRQVYEFRVKESSIIAPAPAEDVDTPPRKKKRKHRLWAAHCRKIQLKKDGSSNHVYNYQPCDHPRQPCDSSCPCVIAQNFCEKFCQC
SSECQNRFPGCRCKAQCNTKQCPCYLAVRECDPDLCLTCGAADHWDSKNVSCKNCSIQRGSKKHLLLAPSDVAGWGIFIKDPVQKNEFIS
EYCGEIISQDEADRRGKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVMMVNGDHRIGIFAKRAIQTGEELFFDYRYS
Q15910-5MGQTGKKSEKGPVCWRKRVKSEYMRLRQLKRFRRADEVKSMFSSNRQKILERTEILNQEWKQRRIQPVHILTSCSVTSDLDFPTQVIPLK
TLNAVASVPIMYSWSPLQQNFMVEDETVLHNIPYMGDEVLDQDGTFIEELIKNYDGKVHGDRECGFINDEIFVELVNALGQYNDDDDDDD
GDDPEEREEKQKDLEDHRDDKESRPPRKFPSDKIFEAISSMFPDKGTAEELKEKYKELTEQQLPGALPPECTPNIDGPNAKSVQREQSLH
SFHTLFCRRCFKYDCFLHPFHATPNTYKRKNTETALDNKPCGPQCYQHLEGAKEFAAALTAERIKTPPKRPGGRRRGRLPNNSSRPSTPT
INVLESKDTDSDREAGTETGGENNDKEEEEKKDETSSSSEANSRCQTPIKMKPNIEPPENVEWSGAEASMFRVLIGTYYDNFCAIARLIG
TKTCRQVYEFRVKESSIIAPAPAEDVDTPPRKKKRKHRLWAAHCRKIQLKKGQNRFPGCRCKAQCNTKQCPCYLAVRECDPDLCLTCGAA
DHWDSKNVSCKNCSIQRGSKKHLLLAPSDVAGWGIFIKDPVQKNEFISEYCGEIISQDEADRRGKVYDKYMCSFLFNLNNDFVVDATRKG

Protein Functional Features

check buttonMain function of this protein. (from UniProt)
EZH2 (go to UniProt):Q15910

check buttonRetention analysis result of protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page

* Minus value of BPloci means that the break pointn is located before the CDS.
- Retained protein feature among the 13 regional features.
Accession_idSubsectionStartEndFuncitonal featureSplicing information
Q15910Domain503605Note=CXC;Ontology_term=ECO:0000255;evidence=ECO:0000255|PROSITE-ProRule:PRU00970Type=Substitution;Start=511;End=553
Q15910Region1340Note=Interaction with DNMT1%2C DNMT3A and DNMT3BType=Substitution;Start=297;End=298
Q15910Region1340Note=Interaction with DNMT1%2C DNMT3A and DNMT3BType=Deletion;Start=83;End=121
Q15910Region1340Note=Interaction with DNMT1%2C DNMT3A and DNMT3BType=Deletion;Start=74;End=82
Q15910Region1340Note=Interaction with DNMT1%2C DNMT3A and DNMT3BType=Deletion;Start=74;End=82
Q15910Region329522Note=Interaction with CDYL;Ontology_term=ECO:0000269;evidence=ECO:0000269|PubMed:22009739;Dbxref=PMID:22009739Type=Substitution;Start=511;End=553


Gene Isoform Structures and Expression Levels for EZH2

check buttonGene structures of our canonical and alternative spliced genes of EZH2
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
gene isoform structure of EZH2

check button Expression levels of gene isoforms across GTEx.
gtex expression

check button Expression levels of gene isoforms across TCGA.
tcga expression


Protein Structures

check button PDB and CIF files of the predicted protein structures
* Here we show the 3D structure of the proteins using Mol*. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Model confidence is shown from the pLDDT values per residue. pLDDT corresponds to the model’s prediction of its score on the local Distance Difference Test. It is a measure of local accuracy (from AlphfaFold website). To color code individual residues, we transformed individual PDB files into CIF format.
3D view using mol* of Q15910-1
3D view using mol* of Q15910-2
3D view using mol* of Q15910-3
3D view using mol* of Q15910-4
3D view using mol* of Q15910-5


pLDDT Score Distribution

check button pLDDT score distribution of the predicted protein structures from AlphaFold2
* AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100.
pLDDT distribution across the protein length of Q15910-1
all structure
pLDDT distribution across the protein length of Q15910-2
all structure
pLDDT distribution across the protein length of Q15910-3
all structure
pLDDT distribution across the protein length of Q15910-4
all structure
pLDDT distribution across the protein length of Q15910-5
all structure


Ramachandran Plot of Protein Structures


check button Ramachandran plot of the torsional angles - phi (φ)and psi (ψ) - of the residues (amino acids) contained in this protein peptide.
Ramachandran plot of Q15910-1
all structure
Ramachandran plot of Q15910-2
all structure
Ramachandran plot of Q15910-3
all structure
Ramachandran plot of Q15910-4
all structure
Ramachandran plot of Q15910-5
all structure

Potential Active Site Information


check button The potential binding sites of these proteins were identified using SiteMap, a module of the Schrodinger suite.
UniProt-idSite scoreSizeD scoreVolumeExposureEnclosureContactPhobicPhilicBalanceDon/AccResidues
Q15910-11.0154041.0351419.6770.6050.7150.9020.541.0230.5271.233108,109,110,111,113,125,127,128,129,130,132,149,15
3,506,507,509,510,511,514,515,516,517,518,537,538,
540,541,542,544,555,576,577,578,622,623,624,648,65
1,652,654,655,657,658,659,661,663,664,665,666,667,
668,669,670,671,674,678,679,685,686,688,689,696,69
7,698,699,700,701,706,708,710,724,725,726,727,728,
729,732,735,736
Q15910-21.0532031.078681.1980.5230.7560.9530.7790.970.8030.844108,109,110,111,113,125,127,128,129,130,132,153,62
6,627,628,629,655,659,662,663,666,667,668,669,670,
683,684,690,691,693,694,731,733,740,741,742,743,74
4,745
Q15910-31.0431351.003274.7430.3840.7631.1050.4071.2110.3361.023528,535,540,558,559,560,562,563,564,565,566,569,57
1,573,592,593,596,597,598,657,659,661,672
Q15910-41.0481971.073642.4390.5570.7490.9450.6940.9710.7150.82499,100,101,102,103,104,116,118,119,120,121,123,144
,612,613,614,615,641,645,648,649,652,653,654,655,6
56,657,669,670,676,677,679,680,717,719,726,727,728
,729,730,732
Q15910-51.0481981.071609.1680.5230.7530.9670.750.9820.7640.74299,100,101,102,104,116,118,119,120,121,123,144,570
,571,572,573,603,606,607,610,611,612,613,614,627,6
28,634,635,637,638,675,677,684,685,686,687,688,690


Protein Structure and Feature Comparision


check button Protein Structure Comparision Using Template Modeling Scores (TM-score).
all structure

check button Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Canonical validated structure (PDB)(green)
3D view using mol* of Q15910-1_Q15910-1_5hyn_A.pdb

check button Protein Structure Comparision Visualization with mol*. between Canonical validated structure (PDB)(orange) vs Alternative predicted structure (AF2)(green)
3D view using mol* of Q15910-1_5hyn_A_Q15910-2.pdb
3D view using mol* of Q15910-1_5hyn_A_Q15910-3.pdb
3D view using mol* of Q15910-1_5hyn_A_Q15910-4.pdb
3D view using mol* of Q15910-1_5hyn_A_Q15910-5.pdb

check button Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Alternative predicted structure (AF2)(green)
3D view using mol* of Q15910-1_Q15910-2.pdb
3D view using mol* of Q15910-1_Q15910-3.pdb
3D view using mol* of Q15910-1_Q15910-4.pdb
3D view using mol* of Q15910-1_Q15910-5.pdb

check button Protein Feature Comparison of the protein sequendary structures among the protiens.
./stats/secondary_structure/figure/Q15910-1_vs_Q15910-2.png
all structure<
./stats/secondary_structure/figure/Q15910-1_vs_Q15910-3.png
all structure<
./stats/secondary_structure/figure/Q15910-1_vs_Q15910-4.png
all structure<
./stats/secondary_structure/figure/Q15910-1_vs_Q15910-5.png
all structure<

check button Protein Feature Comparison of the relative accessible surface area (ASA) among the protiens.
./stats/relative_asa/Q15910-1_vs_Q15910-2.png
all structure<
./stats/relative_asa/Q15910-1_vs_Q15910-3.png
all structure<
./stats/relative_asa/Q15910-1_vs_Q15910-4.png
all structure<
./stats/relative_asa/Q15910-1_vs_Q15910-5.png
all structure<


Protein-Protein Interaction


check button Interactors from UniProt.
Accession_idSubsectionStartEndFuncitonal featureSplicing information
Q15910Region1340Note=Interaction with DNMT1%2C DNMT3A and DNMT3BType=Substitution;Start=297;End=298
Q15910Region1340Note=Interaction with DNMT1%2C DNMT3A and DNMT3BType=Deletion;Start=83;End=121
Q15910Region1340Note=Interaction with DNMT1%2C DNMT3A and DNMT3BType=Deletion;Start=74;End=82
Q15910Region1340Note=Interaction with DNMT1%2C DNMT3A and DNMT3BType=Deletion;Start=74;End=82
Q15910Region329522Note=Interaction with CDYL;Ontology_term=ECO:0000269;evidence=ECO:0000269|PubMed:22009739;Dbxref=PMID:22009739Type=Substitution;Start=511;End=553


check button Interactors from STRING.
Gene nameInteractors


Related Drugs to EZH2


check button Drugs targeting this gene/protein.
(DrugBank)
UniProt accessionGene nameDrugBank IDDrug nameDrug groupActions
Q15910EZH2DB14581CPI-1205investigationalinhibitor
Q15910EZH2DB12887Tazemetostatapproved, investigationalinhibitor

Related Diseases to EZH2


check button Previous studies relating to the alternative splicing of EZH2 and disease information from the MeSH term (PubMed)
GenePMIDTitleAbstractMeSH IDMeSH term
EZH224032713Histone trimethylation of the p53 gene by expression of a constitutively active prolactin receptor in prostate cancer cells.Prolactin (PRL) is a pituitary polypeptide hormone characterized by multiple biological actions including stimulation of growth in the prostate, breast and ovarian epithelial cells. A sizable body of reports has presented evidences to indicate the involvement of PRL in the pathogenic process of cancers of the reproductive system, such as prostate and breast cancers. PRL exerts its effects by dimerizing its receptor (PRLR) on the plasma membrane, and initiating cellular Jak-Stat signal pathway. We have previously cloned from prostate cancer cells a natural variant of PRLR in which the S2 subdomain of the extracellular domain is missing (ΔS2). Our preliminary data showed that ΔS2 PRLR was able to dimerize and to constitutively activate the β-casein promoter (in the absence of its ligand, PRL) in breast and prostate epithelial cells. Enhancer of zeste homologue 2 (EZH2), an important histone-modifying enzyme, is able to trimethylate histone 3 on lysine 27 (H3K27Me3), consequently leading to gene silencing, especially silencing of tumor suppressor genes such as p53. We hypothesized that ΔS2 PRLR played an important pathogenic role in prostate cancer through, at least partly, alterations in the expression of EZH2 and the trimethylation of histone 3 on lysine 27. In the present study, overexpression of ΔS2 PRLR in prostate epithelial cells was achieved by infection with an adenoviral vector carrying the cDNA. The viable cell number overexpressing ΔS2 PRLR was assessed using MTS reagent. Western blot, chromatin immunoprecipitation (ChIP) assay and acid histone extraction were applied to detect expression of EZH2 as well as trimethylation of histone 3, respectively. In prostate epithelial cells, overexpression of ΔS2 PRLR increased the levels of EZH2 methyltransferase mRNA and protein, induced EZH2 methyltransferase recruitment to chromatin, increased the trimethylation of histone 3 on lysine 27, and decreased expression of the p53 gene. We concluded that ΔS2 PRLR plays an important pathogenic role in prostate cancer through epigenetic covalent modification leading to chromatin remodeling. Hypertrimethylation on H3K27 of the p53 gene promoter region due to elevated expression of ΔS2 PRLR by alternative splicing of the pre-mRNA in its full-length form might serve as a new mechanism underlying human prostate cancer.D011471Prostatic Neoplasms


Clinically important variants in EZH2


check button (ClinVar, 04/20/2024)
accession_iduniprot_idgene_nameTypeVariantClinical_significance