ASpdb: an integrative knowledgebase of human protein isoforms from experimental and AI-predicted structures
ASpdb Logo

Home

Download

Statistics

Examples

Help

Contact

Terms of Use

Center for Computational Systems Medicine
leaf

Protein Summary

leaf

AS Summary

leaf

Protein Functional Features

leaf

Gene Isoform Structures and Expression Levels

leaf

Protein Structures

leaf

pLDDT Score Distribution

leaf

Ramachandran Plot of Protein Structures

leaf

Potential Active Site Information

leaf

Protein Structure and Feature Comparision

leaf

Protein-Protein Interaction

leaf

Related Drugs

leaf

Related Diseases

leaf

Clinically Important Variants

Protein:CELF2

Protein Summary

check button Gene summary
Gene name: CELF2
ASpdb.0 ID: 10659
Gene
Gene symbol

CELF2

Gene ID

10659

Gene nameCUGBP Elav-like family member 2
SynonymsBRUNOL3|CELF-2|CUG-BP2|CUGBP2|DEE97|ETR-3|ETR3|NAPOR
Cytomap

10p14

Type of geneprotein-coding
DescriptionCUGBP Elav-like family member 2CUG triplet repeat RNA-binding protein 2CUG-BP- and ETR-3-like factor 2ELAV-type RNA-binding protein 3KDM2B/CELF2 fusionRNA-binding protein BRUNOL-3bruno-like protein 3neuroblastoma apoptosis-related RNA-binding prote
Modification date20240403
UniProtAcc

O95319


check button Gene ontology of this gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
GeneCELF2

GO:0005634

nucleus

11158314

GeneCELF2

GO:0005654

nucleoplasm

-

GeneCELF2

GO:0006376

mRNA splice site recognition

11158314

GeneCELF2

GO:0036002

pre-mRNA binding

11158314

GeneCELF2

GO:0043231

intracellular membrane-bounded organelle

-

GeneCELF2

GO:0090543

Flemming body

-



AS Summary

check button Information of the canonical protein with experimentally identified structure from PDB (2023).
UniProt AccFile namePDB IDMethodResolutionChainStartEnd
O95319-1O95319-1_4lmz_A.pdb4LMZX-ray2.78A36211

check button ASpdb's canonical and alternatively spliced isoform information.
accession_idgene_namecanonical_idalternative_idcanonical_lengthalternative_lengthcanonical_startcanonical_endtypeoriginalSEQvariationSEQalternative_startalternative_end
O95319CELF2O95319-1O95319-2508490124Deletionnonenone00
O95319CELF2O95319-1O95319-2508490358358SubstitutionAAVAQMLS334340
O95319CELF2O95319-1O95319-3508521118SubstitutionMRCPKSAVTMRNEELLLSMMVEGRLLVPDRI113
O95319CELF2O95319-1O95319-3508521358358SubstitutionAAGTINTPRSKRLLLPKDNN353371
O95319CELF2O95319-1O95319-4508509118SubstitutionMRCPKSAVTMRNEELLLSMMVEGRLLVPDRI113
O95319CELF2O95319-1O95319-4508509358358SubstitutionAAVAQMLS353359
O95319CELF2O95319-1O95319-5508488124Deletionnonenone00
O95319CELF2O95319-1O95319-5508488359359SubstitutionGGTINS335339

check buttonMultiple sequence alignment of our canonical and alternatively spliced CELF2

check button Matched gene isoform IDs with Ensembl and RefSeq of our canonical and alternative spliced genes of CELF2
UniProt-idENSGENSTENSP
O95319-1ENSG00000048740.19ENST00000416382.6ENSP00000406451.2
O95319-1ENSG00000048740.19ENST00000631460.1ENSP00000488582.1
O95319-2ENSG00000048740.19ENST00000399850.7ENSP00000382743.3
O95319-2ENSG00000048740.19ENST00000417956.6ENSP00000404834.3
O95319-2ENSG00000048740.19ENST00000632728.1ENSP00000487802.1
O95319-2ENSG00000048740.19ENST00000638035.1ENSP00000490401.1
O95319-5ENSG00000048740.19ENST00000608830.5ENSP00000476999.1

UniProt-idNM IDNP ID
O95319-1NM_001025077.2NP_001020248.1
O95319-1NM_001326332.1NP_001313261.1
O95319-2NM_001025076.2NP_001020247.1
O95319-2NM_001326318.1NP_001313247.1
O95319-2NM_001326320.1NP_001313249.1
O95319-2NM_001326324.1NP_001313253.1
O95319-2NM_001326328.1NP_001313257.1
O95319-2NM_001326330.1NP_001313259.1
O95319-2NM_001326334.1NP_001313263.1
O95319-2NM_001326345.1NP_001313274.1
O95319-2NM_001326349.1NP_001313278.1
O95319-5NM_001083591.1NP_001077060.1

check buttonAmino acid sequences of our canonical and alternatively spliced CELF2
accession_idProtein sequence
O95319-1MRCPKSAVTMRNEELLLSNGTANKMNGALDHSDQPDPDAIKMFVGQIPRSWSEKELKELFEPYGAVYQINVLRDRSQNPPQSKGCCFVTF
YTRKAALEAQNALHNIKTLPGMHHPIQMKPADSEKSNAVEDRKLFIGMVSKKCNENDIRVMFSPFGQIEECRILRGPDGLSRGCAFVTFS
TRAMAQNAIKAMHQSQTMEGCSSPIVVKFADTQKDKEQRRLQQQLAQQMQQLNTATWGNLTGLGGLTPQYLALLQQATSSSNLGAFSGIQ
QMAGMNALQLQNLATLAAAAAAAQTSATSTNANPLSTTSSALGALTSPVAASTPNSTAGAAMNSLTSLGTLQGLAGATVGLNNINALAGM
AALNGGLGATGLTNGTAGTMDALTQAYSGIQQYAAAALPTLYSQSLLQQQSAAGSQKEGPEGANLFIYHLPQEFGDQDILQMFMPFGNVI
O95319-2MNGALDHSDQPDPDAIKMFVGQIPRSWSEKELKELFEPYGAVYQINVLRDRSQNPPQSKGCCFVTFYTRKAALEAQNALHNIKTLPGMHH
PIQMKPADSEKSNAVEDRKLFIGMVSKKCNENDIRVMFSPFGQIEECRILRGPDGLSRGCAFVTFSTRAMAQNAIKAMHQSQTMEGCSSP
IVVKFADTQKDKEQRRLQQQLAQQMQQLNTATWGNLTGLGGLTPQYLALLQQATSSSNLGAFSGIQQMAGMNALQLQNLATLAAAAAAAQ
TSATSTNANPLSTTSSALGALTSPVAASTPNSTAGAAMNSLTSLGTLQGLAGATVGLNNINALAVAQMLSGMAALNGGLGATGLTNGTAG
TMDALTQAYSGIQQYAAAALPTLYSQSLLQQQSAAGSQKEGPEGANLFIYHLPQEFGDQDILQMFMPFGNVISAKVFIDKQTNLSKCFGF
O95319-3MMVEGRLLVPDRINGTANKMNGALDHSDQPDPDAIKMFVGQIPRSWSEKELKELFEPYGAVYQINVLRDRSQNPPQSKGCCFVTFYTRKA
ALEAQNALHNIKTLPGMHHPIQMKPADSEKSNAVEDRKLFIGMVSKKCNENDIRVMFSPFGQIEECRILRGPDGLSRGCAFVTFSTRAMA
QNAIKAMHQSQTMEGCSSPIVVKFADTQKDKEQRRLQQQLAQQMQQLNTATWGNLTGLGGLTPQYLALLQQATSSSNLGAFSGIQQMAGM
NALQLQNLATLAAAAAAAQTSATSTNANPLSTTSSALGALTSPVAASTPNSTAGAAMNSLTSLGTLQGLAGATVGLNNINALAGTINTPR
SKRLLLPKDNNGMAALNGGLGATGLTNGTAGTMDALTQAYSGIQQYAAAALPTLYSQSLLQQQSAAGSQKEGPEGANLFIYHLPQEFGDQ
O95319-4MMVEGRLLVPDRINGTANKMNGALDHSDQPDPDAIKMFVGQIPRSWSEKELKELFEPYGAVYQINVLRDRSQNPPQSKGCCFVTFYTRKA
ALEAQNALHNIKTLPGMHHPIQMKPADSEKSNAVEDRKLFIGMVSKKCNENDIRVMFSPFGQIEECRILRGPDGLSRGCAFVTFSTRAMA
QNAIKAMHQSQTMEGCSSPIVVKFADTQKDKEQRRLQQQLAQQMQQLNTATWGNLTGLGGLTPQYLALLQQATSSSNLGAFSGIQQMAGM
NALQLQNLATLAAAAAAAQTSATSTNANPLSTTSSALGALTSPVAASTPNSTAGAAMNSLTSLGTLQGLAGATVGLNNINALAVAQMLSG
MAALNGGLGATGLTNGTAGTMDALTQAYSGIQQYAAAALPTLYSQSLLQQQSAAGSQKEGPEGANLFIYHLPQEFGDQDILQMFMPFGNV
O95319-5MNGALDHSDQPDPDAIKMFVGQIPRSWSEKELKELFEPYGAVYQINVLRDRSQNPPQSKGCCFVTFYTRKAALEAQNALHNIKTLPGMHH
PIQMKPADSEKSNAVEDRKLFIGMVSKKCNENDIRVMFSPFGQIEECRILRGPDGLSRGCAFVTFSTRAMAQNAIKAMHQSQTMEGCSSP
IVVKFADTQKDKEQRRLQQQLAQQMQQLNTATWGNLTGLGGLTPQYLALLQQATSSSNLGAFSGIQQMAGMNALQLQNLATLAAAAAAAQ
TSATSTNANPLSTTSSALGALTSPVAASTPNSTAGAAMNSLTSLGTLQGLAGATVGLNNINALAGTINSMAALNGGLGATGLTNGTAGTM
DALTQAYSGIQQYAAAALPTLYSQSLLQQQSAAGSQKEGPEGANLFIYHLPQEFGDQDILQMFMPFGNVISAKVFIDKQTNLSKCFGFVS

Protein Functional Features

check buttonMain function of this protein. (from UniProt)
CELF2 (go to UniProt):O95319

check buttonRetention analysis result of protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page

* Minus value of BPloci means that the break pointn is located before the CDS.
- Retained protein feature among the 13 regional features.
Accession_idSubsectionStartEndFuncitonal featureSplicing information
O95319Region1283Note=Necessary for RNA-binding%2C TNNT2 exon 5 and NMDA R1 exon 21 inclusionType=Deletion;Start=1;End=24
O95319Region1283Note=Necessary for RNA-binding%2C TNNT2 exon 5 and NMDA R1 exon 21 inclusionType=Substitution;Start=1;End=18
O95319Region1283Note=Necessary for RNA-binding%2C TNNT2 exon 5 and NMDA R1 exon 21 inclusionType=Substitution;Start=1;End=18
O95319Region1283Note=Necessary for RNA-binding%2C TNNT2 exon 5 and NMDA R1 exon 21 inclusionType=Deletion;Start=1;End=24
O95319Region357508Note=Necessary for RNA-binding%2C TNNT2 exon 5 and NMDA R1 exon 21 inclusionType=Substitution;Start=358;End=358
O95319Region357508Note=Necessary for RNA-binding%2C TNNT2 exon 5 and NMDA R1 exon 21 inclusionType=Substitution;Start=358;End=358
O95319Region357508Note=Necessary for RNA-binding%2C TNNT2 exon 5 and NMDA R1 exon 21 inclusionType=Substitution;Start=358;End=358
O95319Region357508Note=Necessary for RNA-binding%2C TNNT2 exon 5 and NMDA R1 exon 21 inclusionType=Substitution;Start=359;End=359


Gene Isoform Structures and Expression Levels for CELF2

check buttonGene structures of our canonical and alternative spliced genes of CELF2
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
gene isoform structure of CELF2

check button Expression levels of gene isoforms across GTEx.
gtex expression

check button Expression levels of gene isoforms across TCGA.
tcga expression


Protein Structures

check button PDB and CIF files of the predicted protein structures
* Here we show the 3D structure of the proteins using Mol*. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Model confidence is shown from the pLDDT values per residue. pLDDT corresponds to the model’s prediction of its score on the local Distance Difference Test. It is a measure of local accuracy (from AlphfaFold website). To color code individual residues, we transformed individual PDB files into CIF format.
3D view using mol* of O95319-1
3D view using mol* of O95319-2
3D view using mol* of O95319-3
3D view using mol* of O95319-4
3D view using mol* of O95319-5


pLDDT Score Distribution

check button pLDDT score distribution of the predicted protein structures from AlphaFold2
* AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100.
pLDDT distribution across the protein length of O95319-1
all structure
pLDDT distribution across the protein length of O95319-2
all structure
pLDDT distribution across the protein length of O95319-3
all structure
pLDDT distribution across the protein length of O95319-4
all structure
pLDDT distribution across the protein length of O95319-5
all structure


Ramachandran Plot of Protein Structures


check button Ramachandran plot of the torsional angles - phi (φ)and psi (ψ) - of the residues (amino acids) contained in this protein peptide.
Ramachandran plot of O95319-1
all structure
Ramachandran plot of O95319-2
all structure
Ramachandran plot of O95319-3
all structure

Potential Active Site Information


check button The potential binding sites of these proteins were identified using SiteMap, a module of the Schrodinger suite.
UniProt-idSite scoreSizeD scoreVolumeExposureEnclosureContactPhobicPhilicBalanceDon/AccResidues
O95319-10.613320.46665.5130.6890.5950.8180.0781.2960.0610.421418,421,422,423,424,498,499,500,501,502
O95319-20.726570.704117.9920.7410.5560.7330.1061.040.1020.502403,404,406,408,410,479,480,481,482,483,484,486
O95319-30.718330.702164.9830.8020.640.7060.3890.6830.5690.927422,423,424,425,427,428,429,430,449,450,453,462,46
3,464,465,466,481
O95319-40.66160.39554.880.6520.8471.23801.50100.531417,418,419,425,452,454,469,471,500
O95319-50.6260.533110.1030.7430.6170.8110.2460.9720.2531.621106,109,111,152,185,186,187,188,189

Protein Structure and Feature Comparision


check button Protein Structure Comparision Using Template Modeling Scores (TM-score).
all structure

check button Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Canonical validated structure (PDB)(green)
3D view using mol* of O95319-1_O95319-1_4lmz_A.pdb

check button Protein Structure Comparision Visualization with mol*. between Canonical validated structure (PDB)(orange) vs Alternative predicted structure (AF2)(green)
3D view using mol* of O95319-1_4lmz_A_O95319-2.pdb
3D view using mol* of O95319-1_4lmz_A_O95319-3.pdb
3D view using mol* of O95319-1_4lmz_A_O95319-4.pdb
3D view using mol* of O95319-1_4lmz_A_O95319-5.pdb

check button Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Alternative predicted structure (AF2)(green)
3D view using mol* of O95319-1_O95319-2.pdb
3D view using mol* of O95319-1_O95319-3.pdb
3D view using mol* of O95319-1_O95319-4.pdb
3D view using mol* of O95319-1_O95319-5.pdb

check button Protein Feature Comparison of the protein sequendary structures among the protiens.
./stats/secondary_structure/figure/O95319-1_vs_O95319-2.png
all structure<
./stats/secondary_structure/figure/O95319-1_vs_O95319-3.png
all structure<
./stats/secondary_structure/figure/O95319-1_vs_O95319-4.png
all structure<
./stats/secondary_structure/figure/O95319-1_vs_O95319-5.png
all structure<

check button Protein Feature Comparison of the relative accessible surface area (ASA) among the protiens.
./stats/relative_asa/O95319-1_vs_O95319-2.png
all structure<
./stats/relative_asa/O95319-1_vs_O95319-3.png
all structure<
./stats/relative_asa/O95319-1_vs_O95319-4.png
all structure<
./stats/relative_asa/O95319-1_vs_O95319-5.png
all structure<


Protein-Protein Interaction


check button Interactors from UniProt.
Accession_idSubsectionStartEndFuncitonal featureSplicing information


check button Interactors from STRING.
Gene nameInteractors


Related Drugs to CELF2


check button Drugs targeting this gene/protein.
(DrugBank)
UniProt accessionGene nameDrugBank IDDrug nameDrug groupActions

Related Diseases to CELF2


check button Previous studies relating to the alternative splicing of CELF2 and disease information from the MeSH term (PubMed)
GenePMIDTitleAbstractMeSH IDMeSH term
CELF29887331Cardiac elav-type RNA-binding protein (ETR-3) binds to RNA CUG repeats expanded in myotonic dystrophy.Myotonic dystrophy (DM) is a neuromuscular disorder associated with CTG triplet repeat expansion in the myotonin protein kinase gene ( DMPK ). We previously proposed a hypothesis suggesting that the expanded CUG repeats sequester specific RNA-binding proteins and that such a sequestration results in abnormal RNA processing of several RNAs containing CUG repeats in multiple tissues affected in patients with DM. One of the members of the CUG-binding proteins, CUG-BP, has been identified previously. Here we describe the second member of this family, elav -type ribonucleoprotein (ETR-3), which is highly expressed in heart and is able to interact with CUG repeats. Screening of a mouse liver cDNA library with a CUG-BP probe identified two mETR-3 cDNAs. Two additional cDNAs from mouse heart were amplified by RT-PCR. These cDNAs differ by several insertions/deletions and might be generated via alternative splicing. Mouse ETR-3 has a mol. wt of 50 kDa and displays a high level of homology to CUG-BP protein. The organization of the RNA-binding domains (RBDs) within the ETR-3 molecule is similar to one within CUG-BP. A study of mETR-3 RNA-binding activity showed that the mETR-3 binds to (CUG)8repeats. Sequence analysis of mETR-3 indicates the presence of several CUG repeats within the mETR-3 mRNA. Both CUG-BP and mETR-3 bind to mETR-3 mRNA via CUG repeats, suggesting the possible involvement of CUG-BP-like proteins in the regulation of mETR-3 processing. Analysis of the tissue distribution of ETR-3 showed that in human cells, ETR-3 mRNA is highly expressed in heart, but is undetectable in other tissues examined. Our results suggest the existence of a family of proteins that bind to CUG repeats and might be affected in DM by expansion of CUG repeats.D009223Myotonic Dystrophy


Clinically important variants in CELF2


check button (ClinVar, 04/20/2024)
accession_iduniprot_idgene_nameTypeVariantClinical_significance