ASpdb: an integrative knowledgebase of human protein isoforms from experimental and AI-predicted structures
ASpdb Logo

Home

Download

Statistics

Examples

Help

Contact

Terms of Use

Center for Computational Systems Medicine
leaf

Protein Summary

leaf

AS Summary

leaf

Protein Functional Features

leaf

Gene Isoform Structures and Expression Levels

leaf

Protein Structures

leaf

pLDDT Score Distribution

leaf

Ramachandran Plot of Protein Structures

leaf

Potential Active Site Information

leaf

Protein Structure and Feature Comparision

leaf

Protein-Protein Interaction

leaf

Related Drugs

leaf

Related Diseases

leaf

Clinically Important Variants

Protein:XPC

Protein Summary

check button Gene summary
Gene name: XPC
ASpdb.0 ID: 7508
Gene
Gene symbol

XPC

Gene ID

7508

Gene nameXPC complex subunit, DNA damage recognition and repair factor
SynonymsRAD4|XP3|XPCC|p125
Cytomap

3p25.1

Type of geneprotein-coding
DescriptionDNA repair protein complementing XP-C cellsmutant xeroderma pigmentosum group Cxeroderma pigmentosum, complementation group C
Modification date20240403
UniProtAcc

Q01831


check button Gene ontology of this gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
GeneXPC

GO:0000109

nucleotide-excision repair complex

11259578

GeneXPC

GO:0003684

damaged DNA binding

12509299

GeneXPC

GO:0003697

single-stranded DNA binding

12509299

GeneXPC

GO:0003713

transcription coactivator activity

29973595|31527837

GeneXPC

GO:0005634

nucleus

18682493

GeneXPC

GO:0005654

nucleoplasm

-

GeneXPC

GO:0005730

nucleolus

-

GeneXPC

GO:0005737

cytoplasm

18682493

GeneXPC

GO:0005739

mitochondrion

-

GeneXPC

GO:0005829

cytosol

-

GeneXPC

GO:0005886

plasma membrane

-

GeneXPC

GO:0006289

nucleotide-excision repair

8168482|9734359|11259578|19941824

GeneXPC

GO:0043231

intracellular membrane-bounded organelle

-

GeneXPC

GO:0044877

protein-containing complex binding

11259578

GeneXPC

GO:0045893

positive regulation of DNA-templated transcription

29973595|31527837

GeneXPC

GO:0070914

UV-damage excision repair

8077226

GeneXPC

GO:0071942

XPC complex

11279143|31527837

GeneXPC

GO:0140612

DNA damage sensor activity

10873465|19941824



AS Summary

check button Information of the canonical protein with experimentally identified structure from PDB (2023).
UniProt AccFile namePDB IDMethodResolutionChainStartEnd
Q01831-1Q01831-1_2obh_D.pdb2OBHX-ray1.8D847863

check button ASpdb's canonical and alternatively spliced isoform information.
accession_idgene_namecanonical_idalternative_idcanonical_lengthalternative_lengthcanonical_startcanonical_endtypeoriginalSEQvariationSEQalternative_startalternative_end
Q01831XPCQ01831-1Q01831-2940903136172Deletionnonenone135135
Q01831XPCQ01831-1Q01831-3940140138140SubstitutionELSVKR138140
Q01831XPCQ01831-1Q01831-3940140141940Deletionnonenone140140

check buttonMultiple sequence alignment of our canonical and alternatively spliced XPC

check button Matched gene isoform IDs with Ensembl and RefSeq of our canonical and alternative spliced genes of XPC
UniProt-idENSGENSTENSP
Q01831-1ENSG00000154767.15ENST00000285021.12ENSP00000285021.8
Q01831-3ENSG00000154767.15ENST00000476581.6ENSP00000424548.1

UniProt-idNM IDNP ID
Q01831-1NM_004628.4NP_004619.3

check buttonAmino acid sequences of our canonical and alternatively spliced XPC
accession_idProtein sequence
Q01831-1MARKRAAGGEPRGRELRSQKSKAKSKARREEEEEDAFEDEKPPKKSLLSKVSQGKRKRGCSHPGGSADGPAKKKVAKVTVKSENLKVIKD
EALSDGDDLRDFPSDLKKAHHLKRGATMNEDSNEEEEESENDWEEVEELSEPVLGDVRESTAFSRSLLPVKPVEIEIETPEQAKTRERSE
KIKLEFETYLRRAMKRFNKGVHEDTHKVHLLCLLANGFYRNNICSQPDLHAIGLSIIPARFTRVLPRDVDTYYLSNLVKWFIGTFTVNAE
LSASEQDNLQTTLERRFAIYSARDDEELVHIFLLILRALQLLTRLVLSLQPIPLKSATAKGKKPSKERLTADPGGSSETSSQVLENHTKP
KTSKGTKQEETFAKGTCRPSAKGKRNKGGRKKRSKPSSSEEDEGPGDKQEKATQRRPHGRERRVASRVSYKEESGSDEAGSGSDFELSSG
EASDPSDEDSEPGPPKQRKAPAPQRTKAGSKSASRTHRGSHRKDPSLPAASSSSSSSKRGKKMCSDGEKAEKRSIAGIDQWLEVFCEQEE
KWVCVDCVHGVVGQPLTCYKYATKPMTYVVGIDSDGWVRDVTQRYDPVWMTVTRKCRVDAEWWAETLRPYQSPFMDREKKEDLEFQAKHM
DQPLPTAIGLYKNHPLYALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVHTLHSRDTWLKKARVVRLGEVPYKMVKGFSNRARKARLA
EPQLREENDLGLFGYWQTEEYQPPVAVDGKVPRNEFGNVYLFLPSMMPIGCVQLNLPNLHRVARKLDIDCVQAITGFDFHGGYSHPVTDG
YIVCEEFKDVLLTAWENEQAVIERKEKEKKEKRALGNWKLLAKGLLIRERLKRRYGPKSEAAAPHTDAGGGLSSDEEEGTSSQAEAARIL
Q01831-2MARKRAAGGEPRGRELRSQKSKAKSKARREEEEEDAFEDEKPPKKSLLSKVSQGKRKRGCSHPGGSADGPAKKKVAKVTVKSENLKVIKD
EALSDGDDLRDFPSDLKKAHHLKRGATMNEDSNEEEEESENDWEEAKTRERSEKIKLEFETYLRRAMKRFNKGVHEDTHKVHLLCLLANG
FYRNNICSQPDLHAIGLSIIPARFTRVLPRDVDTYYLSNLVKWFIGTFTVNAELSASEQDNLQTTLERRFAIYSARDDEELVHIFLLILR
ALQLLTRLVLSLQPIPLKSATAKGKKPSKERLTADPGGSSETSSQVLENHTKPKTSKGTKQEETFAKGTCRPSAKGKRNKGGRKKRSKPS
SSEEDEGPGDKQEKATQRRPHGRERRVASRVSYKEESGSDEAGSGSDFELSSGEASDPSDEDSEPGPPKQRKAPAPQRTKAGSKSASRTH
RGSHRKDPSLPAASSSSSSSKRGKKMCSDGEKAEKRSIAGIDQWLEVFCEQEEKWVCVDCVHGVVGQPLTCYKYATKPMTYVVGIDSDGW
VRDVTQRYDPVWMTVTRKCRVDAEWWAETLRPYQSPFMDREKKEDLEFQAKHMDQPLPTAIGLYKNHPLYALKRHLLKYEAIYPETAAIL
GYCRGEAVYSRDCVHTLHSRDTWLKKARVVRLGEVPYKMVKGFSNRARKARLAEPQLREENDLGLFGYWQTEEYQPPVAVDGKVPRNEFG
NVYLFLPSMMPIGCVQLNLPNLHRVARKLDIDCVQAITGFDFHGGYSHPVTDGYIVCEEFKDVLLTAWENEQAVIERKEKEKKEKRALGN
WKLLAKGLLIRERLKRRYGPKSEAAAPHTDAGGGLSSDEEEGTSSQAEAARILAASWPQNREDEEKQKLKGGPKKTKREKKAAASHLFPF
Q01831-3MARKRAAGGEPRGRELRSQKSKAKSKARREEEEEDAFEDEKPPKKSLLSKVSQGKRKRGCSHPGGSADGPAKKKVAKVTVKSENLKVIKD

Protein Functional Features

check buttonMain function of this protein. (from UniProt)
XPC (go to UniProt):Q01831

check buttonRetention analysis result of protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page

* Minus value of BPloci means that the break pointn is located before the CDS.
- Retained protein feature among the 13 regional features.
Accession_idSubsectionStartEndFuncitonal featureSplicing information
Q01831Region111136Note=Disordered;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-liteType=Deletion;Start=136;End=172
Q01831Region327519Note=Disordered;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-liteType=Deletion;Start=141;End=940
Q01831Region496734Note=Interaction with RAD23BType=Deletion;Start=141;End=940
Q01831Region607766Note=Minimal sensor domain involved in damage recognitionType=Deletion;Start=141;End=940
Q01831Region607741Note=DNA-binding%3B preference for heteroduplex DNAType=Deletion;Start=141;End=940
Q01831Region767831Note=DNA-binding%3B preference for single stranded DNA%3B required for formation of stable nucleoprotein complexType=Deletion;Start=141;End=940
Q01831Region816940Note=Interaction with ERCC2 and GTF2H1;Ontology_term=ECO:0000269;evidence=ECO:0000269|PubMed:12509233;Dbxref=PMID:12509233Type=Deletion;Start=141;End=940
Q01831Region847866Note=Interaction with CETN2Type=Deletion;Start=141;End=940
Q01831Region866940Note=Disordered;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-liteType=Deletion;Start=141;End=940
Q01831Motif390395Note=Nuclear localization signal;Ontology_term=ECO:0000255;evidence=ECO:0000255Type=Deletion;Start=141;End=940
Q01831Compositional bias121136Note=Acidic residues;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-liteType=Deletion;Start=136;End=172
Q01831Compositional bias342359Note=Polar residues;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-liteType=Deletion;Start=141;End=940
Q01831Compositional bias393438Note=Basic and acidic residues;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-liteType=Deletion;Start=141;End=940
Q01831Compositional bias496510Note=Polar residues;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-liteType=Deletion;Start=141;End=940
Q01831Compositional bias905919Note=Basic and acidic residues;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-liteType=Deletion;Start=141;End=940


Gene Isoform Structures and Expression Levels for XPC

check buttonGene structures of our canonical and alternative spliced genes of XPC
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
gene isoform structure of XPC

check button Expression levels of gene isoforms across GTEx.
gtex expression

check button Expression levels of gene isoforms across TCGA.
tcga expression


Protein Structures

check button PDB and CIF files of the predicted protein structures
* Here we show the 3D structure of the proteins using Mol*. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Model confidence is shown from the pLDDT values per residue. pLDDT corresponds to the model’s prediction of its score on the local Distance Difference Test. It is a measure of local accuracy (from AlphfaFold website). To color code individual residues, we transformed individual PDB files into CIF format.
3D view using mol* of Q01831-1
3D view using mol* of Q01831-2
3D view using mol* of Q01831-3


pLDDT Score Distribution

check button pLDDT score distribution of the predicted protein structures from AlphaFold2
* AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100.
pLDDT distribution across the protein length of Q01831-1
all structure
pLDDT distribution across the protein length of Q01831-2
all structure


Ramachandran Plot of Protein Structures


check button Ramachandran plot of the torsional angles - phi (φ)and psi (ψ) - of the residues (amino acids) contained in this protein peptide.
Ramachandran plot of Q01831-1
all structure

Potential Active Site Information


check button The potential binding sites of these proteins were identified using SiteMap, a module of the Schrodinger suite.
UniProt-idSite scoreSizeD scoreVolumeExposureEnclosureContactPhobicPhilicBalanceDon/AccResidues
Q01831-10.9851881.007874.650.6310.6720.8460.321.030.310.927118,119,120,121,122,123,124,126,127,129,130,131,64
1,642,643,644,647,648,649,652,653,654,655,656,657,
678,681,683,685,686,688,689,692,693,709,710,730,73
2,761,762,763,764,773,795,796,797,799,809
Q01831-20.974930.928218.8340.4890.6990.9710.0771.2210.0630.375376,377,378,379,380,738,739,740,741,743,744,747,75
5,771,772,773,774

Protein Structure and Feature Comparision


check button Protein Structure Comparision Using Template Modeling Scores (TM-score).
all structure

check button Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Canonical validated structure (PDB)(green)
3D view using mol* of Q01831-1_Q01831-1_2obh_D.pdb

check button Protein Structure Comparision Visualization with mol*. between Canonical validated structure (PDB)(orange) vs Alternative predicted structure (AF2)(green)
3D view using mol* of Q01831-1_2obh_D_Q01831-2.pdb
3D view using mol* of Q01831-1_2obh_D_Q01831-3.pdb

check button Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Alternative predicted structure (AF2)(green)
3D view using mol* of Q01831-1_Q01831-2.pdb
3D view using mol* of Q01831-1_Q01831-3.pdb

check button Protein Feature Comparison of the protein sequendary structures among the protiens.
./stats/secondary_structure/figure/Q01831-1_vs_Q01831-2.png
all structure<
./stats/secondary_structure/figure/Q01831-1_vs_Q01831-3.png
all structure<

check button Protein Feature Comparison of the relative accessible surface area (ASA) among the protiens.
./stats/relative_asa/Q01831-1_vs_Q01831-2.png
all structure<
./stats/relative_asa/Q01831-1_vs_Q01831-3.png
all structure<


Protein-Protein Interaction


check button Interactors from UniProt.
Accession_idSubsectionStartEndFuncitonal featureSplicing information
Q01831Region496734Note=Interaction with RAD23BType=Deletion;Start=141;End=940
Q01831Region816940Note=Interaction with ERCC2 and GTF2H1;Ontology_term=ECO:0000269;evidence=ECO:0000269|PubMed:12509233;Dbxref=PMID:12509233Type=Deletion;Start=141;End=940
Q01831Region847866Note=Interaction with CETN2Type=Deletion;Start=141;End=940


check button Interactors from STRING.
Gene nameInteractors


Related Drugs to XPC


check button Drugs targeting this gene/protein.
(DrugBank)
UniProt accessionGene nameDrugBank IDDrug nameDrug groupActions

Related Diseases to XPC


check button Previous studies relating to the alternative splicing of XPC and disease information from the MeSH term (PubMed)
GenePMIDTitleAbstractMeSH IDMeSH term
XPC12177305The human XPC DNA repair gene: arrangement, splice site information content and influence of a single nucleotide polymorphism in a splice acceptor site on alternative splicing and function.XPC DNA repair gene mutations result in the cancer-prone disorder xeroderma pigmentosum. The XPC gene spans 33 kb and has 16 exons (82-882 bp) and 15 introns (0.08-5.4 kb). A 1.6 kb intron was found within exon 5. Sensitive real- time quantitative reverse transcription-polymerase chain reaction methods were developed to measure full-length XPC mRNA (the predominant form) and isoforms that skipped exons 4, 7 or 12. Exon 7 was skipped in approximately 0.07% of XPC mRNAs, consistent with the high information content of the exon 7 splice acceptor and donor sites (12.3 and 10.4 bits). In contrast, exon 4 was skipped in approximately 0.7% of the XPC mRNAs, consistent with the low information content of the exon 4 splice acceptor (-0.1 bits). A new common C/A single nucleotide polymorphism in the XPC intron 11 splice acceptor site (58% C in 97 normals) decreased its information content from 7.5 to 5.1 bits. Fibroblasts homozygous for A/A had significantly higher levels (approximately 2.6-fold) of the XPC mRNA isoform that skipped exon 12 than those homozygous for C/C. This abnormally spliced XPC mRNA isoform has diminished DNA repair function and may contribute to cancer susceptibility.D009369Neoplasms


Clinically important variants in XPC


check button (ClinVar, 04/20/2024)
accession_iduniprot_idgene_nameTypeVariantClinical_significance