ASpdb: an integrative knowledgebase of human protein isoforms from experimental and AI-predicted structures
ASpdb Logo

Home

Download

Statistics

Examples

Help

Contact

Terms of Use

Center for Computational Systems Medicine
leaf

Protein Summary

leaf

AS Summary

leaf

Protein Functional Features

leaf

Gene Isoform Structures and Expression Levels

leaf

Protein Structures

leaf

pLDDT Score Distribution

leaf

Ramachandran Plot of Protein Structures

leaf

Potential Active Site Information

leaf

Protein Structure and Feature Comparision

leaf

Protein-Protein Interaction

leaf

Related Drugs

leaf

Related Diseases

leaf

Clinically Important Variants

Protein:IRF7

Protein Summary

check button Gene summary
Gene name: IRF7
ASpdb.0 ID: 3665
Gene
Gene symbol

IRF7

Gene ID

3665

Gene nameinterferon regulatory factor 7
SynonymsIMD39|IRF-7|IRF-7H|IRF7A|IRF7B|IRF7C|IRF7H
Cytomap

11p15.5

Type of geneprotein-coding
Descriptioninterferon regulatory factor 7interferon regulatory factor-7H
Modification date20240403
UniProtAcc

Q92985


check button Gene ontology of this gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
GeneIRF7

GO:0000785

chromatin

11473119

GeneIRF7

GO:0000978

RNA polymerase II cis-regulatory region sequence-specific DNA binding

17404045

GeneIRF7

GO:0000981

DNA-binding transcription factor activity, RNA polymerase II-specific

17404045

GeneIRF7

GO:0002753

cytoplasmic pattern recognition receptor signaling pathway

21931555

GeneIRF7

GO:0002819

regulation of adaptive immune response

17404045

GeneIRF7

GO:0005634

nucleus

22065573

GeneIRF7

GO:0005654

nucleoplasm

-

GeneIRF7

GO:0005737

cytoplasm

19176627

GeneIRF7

GO:0005829

cytosol

-

GeneIRF7

GO:0032481

positive regulation of type I interferon production

17404045

GeneIRF7

GO:0032727

positive regulation of interferon-alpha production

16127453

GeneIRF7

GO:0045944

positive regulation of transcription by RNA polymerase II

16127453|17404045

GeneIRF7

GO:0051607

defense response to virus

21478870

GeneIRF7

GO:1990837

sequence-specific double-stranded DNA binding

28473536



AS Summary

check button Information of the canonical protein with experimentally identified structure from PDB (2023).
UniProt AccFile namePDB IDMethodResolutionChainStartEnd
Q92985-1Q92985-1_2o61_A.pdb2O61X-ray2.8A20125

check button ASpdb's canonical and alternatively spliced isoform information.
accession_idgene_namecanonical_idalternative_idcanonical_lengthalternative_lengthcanonical_startcanonical_endtypeoriginalSEQvariationSEQalternative_startalternative_end
Q92985IRF7Q92985-1Q92985-2503474228256Deletionnonenone227227
Q92985IRF7Q92985-1Q92985-3503164152164SubstitutionGGPPGPFLAHTHAAQGSLLGSCTGGQ152164
Q92985IRF7Q92985-1Q92985-3503164165503Deletionnonenone164164
Q92985IRF7Q92985-1Q92985-450351616SubstitutionMALAPEMPVPERPAAGPDSPRPGTR119

check buttonMultiple sequence alignment of our canonical and alternatively spliced IRF7

check button Matched gene isoform IDs with Ensembl and RefSeq of our canonical and alternative spliced genes of IRF7
UniProt-idENSGENSTENSP
Q92985-1ENSG00000185507.21ENST00000525445.6ENSP00000434009.2
Q92985-1ENSG00000276561.4ENST00000621391.4ENSP00000480358.1
Q92985-2ENSG00000185507.21ENST00000348655.11ENSP00000331803.9
Q92985-2ENSG00000276561.4ENST00000633943.1ENSP00000488666.1
Q92985-3ENSG00000185507.21ENST00000469048.6ENSP00000434607.1
Q92985-3ENSG00000185507.21ENST00000533182.5ENSP00000433903.1
Q92985-3ENSG00000276561.4ENST00000633274.1ENSP00000488591.1
Q92985-3ENSG00000276561.4ENST00000634105.1ENSP00000488581.1
Q92985-4ENSG00000185507.21ENST00000330243.9ENSP00000329411.5
Q92985-4ENSG00000185507.21ENST00000397566.5ENSP00000380697.1
Q92985-4ENSG00000276561.4ENST00000612534.4ENSP00000479615.1
Q92985-4ENSG00000276561.4ENST00000632827.1ENSP00000488039.1

UniProt-idNM IDNP ID
Q92985-1NM_001572.3NP_001563.2
Q92985-2NM_004029.2NP_004020.1
Q92985-4NM_004031.2NP_004022.2

check buttonAmino acid sequences of our canonical and alternatively spliced IRF7
accession_idProtein sequence
Q92985-1MALAPERAAPRVLFGEWLLGEISSGCYEGLQWLDEARTCFRVPWKHFARKDLSEADARIFKAWAVARGRWPPSSRGGGPPPEAETAERAG
WKTNFRCALRSTRRFVMLRDNSGDPADPHKVYALSRELCWREGPGTDQTEAEAPAAVPPPQGGPPGPFLAHTHAGLQAPGPLPAPAGDKG
DLLLQAVQQSCLADHLLTASWGADPVPTKAPGEGQEGLPLTGACAGGPGLPAGELYGWAVETTPSPGPQPAALTTGEAAAPESPHQAEPY
LSPSPSACTAVQEPSPGALDVTIMYKGRTVLQKVVGHPSCTFLYGPPDPAVRATDPQQVAFPSPAELPDQKQLRYTEELLRHVAPGLHLE
LRGPQLWARRMGKCKVYWEVGGPPGSASPSTPACLLPRNCDTPIFDFRVFFQELVEFRARQRRGSPRYTIYLGFGQDLSAGRPKEKSLVL
Q92985-2MALAPERAAPRVLFGEWLLGEISSGCYEGLQWLDEARTCFRVPWKHFARKDLSEADARIFKAWAVARGRWPPSSRGGGPPPEAETAERAG
WKTNFRCALRSTRRFVMLRDNSGDPADPHKVYALSRELCWREGPGTDQTEAEAPAAVPPPQGGPPGPFLAHTHAGLQAPGPLPAPAGDKG
DLLLQAVQQSCLADHLLTASWGADPVPTKAPGEGQEGLPLTGACAGGEAAAPESPHQAEPYLSPSPSACTAVQEPSPGALDVTIMYKGRT
VLQKVVGHPSCTFLYGPPDPAVRATDPQQVAFPSPAELPDQKQLRYTEELLRHVAPGLHLELRGPQLWARRMGKCKVYWEVGGPPGSASP
STPACLLPRNCDTPIFDFRVFFQELVEFRARQRRGSPRYTIYLGFGQDLSAGRPKEKSLVLVKLEPWLCRVHLEGTQREGVSSLDSSSLS
Q92985-3MALAPERAAPRVLFGEWLLGEISSGCYEGLQWLDEARTCFRVPWKHFARKDLSEADARIFKAWAVARGRWPPSSRGGGPPPEAETAERAG
Q92985-4MPVPERPAAGPDSPRPGTRRAAPRVLFGEWLLGEISSGCYEGLQWLDEARTCFRVPWKHFARKDLSEADARIFKAWAVARGRWPPSSRGG
GPPPEAETAERAGWKTNFRCALRSTRRFVMLRDNSGDPADPHKVYALSRELCWREGPGTDQTEAEAPAAVPPPQGGPPGPFLAHTHAGLQ
APGPLPAPAGDKGDLLLQAVQQSCLADHLLTASWGADPVPTKAPGEGQEGLPLTGACAGGPGLPAGELYGWAVETTPSPGPQPAALTTGE
AAAPESPHQAEPYLSPSPSACTAVQEPSPGALDVTIMYKGRTVLQKVVGHPSCTFLYGPPDPAVRATDPQQVAFPSPAELPDQKQLRYTE
ELLRHVAPGLHLELRGPQLWARRMGKCKVYWEVGGPPGSASPSTPACLLPRNCDTPIFDFRVFFQELVEFRARQRRGSPRYTIYLGFGQD

Protein Functional Features

check buttonMain function of this protein. (from UniProt)
IRF7 (go to UniProt):Q92985

check buttonRetention analysis result of protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page

* Minus value of BPloci means that the break pointn is located before the CDS.
- Retained protein feature among the 13 regional features.
Accession_idSubsectionStartEndFuncitonal featureSplicing information
Q92985Region133156Note=Disordered;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-liteType=Substitution;Start=152;End=164
Q92985Region242277Note=Disordered;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-liteType=Deletion;Start=228;End=256
Q92985Region242277Note=Disordered;Ontology_term=ECO:0000256;evidence=ECO:0000256|SAM:MobiDB-liteType=Deletion;Start=165;End=503
Q92985Region284456Note=Necessary for the interaction with NMI;Ontology_term=ECO:0000250;evidence=ECO:0000250|UniProtKB:P70434Type=Deletion;Start=165;End=503


Gene Isoform Structures and Expression Levels for IRF7

check buttonGene structures of our canonical and alternative spliced genes of IRF7
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
gene isoform structure of IRF7

check button Expression levels of gene isoforms across GTEx.
gtex expression

check button Expression levels of gene isoforms across TCGA.
tcga expression


Protein Structures

check button PDB and CIF files of the predicted protein structures
* Here we show the 3D structure of the proteins using Mol*. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Model confidence is shown from the pLDDT values per residue. pLDDT corresponds to the model’s prediction of its score on the local Distance Difference Test. It is a measure of local accuracy (from AlphfaFold website). To color code individual residues, we transformed individual PDB files into CIF format.
3D view using mol* of Q92985-1
3D view using mol* of Q92985-2
3D view using mol* of Q92985-3
3D view using mol* of Q92985-4


pLDDT Score Distribution

check button pLDDT score distribution of the predicted protein structures from AlphaFold2
* AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100.
pLDDT distribution across the protein length of Q92985-1
all structure
pLDDT distribution across the protein length of Q92985-2
all structure
pLDDT distribution across the protein length of Q92985-3
all structure
pLDDT distribution across the protein length of Q92985-4
all structure


Ramachandran Plot of Protein Structures


check button Ramachandran plot of the torsional angles - phi (φ)and psi (ψ) - of the residues (amino acids) contained in this protein peptide.
Ramachandran plot of Q92985-1
all structure
Ramachandran plot of Q92985-3
all structure

Potential Active Site Information


check button The potential binding sites of these proteins were identified using SiteMap, a module of the Schrodinger suite.
UniProt-idSite scoreSizeD scoreVolumeExposureEnclosureContactPhobicPhilicBalanceDon/AccResidues
Q92985-11.041391.093460.3060.5940.6870.8560.9050.7931.1411.56283,286,418,456,457,459,460,461,463,464,465,467,46
8,471,475,477,478,480,481,482,483,484,485,487,488,
491,494,495,498
Q92985-21.0421591.07623.5740.5350.7381.0060.9220.9610.9590.68615,19,102,103,104,124,127,128,129,130,131,132,182,
185,186,189,190,335,336,349,350,351,362,363,365,37
3,374,375,376,377,379,380,381,382,383,384,386,397,
398
Q92985-30.792550.711133.770.5740.670.9480.0911.1890.0760.72827,28,29,41,43,44,45,47,55,56,58,59,60,116,117,118
,119
Q92985-40.9533490.915829.7170.6190.6280.8460.1671.2310.1360.90821,22,23,24,25,26,27,28,32,36,45,49,50,51,52,76,78
,79,80,81,82,111,114,115,117,119,136,137,138,139,1
40,141,142,143,144,145,146,390,392,394,395,396,397
,398,399,401,402,403,404,405,406,407,408,430,433,4
34,435,436,440,444,450,451,452,453,454,455,456,457
,458,482,483,484,485,486,487

Protein Structure and Feature Comparision


check button Protein Structure Comparision Using Template Modeling Scores (TM-score).
all structure

check button Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Canonical validated structure (PDB)(green)
3D view using mol* of Q92985-1_Q92985-1_2o61_A.pdb

check button Protein Structure Comparision Visualization with mol*. between Canonical validated structure (PDB)(orange) vs Alternative predicted structure (AF2)(green)
3D view using mol* of Q92985-1_2o61_A_Q92985-2.pdb
3D view using mol* of Q92985-1_2o61_A_Q92985-3.pdb
3D view using mol* of Q92985-1_2o61_A_Q92985-4.pdb

check button Protein Structure Comparision Visualization with mol*. between Canonical predicted structure (AF2)(orange) vs Alternative predicted structure (AF2)(green)
3D view using mol* of Q92985-1_Q92985-2.pdb
3D view using mol* of Q92985-1_Q92985-3.pdb
3D view using mol* of Q92985-1_Q92985-4.pdb

check button Protein Feature Comparison of the protein sequendary structures among the protiens.
./stats/secondary_structure/figure/Q92985-1_vs_Q92985-2.png
all structure<
./stats/secondary_structure/figure/Q92985-1_vs_Q92985-3.png
all structure<
./stats/secondary_structure/figure/Q92985-1_vs_Q92985-4.png
all structure<

check button Protein Feature Comparison of the relative accessible surface area (ASA) among the protiens.
./stats/relative_asa/Q92985-1_vs_Q92985-2.png
all structure<
./stats/relative_asa/Q92985-1_vs_Q92985-3.png
all structure<
./stats/relative_asa/Q92985-1_vs_Q92985-4.png
all structure<


Protein-Protein Interaction


check button Interactors from UniProt.
Accession_idSubsectionStartEndFuncitonal featureSplicing information
Q92985Region284456Note=Necessary for the interaction with NMI;Ontology_term=ECO:0000250;evidence=ECO:0000250|UniProtKB:P70434Type=Deletion;Start=165;End=503


check button Interactors from STRING.
Gene nameInteractors


Related Drugs to IRF7


check button Drugs targeting this gene/protein.
(DrugBank)
UniProt accessionGene nameDrugBank IDDrug nameDrug groupActions

Related Diseases to IRF7


check button Previous studies relating to the alternative splicing of IRF7 and disease information from the MeSH term (PubMed)
GenePMIDTitleAbstractMeSH IDMeSH term
IRF720209099Dual functions of interferon regulatory factors 7C in Epstein-Barr virus-mediated transformation of human B lymphocytes.Epstein-Barr virus (EBV) infection is associated with several human malignancies. Interferon (IFN) regulatory factor 7 (IRF-7) has several splicing variants, and at least the major splicing variant (IRF-7A) has oncogenic potential and is associated with EBV transformation processes. IRF-7C is an alternative splicing variant with only the DNA-binding domain of IRF-7. Whether IRF-7C is present under physiological conditions and its functions in viral transformation are unknown. In this report, we prove the existence of IRF-7C protein and RNA in certain cells under physiological conditions, and find that high levels of IRF-7C are associated with EBV transformation of human primary B cells in vitro as well as EBV type III latency. EBV latent membrane protein 1 (LMP-1) stimulates IRF-7C expression in B lymphocytes. IRF-7C has oncogenic potential in rodent cells and partially restores the growth properties of EBV-transformed cells under a growth-inhibition condition. A tumor array experiment has identified six primary tumor specimens with high levels of IRF-7C protein--all of them are lymphomas. Furthermore, we show that the expression of IRF-7C is apparently closely associated with other IRF-7 splicing variants. IRF-7C inhibits the function of IRF-7 in transcriptional regulation of IFN genes. These data suggest that EBV may use splicing variants of IRF-7 for its transformation process in two strategies: to use oncogenic properties of various IRF-7 splicing variants, but use one of its splicing variants (IRF-7C) to block the IFN-induction function of IRF-7 that is detrimental for viral transformation. The work provides a novel relation of host/virus interactions, and has expanded our knowledge about IRFs in EBV transformation.D002472Cell Transformation, Viral


Clinically important variants in IRF7


check button (ClinVar, 04/20/2024)
accession_iduniprot_idgene_nameTypeVariantClinical_significance