ID W9YQ60_9EURO Unreviewed; 1044 AA.
AC W9YQ60;
DT 14-MAY-2014, integrated into UniProtKB/TrEMBL.
DT 14-MAY-2014, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE RecName: Full=Xeroderma pigmentosum group C-complementing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=A1O1_04473 {ECO:0000313|EMBL:EXJ91361.1};
OS Capronia coronata CBS 617.96.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Chaetothyriomycetidae; Chaetothyriales; Herpotrichiellaceae; Capronia.
OX NCBI_TaxID=1182541 {ECO:0000313|EMBL:EXJ91361.1, ECO:0000313|Proteomes:UP000019484};
RN [1] {ECO:0000313|EMBL:EXJ91361.1, ECO:0000313|Proteomes:UP000019484}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 617.96 {ECO:0000313|EMBL:EXJ91361.1,
RC ECO:0000313|Proteomes:UP000019484};
RG The Broad Institute Genomics Platform;
RA Cuomo C., de Hoog S., Gorbushina A., Walker B., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Capronia coronata CBS 617.96.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the XPC family. {ECO:0000256|ARBA:ARBA00009525}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EXJ91361.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMWN01000003; EXJ91361.1; -; Genomic_DNA.
DR RefSeq; XP_007723555.1; XM_007725365.1.
DR AlphaFoldDB; W9YQ60; -.
DR STRING; 1182541.W9YQ60; -.
DR GeneID; 19159354; -.
DR eggNOG; KOG2179; Eukaryota.
DR HOGENOM; CLU_003639_1_1_1; -.
DR OrthoDB; 181129at2759; -.
DR Proteomes; UP000019484; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003684; F:damaged DNA binding; IEA:InterPro.
DR GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR Gene3D; 2.20.20.110; Rad4, beta-hairpin domain BHD1; 1.
DR Gene3D; 3.30.60.290; Rad4, beta-hairpin domain BHD2; 1.
DR Gene3D; 3.30.70.2460; Rad4, beta-hairpin domain BHD3; 1.
DR Gene3D; 3.90.260.10; Transglutaminase-like; 1.
DR InterPro; IPR018327; BHD_2.
DR InterPro; IPR004583; DNA_repair_Rad4.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR018325; Rad4/PNGase_transGLS-fold.
DR InterPro; IPR018326; Rad4_beta-hairpin_dom1.
DR InterPro; IPR018328; Rad4_beta-hairpin_dom3.
DR InterPro; IPR042488; Rad4_BHD3_sf.
DR InterPro; IPR036985; Transglutaminase-like_sf.
DR PANTHER; PTHR12135:SF3; DNA REPAIR PROTEIN RAD4; 1.
DR PANTHER; PTHR12135; DNA REPAIR PROTEIN XP-C / RAD4; 1.
DR Pfam; PF10403; BHD_1; 1.
DR Pfam; PF10404; BHD_2; 1.
DR Pfam; PF10405; BHD_3; 1.
DR Pfam; PF03835; Rad4; 1.
DR SMART; SM01030; BHD_1; 1.
DR SMART; SM01031; BHD_2; 1.
DR SMART; SM01032; BHD_3; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000019484}.
FT DOMAIN 480..538
FT /note="Rad4 beta-hairpin"
FT /evidence="ECO:0000259|SMART:SM01030"
FT DOMAIN 540..602
FT /note="Rad4 beta-hairpin"
FT /evidence="ECO:0000259|SMART:SM01031"
FT DOMAIN 609..683
FT /note="Rad4 beta-hairpin"
FT /evidence="ECO:0000259|SMART:SM01032"
FT REGION 1..31
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 43..98
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 120..166
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 321..357
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 556..581
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 724..772
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 813..1044
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 43..69
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 152..166
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 335..351
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 556..575
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 729..748
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 837..851
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 852..867
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 904..925
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 954..973
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 979..993
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1005..1022
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1044 AA; 117298 MW; B1AA37B47AC6384E CRC64;
MPPKKNLPAR SSSRKTRQST RRREVLDDGI PSVYGEMVAE AVAEERDLSS QLRSSKRRKI
SEESSSKIEL DFDLFGTPEV GADGDPIPAT ADGTSPKLQQ VVYDDFEGSD ESDVEFEDVE
LEPAEDDEDG PSSKPEQQSL ELDLSRSTLE PSRRGPQRRK PVGPAEKKIR LEVHKAHLVC
LLAHLSSRNR WCESESVQGL LKPLVPRKVI SLLHVEESQP QYQRSHSFMK GIEELCALWK
QLWTITARGM QRAYWKEDVD AIKESDDTED LIDFDEFKAA ARSRSGSRDL GAQLFCALLR
SVAVDTRLVC SLQVLPFSGV AKGQTPEKPK SQYYHAPPQN YTQNSTISRP APTTRKKKRK
IIESPYPIFW VEVYSPSTST WIPLDPIVRN TINKPKTGFE PPAADRLNSM TYVIALEDDG
SAKDVTRRYA QWYNAKTRRQ RVESTKGGEE WWTRLMGALQ KPFDEFRDEI EDANLLRRAE
SEAMPRNVQD FKGHPVYVLE RHLRMNEVIH PKHEVGKVST GSGKNAKLES VFRRRDVHVC
RTADAWYRRG RDVKQGEQPL KRVMPKRRRT PEDQDLDDDD HAETSEGMAL YAEYQTKLYE
PPPVVDGKIP KNGYGNLDVY VPSMIPAGAV HIRHPLAAEA ARVLGIDYTD AVTGFVFKGR
QGTAVIDGVV VSMHMCNAMI NVIEGLESQA TEEAEQARSR VILALWKRWL TALRVRERVH
RDYGDREEDT TNVAEDDDEE EEDSTYRDDG EDYGGGFMPD VDQEQTSQVN VEPDTDAISA
LPHLKPVDQL IPPEVVHQAI VVVRSPHKLH QWESQELLNP EPEDANIGGE AGEDQADAGG
FIDEDDEQEA GGFFQFQDHE HETGGFIPDD TNKQQAPSHH PPFPAMDNDD AGAGGFFPED
TDMTSPPEPK HGDTKNGKDE DFEDSAAGGF LLEDDETDTP RLLDPSASVN HRQSAEEDQI
RKKHEDKTAN PDPSLVVKDE TNPISNNQPA PLEPRRPSSS PPTSNPRSSP TSLRSQSQTS
LLSHDPDEED AEPEWLLNSL GEME
//