ID C1GS01_PARBA Unreviewed; 1058 AA.
AC C1GS01;
DT 26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 04-FEB-2015, sequence version 2.
DT 27-MAR-2024, entry version 66.
DE SubName: Full=DNA repair protein rhp41 {ECO:0000313|EMBL:EEH38375.2};
GN ORFNames=PAAG_01296 {ECO:0000313|EMBL:EEH38375.2};
OS Paracoccidioides lutzii (strain ATCC MYA-826 / Pb01) (Paracoccidioides
OS brasiliensis).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Onygenales; Onygenales incertae sedis; Paracoccidioides.
OX NCBI_TaxID=502779 {ECO:0000313|EMBL:EEH38375.2, ECO:0000313|Proteomes:UP000002059};
RN [1] {ECO:0000313|EMBL:EEH38375.2, ECO:0000313|Proteomes:UP000002059}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC MYA-826 / Pb01 {ECO:0000313|Proteomes:UP000002059};
RX PubMed=22046142; DOI=10.1371/journal.pgen.1002345;
RA Desjardins C.A., Champion M.D., Holder J.W., Muszewska A., Goldberg J.,
RA Bailao A.M., Brigido M.M., Ferreira M.E., Garcia A.M., Grynberg M.,
RA Gujja S., Heiman D.I., Henn M.R., Kodira C.D., Leon-Narvaez H.,
RA Longo L.V.G., Ma L.-J., Malavazi I., Matsuo A.L., Morais F.V., Pereira M.,
RA Rodriguez-Brito S., Sakthikumar S., Salem-Izacc S.M., Sykes S.M.,
RA Teixeira M.M., Vallejo M.C., Walter M.E., Yandava C., Young S., Zeng Q.,
RA Zucker J., Felipe M.S., Goldman G.H., Haas B.J., McEwen J.G., Nino-Vega G.,
RA Puccia R., San-Blas G., Soares C.M., Birren B.W., Cuomo C.A.;
RT "Comparative genomic analysis of human fungal pathogens causing
RT paracoccidioidomycosis.";
RL PLoS Genet. 7:E1002345-E1002345(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the XPC family. {ECO:0000256|ARBA:ARBA00009525}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KN293993; EEH38375.2; -; Genomic_DNA.
DR RefSeq; XP_015701085.1; XM_015844304.1.
DR AlphaFoldDB; C1GS01; -.
DR STRING; 502779.C1GS01; -.
DR GeneID; 9100305; -.
DR KEGG; pbl:PAAG_01296; -.
DR VEuPathDB; FungiDB:PAAG_01296; -.
DR eggNOG; KOG2179; Eukaryota.
DR HOGENOM; CLU_003639_1_1_1; -.
DR OMA; KPSKFEP; -.
DR OrthoDB; 181129at2759; -.
DR Proteomes; UP000002059; Partially assembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003684; F:damaged DNA binding; IEA:InterPro.
DR GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR Gene3D; 2.20.20.110; Rad4, beta-hairpin domain BHD1; 1.
DR Gene3D; 3.30.60.290; Rad4, beta-hairpin domain BHD2; 1.
DR Gene3D; 3.30.70.2460; Rad4, beta-hairpin domain BHD3; 1.
DR Gene3D; 3.90.260.10; Transglutaminase-like; 1.
DR InterPro; IPR018327; BHD_2.
DR InterPro; IPR004583; DNA_repair_Rad4.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR018325; Rad4/PNGase_transGLS-fold.
DR InterPro; IPR018326; Rad4_beta-hairpin_dom1.
DR InterPro; IPR018328; Rad4_beta-hairpin_dom3.
DR InterPro; IPR042488; Rad4_BHD3_sf.
DR InterPro; IPR036985; Transglutaminase-like_sf.
DR PANTHER; PTHR12135:SF3; DNA REPAIR PROTEIN RAD4; 1.
DR PANTHER; PTHR12135; DNA REPAIR PROTEIN XP-C / RAD4; 1.
DR Pfam; PF10403; BHD_1; 1.
DR Pfam; PF10404; BHD_2; 1.
DR Pfam; PF10405; BHD_3; 1.
DR Pfam; PF03835; Rad4; 1.
DR SMART; SM01030; BHD_1; 1.
DR SMART; SM01031; BHD_2; 1.
DR SMART; SM01032; BHD_3; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000002059}.
FT DOMAIN 562..624
FT /note="Rad4 beta-hairpin"
FT /evidence="ECO:0000259|SMART:SM01030"
FT DOMAIN 626..688
FT /note="Rad4 beta-hairpin"
FT /evidence="ECO:0000259|SMART:SM01031"
FT DOMAIN 695..769
FT /note="Rad4 beta-hairpin"
FT /evidence="ECO:0000259|SMART:SM01032"
FT REGION 1..50
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 67..125
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 161..183
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 386..434
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 811..843
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 925..1058
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 80..108
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 395..419
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 815..840
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 925..1011
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1012..1044
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1058 AA; 116648 MW; 08C2B12FF960818E CRC64;
MRKSSIQSRG KGKTPVRRTA SESVSAHGRV RGLGVEQRVR STRLAPEASN GVPQVYRDML
AEIAGRAGSP LSGSASNGGD ERPLKRRRVG ETRSGTEVDV ERGRSGVGVD VQRGEGGVEG
HGSRPVQTVF DADASDDESE MEWEDVEVSA AELSFAPGYD GAAEGQEPNH HQGVETEESE
GSLQITLDKP MERGKQRAAG MRRKPVTAQE RRWRLDIHKM HVLCLLGHVQ RRNLWCNDDG
VQDALKRILS KHTIMCLNPK ANLPQFTRST TFADGLKQAG DAFKQRFKVT TPGMKRPYWL
EHPNDIKDPT SLLDTAEILS CKQDFLKQAV ALQGSRDLGA QLFCALLRAV GTDARLVCSL
QVLPFTGVTK RTMPLKPDRE YIVLSEDDEV RSSSDAGKGS PTQAKPGNAQ PQSRMRRIGQ
PRFSPSPVAA SPKPKRAIGL PRGFAESPFP VFWVEVFNEA MQKWVSVDPL VTNSVGKTSK
FEPPASDRYN NMSYVIAFED DASARDVTKR YTKSFNSKTR KQRVESTKNG EEWWARTMRF
FEKPFLDDRD QVEIGELTAK SAAEAMPRNV QDFKDHPVYA LERHLRRNEV IFPKREIGKV
GLSKVSINKK NPPLESVYRR GDVHVVKSAD GWYRLGREVK MGEQPLKRFP ISRPKWAFER
REETSDYEEE LQETPMYAIH QTELYKPPPV VDNRVVKNAF GNIDVYTPTM VPEGGFHLSH
GEAARAARIL GIDYADAVTG FHFKGRHGTA VVQGIVASVE YREALYAVID ALEDERVLAE
QGRKAAEALR MWKLLLLKLR VAERVRRYAF EGEEEPDDVS SDGDIDGGGG DEDGDEDVEA
AGGFICEGGK GGITSPNGGF AGFDDLSGAS GFIPETTGEE ASLGGGFMPP TTTILDEARP
STSISAVGNA RIRPRDRSLY TLLVTPGGTS SQQQPASTRP SIEMNDPEPH TSCPMTSTQT
CGLQRQHQQL HTQQPQPQSI STPQLPQPST TAAEGSPTAP ITVPSSSGDE PSTSVEEKRE
LPDLVHETDT DSEIDQCSML SHDPEDDDAE PDWLLSDN
//