ID W2SZ88_NECAM Unreviewed; 986 AA.
AC W2SZ88;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 24-JAN-2024, entry version 38.
DE SubName: Full=DNA repair protein Rad4 {ECO:0000313|EMBL:ETN74923.1};
GN ORFNames=NECAME_12620 {ECO:0000313|EMBL:ETN74923.1};
OS Necator americanus (Human hookworm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Strongylida;
OC Ancylostomatoidea; Ancylostomatidae; Bunostominae; Necator.
OX NCBI_TaxID=51031 {ECO:0000313|EMBL:ETN74923.1, ECO:0000313|Proteomes:UP000053676};
RN [1] {ECO:0000313|Proteomes:UP000053676}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24441737; DOI=10.1038/ng.2875;
RA Tang Y.T., Gao X., Rosa B.A., Abubucker S., Hallsworth-Pepin K., Martin J.,
RA Tyagi R., Heizer E., Zhang X., Bhonagiri-Palsikar V., Minx P., Warren W.C.,
RA Wang Q., Zhan B., Hotez P.J., Sternberg P.W., Dougall A., Gaze S.T.,
RA Mulvenna J., Sotillo J., Ranganathan S., Rabelo E.M., Wilson R.K.,
RA Felgner P.L., Bethony J., Hawdon J.M., Gasser R.B., Loukas A., Mitreva M.;
RT "Genome of the human hookworm Necator americanus.";
RL Nat. Genet. 46:261-269(2014).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the XPC family. {ECO:0000256|ARBA:ARBA00009525}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI660325; ETN74923.1; -; Genomic_DNA.
DR RefSeq; XP_013297150.1; XM_013441696.1.
DR AlphaFoldDB; W2SZ88; -.
DR STRING; 51031.W2SZ88; -.
DR EnsemblMetazoa; NECAME_12620; NECAME_12620; NECAME_12620.
DR GeneID; 25352648; -.
DR KEGG; nai:NECAME_12620; -.
DR CTD; 25352648; -.
DR OMA; WIKMARS; -.
DR OrthoDB; 181129at2759; -.
DR Proteomes; UP000053676; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0003684; F:damaged DNA binding; IEA:InterPro.
DR GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR Gene3D; 2.20.20.110; Rad4, beta-hairpin domain BHD1; 1.
DR Gene3D; 3.30.70.2460; Rad4, beta-hairpin domain BHD3; 1.
DR Gene3D; 3.90.260.10; Transglutaminase-like; 1.
DR InterPro; IPR018327; BHD_2.
DR InterPro; IPR004583; DNA_repair_Rad4.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR018325; Rad4/PNGase_transGLS-fold.
DR InterPro; IPR018326; Rad4_beta-hairpin_dom1.
DR InterPro; IPR018328; Rad4_beta-hairpin_dom3.
DR InterPro; IPR042488; Rad4_BHD3_sf.
DR InterPro; IPR036985; Transglutaminase-like_sf.
DR PANTHER; PTHR12135:SF0; DNA REPAIR PROTEIN COMPLEMENTING XP-C CELLS; 1.
DR PANTHER; PTHR12135; DNA REPAIR PROTEIN XP-C / RAD4; 1.
DR Pfam; PF10403; BHD_1; 1.
DR Pfam; PF10404; BHD_2; 1.
DR Pfam; PF10405; BHD_3; 1.
DR Pfam; PF03835; Rad4; 1.
DR SMART; SM01030; BHD_1; 1.
DR SMART; SM01031; BHD_2; 1.
DR SMART; SM01032; BHD_3; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000053676}.
FT DOMAIN 811..865
FT /note="Rad4 beta-hairpin"
FT /evidence="ECO:0000259|SMART:SM01030"
FT DOMAIN 867..922
FT /note="Rad4 beta-hairpin"
FT /evidence="ECO:0000259|SMART:SM01031"
FT DOMAIN 929..986
FT /note="Rad4 beta-hairpin"
FT /evidence="ECO:0000259|SMART:SM01032"
FT REGION 59..376
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 669..699
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 61..78
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 108..122
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 186..204
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 205..219
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 228..261
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 262..283
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 288..304
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 307..331
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 332..366
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 669..696
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 986 AA; 111376 MW; 46F2E88FA2F84E99 CRC64;
MSIVESCTET GFHLHSEAEK LFLEAGHVMY CSDYSTTVVD MRTLNTCVIC VSAMVATRRS
SKITDNEKEK EKVASKRSTS AKKKNSRKSA VAKASTSPCK VEKQRDTKQS GSLPVQNDAE
ATQKQKRAPR RSLSKGSTNS DLPVRKKPKG PQKQQVSLKE EEVNLQSDLV GKIAEAANDT
AVKNSSSDPK TKEEESDQWE HFGNTSDEND VTNSPVPPKK TKKQGKKKKT SGMESTPDEK
GIVMRRSKRD IVCKDYASKK TNDDISPSSS SADDSNSGNV PDDAKSESES ETDSEEEGSD
SYDSDESFDF VPTKRKKDSQ SGKKRDGPAQ SGKRRNTNQS GSTGSRSSNE KSRNSSKTST
SLPRSYVNKP AKPWQRTSEG VIEGDVVVPP RALRKGEKKM LKFRIIAASL AIGRNYEMTL
EEGSKIVEEM KAQSVAHQAA ISSALDIQQD KANDNGKLFM AYHNTHFISP HHASFLDSSS
EDEWEDMEPV DLNESNAKGV EVTLKREEEK DWWAIYLRQE VNKCVRENWE NAHKVNILCY
IAHLQFLRKI VLEENLIPSL MLSIIPSGYR SLVGESLNVE NIRRIAKWYH NTFKPSGGLV
KYEVGTCGFD ATARLSEMVS QQVFENDADR AALLFAIFVA MECTSRICLN TQPIPRKWDE
DVINSIKNGK DAKLSHSQPK SKERKQKCKE QSRKDSSGHS GYLGSVRDYW IEYWDKKQKR
WICVDPLHGT VDEPNSIEDN LTKPVTYVFA IDNEGGVREV TARYASEFLR PDFRRLRTDQ
KWIADTLKAK FIRANRERGE LEDLHMRQEL VNKPLPTTLS EYKNHPLYVL EKDLLKFEGI
YPKPECQKPL GEVRGHKVYP RSTVYTLQSA LNWIKMARSV KEGEKAYKVV KARSNPRIPA
EQREQRYLDV FGFWQTEPFR PPNVENGRIP RNEFGNVYMY QPTMCPIGAV HLRLPGLPSI
ARRLGGLECV PAVVGWEFNS CSNFPM
//