GenomeNet

Database: UniProt
Entry: W2SZ88_NECAM
LinkDB: W2SZ88_NECAM
Original site: W2SZ88_NECAM 
ID   W2SZ88_NECAM            Unreviewed;       986 AA.
AC   W2SZ88;
DT   19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT   19-MAR-2014, sequence version 1.
DT   24-JAN-2024, entry version 38.
DE   SubName: Full=DNA repair protein Rad4 {ECO:0000313|EMBL:ETN74923.1};
GN   ORFNames=NECAME_12620 {ECO:0000313|EMBL:ETN74923.1};
OS   Necator americanus (Human hookworm).
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Strongylida;
OC   Ancylostomatoidea; Ancylostomatidae; Bunostominae; Necator.
OX   NCBI_TaxID=51031 {ECO:0000313|EMBL:ETN74923.1, ECO:0000313|Proteomes:UP000053676};
RN   [1] {ECO:0000313|Proteomes:UP000053676}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=24441737; DOI=10.1038/ng.2875;
RA   Tang Y.T., Gao X., Rosa B.A., Abubucker S., Hallsworth-Pepin K., Martin J.,
RA   Tyagi R., Heizer E., Zhang X., Bhonagiri-Palsikar V., Minx P., Warren W.C.,
RA   Wang Q., Zhan B., Hotez P.J., Sternberg P.W., Dougall A., Gaze S.T.,
RA   Mulvenna J., Sotillo J., Ranganathan S., Rabelo E.M., Wilson R.K.,
RA   Felgner P.L., Bethony J., Hawdon J.M., Gasser R.B., Loukas A., Mitreva M.;
RT   "Genome of the human hookworm Necator americanus.";
RL   Nat. Genet. 46:261-269(2014).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- SIMILARITY: Belongs to the XPC family. {ECO:0000256|ARBA:ARBA00009525}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KI660325; ETN74923.1; -; Genomic_DNA.
DR   RefSeq; XP_013297150.1; XM_013441696.1.
DR   AlphaFoldDB; W2SZ88; -.
DR   STRING; 51031.W2SZ88; -.
DR   EnsemblMetazoa; NECAME_12620; NECAME_12620; NECAME_12620.
DR   GeneID; 25352648; -.
DR   KEGG; nai:NECAME_12620; -.
DR   CTD; 25352648; -.
DR   OMA; WIKMARS; -.
DR   OrthoDB; 181129at2759; -.
DR   Proteomes; UP000053676; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:InterPro.
DR   GO; GO:0003684; F:damaged DNA binding; IEA:InterPro.
DR   GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR   Gene3D; 2.20.20.110; Rad4, beta-hairpin domain BHD1; 1.
DR   Gene3D; 3.30.70.2460; Rad4, beta-hairpin domain BHD3; 1.
DR   Gene3D; 3.90.260.10; Transglutaminase-like; 1.
DR   InterPro; IPR018327; BHD_2.
DR   InterPro; IPR004583; DNA_repair_Rad4.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR018325; Rad4/PNGase_transGLS-fold.
DR   InterPro; IPR018326; Rad4_beta-hairpin_dom1.
DR   InterPro; IPR018328; Rad4_beta-hairpin_dom3.
DR   InterPro; IPR042488; Rad4_BHD3_sf.
DR   InterPro; IPR036985; Transglutaminase-like_sf.
DR   PANTHER; PTHR12135:SF0; DNA REPAIR PROTEIN COMPLEMENTING XP-C CELLS; 1.
DR   PANTHER; PTHR12135; DNA REPAIR PROTEIN XP-C / RAD4; 1.
DR   Pfam; PF10403; BHD_1; 1.
DR   Pfam; PF10404; BHD_2; 1.
DR   Pfam; PF10405; BHD_3; 1.
DR   Pfam; PF03835; Rad4; 1.
DR   SMART; SM01030; BHD_1; 1.
DR   SMART; SM01031; BHD_2; 1.
DR   SMART; SM01032; BHD_3; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
PE   3: Inferred from homology;
KW   DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW   DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053676}.
FT   DOMAIN          811..865
FT                   /note="Rad4 beta-hairpin"
FT                   /evidence="ECO:0000259|SMART:SM01030"
FT   DOMAIN          867..922
FT                   /note="Rad4 beta-hairpin"
FT                   /evidence="ECO:0000259|SMART:SM01031"
FT   DOMAIN          929..986
FT                   /note="Rad4 beta-hairpin"
FT                   /evidence="ECO:0000259|SMART:SM01032"
FT   REGION          59..376
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          669..699
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        61..78
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        108..122
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        186..204
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        205..219
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        228..261
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        262..283
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        288..304
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        307..331
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        332..366
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        669..696
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   986 AA;  111376 MW;  46F2E88FA2F84E99 CRC64;
     MSIVESCTET GFHLHSEAEK LFLEAGHVMY CSDYSTTVVD MRTLNTCVIC VSAMVATRRS
     SKITDNEKEK EKVASKRSTS AKKKNSRKSA VAKASTSPCK VEKQRDTKQS GSLPVQNDAE
     ATQKQKRAPR RSLSKGSTNS DLPVRKKPKG PQKQQVSLKE EEVNLQSDLV GKIAEAANDT
     AVKNSSSDPK TKEEESDQWE HFGNTSDEND VTNSPVPPKK TKKQGKKKKT SGMESTPDEK
     GIVMRRSKRD IVCKDYASKK TNDDISPSSS SADDSNSGNV PDDAKSESES ETDSEEEGSD
     SYDSDESFDF VPTKRKKDSQ SGKKRDGPAQ SGKRRNTNQS GSTGSRSSNE KSRNSSKTST
     SLPRSYVNKP AKPWQRTSEG VIEGDVVVPP RALRKGEKKM LKFRIIAASL AIGRNYEMTL
     EEGSKIVEEM KAQSVAHQAA ISSALDIQQD KANDNGKLFM AYHNTHFISP HHASFLDSSS
     EDEWEDMEPV DLNESNAKGV EVTLKREEEK DWWAIYLRQE VNKCVRENWE NAHKVNILCY
     IAHLQFLRKI VLEENLIPSL MLSIIPSGYR SLVGESLNVE NIRRIAKWYH NTFKPSGGLV
     KYEVGTCGFD ATARLSEMVS QQVFENDADR AALLFAIFVA MECTSRICLN TQPIPRKWDE
     DVINSIKNGK DAKLSHSQPK SKERKQKCKE QSRKDSSGHS GYLGSVRDYW IEYWDKKQKR
     WICVDPLHGT VDEPNSIEDN LTKPVTYVFA IDNEGGVREV TARYASEFLR PDFRRLRTDQ
     KWIADTLKAK FIRANRERGE LEDLHMRQEL VNKPLPTTLS EYKNHPLYVL EKDLLKFEGI
     YPKPECQKPL GEVRGHKVYP RSTVYTLQSA LNWIKMARSV KEGEKAYKVV KARSNPRIPA
     EQREQRYLDV FGFWQTEPFR PPNVENGRIP RNEFGNVYMY QPTMCPIGAV HLRLPGLPSI
     ARRLGGLECV PAVVGWEFNS CSNFPM
//
DBGET integrated database retrieval system