GenomeNet

Database: UniProt
Entry: W2T008_NECAM
LinkDB: W2T008_NECAM
Original site: W2T008_NECAM 
ID   W2T008_NECAM            Unreviewed;       984 AA.
AC   W2T008;
DT   19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT   19-MAR-2014, sequence version 1.
DT   27-MAR-2024, entry version 38.
DE   SubName: Full=Papain family cysteine protease {ECO:0000313|EMBL:ETN75330.1};
GN   ORFNames=NECAME_12452 {ECO:0000313|EMBL:ETN75330.1};
OS   Necator americanus (Human hookworm).
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae; Bunostominae;
OC   Necator.
OX   NCBI_TaxID=51031 {ECO:0000313|EMBL:ETN75330.1, ECO:0000313|Proteomes:UP000053676};
RN   [1] {ECO:0000313|Proteomes:UP000053676}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=24441737; DOI=10.1038/ng.2875;
RA   Tang Y.T., Gao X., Rosa B.A., Abubucker S., Hallsworth-Pepin K., Martin J.,
RA   Tyagi R., Heizer E., Zhang X., Bhonagiri-Palsikar V., Minx P., Warren W.C.,
RA   Wang Q., Zhan B., Hotez P.J., Sternberg P.W., Dougall A., Gaze S.T.,
RA   Mulvenna J., Sotillo J., Ranganathan S., Rabelo E.M., Wilson R.K.,
RA   Felgner P.L., Bethony J., Hawdon J.M., Gasser R.B., Loukas A., Mitreva M.;
RT   "Genome of the human hookworm Necator americanus.";
RL   Nat. Genet. 46:261-269(2014).
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KI660305; ETN75330.1; -; Genomic_DNA.
DR   RefSeq; XP_013297557.1; XM_013442103.1.
DR   AlphaFoldDB; W2T008; -.
DR   EnsemblMetazoa; NECAME_12452; NECAME_12452; NECAME_12452.
DR   GeneID; 25352480; -.
DR   KEGG; nai:NECAME_12452; -.
DR   CTD; 25352480; -.
DR   OrthoDB; 3132801at2759; -.
DR   Proteomes; UP000053676; Unassembled WGS sequence.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd02620; Peptidase_C1A_CathepsinB; 3.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 3.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   PANTHER; PTHR12411:SF895; CATHEPSIN B; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF00112; Peptidase_C1; 3.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00645; Pept_C1; 3.
DR   SUPFAM; SSF54001; Cysteine proteinases; 3.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 2.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 3.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 3.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000313|EMBL:ETN75330.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053676};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT   SIGNAL          1..30
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           31..984
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004825091"
FT   DOMAIN          101..353
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
FT   DOMAIN          392..651
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
FT   DOMAIN          729..981
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   984 AA;  112200 MW;  F0A44F51FCF2FC8A CRC64;
     MSLEVSVLKG KKISLLQMLL FLTLFVAILA ADEKIVEDAV KEESKALTGH ALAEYVRTHQ
     SLFEVEESEE VNDRMKYLLP KHFMVKPKEE DRTKIQLDKE PPEKFDARDA WPYCREIIGH
     VRDQSRCGSC WAVSAASTMS DRLCVQSKGK IKLHVSDTDI LACCGEFCGA GCSGGWPFQA
     WKWVGKYGVC TGGDYRAKGV CKPYSFHPCG NHKNQVYYGE CPKGSWPTPR CEQFCRRGYT
     KPYKRDKFYA KKSYWLPNNE KEIRLDIMKN GPVQAAFDVY EDFKLYKRGI YKHKEGIQTG
     GHAVKIIGWG KENGTDYWLI ANSWSKDWGE SGFFRMVRGE NDCEMEDMIT AGIMMVKESP
     ETAMRMKFLM DKKFATVPDS KYRKEVKVDE EPPEKFDARD EWPECFSIGT IRDQSSCGSC
     WAVSAAEAMS DRLCIQSGGR IKVEISETDI LACCPRPLCG LGCNGGWSFE SWNFMVTHGV
     CTGGKYRQKG VCKPYPFHPC GFKPNQTYYG DCPGRLWATP KCTDFCNRGY VKPYEEDKYY
     GAHSEIIPAT SAYVVANDEK AIRKEIMKYG PVQAVFYTYE DFGFYDGGIY VQTAGQETGA
     HAVKIIGWGE EKGVKYWLIA NSWNFVWGEK GKVKNLLPKH FFDLQMLLFL TLFVAILAAD
     EKIVEDAVKE ESKALTGHAL AEYVRTHQSL FEVEESEEVN VRMKYLLPKH FMVKPKEEDR
     TKIQLDKEPP EKFDARDAWP YCREIIGHVR DQSRCGSCWA VSAASVMSDR LCVQSNGKIK
     LHVSDTDILA CCGEFCGDGC SGGWPFQAWE WVRKYGVCTG GDYRAKGVCK PYAFHPCGNH
     ENQVYYGVCP KGSWPTPRCE KFCQRGYIKP YKKDKFYAKK SYWLPNDEKE IRLDIMKNGP
     VQAAFDVYED FKLYKRGIYK HKEGIQTGGH AVKIIGWGKD NGTDYWLIAN SWSKDWGESG
     FFRMVRGEND CEIEDMITAG IMMV
//
DBGET integrated database retrieval system