ID W2T008_NECAM Unreviewed; 984 AA.
AC W2T008;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE SubName: Full=Papain family cysteine protease {ECO:0000313|EMBL:ETN75330.1};
GN ORFNames=NECAME_12452 {ECO:0000313|EMBL:ETN75330.1};
OS Necator americanus (Human hookworm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae; Bunostominae;
OC Necator.
OX NCBI_TaxID=51031 {ECO:0000313|EMBL:ETN75330.1, ECO:0000313|Proteomes:UP000053676};
RN [1] {ECO:0000313|Proteomes:UP000053676}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24441737; DOI=10.1038/ng.2875;
RA Tang Y.T., Gao X., Rosa B.A., Abubucker S., Hallsworth-Pepin K., Martin J.,
RA Tyagi R., Heizer E., Zhang X., Bhonagiri-Palsikar V., Minx P., Warren W.C.,
RA Wang Q., Zhan B., Hotez P.J., Sternberg P.W., Dougall A., Gaze S.T.,
RA Mulvenna J., Sotillo J., Ranganathan S., Rabelo E.M., Wilson R.K.,
RA Felgner P.L., Bethony J., Hawdon J.M., Gasser R.B., Loukas A., Mitreva M.;
RT "Genome of the human hookworm Necator americanus.";
RL Nat. Genet. 46:261-269(2014).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI660305; ETN75330.1; -; Genomic_DNA.
DR RefSeq; XP_013297557.1; XM_013442103.1.
DR AlphaFoldDB; W2T008; -.
DR EnsemblMetazoa; NECAME_12452; NECAME_12452; NECAME_12452.
DR GeneID; 25352480; -.
DR KEGG; nai:NECAME_12452; -.
DR CTD; 25352480; -.
DR OrthoDB; 3132801at2759; -.
DR Proteomes; UP000053676; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02620; Peptidase_C1A_CathepsinB; 3.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 3.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR PANTHER; PTHR12411:SF895; CATHEPSIN B; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF00112; Peptidase_C1; 3.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00645; Pept_C1; 3.
DR SUPFAM; SSF54001; Cysteine proteinases; 3.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 2.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 3.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 3.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000313|EMBL:ETN75330.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000053676};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT SIGNAL 1..30
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 31..984
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004825091"
FT DOMAIN 101..353
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT DOMAIN 392..651
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT DOMAIN 729..981
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 984 AA; 112200 MW; F0A44F51FCF2FC8A CRC64;
MSLEVSVLKG KKISLLQMLL FLTLFVAILA ADEKIVEDAV KEESKALTGH ALAEYVRTHQ
SLFEVEESEE VNDRMKYLLP KHFMVKPKEE DRTKIQLDKE PPEKFDARDA WPYCREIIGH
VRDQSRCGSC WAVSAASTMS DRLCVQSKGK IKLHVSDTDI LACCGEFCGA GCSGGWPFQA
WKWVGKYGVC TGGDYRAKGV CKPYSFHPCG NHKNQVYYGE CPKGSWPTPR CEQFCRRGYT
KPYKRDKFYA KKSYWLPNNE KEIRLDIMKN GPVQAAFDVY EDFKLYKRGI YKHKEGIQTG
GHAVKIIGWG KENGTDYWLI ANSWSKDWGE SGFFRMVRGE NDCEMEDMIT AGIMMVKESP
ETAMRMKFLM DKKFATVPDS KYRKEVKVDE EPPEKFDARD EWPECFSIGT IRDQSSCGSC
WAVSAAEAMS DRLCIQSGGR IKVEISETDI LACCPRPLCG LGCNGGWSFE SWNFMVTHGV
CTGGKYRQKG VCKPYPFHPC GFKPNQTYYG DCPGRLWATP KCTDFCNRGY VKPYEEDKYY
GAHSEIIPAT SAYVVANDEK AIRKEIMKYG PVQAVFYTYE DFGFYDGGIY VQTAGQETGA
HAVKIIGWGE EKGVKYWLIA NSWNFVWGEK GKVKNLLPKH FFDLQMLLFL TLFVAILAAD
EKIVEDAVKE ESKALTGHAL AEYVRTHQSL FEVEESEEVN VRMKYLLPKH FMVKPKEEDR
TKIQLDKEPP EKFDARDAWP YCREIIGHVR DQSRCGSCWA VSAASVMSDR LCVQSNGKIK
LHVSDTDILA CCGEFCGDGC SGGWPFQAWE WVRKYGVCTG GDYRAKGVCK PYAFHPCGNH
ENQVYYGVCP KGSWPTPRCE KFCQRGYIKP YKKDKFYAKK SYWLPNDEKE IRLDIMKNGP
VQAAFDVYED FKLYKRGIYK HKEGIQTGGH AVKIIGWGKD NGTDYWLIAN SWSKDWGESG
FFRMVRGEND CEIEDMITAG IMMV
//