ID A0A016VNU9_9BILA Unreviewed; 1549 AA.
AC A0A016VNU9;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=BPTI/Kunitz inhibitor domain-containing protein {ECO:0000259|PROSITE:PS50279};
GN Name=Acey_s0007.g3557 {ECO:0000313|EMBL:EYC29045.1};
GN ORFNames=Y032_0007g3557 {ECO:0000313|EMBL:EYC29045.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYC29045.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYC29045.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01001343; EYC29045.1; -; Genomic_DNA.
DR STRING; 53326.A0A016VNU9; -.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:InterPro.
DR CDD; cd00109; Kunitz-type; 4.
DR CDD; cd22635; Kunitz_papilin; 1.
DR Gene3D; 2.60.120.830; -; 1.
DR Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 6.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 6.
DR InterPro; IPR013273; ADAMTS/ADAMTS-like.
DR InterPro; IPR010294; ADAMTS_spacer1.
DR InterPro; IPR002223; Kunitz_BPTI.
DR InterPro; IPR036880; Kunitz_BPTI_sf.
DR InterPro; IPR020901; Prtase_inh_Kunz-CS.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR PANTHER; PTHR13723; ADAMTS A DISINTEGRIN AND METALLOPROTEASE WITH THROMBOSPONDIN MOTIFS PROTEASE; 1.
DR PANTHER; PTHR13723:SF179; PAPILIN; 1.
DR Pfam; PF05986; ADAMTS_spacer1; 1.
DR Pfam; PF00014; Kunitz_BPTI; 6.
DR Pfam; PF19030; TSP1_ADAMTS; 5.
DR Pfam; PF00090; TSP_1; 1.
DR PRINTS; PR01857; ADAMTSFAMILY.
DR PRINTS; PR00759; BASICPTASE.
DR SMART; SM00131; KU; 6.
DR SMART; SM00209; TSP1; 7.
DR SUPFAM; SSF57362; BPTI-like; 6.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 6.
DR PROSITE; PS00280; BPTI_KUNITZ_1; 5.
DR PROSITE; PS50279; BPTI_KUNITZ_2; 6.
DR PROSITE; PS50092; TSP1; 6.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157,
KW ECO:0000256|PIRSR:PIRSR613273-3};
KW Reference proteome {ECO:0000313|Proteomes:UP000024635};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..1549
FT /note="BPTI/Kunitz inhibitor domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001490357"
FT DOMAIN 1078..1130
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1139..1191
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1253..1303
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1370..1420
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1439..1489
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1496..1546
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT REGION 1319..1342
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1321..1337
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 82..111
FT /evidence="ECO:0000256|PIRSR:PIRSR613273-3"
FT DISULFID 86..116
FT /evidence="ECO:0000256|PIRSR:PIRSR613273-3"
FT DISULFID 97..101
FT /evidence="ECO:0000256|PIRSR:PIRSR613273-3"
SQ SEQUENCE 1549 AA; 169446 MW; F90D17ABBA4BEE09 CRC64;
MKALLISVAL FHALDAFSLS FFSSQPTTPY LHPLSPPETN PASARAKRQA YQVYVDGESS
VSLDKTGQTE SGPWGQWTPE ECSRTCGGGV QIEKRQCSGD CTGPSVRYVS CNLDPCPDGT
DFRAEQCAKH NDDALDGHYH KWQPYKGKNK CELLCKPEGG NFYYKWDEKV VDGTKCDAKG
DDICVDGVCL PVGCDGKLGS ALKVDKCGVC DGDGSKCKTI EGTFDERNLS PGYHDVMRLP
AGATAIKIEE ARPSSNNLAL KNSTDHFFLN GNSMIQVERD VELNGVYFEY DDSKPERIIA
KGPLKEDVTV SVLFRKGNRD AAIKYEFSIP LVEDVDYMYK PGEWSSCSVT CGKGVQTRTP
YCIDTKTQRR VNDALCDNAN YTKPEFEKAC ETVDCEAEWF EGDWEPCSQT CGDQGEQYRV
VYCHQVYANG RRVTVEDGNC TSERPSVRQV CNRFSCPEWQ AGPWSACSEK CGDAFQYRSV
TCRSEKEGEE GKLLPADACD PEKTVESQRS CNLGPCEGLK FFTSEWKLCS KCNDTEETRE
VTCKDNMGRA YPLEKCLTEE EKEIPSDTRA CATQQPCIYE WTASQWSKCS TECGHGHKTR
RVICAIHEEG DITIVDEGLC SGEKPEDKTN CTNEEKCTGT WYSGPWSPCS AECGGGSQER
VAVCLNYDKK PVPEWCDEAT MPVLTQECNV DPCPTCFDSE FGCCPDNTTF ATGDFNQGCS
NCTLSEFGCC ADNYTEATGK NGMGCEEFVE SPLNLEEGKE EEGSGDQDAK PTECQVTNEQ
GEMATVDCAV ANATVTDVDD LFGNGTDTNA TMHCSKSEFG CCPDWFTAAE GPNNAGCPVF
VLGVCNETEY GCCHDDVTLA RGPNLEGCGE PTCAGSLYGC CKDRKTIAFG PHYAGCERSS
FPCELSSFGC CSDGETAALG PNGTGCGENC LTTKYGCCPD GKGIAKGHHN EGCGCVYAQY
GCCPDGKTSA KGAGFYGCPD SCAQSQFGCC PDGKTPARGS HKEGCPCQYT RYGCCPDGET
TALGPRNDGC DDCRYAKYGC CPDGESKAIG PDYAGCPSTT LAPFLLGGTV APSKISSCAL
PQDQGTVCSS GYKLVWYYDT TEGRCSQFWY GGCDGNDNRF ATKEQCETIC VEPPGIGRCY
LPKVEGPLRC DVPQARYWYD YNTKQCAAFW WRGCHGNANN FASWEECSTF CKDVGPFEIP
TTQPPPTQPQ PFIQPEIPEI GHEVVAHPVV DEVPAPRVDP RPRQPMPTIE EVCRSTQDSG
PCQDYSDQYY YDAYKGTCQT FIYGGCGGNL NRFRTEEECM QRCGFLNPTV AAVHAGHQAG
PVHQGHEHQH QQQPHPHEQE MVRPPPPVVP HQQPAHAVQP HSHVKSRQVC HLPLDVGKCQ
GSFDSWYYEM ATGSCVEFKY SGCSGNANRF ASRSDCEATC VRQHDASAGD ASEGATSICD
ESKDTGPCTN FVTKWYYNKA DGTCNRFHYG GCEGTGNRFD NEQSCKAACG NHQDACTLPK
VQGPCSGKHE YFFFNTISQQ CEKFVYGGCL GNTNRFSTQE ECQSRCPRK
//