ID A0A182PGJ0_9DIPT Unreviewed; 2683 AA.
AC A0A182PGJ0;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE RecName: Full=Papilin {ECO:0008006|Google:ProtNLM};
OS Anopheles epiroticus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=199890 {ECO:0000313|EnsemblMetazoa:AEPI006049-PA, ECO:0000313|Proteomes:UP000075885};
RN [1] {ECO:0000313|Proteomes:UP000075885}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Epiroticus2 {ECO:0000313|Proteomes:UP000075885};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles epiroticus epiroticus2.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AEPI006049-PA}
RP IDENTIFICATION.
RC STRAIN=Epiroticus2 {ECO:0000313|EnsemblMetazoa:AEPI006049-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 199890.A0A182PGJ0; -.
DR EnsemblMetazoa; AEPI006049-RA; AEPI006049-PA; AEPI006049.
DR VEuPathDB; VectorBase:AEPI006049; -.
DR OrthoDB; 2910701at2759; -.
DR Proteomes; UP000075885; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:InterPro.
DR CDD; cd00109; Kunitz-type; 6.
DR CDD; cd22593; Kunitz_conkunitzin; 1.
DR CDD; cd22639; Kunitz_papilin_lacunin-like; 1.
DR CDD; cd00199; WAP; 1.
DR Gene3D; 2.60.120.830; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 3.
DR Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 10.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 5.
DR InterPro; IPR013273; ADAMTS/ADAMTS-like.
DR InterPro; IPR010294; ADAMTS_spacer1.
DR InterPro; IPR036645; Elafin-like_sf.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR013098; Ig_I-set.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR InterPro; IPR002223; Kunitz_BPTI.
DR InterPro; IPR036880; Kunitz_BPTI_sf.
DR InterPro; IPR010909; PLAC.
DR InterPro; IPR020901; Prtase_inh_Kunz-CS.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR InterPro; IPR008197; WAP_dom.
DR PANTHER; PTHR13723; ADAMTS A DISINTEGRIN AND METALLOPROTEASE WITH THROMBOSPONDIN MOTIFS PROTEASE; 1.
DR PANTHER; PTHR13723:SF179; PAPILIN; 1.
DR Pfam; PF05986; ADAMTS_spacer1; 1.
DR Pfam; PF07679; I-set; 2.
DR Pfam; PF13927; Ig_3; 1.
DR Pfam; PF00014; Kunitz_BPTI; 10.
DR Pfam; PF08686; PLAC; 1.
DR Pfam; PF19030; TSP1_ADAMTS; 6.
DR Pfam; PF00090; TSP_1; 1.
DR PRINTS; PR01857; ADAMTSFAMILY.
DR PRINTS; PR00759; BASICPTASE.
DR SMART; SM00409; IG; 3.
DR SMART; SM00408; IGc2; 3.
DR SMART; SM00131; KU; 10.
DR SMART; SM00209; TSP1; 7.
DR SMART; SM00217; WAP; 1.
DR SUPFAM; SSF57362; BPTI-like; 10.
DR SUPFAM; SSF57256; Elafin-like; 1.
DR SUPFAM; SSF48726; Immunoglobulin; 3.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 7.
DR PROSITE; PS00280; BPTI_KUNITZ_1; 6.
DR PROSITE; PS50279; BPTI_KUNITZ_2; 10.
DR PROSITE; PS50835; IG_LIKE; 3.
DR PROSITE; PS50900; PLAC; 1.
DR PROSITE; PS50092; TSP1; 5.
DR PROSITE; PS51390; WAP; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157,
KW ECO:0000256|PIRSR:PIRSR613273-3}; Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..2683
FT /note="Papilin"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008131406"
FT DOMAIN 1473..1523
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1532..1582
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1591..1641
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1651..1701
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1710..1760
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1789..1841
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1849..1899
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1925..1977
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1986..2036
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 2052..2104
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 2241..2286
FT /note="WAP"
FT /evidence="ECO:0000259|PROSITE:PS51390"
FT DOMAIN 2302..2393
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2399..2484
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2533..2629
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2635..2674
FT /note="PLAC"
FT /evidence="ECO:0000259|PROSITE:PS50900"
FT REGION 694..1093
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1224..1262
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 706..738
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 789..1080
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1231..1249
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 62..98
FT /evidence="ECO:0000256|PIRSR:PIRSR613273-3"
FT DISULFID 66..103
FT /evidence="ECO:0000256|PIRSR:PIRSR613273-3"
FT DISULFID 77..88
FT /evidence="ECO:0000256|PIRSR:PIRSR613273-3"
SQ SEQUENCE 2683 AA; 290762 MW; 546FFC2B1CB0E0DE CRC64;
RYTKLLVLLV TLQLSFTESR RFGARHKRQH GSNLYQPESF IIPGGEGSPQ DRWSDWGAPS
ECSRTCGGGV AYQERECLDL DHTGAPVCTG GKRKYFSCNT QDCPERELDF RKQQCTEFND
VPFEGHRYQW VPYTRAPNPC ELNCMPVGER FYYRHRAKVT DGTRCNDESF DVCVDGTCQP
VGCDMMLGSQ AKEDKCRICR GDGSSCKTAS GLLDANNLQV GYNDLLLIPS GATNVVISER
APSNNYLAIR NLTGFYHLNG NYRIDFPRTM HFAGSDWHYE RRPQGFAAPD RLTCLGPTTE
SVYLVLLSQD RNVGVNYEYS VSSQSAPVDE PDSYSWMYTA FEPCTVTCGG GVQTRNVTCN
SKGSLRQVDE GLCNLGEKPP TTQKCGQQAC PPRWFEGPWS NCSKPCGNEG KQTREVYCER
VSADGETKKI EDDVCLDQVG NKPATERECN QGIICPEWYV GRWSPCNKLC GEGERSRKVV
CYRKENGRIT VLDDEECITE KPAVSEKCML RPCEGVDYVT SSWSGCDDCG ATVETRTVHC
ASKSGTVYDA SFCADRQMPE LRRECKLTPC EYQWYTSQWS KCSAVCGKGV QTRTVVCGVF
DGQSLKRADD DYKCDPEQKP EKERECEGPP ECPGQWFAGP WTDCDKSCGG GMMSRKVLCL
ANGTVVPETN CDVDKIQLAT ESCNKHDCTE DEVIPVEPRS KQPEVDDDYD EDELCDEDEE
DGGSGSGDDD DDEEGSSVTT EGQDGVMMVT EDTSLAEGIE LGVSGTTESS LETDEMMLSD
ATGFDTGATD PDEGSTTEAT VEGSGDGSSL VTDVSEGSGD EGEATTVQAA SAASEDASSD
AVSSSTEASA STGSSVELDG SSSSTEASAQ SSTESSDAVS PSESSSVEDS QPTTTDSAST
TDASDPTTEV TETSISSTDE ESSTTDASVS STTMDLTSTM DDASTTDVPS TTDGSEAMST
TIASDMATGS SDAASSESST DESSTDSSTK LVDTSKSTEA SLESSTSESA VTGTSVDGTS
ESGPTEASVT GASTDDSSTT VELTEPTVTG STEDFTGSTV TEESVDETTF DIWASSTGST
DEETDDSESS TPYSLTSIIA KEQKPRKCKP RPKVPQCAKS THGCCPDGKT KATGPFLEGC
TLAETCKDTK HGCCPDGVSP AKGPNDKGCP KAECADTLFG CCPDKVTPAE GNDAEGCPVE
TTTTAASCTA GKFGCCPDGA TEAKGPNGKG CPGAKEEDEK EEESEKPSGT EQPSVEGPAG
AEGCSSTEHG CCPDNTTAAS GPDGQGCEPC QREPFGCCPD GKTPAHGYNG EGCCLVTPYG
CCPDNIVPAH GPNLEGCDCQ YAPYGCCPDN KTSARGHDNE GCGCQYAKHG CCPDKETEAT
GPEFEGCPCH AYQFGCCPDG VTAAKGPHNQ GCHCSHSEFK CCSDGKTPAK GPDGEGCTCA
DSMHGCCPDG VSEAQGSKFE GCTDVPESPQ KACALPKDKG PCHNYTVKHF FDVEYGGCGR
FWYGGCEGNN NRFDSAEECK TVCETPTGKD VCQLPKISGP CTGHYNMWYY DAERNMCSQF
TYGGCLGNAN RFEKLEDCKA MCSVDDSKPP CEQPMDAGPC NGTFERWYYD KDSDACHPFY
YGGCKGNKNN YPTEASCGYH CKKPGVHKPS CSEPLEQGSC NAQQARWYFA SDSQKCMPFY
FTGCDGNGNQ FVSRDECEDR CPPKVEKDIC FLPAEIGECQ NYTAHWYFDT KDERCRQFYY
GGCGGNGNNF ADEQACISRC IEEVPKPPAE VPAPAVPAPE RVPFDRSQCQ LPMDNGDREC
SPYVARYYYD AQTGSCSRFT YTGCGGNGNN FQTEEECLQA CGAVPQDVCQ LPDAYGDCTG
NEERWFFEPN EQRCVRFAYS GCGGNGNNFA SQAECERTCP VNRPVVDVDL RGPAEQQPTS
NVSRCEDVPD FGDGEGDGEL ILFYYNAERQ SCERFRYSGA GGNKNRFNSE EECERVCGMY
RGVDVCRDPV ETGPCTGSTQ MYYYDPRALA CYSFNYSGCE GNGNRFTTPE ECEETCLPRR
TNIEDAEKVR VCSLPLQQGS RCKAKSRKRW YYDPERETCF AFRYLGCGGN RNSFPSYKNC
RTYCNIECKS STLLQKMGKW LLLLMVLPLL LLVCLTLGVG CRGMHLVVRG LANLTNRALC
SSFPTVNAVN PCEQYEHECS QLQCQYGIAK SYDPSNGCER CQCNDPCAGY YCPQGSQCVV
DVQAGGAVRG TGSEFVGVCR ESQKPGDCPE LANATYCSTD CYSDADCRGN NKCCQAGCAQ
ICISPVDRPV APTGVQPGAR GPVVLQEVPQ EELDVKSEEG GIATLRCYAT GFPPPSITWR
KGQIMLNTNQ GRYVLTSNGD LQIVQLHRTD SGTYVCVAEN GVGEPVLREV QLTVNDPVPR
DAYIAGSLND TQVVELEGPA TLRCPAGGHP KPIVTWWRET FMMPLKIVNR DYSLYLARVR
LEDLGPYVCQ AYSGAGKGIS RTVTLLGYNP VAPINPLDEK YMKYVVPAPA VRPSLVPVDR
YPSRPRPPVA QVPPVVLRPP APVRVQMYFP QGRDLRPNSN FTVNCTVDGY PRPTVNWFKD
GEMLVPTDRI HITDTHLLIV TGAIPSDSGR YKCLARNEMS EAFQENSVHV EGVYVPPGCT
DNQLLAKCDL IVAGRYCNHK YYARFCCRSC TLAGQINVNN RYR
//