ID A0A182W7M2_9DIPT Unreviewed; 1339 AA.
AC A0A182W7M2;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 {ECO:0008006|Google:ProtNLM};
OS Anopheles minimus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=112268 {ECO:0000313|EnsemblMetazoa:AMIN006345-PA, ECO:0000313|Proteomes:UP000075920};
RN [1] {ECO:0000313|Proteomes:UP000075920}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MINIMUS1 {ECO:0000313|Proteomes:UP000075920};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Walton C., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles minimus MINIMUS1.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AMIN006345-PA}
RP IDENTIFICATION.
RC STRAIN=MINIMUS1 {ECO:0000313|EnsemblMetazoa:AMIN006345-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 112268.A0A182W7M2; -.
DR EnsemblMetazoa; AMIN006345-RA; AMIN006345-PA; AMIN006345.
DR VEuPathDB; VectorBase:AMIN006345; -.
DR OrthoDB; 3035244at2759; -.
DR Proteomes; UP000075920; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0030414; F:peptidase inhibitor activity; IEA:InterPro.
DR CDD; cd00033; CCP; 7.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 11.
DR InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR InterPro; IPR008197; WAP_dom.
DR PANTHER; PTHR19325; COMPLEMENT COMPONENT-RELATED SUSHI DOMAIN-CONTAINING; 1.
DR PANTHER; PTHR19325:SF573; SUSHI DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00084; Sushi; 7.
DR Pfam; PF00095; WAP; 1.
DR SMART; SM00032; CCP; 12.
DR SUPFAM; SSF57535; Complement control module/SCR domain; 10.
DR PROSITE; PS50923; SUSHI; 9.
DR PROSITE; PS51390; WAP; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00302};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Sushi {ECO:0000256|PROSITE-ProRule:PRU00302}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..1339
FT /note="Sushi, von Willebrand factor type A, EGF and
FT pentraxin domain-containing protein 1"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008141001"
FT DOMAIN 60..113
FT /note="WAP"
FT /evidence="ECO:0000259|PROSITE:PS51390"
FT DOMAIN 107..171
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 173..237
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 238..296
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 300..357
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 358..417
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 493..560
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 561..632
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 985..1052
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 1053..1130
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT REGION 34..70
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 781..846
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 939..975
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 34..50
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 792..806
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 939..964
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 142..169
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 328..355
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 388..415
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 1101..1128
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
SQ SEQUENCE 1339 AA; 149466 MW; 674EF9F2EC207875 CRC64;
MLLLKKNLLC WLLLSAILVK TVYCVPTAAV TEADDENDSE WDEDEESSEA DDDGRVYKNP
RNSPSTECPR DEEQATLLGQ KCLRKCSSDE DCKSKKKKCL CDGACGMSCI KPDRECPDLA
QPSLGSVTLS GKHFGSRATY TCPHGYHVVG LQSRLCQADG SWAGSEPVCK QNIYCLEPPT
IEHARHSALP EQATFDLDST VQYHCHTGYA TAGFPRAKCL AVDGQASWYG PDISCEPRSC
GQPPDPAHGW HAGESYTFGG KVTYHCGEGY ELVGRAERYC QADGSWTPKE LPTCVLVTSV
QCPSPENPRN GKAIYTSTSY NSVVSYECRY GYTLVGESSR RCGADKRWTG TLPACKEINC
GHPGTLYNGW LENIESGTGL GASIIFRCHP EMLLVGNTSS VCQIDGRWRY PLPQCLAPCV
VPSISQGQVI PIEFDIDVNA TTVVPTSGSS SKVKHGTVLE VICDEHYEFP FSSLSPPTCN
NGTWSVIPRC APARCKTMPK PPKFGMVLAP KTEHGMRARF KCKDGYNLTA PGGKPIPDPN
NYVLICSFGN WTGEMPQCTE VYCQFPGYIP NGKVLLVGNM GLYDYRPYVR KVINNKQIMY
ECDKGYVIET GPPGATCIGG KWSPTELPIC TPGQHPRLRW NRRRRSLDLR RRFHRSNQLK
QHYRYLQRKL EEHDAYQQHK LVKRAAAPYR GQIPSKRHYT FEDFQRFRAK RSIEASRNAM
LRSAFRTAMV RERRELSDVE KAYSKYYERI KAKYRNYVQN LLGFNKARTM PVHDDIHVQD
GRWHSYPESD PVSRNRGMQN GRETEMGQLV GTPAPVPSAT RRPRPHKGKQ GKARKPKLPP
PITVPDINEQ SRYRLDFVES GESNRTTDHL NENDIYSNYF PPPLTGRYHT SWQFASEVTP
SHRNEGNQYR ATQFRGRNRT EPPVDENPFA LMEKLQSQII RRKRDTRDGG SSMEETKRGQ
KANRKAVNQT DVDSMDPARK VKFKGPCEPL ASEPYAQLEI VRPGKDPNET FGPGTIVRVT
CTKGYVSNII NPNATAKCVR GRWKPTKPTC SMKPCFVPST EHGKYYDASV DLTTIDAVRP
TTASLTPMEM IENGKMINFQ CDHGYNTQGP SNLRCWNGEW AVSSLPECLP APCVLPPIMH
AMYQGGYRAG LTIAHGSSVM IQCESGMGSA SSVQMDCALG SLTPETINCG YISSRKSRDD
DSSSIIVLDG NNVTSSEDEN DGRECGPPGK IHGSLVYKNG EQMDEGSDEG FPSGTEITFD
CIASITGEQT TWKIICEDGQ WIGRSMNCDD DDPLFHRLPS ANGSCMFRNN EPHVVSFYND
LEIREDIVEF PPGTTIISR
//