ID W5JTC7_ANODA Unreviewed; 1191 AA.
AC W5JTC7;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 48.
DE RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 {ECO:0008006|Google:ProtNLM};
GN ORFNames=AND_000553 {ECO:0000313|EMBL:ETN67627.1};
OS Anopheles darlingi (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=43151 {ECO:0000313|EMBL:ETN67627.1};
RN [1] {ECO:0000313|EMBL:ETN67627.1, ECO:0000313|Proteomes:UP000000673}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=20920257; DOI=10.1186/1471-2164-11-529;
RA Mendes N.D., Freitas A.T., Vasconcelos A.T., Sagot M.F.;
RT "Combination of measures distinguishes pre-miRNAs from other stem-loops in
RT the genome of the newly sequenced Anopheles darlingi.";
RL BMC Genomics 11:529-529(2010).
RN [2] {ECO:0000313|EMBL:ETN67627.1}
RP NUCLEOTIDE SEQUENCE.
RA Almeida L.G., Nicolas M.F., Souza R.C., Vasconcelos A.T.R.;
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:ETN67627.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=23761445;
RA Marinotti O., Cerqueira G.C., de Almeida L.G., Ferro M.I., Loreto E.L.,
RA Zaha A., Teixeira S.M., Wespiser A.R., Almeida E Silva A.,
RA Schlindwein A.D., Pacheco A.C., Silva A.L., Graveley B.R., Walenz B.P.,
RA Lima Bde A., Ribeiro C.A., Nunes-Silva C.G., de Carvalho C.R., Soares C.M.,
RA de Menezes C.B., Matiolli C., Caffrey D., Araujo D.A., de Oliveira D.M.,
RA Golenbock D., Grisard E.C., Fantinatti-Garboggini F., de Carvalho F.M.,
RA Barcellos F.G., Prosdocimi F., May G., Azevedo Junior G.M., Guimaraes G.M.,
RA Goldman G.H., Padilha I.Q., Batista Jda S., Ferro J.A., Ribeiro J.M.,
RA Fietto J.L., Dabbas K.M., Cerdeira L., Agnez-Lima L.F., Brocchi M.,
RA de Carvalho M.O., Teixeira Mde M., Diniz Maia Mde M., Goldman M.H.,
RA Cruz Schneider M.P., Felipe M.S., Hungria M., Nicolas M.F., Pereira M.,
RA Montes M.A., Cantao M.E., Vincentz M., Rafael M.S., Silverman N.,
RA Stoco P.H., Souza R.C., Vicentini R., Gazzinelli R.T., Neves Rde O.,
RA Silva R., Astolfi-Filho S., Maciel T.E., Urmenyi T.P., Tadei W.P.,
RA Camargo E.P., de Vasconcelos A.T.;
RT "The genome of Anopheles darlingi, the main neotropical malaria vector.";
RL Nucleic Acids Res. 41:7387-7400(2013).
RN [4] {ECO:0000313|EnsemblMetazoa:ADAC000553-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADMH02000140; ETN67627.1; -; Genomic_DNA.
DR AlphaFoldDB; W5JTC7; -.
DR STRING; 43151.W5JTC7; -.
DR EnsemblMetazoa; ADAC000553-RA; ADAC000553-PA; ADAC000553.
DR VEuPathDB; VectorBase:ADAC000553; -.
DR VEuPathDB; VectorBase:ADAR2_007488; -.
DR eggNOG; KOG4297; Eukaryota.
DR HOGENOM; CLU_012987_0_0_1; -.
DR OMA; FPPVCRY; -.
DR OrthoDB; 5306009at2759; -.
DR Proteomes; UP000000673; Unassembled WGS sequence.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-KW.
DR GO; GO:0042806; F:fucose binding; IEA:UniProt.
DR GO; GO:0010185; P:regulation of cellular defense response; IEA:UniProt.
DR GO; GO:0001868; P:regulation of complement activation, lectin pathway; IEA:UniProt.
DR CDD; cd00033; CCP; 4.
DR CDD; cd00037; CLECT; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 4.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR018378; C-type_lectin_CS.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR006585; FTP1.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR PANTHER; PTHR19325; COMPLEMENT COMPONENT-RELATED SUSHI DOMAIN-CONTAINING; 1.
DR PANTHER; PTHR19325:SF567; LOCOMOTION-RELATED PROTEIN HIKARU GENKI; 1.
DR Pfam; PF00754; F5_F8_type_C; 1.
DR Pfam; PF00059; Lectin_C; 1.
DR Pfam; PF00084; Sushi; 4.
DR SMART; SM00032; CCP; 4.
DR SMART; SM00034; CLECT; 1.
DR SMART; SM00607; FTP; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF57535; Complement control module/SCR domain; 4.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR PROSITE; PS00615; C_TYPE_LECTIN_1; 1.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR PROSITE; PS50923; SUSHI; 4.
PE 4: Predicted;
KW Calcium {ECO:0000256|ARBA:ARBA00022837};
KW Cell membrane {ECO:0000256|ARBA:ARBA00022475};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00302}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000000673};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Sushi {ECO:0000256|PROSITE-ProRule:PRU00302};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 975..1000
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 49..108
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 274..392
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT DOMAIN 396..454
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 455..514
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 515..574
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT REGION 1..21
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 589..695
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 722..759
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 774..875
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1077..1140
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1153..1191
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 608..622
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 644..683
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 730..745
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 837..871
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1077..1137
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 79..106
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 425..452
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 485..512
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 545..572
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
SQ SEQUENCE 1191 AA; 128372 MW; 9C1DDD31AA52D227 CRC64;
MAWQPPSIRS SCNQTSPSIG AAKAAADTEP TVASLVGGPP GSAGRGSGKT CKFPGAPAHG
SVEFTDDALG DGTVATYYCE RGFELLGPSR RVCNDGQWIP EGIPFCGILL NVAAGKAPMQ
ISTEGSGIPQ KAIDGSTSAF FSADSCSLTK LERVPWWYVN LLEPYMVQLV RLDFGKSCCG
NGTPATIVVR VGNNRPDLGT NPICNRFTGT LEEGQPLFLP CNPPMPGAFV SVHLETAAPS
QLSICEAFVY TDQALPIERC PAFRDQPPGA SASYNGKCYI FYNRQPVTLR DALAFCRARG
GTLINESNPA LQGFISWELW RRHRSDTSSQ YWMGAVRDAQ DRNTWKWIGG EEVSVSFWNL
PGGEEDCARY DGSKGWLWSD TNCNTQLNFI CQHQPKACGR PEQPPNSTML APKGFDVGAV
VEYSCDEGHL LVGPQQRTCL ETGFYNEFPP VCRYIECGLP ASIPHGYYDL LNGTVGYLST
VQYRCADGYE MVGRAVLTCD IDERWNGPPP RCELIECDPL PTLFANGVVF APNQTVYGAV
AEVRCNRDYV PDGVPAIRCT ATGQWNHTLP ACIPRAAYGD AAVDDSDDDG FYDPTVVTVR
PVGPPNSSIR PHQTPTPSRP GRPQPNGGQR RPGTVRYTPP TAVPPAASGS SSSSNSNGPM
TAAATTTSTT TTTTTSSPAS TVAIEIDDGT DDDLSAVDHS KYPLYYGQRH PSDTFHYKYH
QHDDINGQRP DNDDDDDDDD DDQNADAHGD SANDNLLTFD DDFDSLEGYF IGTNEDYSLP
PPEVRPGGVG GAVVLREDGP SVRPHGGHYR PSVPPPSVVV LPNGSAGAVS VAPAKPSAKP
PQQPSPPARP VSGRPQAPPL TRRPPTNPVT AAPAPTTVRG RVHEQDILLS QHPQDNEIAG
SVNIRQDQSP KVNVPFAVDN VNTGPVDGPV DGGSGGQRGA GGGAIDIGGT ASGGTNLAGA
TAGDRKESKN AKLNLGAIVA LGAFGGFVFL AAVITTIVIV VRRILWWKHL LLGCSSFSSF
TLLCLLPPSR NRTTNQHYRH RASPDCNTVA SFSSSSSESR NGLNRYYRQA WENLHESASK
SHHSGHSGLK RKETMDAPVN RSRSRENLDV SRSRDLDRSR ENLSAARSRD YGRDSMAMRD
GSEMVVSDVC VKGEKKRHHH HHHKSSHQQR GDFREPNIMG SGNGRREHCH Y
//