ID A0A182K7Z7_9DIPT Unreviewed; 970 AA.
AC A0A182K7Z7;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 {ECO:0008006|Google:ProtNLM};
OS Anopheles christyi.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=43041 {ECO:0000313|EnsemblMetazoa:ACHR006882-PA, ECO:0000313|Proteomes:UP000075881};
RN [1] {ECO:0000313|Proteomes:UP000075881}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ACHKN1017 {ECO:0000313|Proteomes:UP000075881};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Besansky N., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles christyi ACHKN1017.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ACHR006882-PA}
RP IDENTIFICATION.
RC STRAIN=ACHKN1017 {ECO:0000313|EnsemblMetazoa:ACHR006882-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182K7Z7; -.
DR STRING; 43041.A0A182K7Z7; -.
DR EnsemblMetazoa; ACHR006882-RA; ACHR006882-PA; ACHR006882.
DR VEuPathDB; VectorBase:ACHR006882; -.
DR OrthoDB; 5306009at2759; -.
DR Proteomes; UP000075881; Unassembled WGS sequence.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-KW.
DR CDD; cd00033; CCP; 4.
DR CDD; cd00037; CLECT; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 4.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR018378; C-type_lectin_CS.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR006585; FTP1.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR PANTHER; PTHR19325; COMPLEMENT COMPONENT-RELATED SUSHI DOMAIN-CONTAINING; 1.
DR PANTHER; PTHR19325:SF573; SUSHI DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00059; Lectin_C; 1.
DR Pfam; PF00084; Sushi; 4.
DR SMART; SM00032; CCP; 4.
DR SMART; SM00034; CLECT; 1.
DR SMART; SM00607; FTP; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF57535; Complement control module/SCR domain; 4.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR PROSITE; PS00615; C_TYPE_LECTIN_1; 1.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR PROSITE; PS50923; SUSHI; 4.
PE 4: Predicted;
KW Calcium {ECO:0000256|ARBA:ARBA00022837};
KW Cell membrane {ECO:0000256|ARBA:ARBA00022475};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00302}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius}; Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Sushi {ECO:0000256|PROSITE-ProRule:PRU00302};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 942..968
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 21..80
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 244..362
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT DOMAIN 366..424
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 425..484
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 485..544
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT REGION 559..621
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 654..851
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 559..574
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 605..621
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 706..728
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 747..761
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 784..800
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 816..851
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 51..78
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 395..422
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 455..482
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 515..542
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
SQ SEQUENCE 970 AA; 104467 MW; BF82CDB677B4C7C9 CRC64;
MNAQENFTSA SLVAGGASAT KGCPFPGVPA HGSVIFSDDS LPNNTVATYY CERGFELLGP
SRRVCIDSQW IPEGIPFCVL NVAAGKAPMQ ISTEASGIPQ KAIDGSTSAF FSPDTCSLTK
AERVPWWYVN LLEPYMVQLV RLDFGKSCCG KGKPATIVVR VGNNRPDLGT NPICNRFTGS
LEEGQPLFLP CNPPMPGAFV SVHLETNAPS QLSICEAFVY TDQALPIERC PAFRDQPPGA
SASYNGKCYI FYSRQPATLR DALAFCRSRG GTLINESNPA LQGFISWELW RRHRSDTSSQ
YWMGAVRDAQ DRNTWKWIGG EEVSVSFWNL PGGDEDCARY DGSKGWLWSD TNCNTQLNFI
CQHQPKACGR PEQPPNSTMI APKGFDVGAV VEYSCDEGHL LVGPKQRTCL ETGFYNEFPP
VCKYIECGLP ASIPHGYYDL INGTVGYLST VMYRCAEGYE MVGRAVLICD IDERWNGPPP
RCELIECDPL PTLFSNGVIV SSNQTVYGTR AEVHCNRGFI PDSEPEIVCT ASGQWSHPLP
KCVPNPADQN EVPVTARPSV VSTTPSSSRA PPVLIVSPTT PASIGGIPSR RQPTGGRRPV
STVRYPTQAP PQSSTVSTAE STTSIVINMA GPTTAQPSRT TSTTESPLFS IEIDEGMDDD
GSYLPHGGPN RKHTNRDENN DNNENDDDDD DYFYRKPNLH GGYRPDVGTD YDDNDDAHDH
HANDDLPTID DNFDSLEGFF IGTNEDYSLP PPPPPPPSSE VRPGSMGEEA IRPFRPQQPS
VVILPKEPGS SSSSASKPAT PKPTIYEPTN PPAVVSMTPP AQTHRPQVTG TTGRRPVTTT
TTRVRNQHPS QEQDILLSHH PQDNEIPGSV NIRQDQSPKV NVPFAVNNVD DLVPADGSNG
SGGTDGLITS FGGANGGAGG LGGAGGSGAG DRKESKNAKL NLGAIVALGA FGGFVFLAAV
ITTIVIVVRR
//