ID B7P6X9_IXOSC Unreviewed; 2842 AA.
AC B7P6X9;
DT 10-FEB-2009, integrated into UniProtKB/TrEMBL.
DT 10-FEB-2009, sequence version 1.
DT 27-MAR-2024, entry version 96.
DE SubName: Full=Hemolectin, putative {ECO:0000313|EMBL:EEC02351.1, ECO:0000313|EnsemblMetazoa:ISCW001097-PA};
DE Flags: Fragment;
GN ORFNames=IscW_ISCW001097 {ECO:0000313|EMBL:EEC02351.1};
OS Ixodes scapularis (Black-legged tick) (Deer tick).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Chelicerata; Arachnida; Acari;
OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes.
OX NCBI_TaxID=6945;
RN [1] {ECO:0000313|EMBL:EEC02351.1, ECO:0000313|Proteomes:UP000001555}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Wikel {ECO:0000313|Proteomes:UP000001555}, and Wikel colony
RC {ECO:0000313|EMBL:EEC02351.1};
RG Ixodes scapularis Genome Project Consortium;
RA Caler E., Hannick L.I., Bidwell S., Joardar V., Thiagarajan M., Amedeo P.,
RA Galinsky K.J., Schobel S., Inman J., Hostetler J., Miller J., Hammond M.,
RA Megy K., Lawson D., Kodira C., Sutton G., Meyer J., Hill C.A., Birren B.,
RA Nene V., Collins F., Alarcon-Chaidez F., Wikel S., Strausberg R.;
RT "Annotation of Ixodes scapularis.";
RL Submitted (MAR-2008) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ISCW001097-PA}
RP IDENTIFICATION.
RC STRAIN=wikel {ECO:0000313|EnsemblMetazoa:ISCW001097-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SIMILARITY: Belongs to the serine protease inhibitor-like (TIL domain-
CC containing) family. {ECO:0000256|ARBA:ARBA00007611}.
CC -!- SIMILARITY: Belongs to the thrombospondin family.
CC {ECO:0000256|ARBA:ARBA00009456}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ABJB010028763; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ABJB010089286; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ABJB010444156; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ABJB010445214; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ABJB010592265; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ABJB010719149; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ABJB010722818; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ABJB010858635; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ABJB010860603; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ABJB010984592; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; DS648544; EEC02351.1; -; Genomic_DNA.
DR RefSeq; XP_002409356.1; XM_002409312.1.
DR STRING; 6945.B7P6X9; -.
DR PaxDb; 6945-B7P6X9; -.
DR EnsemblMetazoa; ISCW001097-RA; ISCW001097-PA; ISCW001097.
DR KEGG; isc:IscW_ISCW001097; -.
DR VEuPathDB; VectorBase:ISCI001097; -.
DR VEuPathDB; VectorBase:ISCI005085; -.
DR VEuPathDB; VectorBase:ISCI006490; -.
DR VEuPathDB; VectorBase:ISCP_008378; -.
DR VEuPathDB; VectorBase:ISCP_014312; -.
DR VEuPathDB; VectorBase:ISCP_023268; -.
DR VEuPathDB; VectorBase:ISCW001097; -.
DR HOGENOM; CLU_224851_0_0_1; -.
DR InParanoid; B7P6X9; -.
DR OMA; NINWRIE; -.
DR Proteomes; UP000001555; Unassembled WGS sequence.
DR GO; GO:0031012; C:extracellular matrix; IBA:GO_Central.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR CDD; cd00057; FA58C; 2.
DR CDD; cd00112; LDLa; 1.
DR CDD; cd19941; TIL; 3.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR InterPro; IPR025155; WxxW_domain.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF373; HEMOLECTIN, ISOFORM A; 1.
DR Pfam; PF08742; C8; 3.
DR Pfam; PF00754; F5_F8_type_C; 2.
DR Pfam; PF13330; Mucin2_WxxW; 4.
DR Pfam; PF01826; TIL; 2.
DR Pfam; PF00094; VWD; 3.
DR SMART; SM00832; C8; 3.
DR SMART; SM00231; FA58C; 2.
DR SMART; SM00192; LDLa; 1.
DR SMART; SM00216; VWD; 3.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR SUPFAM; SSF57567; Serine protease inhibitors; 2.
DR PROSITE; PS01285; FA58C_1; 1.
DR PROSITE; PS01286; FA58C_2; 1.
DR PROSITE; PS50022; FA58C_3; 2.
DR PROSITE; PS50184; VWFC_2; 1.
DR PROSITE; PS51233; VWFD; 3.
PE 1: Evidence at protein level;
KW Copper {ECO:0000256|ARBA:ARBA00023008};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Proteomics identification {ECO:0007829|PeptideAtlas:B7P6X9};
KW Reference proteome {ECO:0000313|Proteomes:UP000001555};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 152..323
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1216..1375
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 1396..1540
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 1835..2012
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 2163..2340
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 2732..2797
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT REGION 688..735
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1231..1254
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 696..710
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1238..1254
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EEC02351.1"
FT NON_TER 2842
FT /evidence="ECO:0000313|EMBL:EEC02351.1"
SQ SEQUENCE 2842 AA; 313125 MW; 1987FC700D6F4A13 CRC64;
CHEAVHPEQF FELCIQAACS CASDARDCIC PVLSHYAHAC AQRGHVLDWR NDVPQCGVSC
ENGQSYQSCA SVCDSSCSLI GRNVSCASRC VEGCACGPGH SADSRGHCVP VSECGCLFRG
HEFPAGFKQR RHSEHCICAS GHWSCTQQQC AGTCALWGDS HVTTFDGAAL DFSGVCDYVL
VKGGLPNNQS ISVVVQNVPC GRRASCAKAV TITAGTIFGR VCTEVLVLTE GHPVPPLEPD
SRMQVQTLGL FVGVDVFGGL AVFWDKHARV YVIAGPAWSD KLQGLCGNFN GDGTDDFRAP
SGGPTLPDAI EFVNSWKLHP HCAPAVAQQD GCLAQPDRKK WAASQCQILR EDLFRPCHSE
VDVQPFYDRC VSDTCACDTG GDCECLCTAI GAYAHQCASR GVRVLWRSQE LCPLQCDTCD
EYSACVSFCK PPTCEHPVVH DACKVSPPLC LEGCTPALCP NGMVHRTATD RTCVPVQDCG
VPPETACSIG GLHFKEGQRV ATSNACQSWY VGLIFRVPIS MPGGICFSFC LRGEVRCVGK
PCPIEVQVCV QTGWSDWESM NSPEYQVFEN SSFRLFFFFF SLQAHGDHCP VEHMVAVECR
TVGSKTPWNE TGESVLCSPH TGLRCLREDQ DDRELCSDYE MRVFCRCPEE EKRIVFCQKP
ARESAPTIRS TRYTFTVKLQ NANVPIGPTA ASAATPPAAR SCTSSSPRAS ARPKRAAACP
AASPPTASRT SSTSTKTPAY LCSTVPACCP NEKPPLRLVA FCFPLCDMSL CTAYTPEPEG
GCWTPWFSAD SPEGPGDFED LRPMQAHGKV CFQPAAINCR TVAAERFVSS LFLFVTLHVG
NTDWSQSGQK VTCAVDRGLQ CWNSENSPEQ CQDYEVRFYC PCVESVYSSG VCTCLFVEVT
YLRFGCGLTW KSPGLAFCTD SPKCKHHHSS LLEDIKQKGK NATSSENTDS CVLHQVLQLF
FETVLLLSPF LCPHRIRCFS VCTQQCHSKI GNNHAKMTSF LELESVSSRR VCTCFFVEDT
HPRFGFTWNS PAFVFWTESP KCKHHYKTPT PVPWTPVPEP HCEHGWSSWL NGHSLDALGD
LETLQSARDT GLLQCPSVTA MECREADSQV PWDQSGLVGV QCDLTTGLVC RNKGQPKGRS
CADFEVRFFC DCGAETPPPP PTLPPATPTP AILVETTPPA CSFWSDWVDE NHPGLEGQDG
EREPGLNLRP ELGDFCQEFI RLIDGPEPLK DSLLKASSSR DSKSGPQASR FSSDKQKPFA
LIRDKLYGSW VPHLNDKEFI EAELVRPQTV YGVETRGDSQ LGAWVTSFTV MFSQDGVAYG
QLSNTDGSPK VFSGNHDAQS KNRQLFEHPF QAKFIRLVPK SWEGRIALKW DLLGCSHVSL
HPFLPDGKKP FRCPVCREPM GLQSGLIGDN QMEASSYRDE EHAAHYGRVE QRGWVAGVAD
KQQYLQVDFE DSRNLTAVVT QGRPEIPQWV KSFIVQTSNN GKRWNTIKDK QGGEMVFSGN
FDSSTPVTSV FPKTVSARYI RIVPVTWKNW IALRLEVLGC EHGKYYHGLW ALEESTPTPE
LTAVHCPEVS GENATACGQG CPVGFACDGL RCVHEADCPC VRDGKHFPVG GILETRTCQE
CQCQLGGRSN CLPIACPACP EGRAKILNPD CSCDCGGCPE GQRLCPTSGE CVNEDLWCDG
VVNCPDDEAQ CEAKTPAPCP PVAKVFCGKG QTMTLQTDDQ GCEHFSCGKT YLPRFVWKTL
RKIYIACNFL LSSVGETSCS CTSHSRVFVN VLALRDQPHE VLLLAVAESG ERPIIVPPPP
SKYAVLYFAN SKFNLCTFTA KGHIFPHLYA SSIDASCELV GHHFSTFDGR EFDYSYCHHI
LLQDVVGGNL TISVDRHCSL ENQESCPKRV IVEHGHHRIV LDADLSATVD EDEYSAHQLQ
LLTKRLPDFE LERVGERIHF RSRVHNLVLK LDVRGRVEIE VDSSLKGVLG GLCGFYNELV
SDDLTTPAGH QVATTDEFGD SWATPGTAQD CLPLACPKDT LLQAAKICNR LKEEPLSQCQ
GIEDQLENCL SASCECLQRG NTTAESCACD AYLDAVTQCE KDGGEKVVES LRGWRLEYGC
VPDCPPEMEW LDCGPDCQIT CENFLSGDLS CATKKSCNPG CFCPPGTVLD RDHCKKADEC
ADKVCTGYGD PMIETFDGWK FHVQSNGHFE LVFDREGRFM VDAVTDSCEE RATCIIGLDI
KHENHVAKIR RHKQVLIDNE EYDRDQLPWT GMGVTIFAMP GKVTVVVFTG LGVQIRYNEI
SAAFSIHVPS KTFFNKTAGL CGNCNGKPDD DKTTKNGTVT EELVTFVCSW ETHISEEECE
PEKPPVPGAC SEILDREVFG QCLSLVDTTK FVDQCRHDTS FSVQENTAVC SSLLEMARAC
YTRYMSCHDA CPQSCDAERE NEASSKFVKG SSKTHYHKSD CRNLKVDGCF CPDGQVFEDG
DCKPKDVCDL CDDQGHKLGD VWKTDQCTSC TCKKDGKMDC QHEVCPPDPI CRENQKLLRH
LPHQHLCFRT LPAEDIVEEC PELSLPRCER GDVTRTKTDS KGCPMYYCEC DPALCPPVVW
PMDLEPGQEV EMTPVGCCAT LHAVCRPEKC PQPPHCAPPL ELLETPGVCC RSFKCRPPEK
VCPYAHKFVV VDGKEVAIEP ALQNVSFYEP GQTWRDGLCS NCSCDESETD LMARCNLERC
YESAELPDDK DYFLSIVDVP NRCCPALVRL FCKDEYGNIR EPGDEWQSRD DQCKSHICEQ
TALGEVHKVE RSIICPKCPE NAQALPLGPG ECCPRCQIVA CEEAGVMHPV GSHWNSSSHL
CYRAECVQAG DTARIVYTSP SC
//