ID E2AVY8_CAMFO Unreviewed; 3873 AA.
AC E2AVY8;
DT 30-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2010, sequence version 1.
DT 27-MAR-2024, entry version 62.
DE SubName: Full=Hemocytin {ECO:0000313|EMBL:EFN62410.1};
GN ORFNames=EAG_01600 {ECO:0000313|EMBL:EFN62410.1};
OS Camponotus floridanus (Florida carpenter ant).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Formicinae; Camponotus.
OX NCBI_TaxID=104421 {ECO:0000313|Proteomes:UP000000311};
RN [1] {ECO:0000313|EMBL:EFN62410.1, ECO:0000313|Proteomes:UP000000311}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=C129 {ECO:0000313|Proteomes:UP000000311};
RX PubMed=20798317; DOI=10.1126/science.1192428;
RA Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G.,
RA Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D.,
RA Wang J., Liebig J.;
RT "Genomic comparison of the ants Camponotus floridanus and Harpegnathos
RT saltator.";
RL Science 329:1068-1071(2010).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL443213; EFN62410.1; -; Genomic_DNA.
DR STRING; 104421.E2AVY8; -.
DR InParanoid; E2AVY8; -.
DR OMA; PQYICEC; -.
DR Proteomes; UP000000311; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0008061; F:chitin binding; IEA:InterPro.
DR CDD; cd00057; FA58C; 2.
DR CDD; cd19941; TIL; 6.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 2.10.25.10; Laminin; 7.
DR InterPro; IPR002557; Chitin-bd_dom.
DR InterPro; IPR036508; Chitin-bd_dom_sf.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF373; HEMOLECTIN, ISOFORM A; 1.
DR Pfam; PF08742; C8; 5.
DR Pfam; PF00754; F5_F8_type_C; 2.
DR Pfam; PF01826; TIL; 5.
DR Pfam; PF00094; VWD; 5.
DR SMART; SM00832; C8; 5.
DR SMART; SM00494; ChtBD2; 1.
DR SMART; SM00041; CT; 1.
DR SMART; SM00181; EGF; 2.
DR SMART; SM00231; FA58C; 2.
DR SMART; SM00214; VWC; 8.
DR SMART; SM00215; VWC_out; 4.
DR SMART; SM00216; VWD; 5.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR SUPFAM; SSF57625; Invertebrate chitin-binding proteins; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 5.
DR PROSITE; PS50940; CHIT_BIND_II; 1.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS00022; EGF_1; 2.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS50022; FA58C_3; 2.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
DR PROSITE; PS51233; VWFD; 5.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000000311};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 90..121
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 186..217
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 353..532
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 720..899
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1198..1367
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1699..1763
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 1961..2116
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 2146..2287
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 2600..2777
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 2933..3120
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 3264..3332
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 3761..3858
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT DISULFID 93..103
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 111..120
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 189..199
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 207..216
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 3873 AA; 433814 MW; EC5709FB72611AEE CRC64;
MLNDAPDIAI EDSYSVKNRK GGRRIFAGGC ARRPDTPING KLKCSLNSGC TASCAPDYQF
PNGALHLIIT CVDKEWHIEG TEWNSIPHCE PICMPECQNK GICIAPHHCN CPEHFSGPQC
QFENKPCLNY PPPVLNSYKK CNSKTCTVSC MEHFTFPDSS SVANLICKDG NWKPTRSDWV
SIPDCEPVCE PPCQNGGNCL PTNLCQCPQD YRGPQCQYSA NTCDTEKLRF NGGYYCTGDS
ESYSCTLNCP TGVEFEFSPA TAYICTYDKG VFEPQPIPQC KIDNNIKIIS LGTSYNTYVR
ESNHSWSMHD IFGTKHSQGI HDGYYGIHGS DASLYPIQSN GVMIFEMNKP KPKTCFTWSG
AHYKTFDDRI YSFDSDCAHT LLQETRDGIC TIVVLNSPGC KSGSSSRCVK IIKLFVHDKE
YTLTSNEMGM SVFSNSKRSL PIPVYLPGLR VDKSAHFTIV SLDSLGMKLK WDGALLLQIE
ASESMWNKTT GLCGTMNDDQ NDEFLTKSGS YASSIPTLAN SWRVDDLGEI CDDYPSTRHA
CESKNELTRD AFEFCNELLS NHKFKTCANT INFSELTESC LWDYCACEHD DKRKCACDTM
DVYIRQCAHK GIAQLTAWRN NDTCPIFCDG GRVYLSCGPK AEASCFSGIE AKQEISSECE
EGCFCPAGTL EHEGKCISPE ECPCRLRGKL FQPGTSVQKK CNTCTCISGK WVCTQIRCGA
RCAIVGDPHY TTFDGKHYDF MGKCKYYLMK GENYTIESEN VPCSGAISEN LGFTFVGSPS
CTKSVTINFK DTIIKLKQNR QITINGDEIT KFPILFNGAR IRIASSIFVV IRLPNALKVW
WDGVSRVYIN APAEFHDRTK GLCGTFTENQ KDDFITPDGD TETAAIVFAN KWKTNEYCID
ELESEPKHPC DLNPQRRARA EEYCSKIHSN IFSDCHWYVD PQEFYRDCMY DMCACDTDVT
SCLCPMLAAY AKDCATLGVK LLWRAEIDEC KIHCTGGQTY QICGNSCTRS CSDVSFYRDC
KQECVEGCNC PEDQTLNANG ECIPIVQCPC FFAGREYKPN HREVRPGNKG QECCSCIGGV
WECRLATPDE IREYPPVTDL FCPASKHLEV TDCQPVEQRT CSNMHIPIEQ TPSVCTSGCI
CKSGYVLDVT NGICVKKEDC PCLHGGKSYK EGSVMQDGCN TCTCKNTKWK CTDRTCAGIC
SVWGDSHYKT FDGKMYSFQG ICDYVLAKST LSKEECFDIS IQNVPCGTNG VACSKSIKLL
IGSGEQQEEL ILTKGKELPK ETYKRMTIRD AGLFVFVDVP DLGLVLQWDK GTRVYVRLNP
EWKSRTMGLC GDYNDNAEDD FKTPSGGISE VSVNLFGDSW KKNAFCLEPK DMQDDVCERH
PERKLWSLRQ CNVLKSPLFS SCHSEVEVEP YLRDCIFDTC SCDAGGDCEC LCTALAAYAH
ECNVRGVPVK WRTQELCPIQ CDEKCSTYSA CVSTCPRETC DNLMIVKHSS HLCTEDTCVE
GCQFKPCPDD HVYQNSSYTE CIPKSMCAKP FCIEMNGTTY YEGDRVSGDD CHSCFCSRGK
VTCNGEACTS TTMANNATIP STELQKCVDG WSLWINKDPE VKGKKFLDVE TLPNLMDFPD
VNGFPICDKE HMVDIRCRSV KEHLSPKESG LDVECSLERG LYCQSHLPDL PCVDFEISVL
CRCFEPTTHG VENTTTEIGK ECDVAHPNSP HPTNCQLFYH CIITPTGHEL VEKSCGPGTL
YNSKTQVCDW PAQVIRIRPE CFEPERTTQT SSGTEWSTNY ENTTTKTVST INVCKDGEMW
NECAIQCVRT CQYYRHILMT QGHCNEDTDC VAGCVLIDQP MCHFPKFWRD GITCVEANHC
PCKSHDGNSV APGAIKKESD CETCQCINNY YTCDTTFCYN VSSHEETVGT GKVPEQTETV
TTQSSFSTIS GSTWKTSPIT SSPSIEHTIF IQSTVTPPEE CDDANYVPLI RNLGKKVTIR
ASSSKNPVLQ FEDLLIYTEG NFPSSSEKFW EPEITNTDQW LDVEFDRPEP VYGVILQGAV
TKDEFVTSYK VLFSEDGQSF SYTLDHEKQP RVFRGPADRI QSVQQRFYQP IEARIIRINP
LTWHNGIAVK MEVLGCQDHI ISVMMTSTTE QSIIKTTMSE KIVRPVCEDS MGLNNGLMTI
EQISVSSSPQ LIQNLSLSSE GVWRAALDNP HQYIQFDFLE TRNLTGIITK GGDNAWTTVY
KVFYSNDGHH WNPVIDKNGN EKEFLGNFDA ESQQTNFFEK PLHARLLRVQ PIKWHDHVAL
KIEILGCYLA YPSMKTSEIT STTTSSSFER ECNICDGMDR TILNDETRCK CEDPYWWDGE
SCVSKWECPC IIGHVSYAVG SIYETEDCQQ CTCVLGGTPT CSPKKCETCL EPGLQSIVSK
LCTCLCKPCP AGTRHCPTSD VCVNETSWCD GIQDCPDDEK DCPEIISTTP IAVEISEITN
TTGLQIITTS SPIQIPLPCE KPFCSLGYRV VFKQSRLHHH KTNIKSNNRK GFTKTKGHKK
YMFHEHPMKN QENQPIVEDI QCPEFICIPK PPVLSDDKKP QTCPEAACPP QYEVVFERTS
MYKKHKCPKY ICRPLKPQEA VCNVIGRTFN TFDNMEYKYD ICNHILARDM YGNEWYITLE
KLCLDSHGQQ RCTRILVVTL NERTIVLYPN LQVDIDGYTF TAKQIARFGN RFPGFELLRT
GDRIIFLSHH YGFWVIWDSS TNVKIGVVAK LVGRVDGLCG YFDGNVANDR QTPEGTQARS
IIQFGNSWAM EGAKECDLHV CPHDVQEQAW TICSSVKSPM LLGACSAIVD LDRFVSRCVE
SVCSCLHSSN TSYEDCRCRL LTSFVSECEA AADYDTNLLT DWRTVYDCPA NCPSPFVHRD
CFRSKCEITC DNLHEVEPCP PMRDICFPGC FCPDGLVRRN DNECVPPARC LDCVCDGLGD
AKFIDFNRKN FRFTDNCTYL LSGNTMENAK NQRNETRAYQ LLITNGYCAT GICTEVITLL
YDEHVVQIKR AELSKDLQVS IDDSRVERFP TDHTWIVLNQ MSTGDVTLLV PFIQLEFVAF
RQNFAFTLKL PSHIFSDVTE GLCGNCNADT EDGFEKRGGE ITQDVEEFGK SWLIKDLPMQ
LGLSDRTCSS NRQSPCTPPP AEEDICKKLL DLPQFMQCHS IVDPKPYMDC CYDALCTGGS
YCDSLEMYAR KCLEAGLCPA WRTDEICPYE CSKDLIYQPC GSSCKETCDT LNKSDNPKCA
SGPVEGCFCP ENYVFHNDSC VLKQDCFICD QEGHVQGDIW YPDKCTECNC NNGVVNCQKT
ECPVLDTICD ENMTPILVNG TEERCCAKYL CAPKPTAPTI CIEPQEPECG FGQIMKAIID
ADGCHKFICQ CLPVNECPTF DELSNEVEQL QSGFVQVMNT SGCCPRPAKI CDPKTCPSAA
KCPDYFNTTV IVLANSCCPT YECVPPKDVC LYTNENQSSQ PVIVKQIGEE WKDGKCKTCL
CENSHDGPKA NCLIMECPSM DAQSDMELYV LEEIQLDDKC CPIFERTACR WEDKIYNVGE
NWKPNIKNAC LSMYCDKKLD SVQILTKVQE CNTVCDYGYT YQAPNDTSVN CCGTCIQVAC
IADGILKEVG EHWYSDDHCV TYFCESTNGS VYIHANTETC PEIDPQLESE YEVEERKIPE
KCCPEYVKTA CRSDGQIYKS GEKWRSLTDN CVIETCVGPN ITKRKEIEVC STQCSPLSLA
VSPAACPDIT DCPAESIYHD HCCKRCNLTI LNIEKEINKT CNTVFVDANN TVGMLVVNHL
LHGKCKNLDV VEGIKQCSGT CQSSTFFDSG SWSQVSNCYC CQAEKYSGLI QVNLTCEDGE
ILKKQLAIPS SCACQSCASS DVKSENRKTK SKS
//