GenomeNet

Database: UniProt
Entry: E2AVY8_CAMFO
LinkDB: E2AVY8_CAMFO
Original site: E2AVY8_CAMFO 
ID   E2AVY8_CAMFO            Unreviewed;      3873 AA.
AC   E2AVY8;
DT   30-NOV-2010, integrated into UniProtKB/TrEMBL.
DT   30-NOV-2010, sequence version 1.
DT   27-MAR-2024, entry version 62.
DE   SubName: Full=Hemocytin {ECO:0000313|EMBL:EFN62410.1};
GN   ORFNames=EAG_01600 {ECO:0000313|EMBL:EFN62410.1};
OS   Camponotus floridanus (Florida carpenter ant).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC   Formicidae; Formicinae; Camponotus.
OX   NCBI_TaxID=104421 {ECO:0000313|Proteomes:UP000000311};
RN   [1] {ECO:0000313|EMBL:EFN62410.1, ECO:0000313|Proteomes:UP000000311}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=C129 {ECO:0000313|Proteomes:UP000000311};
RX   PubMed=20798317; DOI=10.1126/science.1192428;
RA   Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G.,
RA   Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D.,
RA   Wang J., Liebig J.;
RT   "Genomic comparison of the ants Camponotus floridanus and Harpegnathos
RT   saltator.";
RL   Science 329:1068-1071(2010).
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; GL443213; EFN62410.1; -; Genomic_DNA.
DR   STRING; 104421.E2AVY8; -.
DR   InParanoid; E2AVY8; -.
DR   OMA; PQYICEC; -.
DR   Proteomes; UP000000311; Unassembled WGS sequence.
DR   GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR   GO; GO:0008061; F:chitin binding; IEA:InterPro.
DR   CDD; cd00057; FA58C; 2.
DR   CDD; cd19941; TIL; 6.
DR   Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR   Gene3D; 2.10.25.10; Laminin; 7.
DR   InterPro; IPR002557; Chitin-bd_dom.
DR   InterPro; IPR036508; Chitin-bd_dom_sf.
DR   InterPro; IPR006207; Cys_knot_C.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR000421; FA58C.
DR   InterPro; IPR008979; Galactose-bd-like_sf.
DR   InterPro; IPR036084; Ser_inhib-like_sf.
DR   InterPro; IPR002919; TIL_dom.
DR   InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR   InterPro; IPR001007; VWF_dom.
DR   InterPro; IPR001846; VWF_type-D.
DR   PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR   PANTHER; PTHR11339:SF373; HEMOLECTIN, ISOFORM A; 1.
DR   Pfam; PF08742; C8; 5.
DR   Pfam; PF00754; F5_F8_type_C; 2.
DR   Pfam; PF01826; TIL; 5.
DR   Pfam; PF00094; VWD; 5.
DR   SMART; SM00832; C8; 5.
DR   SMART; SM00494; ChtBD2; 1.
DR   SMART; SM00041; CT; 1.
DR   SMART; SM00181; EGF; 2.
DR   SMART; SM00231; FA58C; 2.
DR   SMART; SM00214; VWC; 8.
DR   SMART; SM00215; VWC_out; 4.
DR   SMART; SM00216; VWD; 5.
DR   SUPFAM; SSF57196; EGF/Laminin; 1.
DR   SUPFAM; SSF57603; FnI-like domain; 1.
DR   SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR   SUPFAM; SSF57625; Invertebrate chitin-binding proteins; 1.
DR   SUPFAM; SSF57567; Serine protease inhibitors; 5.
DR   PROSITE; PS50940; CHIT_BIND_II; 1.
DR   PROSITE; PS01225; CTCK_2; 1.
DR   PROSITE; PS00022; EGF_1; 2.
DR   PROSITE; PS50026; EGF_3; 2.
DR   PROSITE; PS50022; FA58C_3; 2.
DR   PROSITE; PS01208; VWFC_1; 1.
DR   PROSITE; PS50184; VWFC_2; 1.
DR   PROSITE; PS51233; VWFD; 5.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW   Reference proteome {ECO:0000313|Proteomes:UP000000311};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT   DOMAIN          90..121
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          186..217
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          353..532
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          720..899
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          1198..1367
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          1699..1763
FT                   /note="Chitin-binding type-2"
FT                   /evidence="ECO:0000259|PROSITE:PS50940"
FT   DOMAIN          1961..2116
FT                   /note="F5/8 type C"
FT                   /evidence="ECO:0000259|PROSITE:PS50022"
FT   DOMAIN          2146..2287
FT                   /note="F5/8 type C"
FT                   /evidence="ECO:0000259|PROSITE:PS50022"
FT   DOMAIN          2600..2777
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          2933..3120
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          3264..3332
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          3761..3858
FT                   /note="CTCK"
FT                   /evidence="ECO:0000259|PROSITE:PS01225"
FT   DISULFID        93..103
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        111..120
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        189..199
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        207..216
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ   SEQUENCE   3873 AA;  433814 MW;  EC5709FB72611AEE CRC64;
     MLNDAPDIAI EDSYSVKNRK GGRRIFAGGC ARRPDTPING KLKCSLNSGC TASCAPDYQF
     PNGALHLIIT CVDKEWHIEG TEWNSIPHCE PICMPECQNK GICIAPHHCN CPEHFSGPQC
     QFENKPCLNY PPPVLNSYKK CNSKTCTVSC MEHFTFPDSS SVANLICKDG NWKPTRSDWV
     SIPDCEPVCE PPCQNGGNCL PTNLCQCPQD YRGPQCQYSA NTCDTEKLRF NGGYYCTGDS
     ESYSCTLNCP TGVEFEFSPA TAYICTYDKG VFEPQPIPQC KIDNNIKIIS LGTSYNTYVR
     ESNHSWSMHD IFGTKHSQGI HDGYYGIHGS DASLYPIQSN GVMIFEMNKP KPKTCFTWSG
     AHYKTFDDRI YSFDSDCAHT LLQETRDGIC TIVVLNSPGC KSGSSSRCVK IIKLFVHDKE
     YTLTSNEMGM SVFSNSKRSL PIPVYLPGLR VDKSAHFTIV SLDSLGMKLK WDGALLLQIE
     ASESMWNKTT GLCGTMNDDQ NDEFLTKSGS YASSIPTLAN SWRVDDLGEI CDDYPSTRHA
     CESKNELTRD AFEFCNELLS NHKFKTCANT INFSELTESC LWDYCACEHD DKRKCACDTM
     DVYIRQCAHK GIAQLTAWRN NDTCPIFCDG GRVYLSCGPK AEASCFSGIE AKQEISSECE
     EGCFCPAGTL EHEGKCISPE ECPCRLRGKL FQPGTSVQKK CNTCTCISGK WVCTQIRCGA
     RCAIVGDPHY TTFDGKHYDF MGKCKYYLMK GENYTIESEN VPCSGAISEN LGFTFVGSPS
     CTKSVTINFK DTIIKLKQNR QITINGDEIT KFPILFNGAR IRIASSIFVV IRLPNALKVW
     WDGVSRVYIN APAEFHDRTK GLCGTFTENQ KDDFITPDGD TETAAIVFAN KWKTNEYCID
     ELESEPKHPC DLNPQRRARA EEYCSKIHSN IFSDCHWYVD PQEFYRDCMY DMCACDTDVT
     SCLCPMLAAY AKDCATLGVK LLWRAEIDEC KIHCTGGQTY QICGNSCTRS CSDVSFYRDC
     KQECVEGCNC PEDQTLNANG ECIPIVQCPC FFAGREYKPN HREVRPGNKG QECCSCIGGV
     WECRLATPDE IREYPPVTDL FCPASKHLEV TDCQPVEQRT CSNMHIPIEQ TPSVCTSGCI
     CKSGYVLDVT NGICVKKEDC PCLHGGKSYK EGSVMQDGCN TCTCKNTKWK CTDRTCAGIC
     SVWGDSHYKT FDGKMYSFQG ICDYVLAKST LSKEECFDIS IQNVPCGTNG VACSKSIKLL
     IGSGEQQEEL ILTKGKELPK ETYKRMTIRD AGLFVFVDVP DLGLVLQWDK GTRVYVRLNP
     EWKSRTMGLC GDYNDNAEDD FKTPSGGISE VSVNLFGDSW KKNAFCLEPK DMQDDVCERH
     PERKLWSLRQ CNVLKSPLFS SCHSEVEVEP YLRDCIFDTC SCDAGGDCEC LCTALAAYAH
     ECNVRGVPVK WRTQELCPIQ CDEKCSTYSA CVSTCPRETC DNLMIVKHSS HLCTEDTCVE
     GCQFKPCPDD HVYQNSSYTE CIPKSMCAKP FCIEMNGTTY YEGDRVSGDD CHSCFCSRGK
     VTCNGEACTS TTMANNATIP STELQKCVDG WSLWINKDPE VKGKKFLDVE TLPNLMDFPD
     VNGFPICDKE HMVDIRCRSV KEHLSPKESG LDVECSLERG LYCQSHLPDL PCVDFEISVL
     CRCFEPTTHG VENTTTEIGK ECDVAHPNSP HPTNCQLFYH CIITPTGHEL VEKSCGPGTL
     YNSKTQVCDW PAQVIRIRPE CFEPERTTQT SSGTEWSTNY ENTTTKTVST INVCKDGEMW
     NECAIQCVRT CQYYRHILMT QGHCNEDTDC VAGCVLIDQP MCHFPKFWRD GITCVEANHC
     PCKSHDGNSV APGAIKKESD CETCQCINNY YTCDTTFCYN VSSHEETVGT GKVPEQTETV
     TTQSSFSTIS GSTWKTSPIT SSPSIEHTIF IQSTVTPPEE CDDANYVPLI RNLGKKVTIR
     ASSSKNPVLQ FEDLLIYTEG NFPSSSEKFW EPEITNTDQW LDVEFDRPEP VYGVILQGAV
     TKDEFVTSYK VLFSEDGQSF SYTLDHEKQP RVFRGPADRI QSVQQRFYQP IEARIIRINP
     LTWHNGIAVK MEVLGCQDHI ISVMMTSTTE QSIIKTTMSE KIVRPVCEDS MGLNNGLMTI
     EQISVSSSPQ LIQNLSLSSE GVWRAALDNP HQYIQFDFLE TRNLTGIITK GGDNAWTTVY
     KVFYSNDGHH WNPVIDKNGN EKEFLGNFDA ESQQTNFFEK PLHARLLRVQ PIKWHDHVAL
     KIEILGCYLA YPSMKTSEIT STTTSSSFER ECNICDGMDR TILNDETRCK CEDPYWWDGE
     SCVSKWECPC IIGHVSYAVG SIYETEDCQQ CTCVLGGTPT CSPKKCETCL EPGLQSIVSK
     LCTCLCKPCP AGTRHCPTSD VCVNETSWCD GIQDCPDDEK DCPEIISTTP IAVEISEITN
     TTGLQIITTS SPIQIPLPCE KPFCSLGYRV VFKQSRLHHH KTNIKSNNRK GFTKTKGHKK
     YMFHEHPMKN QENQPIVEDI QCPEFICIPK PPVLSDDKKP QTCPEAACPP QYEVVFERTS
     MYKKHKCPKY ICRPLKPQEA VCNVIGRTFN TFDNMEYKYD ICNHILARDM YGNEWYITLE
     KLCLDSHGQQ RCTRILVVTL NERTIVLYPN LQVDIDGYTF TAKQIARFGN RFPGFELLRT
     GDRIIFLSHH YGFWVIWDSS TNVKIGVVAK LVGRVDGLCG YFDGNVANDR QTPEGTQARS
     IIQFGNSWAM EGAKECDLHV CPHDVQEQAW TICSSVKSPM LLGACSAIVD LDRFVSRCVE
     SVCSCLHSSN TSYEDCRCRL LTSFVSECEA AADYDTNLLT DWRTVYDCPA NCPSPFVHRD
     CFRSKCEITC DNLHEVEPCP PMRDICFPGC FCPDGLVRRN DNECVPPARC LDCVCDGLGD
     AKFIDFNRKN FRFTDNCTYL LSGNTMENAK NQRNETRAYQ LLITNGYCAT GICTEVITLL
     YDEHVVQIKR AELSKDLQVS IDDSRVERFP TDHTWIVLNQ MSTGDVTLLV PFIQLEFVAF
     RQNFAFTLKL PSHIFSDVTE GLCGNCNADT EDGFEKRGGE ITQDVEEFGK SWLIKDLPMQ
     LGLSDRTCSS NRQSPCTPPP AEEDICKKLL DLPQFMQCHS IVDPKPYMDC CYDALCTGGS
     YCDSLEMYAR KCLEAGLCPA WRTDEICPYE CSKDLIYQPC GSSCKETCDT LNKSDNPKCA
     SGPVEGCFCP ENYVFHNDSC VLKQDCFICD QEGHVQGDIW YPDKCTECNC NNGVVNCQKT
     ECPVLDTICD ENMTPILVNG TEERCCAKYL CAPKPTAPTI CIEPQEPECG FGQIMKAIID
     ADGCHKFICQ CLPVNECPTF DELSNEVEQL QSGFVQVMNT SGCCPRPAKI CDPKTCPSAA
     KCPDYFNTTV IVLANSCCPT YECVPPKDVC LYTNENQSSQ PVIVKQIGEE WKDGKCKTCL
     CENSHDGPKA NCLIMECPSM DAQSDMELYV LEEIQLDDKC CPIFERTACR WEDKIYNVGE
     NWKPNIKNAC LSMYCDKKLD SVQILTKVQE CNTVCDYGYT YQAPNDTSVN CCGTCIQVAC
     IADGILKEVG EHWYSDDHCV TYFCESTNGS VYIHANTETC PEIDPQLESE YEVEERKIPE
     KCCPEYVKTA CRSDGQIYKS GEKWRSLTDN CVIETCVGPN ITKRKEIEVC STQCSPLSLA
     VSPAACPDIT DCPAESIYHD HCCKRCNLTI LNIEKEINKT CNTVFVDANN TVGMLVVNHL
     LHGKCKNLDV VEGIKQCSGT CQSSTFFDSG SWSQVSNCYC CQAEKYSGLI QVNLTCEDGE
     ILKKQLAIPS SCACQSCASS DVKSENRKTK SKS
//
DBGET integrated database retrieval system