ID A0A0T6AYV5_9SCAR Unreviewed; 2262 AA.
AC A0A0T6AYV5;
DT 17-FEB-2016, integrated into UniProtKB/TrEMBL.
DT 17-FEB-2016, sequence version 1.
DT 24-JAN-2024, entry version 29.
DE RecName: Full=Hemocytin {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
GN ORFNames=AMK59_6836 {ECO:0000313|EMBL:KRT80045.1};
OS Oryctes borbonicus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Scarabaeiformia;
OC Scarabaeidae; Dynastinae; Oryctes.
OX NCBI_TaxID=1629725 {ECO:0000313|EMBL:KRT80045.1, ECO:0000313|Proteomes:UP000051574};
RN [1] {ECO:0000313|EMBL:KRT80045.1, ECO:0000313|Proteomes:UP000051574}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=OB123 {ECO:0000313|EMBL:KRT80045.1};
RC TISSUE=Whole animal {ECO:0000313|EMBL:KRT80045.1};
RA Meyer J.M., Markov G.V., Baskaran P., Herrmann M., Sommer R.J.,
RA Roedelsperger C.;
RT "Draft genome of the scarab beetle Oryctes borbonicus.";
RL Submitted (SEP-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the thrombospondin family.
CC {ECO:0000256|ARBA:ARBA00009456}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00124}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRT80045.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LJIG01022538; KRT80045.1; -; Genomic_DNA.
DR OrthoDB; 5398470at2759; -.
DR Proteomes; UP000051574; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0008061; F:chitin binding; IEA:InterPro.
DR CDD; cd00057; FA58C; 1.
DR CDD; cd00112; LDLa; 1.
DR CDD; cd19941; TIL; 4.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR002557; Chitin-bd_dom.
DR InterPro; IPR036508; Chitin-bd_dom_sf.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF386; HEMOLECTIN, ISOFORM A; 1.
DR Pfam; PF08742; C8; 3.
DR Pfam; PF00754; F5_F8_type_C; 2.
DR Pfam; PF01826; TIL; 3.
DR Pfam; PF00094; VWD; 3.
DR SMART; SM00832; C8; 3.
DR SMART; SM00494; ChtBD2; 1.
DR SMART; SM00231; FA58C; 2.
DR SMART; SM00192; LDLa; 1.
DR SMART; SM00215; VWC_out; 2.
DR SMART; SM00216; VWD; 2.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR SUPFAM; SSF57625; Invertebrate chitin-binding proteins; 1.
DR SUPFAM; SSF57424; LDL receptor-like module; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 3.
DR PROSITE; PS50940; CHIT_BIND_II; 1.
DR PROSITE; PS01285; FA58C_1; 1.
DR PROSITE; PS01286; FA58C_2; 1.
DR PROSITE; PS50022; FA58C_3; 2.
DR PROSITE; PS50068; LDLRA_2; 1.
DR PROSITE; PS51233; VWFD; 4.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000051574};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1..38
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 342..511
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 855..919
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 1127..1281
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 1367..1510
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 1816..1991
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 2146..2262
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KRT80045.1"
FT NON_TER 2262
FT /evidence="ECO:0000313|EMBL:KRT80045.1"
SQ SEQUENCE 2262 AA; 251813 MW; 5D707E0453DCC90E CRC64;
LCGTFNENQK DDFLTPENDV EKSVIAFANK WKLDEKCNDV PDAIKSHPCD TNVINRAIAE
KYCKRLKAER DIFQKCHAFI DPETFYQDCL FDMCSCEGKL ATCLCPILGA YVDHCSLQSV
RDINWRNEIK ECGIHCPGGQ IYQTCGNSCT RSCYDISSRL DCKEQCVEGC NCPPGEALNE
HGECIPTGQC GCQRDGFDFR PGYNETRPGP NGPEICTCGN GRWNCVAAPP GTPLEYNKLY
DEQGKCSTKD NFEFTTCESP EPVTCKNMHN LAMVSAAVCH PGCQCKDGYV LDQQRNICVK
PSECPCHHGG KSYGENAIVQ VDCNTCTCKG GHWLCTERQC AGECSAWGDS HYKTFDGKHY
DYQGQCDYVL AKGALNSDES FVVTTQNVPC GTTGVSCSKS IKIEIGGNRE DSIVLTQGKK
FSGTQKYKHF SVRSTDLFII IEATNLGLIV QWDRGTRVYI KLQPRWKNRV KGLCGNFNEN
EADDFQTPSG GVVEASAYIF GNSWKTQATC IDPEEIIDTC EMRPDRRTWA LKKCGVLKTS
PFLACHSEVP VDSYYENCLF DTCACDQGGD CECLCTALAA YAQECNTKGV PIKWRSQELC
PIQCDERCAQ YTPCISSCPQ ETCDNLLKHN EHISRLCKED SCIEGCQPKP CPPGQVYLNT
SLSECVPRNI CKVPCMKVGD TVFYEGDVIS EDDCQTCHCL NGRESCVGVP CTTIRVETET
PPSLMGGEQV KCKSGWTEWI NKNKGPTFSE TNQGGKLVDV EPLPTSLVLN TLKGEKVFCN
QTEMAAIECR TVLGHIPAKE TGLDVECSLE NGLLCTSGTK SCVDFEIRVL CRCEKVEITT
EKVPTPTATP STVHPPECDL VTPFRPIPNN CSAYYHCVST TDGPKEILEI CEGKLLFNPV
LNRCDLADEV YKINKDCLNA SLIPCADGFI HDDCAIQCDK LCSYYQHTVV VEQKQCKKGK
KCEKGCRPIN RPDTCPPGYL WRNQYHCVSV ADCLCASQNG TAVKPGDIVQ EDECTKCQCV
NDYYGCDNTL CLTTYRQTTI TTTEKKKITQ GTTIGYTEEQ TTGVKIPLTT EGVPIGFTEE
ETTYTTGKEV ERVTTGVTET TMGTTYGEVE TSTEFLASTS VTPPAECNPA QFINLIQADR
PLPDNAFSAS SILNDDFLPH YARIEKEPIG GGSWKPSPTD EYPYLQVSLN RLTPIYGVII
KGNPVTDEYI TSYKVSYIDN VHQTFSFITS DGKTPQIFRG PINSQNPTRE IFKIPFEAKA
IRIHPVTYEH DRAMQLDIIG CSEYPLTTEQ IIRSSTEHAI ISTTTTAIGI VSIAVGTTAG
IEKETSEEET YPPEVTVPGS TERVTTPKVI IEVTTEITII PTVPVVCDDA MGLGPGHMSP
RLITQSSYLK PSTKVRYLDI HRKGAWQPYL NSPTEWVMFN FTGPRNITGM ITRGGPNGFV
QSYKLLYSNN LADWNSILDE NGEEKIFPAN VDNETPVGNY FPAPIRTTYL KLVPQTWHDN
IQLRAEPRGC YEPYKYPEVE EELPLEVCPF CPTVPVIHDM ECLCEPELYW SGEDCVKRNE
CPCVEKDGTK HKPKEPYIKY KDCLVCTCRM GGREDCEPMK CDIDCPGQVV ERTATCNCTC
KSCAPNEVYC PTSKICIDAQ KWCDGVIDCQ DDEEDCVPVT TPETLKTSEV QTTVTSPTPP
VTVPQCLKKT CPPNYELRQR TQGPLKGHYV WTKMSPNKYA PKTKSFVGYK TKVKGSGHIK
SALPYAPISD NVKQALENKC PEYYCVPPPP PLPKGNETVS CPPIVCPPGH KVEYNLLTSF
DQECPEYGCV PPPPDTTCIV DGKNVNTFDN TNYQYDICNH VLARDSIKNA WNVTLIKCDK
RGSCSQRLEI RQFEHLFVFY PDLTVGYNSY NYTPEQIEVI GSYSPLFSIS RIGNCLVFTS
EFYGFWVKWS LSSTTTIGVS EPNKGLVDGL CGYYDQKPTN DKRKPNGDVV ISTVDFGDSW
SLVEKPWEIC PPETCPAELY KEATELCSKV KDEVFSACHN VLDMDSFISL CRDKTCTCLR
SVTNNETATE NCRCEALQKF VVQCMQLDST VNVENWRGSY NCRTTCPPPL IQQDCYRRTC
ELTCDTVMNP SACPKLDDTC FPGCYCPPGF VRDGEGCVEI PTCKDCECNL KPDLQYITYD
ESNFTLNGNC VYVMSRDTLP SKESGHNFQV LITNAPCEKN SAKICVNKVT IFFAGKRIHI
FNSPYGNKLK VTVDGAYLAD FVDVAEWLGV TETKARDLIF TL
//