GenomeNet

Database: UniProt
Entry: A0A0T6AYV5_9SCAR
LinkDB: A0A0T6AYV5_9SCAR
Original site: A0A0T6AYV5_9SCAR 
ID   A0A0T6AYV5_9SCAR        Unreviewed;      2262 AA.
AC   A0A0T6AYV5;
DT   17-FEB-2016, integrated into UniProtKB/TrEMBL.
DT   17-FEB-2016, sequence version 1.
DT   24-JAN-2024, entry version 29.
DE   RecName: Full=Hemocytin {ECO:0008006|Google:ProtNLM};
DE   Flags: Fragment;
GN   ORFNames=AMK59_6836 {ECO:0000313|EMBL:KRT80045.1};
OS   Oryctes borbonicus.
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Coleoptera; Polyphaga; Scarabaeiformia;
OC   Scarabaeidae; Dynastinae; Oryctes.
OX   NCBI_TaxID=1629725 {ECO:0000313|EMBL:KRT80045.1, ECO:0000313|Proteomes:UP000051574};
RN   [1] {ECO:0000313|EMBL:KRT80045.1, ECO:0000313|Proteomes:UP000051574}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=OB123 {ECO:0000313|EMBL:KRT80045.1};
RC   TISSUE=Whole animal {ECO:0000313|EMBL:KRT80045.1};
RA   Meyer J.M., Markov G.V., Baskaran P., Herrmann M., Sommer R.J.,
RA   Roedelsperger C.;
RT   "Draft genome of the scarab beetle Oryctes borbonicus.";
RL   Submitted (SEP-2015) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the thrombospondin family.
CC       {ECO:0000256|ARBA:ARBA00009456}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00124}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KRT80045.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LJIG01022538; KRT80045.1; -; Genomic_DNA.
DR   OrthoDB; 5398470at2759; -.
DR   Proteomes; UP000051574; Unassembled WGS sequence.
DR   GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR   GO; GO:0008061; F:chitin binding; IEA:InterPro.
DR   CDD; cd00057; FA58C; 1.
DR   CDD; cd00112; LDLa; 1.
DR   CDD; cd19941; TIL; 4.
DR   Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR   Gene3D; 2.10.25.10; Laminin; 3.
DR   InterPro; IPR002557; Chitin-bd_dom.
DR   InterPro; IPR036508; Chitin-bd_dom_sf.
DR   InterPro; IPR000421; FA58C.
DR   InterPro; IPR008979; Galactose-bd-like_sf.
DR   InterPro; IPR036055; LDL_receptor-like_sf.
DR   InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR   InterPro; IPR036084; Ser_inhib-like_sf.
DR   InterPro; IPR002919; TIL_dom.
DR   InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR   InterPro; IPR001007; VWF_dom.
DR   InterPro; IPR001846; VWF_type-D.
DR   PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR   PANTHER; PTHR11339:SF386; HEMOLECTIN, ISOFORM A; 1.
DR   Pfam; PF08742; C8; 3.
DR   Pfam; PF00754; F5_F8_type_C; 2.
DR   Pfam; PF01826; TIL; 3.
DR   Pfam; PF00094; VWD; 3.
DR   SMART; SM00832; C8; 3.
DR   SMART; SM00494; ChtBD2; 1.
DR   SMART; SM00231; FA58C; 2.
DR   SMART; SM00192; LDLa; 1.
DR   SMART; SM00215; VWC_out; 2.
DR   SMART; SM00216; VWD; 2.
DR   SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR   SUPFAM; SSF57625; Invertebrate chitin-binding proteins; 1.
DR   SUPFAM; SSF57424; LDL receptor-like module; 1.
DR   SUPFAM; SSF57567; Serine protease inhibitors; 3.
DR   PROSITE; PS50940; CHIT_BIND_II; 1.
DR   PROSITE; PS01285; FA58C_1; 1.
DR   PROSITE; PS01286; FA58C_2; 1.
DR   PROSITE; PS50022; FA58C_3; 2.
DR   PROSITE; PS50068; LDLRA_2; 1.
DR   PROSITE; PS51233; VWFD; 4.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Reference proteome {ECO:0000313|Proteomes:UP000051574};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT   DOMAIN          1..38
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          342..511
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          855..919
FT                   /note="Chitin-binding type-2"
FT                   /evidence="ECO:0000259|PROSITE:PS50940"
FT   DOMAIN          1127..1281
FT                   /note="F5/8 type C"
FT                   /evidence="ECO:0000259|PROSITE:PS50022"
FT   DOMAIN          1367..1510
FT                   /note="F5/8 type C"
FT                   /evidence="ECO:0000259|PROSITE:PS50022"
FT   DOMAIN          1816..1991
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          2146..2262
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:KRT80045.1"
FT   NON_TER         2262
FT                   /evidence="ECO:0000313|EMBL:KRT80045.1"
SQ   SEQUENCE   2262 AA;  251813 MW;  5D707E0453DCC90E CRC64;
     LCGTFNENQK DDFLTPENDV EKSVIAFANK WKLDEKCNDV PDAIKSHPCD TNVINRAIAE
     KYCKRLKAER DIFQKCHAFI DPETFYQDCL FDMCSCEGKL ATCLCPILGA YVDHCSLQSV
     RDINWRNEIK ECGIHCPGGQ IYQTCGNSCT RSCYDISSRL DCKEQCVEGC NCPPGEALNE
     HGECIPTGQC GCQRDGFDFR PGYNETRPGP NGPEICTCGN GRWNCVAAPP GTPLEYNKLY
     DEQGKCSTKD NFEFTTCESP EPVTCKNMHN LAMVSAAVCH PGCQCKDGYV LDQQRNICVK
     PSECPCHHGG KSYGENAIVQ VDCNTCTCKG GHWLCTERQC AGECSAWGDS HYKTFDGKHY
     DYQGQCDYVL AKGALNSDES FVVTTQNVPC GTTGVSCSKS IKIEIGGNRE DSIVLTQGKK
     FSGTQKYKHF SVRSTDLFII IEATNLGLIV QWDRGTRVYI KLQPRWKNRV KGLCGNFNEN
     EADDFQTPSG GVVEASAYIF GNSWKTQATC IDPEEIIDTC EMRPDRRTWA LKKCGVLKTS
     PFLACHSEVP VDSYYENCLF DTCACDQGGD CECLCTALAA YAQECNTKGV PIKWRSQELC
     PIQCDERCAQ YTPCISSCPQ ETCDNLLKHN EHISRLCKED SCIEGCQPKP CPPGQVYLNT
     SLSECVPRNI CKVPCMKVGD TVFYEGDVIS EDDCQTCHCL NGRESCVGVP CTTIRVETET
     PPSLMGGEQV KCKSGWTEWI NKNKGPTFSE TNQGGKLVDV EPLPTSLVLN TLKGEKVFCN
     QTEMAAIECR TVLGHIPAKE TGLDVECSLE NGLLCTSGTK SCVDFEIRVL CRCEKVEITT
     EKVPTPTATP STVHPPECDL VTPFRPIPNN CSAYYHCVST TDGPKEILEI CEGKLLFNPV
     LNRCDLADEV YKINKDCLNA SLIPCADGFI HDDCAIQCDK LCSYYQHTVV VEQKQCKKGK
     KCEKGCRPIN RPDTCPPGYL WRNQYHCVSV ADCLCASQNG TAVKPGDIVQ EDECTKCQCV
     NDYYGCDNTL CLTTYRQTTI TTTEKKKITQ GTTIGYTEEQ TTGVKIPLTT EGVPIGFTEE
     ETTYTTGKEV ERVTTGVTET TMGTTYGEVE TSTEFLASTS VTPPAECNPA QFINLIQADR
     PLPDNAFSAS SILNDDFLPH YARIEKEPIG GGSWKPSPTD EYPYLQVSLN RLTPIYGVII
     KGNPVTDEYI TSYKVSYIDN VHQTFSFITS DGKTPQIFRG PINSQNPTRE IFKIPFEAKA
     IRIHPVTYEH DRAMQLDIIG CSEYPLTTEQ IIRSSTEHAI ISTTTTAIGI VSIAVGTTAG
     IEKETSEEET YPPEVTVPGS TERVTTPKVI IEVTTEITII PTVPVVCDDA MGLGPGHMSP
     RLITQSSYLK PSTKVRYLDI HRKGAWQPYL NSPTEWVMFN FTGPRNITGM ITRGGPNGFV
     QSYKLLYSNN LADWNSILDE NGEEKIFPAN VDNETPVGNY FPAPIRTTYL KLVPQTWHDN
     IQLRAEPRGC YEPYKYPEVE EELPLEVCPF CPTVPVIHDM ECLCEPELYW SGEDCVKRNE
     CPCVEKDGTK HKPKEPYIKY KDCLVCTCRM GGREDCEPMK CDIDCPGQVV ERTATCNCTC
     KSCAPNEVYC PTSKICIDAQ KWCDGVIDCQ DDEEDCVPVT TPETLKTSEV QTTVTSPTPP
     VTVPQCLKKT CPPNYELRQR TQGPLKGHYV WTKMSPNKYA PKTKSFVGYK TKVKGSGHIK
     SALPYAPISD NVKQALENKC PEYYCVPPPP PLPKGNETVS CPPIVCPPGH KVEYNLLTSF
     DQECPEYGCV PPPPDTTCIV DGKNVNTFDN TNYQYDICNH VLARDSIKNA WNVTLIKCDK
     RGSCSQRLEI RQFEHLFVFY PDLTVGYNSY NYTPEQIEVI GSYSPLFSIS RIGNCLVFTS
     EFYGFWVKWS LSSTTTIGVS EPNKGLVDGL CGYYDQKPTN DKRKPNGDVV ISTVDFGDSW
     SLVEKPWEIC PPETCPAELY KEATELCSKV KDEVFSACHN VLDMDSFISL CRDKTCTCLR
     SVTNNETATE NCRCEALQKF VVQCMQLDST VNVENWRGSY NCRTTCPPPL IQQDCYRRTC
     ELTCDTVMNP SACPKLDDTC FPGCYCPPGF VRDGEGCVEI PTCKDCECNL KPDLQYITYD
     ESNFTLNGNC VYVMSRDTLP SKESGHNFQV LITNAPCEKN SAKICVNKVT IFFAGKRIHI
     FNSPYGNKLK VTVDGAYLAD FVDVAEWLGV TETKARDLIF TL
//
DBGET integrated database retrieval system