ID T1IGE7_RHOPR Unreviewed; 3404 AA.
AC T1IGE7;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 27-MAR-2024, entry version 60.
DE RecName: Full=Hemocytin {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
OS Rhodnius prolixus (Triatomid bug).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Paraneoptera; Hemiptera; Heteroptera; Panheteroptera;
OC Cimicomorpha; Reduviidae; Triatominae; Rhodnius.
OX NCBI_TaxID=13249 {ECO:0000313|EnsemblMetazoa:RPRC015366-PA, ECO:0000313|Proteomes:UP000015103};
RN [1] {ECO:0000313|Proteomes:UP000015103}
RP NUCLEOTIDE SEQUENCE.
RA Wilson R.K., Warren W., Dotson E., Oliveira P.L.;
RL Submitted (APR-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:RPRC015366-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (MAY-2015) to UniProtKB.
CC -!- SIMILARITY: Belongs to the thrombospondin family.
CC {ECO:0000256|ARBA:ARBA00009456}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACPB03019359; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 13249.T1IGE7; -.
DR EnsemblMetazoa; RPRC015366-RA; RPRC015366-PA; RPRC015366.
DR VEuPathDB; VectorBase:RPRC015366; -.
DR eggNOG; KOG1216; Eukaryota.
DR HOGENOM; CLU_000204_0_0_1; -.
DR InParanoid; T1IGE7; -.
DR OMA; PQYICEC; -.
DR OrthoDB; 5398470at2759; -.
DR Proteomes; UP000015103; Unassembled WGS sequence.
DR CDD; cd00057; FA58C; 2.
DR CDD; cd00112; LDLa; 1.
DR CDD; cd19941; TIL; 6.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF401; INTESTINAL MUCIN-LIKE PROTEIN ISOFORM X1; 1.
DR Pfam; PF08742; C8; 5.
DR Pfam; PF00754; F5_F8_type_C; 2.
DR Pfam; PF01826; TIL; 4.
DR Pfam; PF00094; VWD; 4.
DR SMART; SM00832; C8; 5.
DR SMART; SM00181; EGF; 4.
DR SMART; SM00231; FA58C; 2.
DR SMART; SM00192; LDLa; 1.
DR SMART; SM00214; VWC; 3.
DR SMART; SM00215; VWC_out; 2.
DR SMART; SM00216; VWD; 4.
DR SUPFAM; SSF57603; FnI-like domain; 2.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR SUPFAM; SSF57424; LDL receptor-like module; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 5.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS00022; EGF_1; 2.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS50022; FA58C_3; 2.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
DR PROSITE; PS51233; VWFD; 4.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000015103};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DISULFID 44..54
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 62..71
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 3404 AA; 378414 MW; 9E30E4FB51E8AAFA CRC64;
CTATCISNYQ FPSGVSQISI ICHESIWKMK GAEDNVIPAC SPTCLPACQN GGTCASPNVC
RCPDDFFGIY CQYESKKCVD YPPTPMNSKK MCTEQVCSIT CANNHQFPDG STTVELYCKE
GEWVPYRNDW TSVPDCQASC DPPCQNGGCL SFNTCQCPQE FRGPQCQYSK ESCDIRRLVF
NGNYNCSGDS QALTCTLKCP SGMNFQTEPA AHYVCLYETG VFQPSDVPKC IFIKQQKVKS
LEINNQLVLD VVEIKLSTPL RNKTRGLCGP INGNGNAIYE AGASLQAFAD LARIDIIGET
CEEKIEENQA CVDISTETNA HNFCNKMLEE IRFGACLKVV NAKEFYDACK WDYCACELVG
QKNHTCGCQS IEMFVKECSN KGVQLKWWRD SSLCPMQCTG GRVYNKCGSE LSCSAPAEEV
TCEEGCFCPP GTYAHNNTCI PAAQCPCVLR QKYFLPGETV PNECNTCTCV AGTWSCTELQ
CTARCEALGD PHYITFDRKY YDFMGKCSYY LLQHSNYSVA AENVPCDGSI SENMGFSVFA
LSTTPSCTKS VTVTIGDVVI KLKQGREVTV NNKDEPLPIL LKQAYIREAS SIFVQVELFD
GVEIWWDGIT RVYIDVPPEL KGKTKGLCGT FNGNQKDDFL TPEGDVEQDA VAFANKWKTN
ENCESVSEEQ IDPCQKNHQK RDSAKRLCSW LHSSLFLDCH FYVDPSVYYK NCLYDMCSCN
KKTSDCLCPL IAAYAKECSR QGVPINWRTS VRECGVQCPR GQQFHTCVDS CARSCQDLSR
EKKCSSHCVE SCTCPAGLIF DHKGHCVPIS SCPCVWNGLQ YPAGHKEVRR SSKGTQFCTC
ENARWKCNQA SADDMKKYPN NTAEELACSA SKHQVYTHCE PSQPVTCRNM HKSHSSREQS
EAICYGGCVC KKGYVLDSIS GACVRPDECP CHHGGKSYND GDTIQDDCNT CTCTSGKWSC
SEHICPSVCS TWGDSHFITF DNHIYDFQGT CEFVMAKGSL SSAEEDCFSV VLEMVSCGSS
GISCPKSLTF TVGSGDKKEH VVFSDGEATS SKARLERTSG LFMFAEVADL GLVLQWDRGT
RVSLRADPKW KNKLRGLCGN YNDNSNDDFQ TPSGGISEVS SQVFADSWKI FDYCPESQPI
TDTCKLYPNR KPWAVEKCSI LKSHLFKPCH AEVSVDPFMA RCIFDSCGCD QGGDCECLCS
AIAAYVQQCN AHGVFIKWRS QQLCPMQCEE SCSEYQACVP TCPPETCDNL LDHSRISELC
KQDVCVEGCK YKNCAEGEVY LNSSLKQCVP KSECKPECLK IGNQTYYEGD FIEGDACHSC
YCSRHQKTCT GQPCASTTLL PETTFTTMAT TTTTTEKPTT IATKPWRDME ETCVPGWSDW
LNKHHPVPGK TTLIFSLGTY SIYALIYLFC QFNLQMQTVR CPIDMIKDIK CRTVDTLQSA
KETGEQAECS LERGLYCEGG CHDYEISVYC QCATTETSTV QQTLSEAPTT LPTTTTVSEK
GCLDGEEWDD CAINCDQLCS YYEHIAYKNG ICLFGQKCTP GCRREDRPHC PPGHRWRDLN
TCVKQEDCTC RSLTGLMIKP GTVVNESECE SCQCLNNHYT CDSSLCVTTL PSVENITKIQ
QVHTDIAIMP TEVTTPPQKC PDSNNFYDKH SGYPSKSLVM QQSYLRTAVA AMVDRPTTSN
FTFTVVSKKF NGKLTKLIDS RFIPLLKNVP DDSFTASSTG TNSAPEKARL TSTGAWRPHV
DDKEPFLQID LGTVDIVYGI TISGSPEDNE FVKSFYILYS VNNATYNYVA FMGMPELFKG
PLTNTDRETI TFQTPVEARY IRINPTSWTG APAVRIEILG CDLTLIETTT LAPTMSTLPG
KKLCFCLDDM SVSMRDSQIS VSSISQGTIT AAKISLSGDS GWTPILSDKN QWLQFDFLGE
RLLTGIITSG GGPASVSSAN ESPAWVISYT IKYSADHKEW NPLTDDNGGL HKFPGNVNNF
DKVTHYFKHP IKARFLRILP ADWHNKILMR IQILGCYENY PELTTEIPEV ETLLPTDCNV
CPGVAVTSDI CHCPPTEWWD GEICTQRTQC SCFVGFIRYP VGTVYDNEFC ERCTCGLNGL
AQCAIKECPP CSEGQRGELT PNCNCICKPC PTDTRLCPTS QVCIPLSSWC DGIYDCPDDE
VNCPTTPITT LAPLTTLTPL TTLAPPTKCP EIECPEGFIL NMTAVEIPEK PKFGDTTWSP
KTSFSKYPGH GTKTYSKTLP EKLVNEKSKC PEYRCVPIEK ETIKNCTKPK CLPGYDLVTK
ASESPDECPR YTCSLKEIPL PDGRCNVTGR TFTTFDGTEY KYDICDHILA RDRIHKTWSV
RQAKKCPYIG PCKRFLLINF GNHSLQFNTD LTVMYNGYTY TIPQVPGKSQ GKVDGLCGFF
DNNMANDKTK ENGQLAKTTV EFGDSWMQPG AFCETVVCPT NVQKEAWKMC KAAIMEEPLS
KCGLEVNLDK LLSQCVESIC LCMQSSNSSE DCRCQALLEI VTECQVKMPR LDLSIWRVEH
DCPVQCPPNL VYKECFKRIC EPCCAELMVS DACPETEECF PGCYCPDGYI RSGEQCIKPT
ECRDCQCDGY GGSSRFVTFD RMDFTFKGNC TYTLAQTIEK TSGPKFSALI TSTGCRDNKE
EICLRMITLL YKEHSVTVKI DGEIVTERPY SNKWMKVVEK HGKCVTVSLT KINVDLEFFN
QGNGFNLRLP SHLYANRTEG ICGKCNLNGT DDFKMKNGTI TDDTKAFGES WLVNDMPEVI
GKEESCHVEK EVECLPPPEH QDPCLKVLNE EVFGKCHPVV DPQHYVDNCH NALCNGASIG
CRELEAYARD CQNEGICLDW RSTSLCPYKC PDGLEYKACG LGCTETCDNY EQFRTNPELC
TSPRGDTCVC PEGKVLKNDT CVLETRCVPC DTEGHYPGDK WNPDTCTECT CSKNNVNCHR
LQCTKESGSI CERGFKSVVM VGTEDQCCPQ YICVPEPTAG PVCPELQQPE CGYGQVMKLE
KTASGCQEFI CQCMPPSECP SVEEEMNKSR PAGMVASVDK SGCCPKVTVD CKPETCSEPL
PCPPFHEHVK KETEHCCPEY KCAIPERKCL YGFEYIEDSE KGGERLRKPS ERFTELKSVG
DRWSDGPCRL CECKESSTSA SCISKVCSTP PQSEDYVYVP EVVNGKCCPI YKRRACKQGQ
QEYEVGSIWP SSDGNPCINL TCTLGPNGEI TKQESVETCK KNCKEGWKYV EPVEMSNVCC
GECKPQGCLV DGIVHEKDST WMSEDNCTTY NCFIDNDDGM QVVASTEKCP FINDCPAERI
YQDKCCKKCN STLIEDKKVC TIEAVPLNET VGIVKLYHEK HGGCINHGHI LGFNECQGTC
ESYTQYDPTT GRHESKCLCC KVKEVDTVAV ALTCDDGFLL EKEV
//