ID A0A0L7RFV0_9HYME Unreviewed; 2283 AA.
AC A0A0L7RFV0;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE SubName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 {ECO:0000313|EMBL:KOC69725.1};
GN ORFNames=WH47_07912 {ECO:0000313|EMBL:KOC69725.1};
OS Habropoda laboriosa.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea;
OC Anthophila; Apidae; Habropoda.
OX NCBI_TaxID=597456 {ECO:0000313|EMBL:KOC69725.1, ECO:0000313|Proteomes:UP000053825};
RN [1] {ECO:0000313|EMBL:KOC69725.1, ECO:0000313|Proteomes:UP000053825}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=0110345459 {ECO:0000313|EMBL:KOC69725.1};
RA Pan H., Kapheim K.;
RT "The genome of Habropoda laboriosa.";
RL Submitted (JUL-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ414601; KOC69725.1; -; Genomic_DNA.
DR STRING; 597456.A0A0L7RFV0; -.
DR Proteomes; UP000053825; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00033; CCP; 9.
DR CDD; cd00054; EGF_CA; 6.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 11.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR Gene3D; 2.10.50.10; Tumor Necrosis Factor Receptor, subunit A, domain 2; 2.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR003410; HYR_dom.
DR InterPro; IPR001759; Pentraxin-related.
DR InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR InterPro; IPR011641; Tyr-kin_ephrin_A/B_rcpt-like.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR19325; COMPLEMENT COMPONENT-RELATED SUSHI DOMAIN-CONTAINING; 1.
DR PANTHER; PTHR19325:SF567; LOCOMOTION-RELATED PROTEIN HIKARU GENKI; 1.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF07699; Ephrin_rec_like; 2.
DR Pfam; PF02494; HYR; 2.
DR Pfam; PF00354; Pentaxin; 1.
DR Pfam; PF00084; Sushi; 8.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00895; PENTAXIN.
DR SMART; SM00032; CCP; 11.
DR SMART; SM00181; EGF; 7.
DR SMART; SM00179; EGF_CA; 6.
DR SMART; SM01411; Ephrin_rec_like; 3.
DR SMART; SM00159; PTX; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF57535; Complement control module/SCR domain; 11.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 6.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS00022; EGF_1; 6.
DR PROSITE; PS01186; EGF_2; 5.
DR PROSITE; PS50026; EGF_3; 6.
DR PROSITE; PS50825; HYR; 2.
DR PROSITE; PS51828; PTX_2; 1.
DR PROSITE; PS50923; SUSHI; 11.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000053825};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP};
KW Sushi {ECO:0000256|PROSITE-ProRule:PRU00302}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..2283
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5005575315"
FT DOMAIN 99..276
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 401..461
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 475..532
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 533..599
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 598..680
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 681..765
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 1168..1204
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1206..1247
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1249..1285
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1287..1323
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1324..1365
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1367..1403
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1612..1674
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 1675..1732
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 1733..1800
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 1801..1887
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 1926..1985
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 2107..2165
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 2166..2228
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 2229..2282
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DISULFID 403..446
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 432..459
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 503..530
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 1194..1203
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1237..1246
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1275..1284
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1313..1322
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1355..1364
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1393..1402
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1703..1730
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 1771..1798
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 1956..1983
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 2199..2226
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
SQ SEQUENCE 2283 AA; 255759 MW; 5F1B9AD5F80719BD CRC64;
MIGRMARIRV LTLLFSCLLI IVSLDAITRF DEQLNFRRID ETAASSLLDF KDKRSTTSQR
GLNASLRNVE KTFKKKAATL SRLLKIHIDY LRNDTDRVEL VFLVDASGSV GLKNFQSELN
FVKNLLSDFS VEPSATRVAI VTFGGKRNIR RNVDQISRVG ENDNKCYLLN RQLNNINYTG
GGTYTRGALL EALMILEKGR SDAKKAVFLI TDGFSNGGDP RPAANLLKDA GATIFTFGIR
TGNVDELHNI ATFPGYTHSY FLDSFAEFEA FVRRALHRDL KTGKYMPVTF PDNCNLLCRN
ISEVGNEWNC CDNFATCACG TATGHYACIC PTGYFGSGFN GSCHPCPNGT YASGEISGDF
TSMCISCPDV NHVTIKVPAT SVEHCVCASG FTTDRNKCEV ITCPRLRVPD NGYLVKASAC
SNVVHAACGI RCRIGFYLTG DSIRLCGKDG NWSGNEPRCL LKTCPPLRAP AHGRMRCQNE
EDEYLITENS TAYPIDARCQ FKCETGYQLR GSKFRNCLPL SQWDGLKATC KAIRCEPLKK
VSHGEVSPEI CSGPEKVPFA TNCTIRCRNG FVLEGPRTKF CGGRSGVWSQ RRTINRCIDE
TPPLVTCPSD IITENLPGKN YAYVNWTTPV ASDNADELPT LWSKPYVNFP WRVKIGTRTV
VYVAQDSSGN KARCKFKVKV LDTESPMIEN CINPPTLFTD NGLGLSNVSW SEPGFYDNSK
TPVRVEQTHR PGENIFPIGL TKVVYNAIDK YDNKATCVLN VTIKDVCEEV TNVLHGYSKC
SSTLENGTNE CTVACEEGFG FAVEEEPNVH IVEDILLLKC NGNSSNWTED NYIPDCSEST
FPKSVSQEGS IVLEGNGTDV CDNETTLREL SEYITADLRT TLLDICGNDI ECNLITFDPE
CEQSPLQLGT VYSNSLRKRR ELRSGDTSAI ELFVNDQVRC ARLKRHAKVN DQSVDKTKVK
MKKKKEKIGI KFKFLAKIIE ENIENPREGI QKLRQKIESL SQSGKLNLLN NKTNQLIAQL
ALNLHLVFKN FQELCDPGSV LKKHTCVKCP LGTFFNSSIK RCQPCPIGEY EDTTGSLKCK
RCPEHTSTRK VHSKFSRDCI HLCKPGYYSQ KKRHQSTRFA LEPCLTCDIG FYQPEFGQTQ
CFSCPANSTT SNRGTKGVND CLLLNNIERN MCNAKSCLND GQCVEEEDSF SCECLDYYVG
SKCESFQNPC DSSPCLNEGT CNMQSFANST ISYVCTCKNG YSGSNCEVYI DECTINPCQN
NGTCVSTESD YTCECKDGFE GEFCESTINH CEPSPCMEGS TCRNINETWQ CFCKPGFLGR
YCNLLPCDWL PCSGNSYCIN IEEENATIMS YKCECIDGYT GANCTIKVDY CESQPCLNKG
KCLNSVSNYT CTCPVPFTGR DCETGKYKLS SDYVIHFSKS GTTDYVSMKG AIDNLSQLTA
CLWLQSKDTF NYGTVLSYAT RYYDNAFTLT DYNGLVIYVN GHKVVTDIKV NDGHWHFICF
TWEAENGFWN IFIDGILRDN GTQLAKRSTI QGNGTLVIGQ EQDRMGGGFS ESESFLGKLT
LLDIWSTVLA GTDVNHLFST CEKYHGDVIA WSQVQEHVHG DTVVIIVSPF CHGCPLPVVP
FKGNINVSED ASKITYYCEN GYVVRYHNKE YRSLKVKCLK QGQWEGYYTP VCTRRRCGFP
GYFPRGRVQG RSYLFGDKIH YSCLPGYELR GNPRRVCNAD GKWSGLPPIC IGKTCKNLLA
PEHGDIEYVI EEYERDDLSI LQVGQQIEFM CELGFRLIGE KYLTCMETGL WDHERPTCVA
YGCPPPKQIE HGYIISASSN NLHETSDTNF SVETSNDTSE KSYDFNNVVG YSCYSGYKFR
GNNFLKEFKL QCTENGTWIG FVPDCIPLKC SWPTFVNNGM LLFKTQDNDT IELFERKSSS
NDTNIGSTSE ESDYDFKENQ DLENQFVAGS HISIKCNVGY KLVGDGVRIC TDNEEWSSPL
SSCEPQECFI LNHPFFQILN KKPSLDNHTT IWGNNQNKEL DGKEQYNGFY KNLEYFVEGH
TYKKKIVLSC KNSDEIKLNN ENIHKSVSNL TWFCNKHSQW EIIDIQLNNT IMAALFNNEI
DNICQKSMCA EITVPEQSYI VEKNYSRSIN STIMFKCNQG YILKGNEKSV CLPNNTWSAI
PSCKPVTCGK PPKLANAVIK EDSVETVIFT FGNTITYECI PGYVMFGQSN VKCLANGKWS
RTYSRCSRLS CNKPKLPPGA TIRGRSYLYQ DQLMYICPGG KKQGLITCKA DGHWSDSPKC
NGY
//