ID A0A0L7RB41_9HYME Unreviewed; 2325 AA.
AC A0A0L7RB41;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 24-JAN-2024, entry version 25.
DE SubName: Full=Chondroitin sulfate proteoglycan 4 {ECO:0000313|EMBL:KOC68137.1};
DE Flags: Fragment;
GN ORFNames=WH47_03295 {ECO:0000313|EMBL:KOC68137.1};
OS Habropoda laboriosa.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea;
OC Anthophila; Apidae; Habropoda.
OX NCBI_TaxID=597456 {ECO:0000313|EMBL:KOC68137.1, ECO:0000313|Proteomes:UP000053825};
RN [1] {ECO:0000313|EMBL:KOC68137.1, ECO:0000313|Proteomes:UP000053825}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=0110345459 {ECO:0000313|EMBL:KOC68137.1};
RA Pan H., Kapheim K.;
RT "The genome of Habropoda laboriosa.";
RL Submitted (JUL-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00122}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ414617; KOC68137.1; -; Genomic_DNA.
DR STRING; 597456.A0A0L7RB41; -.
DR Proteomes; UP000053825; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd00110; LamG; 2.
DR Gene3D; 2.60.120.200; -; 2.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR039005; CSPG_rpt.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR45739; MATRIX PROTEIN, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR45739:SF8; TNFR-CYS DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF16184; Cadherin_3; 12.
DR Pfam; PF02210; Laminin_G_2; 2.
DR SMART; SM00282; LamG; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR PROSITE; PS51854; CSPG; 9.
DR PROSITE; PS50025; LAM_G_DOMAIN; 2.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000053825};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..2325
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5005575256"
FT TRANSMEM 2176..2197
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 31..203
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 204..389
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT REPEAT 432..527
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 560..652
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 671..766
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 786..881
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1019..1111
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1127..1216
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1238..1340
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1470..1560
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1801..1899
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KOC68137.1"
SQ SEQUENCE 2325 AA; 261091 MW; 6A37C4A85CDE3606 CRC64;
RRSLMFLNDM GLLMFLAVTA SLLGYCQTDE KVSFYGASYI HLPVQEAKGA TDISFRFRTH
LADAMLLLAV GKTDYCLIKL EAGRLKVHIN LGAGESEMAS ARGLTLNDLS WHEVNLTRRQ
ANITLQIDVI HTTRSLLPGR FFELNIHYGV YIGGQGDFNE LFLGHTDYLR GCMADIIYNS
ARVIEYAKSR KGQSEATAVT WGCSPEFDAT RSTEVSFVED GAFTAIPRPI PRSGSKWKFE
LKTGAETGLL LYNTGQSSYA DYLGIELFEG KIRLLINKGN GATELIHGTP VADGKWHRVL
VDFNPSGIGI TVDHQEKTMT LPSGGNRYLD LADTLYIGGT ELNKRARALT KGLKSGDVSY
KGCLRNMLMD NKELGLPDVK ISQGIVVGCV WGFPCIEADP CVSEAACSQL GVNSFSCDCD
QPVYSKVNLP VNLEILSLSP LLVSEGEHVP VTSDNIAMVL DIAKYGVEED GVIFTLVTPP
TYGNLALDLL TTRTEHSFTL QDVNRDKIQY MHDGSETTED SMILEVTLVA GAGYTLPGYL
QGRLRFPLHV NVTPVNDPPL LEISTAKVLR LAQGTRKILT KELIWAIDAD TPSDMLVYTV
LRTDADAGYV ERVTDPSQPI DTFTQAELMQ GLIAYVHRGN AKPNAKLDLQ VSDGIENSQQ
ASLRVAAHPL EMKLMHNTGL VVVHRSYSYL TPANLSFTTN SDDSSIDIRY DIVSQPQFGT
IQKLKDVSSS WMNMDHFTSK DVEMHTIRYL HNEGSPNQDE FKFQASVREV KTQHTYDFRI
TFIDLELKET KRVPINFTNV AEVVVSGQNL RYQTNPLVTA FNKIVFTVTT GPRYGNLFLS
SRKMEIGDTF TQEDVDSGKL RYRLFKRAYS TILDEFGFKV SAPQCIDLHS ILKFRHYLSK
NMKPLDSVET LRVDEGSRIS VRILRTNPRD YGVTSLTYNL TIEPHHGWLT VTNNSRSPSR
NNTSYFTSEE LSSQRVYYVH DDSETKEDSF QFVAIANDAV DFMYVGLFRV EVTMKNDNAP
ERVIHKVYHV VSRGERLLTS KDLAYIDKDI DTKPSELIYT RRDTQKNGIY RVTNPSMQVH
EFSQQDIDDG QILFKHQGED HGKFEFGVTD GHFYTAGVLE IQASPPYVRL RESNGSVVQF
NRSVALRPSE LDIETNVYTS DKDIKYTVLE KPKHGVLLKH GRETNTFTEE NLRYGILLYK
HLGGSLAKDD FKFKVSTKGA ETEGVFFIKI YPESYWQPLI VQNNKTVFVE EATSILLSRK
SLEIMHPKIP ATEIVYFLKE WPQNGYLELQ IHDEHSDETR EDYIGNAVKS FDQSMINEGR
VFYVQSVINQ TNDKFVVDVT NGITLLRDQS VNFVIVPEKL YVEAKGLVVV EGKSTILEET
NFTILTPYYS GKVTDYRITE KPKHGVIIES TKNSQVKKFS QKHLTTGTIL YKHNGDEFSK
DSFKMVLIAG DKTSEPFNVW VTVQPVNDEV PVLVNRTKLI VWQGGSTTLT PESLAAVDND
TTAHDVTFNV TGVKNGFISL KSSPEVDIYN FTQEQIDQSK VIFTHTNGSD AEFNFVLYDG
VHTTESYNIM VATKPVRLTT ERNAALNVFP LTRKMISSKL LLTKCLDETR EIKYIVRNGP
HLGKIIMETN EGAWLNVERF TQRDVNNSKV FYEHTKQFMD LAANDSFTFD VFQIDISVSS
GGLDRYISVR NVRVEEGGSA QVIMNISGIV SFLQTHAGIE NPAVLSRLVS QPSHGHVMIL
PDLNVTTFSQ PQIEGGKIAY YHDHSDTLED RINFSLYLTP GHILLCNTSI PVIIEPVNDE
PFKLITNAPS ITVVQNQNQT ITRENLLTID PDTPPEEIMY DVISVPTYGR LLLLPFNENI
SEVRQVNKFT QYDVDSNRLV YEHNGPLQAA SFYFRVWDGR FNPTYIVFNV YVLPIRLNVT
VPGPVSLQQG SNVALISESN VKLDTNARQD LVIYEVTITP KYGVLYVRDG AAASFKQTDL
LSKSVMFMQT DMTVSNDSLE LTAKLSGFEQ RHIRIEIKVV PLMIMNPMIA LAGEKTRITL
QYLDATPLAK LTTSNPVYTI IRKPKFGKIK RIIRSSSSSG EKRGTREKEV IRFSHQEVMS
GVIYMVCRKI PTMEYEGVPD SFAFVLAASI FQPAIGEFEF RVKLDIDEYN ITLGGPMDPV
GHEGEMAIAP NMSNDYLLIL GMLLGVFLLG VLVIITIRCR HNRYKHAEEE KPEATPAVGV
MPLPRPPDHL MPVTPHVKRF ANDHNSVAAS TPLPTMTSTL PQCKVIPLSP LESITGSEVD
VSAKYPYGVA DGDEWSSFDT SDLPCQSATT QRTNNPLLRR NQYWV
//