ID A0A0V1NLY5_9BILA Unreviewed; 987 AA.
AC A0A0V1NLY5;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 27-MAR-2024, entry version 14.
DE SubName: Full=Chondroitin proteoglycan 1 {ECO:0000313|EMBL:KRZ85033.1};
GN Name=cpg-1 {ECO:0000313|EMBL:KRZ85033.1};
GN ORFNames=T08_12443 {ECO:0000313|EMBL:KRZ85033.1};
OS Trichinella sp. T8.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=92180 {ECO:0000313|EMBL:KRZ85033.1, ECO:0000313|Proteomes:UP000054924};
RN [1] {ECO:0000313|EMBL:KRZ85033.1, ECO:0000313|Proteomes:UP000054924}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS272 {ECO:0000313|EMBL:KRZ85033.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRZ85033.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDM01000153; KRZ85033.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0V1NLY5; -.
DR STRING; 92180.A0A0V1NLY5; -.
DR Proteomes; UP000054924; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0008061; F:chitin binding; IEA:InterPro.
DR Gene3D; 2.170.140.10; Chitin binding domain; 8.
DR InterPro; IPR002557; Chitin-bd_dom.
DR InterPro; IPR036508; Chitin-bd_dom_sf.
DR PANTHER; PTHR23301; CHITIN BINDING PERITROPHIN-A; 1.
DR PANTHER; PTHR23301:SF95; LD43683P; 1.
DR Pfam; PF01607; CBM_14; 8.
DR SMART; SM00494; ChtBD2; 9.
DR SUPFAM; SSF57625; Invertebrate chitin-binding proteins; 8.
DR PROSITE; PS50940; CHIT_BIND_II; 9.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000054924};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..987
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5006883484"
FT DOMAIN 29..88
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 96..156
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 160..223
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 244..304
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 320..384
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 403..462
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 475..534
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 559..616
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 671..731
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
SQ SEQUENCE 987 AA; 109853 MW; 64CB3D68256AFAEB CRC64;
MIDATAAFAL LLSLLASLSC CCCSASQTEF NCSYISDGKY SHPDQPCSPI FYHCFQQVTT
IMKCPDLLYF DVLQETCLPR KEVQACFNES IPEESALDCK ILADGQYPDV AHNCSRKYYK
CLAGVATSFT CPGEYLYYDA EEKQCVPYTE IEDCSSLTIS VDCTKLLSGD AKRHSIADNA
CSEFFINCEN NNFGKIAKCQ DGLFFNPTKK SCDLNSNIGV CNISSLHSKT DHVHSTAISN
FNSYSTCSNM TEGMNPIARN VCSKFYIYCQ QNSTPILMKC SYEFYFDSDQ NRCVHKKVSK
NCNIWSGYKN ETEQSFINFP VKCSDQMVVG KHAISDDGCS QFYIECENDS DFPIATLKKC
KDTLLFDPKN KNCTREEKIT SCQLELPTST TEQNFYSGNQ TDEFDCHGIG DGNYSISGKG
CFNFYYQCID EVAFKLQCSS GLYFDPVSRI CSPYENIAYC NQESNLPSKI SMKETNFCSP
LVNENYPDPM QNCSSKYYTC FNGYLVQRHC EFGKYYDVQS DKCDLFRMVP ACSRFGRSNN
LRILSTEATP TTASLPTLSF NCENLPDGNW AASPCKPYYF ACVGGFSFMQ PCPPGTYYDP
DTDQCNYKSA IPLCGEIVEW ISVSPLSTTD QPAATATPAP TIITATTTTT TDATTAAATS
PATNYIYSTL SVDCKNLPNG YYADPRNSCS KIYFLCYNGN DYMFYCVGAR LFYDPDNGIC
DDEAHVVACG GSARGIAASA SPFHQRSTVE TTQQPTRAPI RTLKTTHLPN VIRTRKLTNR
IINPTQSATG EGRRTSSSST ATTKSINIPT RAMIRTTQRP RVVRTRKTVH PWRIITTIKP
THLSSVDSFS TMAMTETTPS PLITASAHAN TSPRAVVKPF VVSVPEIGRV TKAPFGSNQQ
TKQTNSVQSQ HSFSFKTKTS FQNSVFTGHN NTVISRGNNW ISIFQRELSA DGSVENSRYF
LRTRQRAKPV RLGTRKPKNR ATSVTQI
//