ID A0A0V1AST2_TRISP Unreviewed; 992 AA.
AC A0A0V1AST2;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 27-MAR-2024, entry version 14.
DE SubName: Full=Chondroitin proteoglycan 1 {ECO:0000313|EMBL:KRY27857.1};
GN Name=cpg-1 {ECO:0000313|EMBL:KRY27857.1};
GN ORFNames=T01_5466 {ECO:0000313|EMBL:KRY27857.1};
OS Trichinella spiralis (Trichina worm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=6334 {ECO:0000313|EMBL:KRY27857.1, ECO:0000313|Proteomes:UP000054776};
RN [1] {ECO:0000313|EMBL:KRY27857.1, ECO:0000313|Proteomes:UP000054776}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS3 {ECO:0000313|EMBL:KRY27857.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRY27857.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDH01000231; KRY27857.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0V1AST2; -.
DR STRING; 6334.A0A0V1AST2; -.
DR InParanoid; A0A0V1AST2; -.
DR Proteomes; UP000054776; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0008061; F:chitin binding; IEA:InterPro.
DR Gene3D; 2.170.140.10; Chitin binding domain; 7.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR002557; Chitin-bd_dom.
DR InterPro; IPR036508; Chitin-bd_dom_sf.
DR PANTHER; PTHR23301; CHITIN BINDING PERITROPHIN-A; 1.
DR PANTHER; PTHR23301:SF0; LP10853P; 1.
DR Pfam; PF01607; CBM_14; 7.
DR SMART; SM00494; ChtBD2; 9.
DR SUPFAM; SSF57625; Invertebrate chitin-binding proteins; 8.
DR PROSITE; PS50940; CHIT_BIND_II; 9.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000054776};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..992
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5006874642"
FT DOMAIN 29..88
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 96..156
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 160..223
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 244..304
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 320..384
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 403..462
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 475..534
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 562..619
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 676..736
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
SQ SEQUENCE 992 AA; 110149 MW; 4AE3CAD48B279043 CRC64;
MIDETAAFAL LLSLLASLSC CCCSASQTEF NCSYVSDGKY SHPDHPCSPV FYHCFQQVTT
VMKCPDPLYF DVLQETCLPR KEVQACFNES IPKESALDCK ILADGQYPDV ALNCSRKYYK
CLAGVATSFT CPGEYLYYDA EKKQCVPYTE IEECSSLSIS VDCTKLLSSD AKRHSIADNA
CSQFFINCEN NNFGKIAKCQ DGLFFNPTKK SCDLNSNIAV CNVSSLHSKT DYAHSTAISN
FNSYSTCSNM TKGMNPIALN VCSKFYIYCQ QNSTPILMKC SYEFYFDSDQ NRCVHKKVSK
ICNIWSGYKN ETEQSFINFP VKCSDQMVVG KHAISDDSCS QFYIECENAS DFPIATLKKC
KDTLLFDPKN KNCAREEKIT SCQLELRSST TEQSFYSENQ TDEFDCHGIG DGNYSISGKG
CFNFYYQCID EVAFKLFCSS GLYFDPVSRI CSPYENIAYC NQESNLPSKI SMKETNFCSP
LVNENYPDPM QNCSSKYYTC FNGYLVQRHC EFGKYYDVQS DKCDLFRMVP ACSRFGRSNN
LRILSTEATP TTASSNSLPT LSFNCENLPD GNWAASACKP YYFACVGGFS FMQPCPPGTY
YDPDTDQCNY KSAIPLCGEI VEWISISPLH TTNQPAATAT PAPTIITATT STTTTTTTAA
AAATSAATNY IYSTLSVDCK NLPNGYYADP RNSCSKIYFL CHNGNDFMFY CVGARLFYDP
DNGICDDKAH VVACGGSSRG IAASTSPFHQ GSTVETTQQP TRAPITTLKT THLPNVIRTR
KLTNRIINPT QSATGEGRRT SSSSTATTKS INIPTRAMIR TTQRPPVVRT RKTVHPWRRI
TTMKPTHLSS VDSFSTMAMT KTTPSPLITA SAHANTSPRA VAKPLVVSVP EIGRVTKAPF
GSNQQTKQTN SVQSQHSFSF RTKTSFQNSV FTGHNNTVIS RGNNWISIFQ RELSADGSVE
NSRYFLRTRQ RAKPVRLGTR KPKYQATSVT QV
//