ID A0A183K145_9TREM Unreviewed; 1377 AA.
AC A0A183K145;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Fibrillar collagen NC1 domain-containing protein {ECO:0000313|WBParaSite:SCUD_0000870801-mRNA-1};
GN ORFNames=SCUD_LOCUS8708 {ECO:0000313|EMBL:VDP32095.1};
OS Schistosoma curassoni.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma.
OX NCBI_TaxID=6186 {ECO:0000313|Proteomes:UP000050789, ECO:0000313|WBParaSite:SCUD_0000870801-mRNA-1};
RN [1] {ECO:0000313|WBParaSite:SCUD_0000870801-mRNA-1}
RP IDENTIFICATION.
RG WormBaseParasite;
RL Submitted (JUN-2016) to UniProtKB.
RN [2] {ECO:0000313|EMBL:VDP32095.1, ECO:0000313|Proteomes:UP000279833}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Dakar {ECO:0000313|EMBL:VDP32095.1}, and Dakar, Senegal
RC {ECO:0000313|Proteomes:UP000279833};
RG Pathogen Informatics;
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; UZAK01032873; VDP32095.1; -; Genomic_DNA.
DR STRING; 6186.A0A183K145; -.
DR WBParaSite; SCUD_0000870801-mRNA-1; SCUD_0000870801-mRNA-1; SCUD_0000870801.
DR Proteomes; UP000050789; Unplaced.
DR Proteomes; UP000279833; Unassembled WGS sequence.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 3.30.750.130; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 2.
DR SMART; SM00038; COLFI; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000279833}.
FT DOMAIN 1118..1376
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 1..34
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 83..714
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 728..1096
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 606..620
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1377 AA; 134604 MW; 68633BE59C2C9B19 CRC64;
GQPGEKGEVG PKGEKGQPGR DGRNGIPGAP GLPGVPSHIL MMPLPSFGKN ELSRMASFRT
TVQQAVMMLK GSRGQQGMTG LPGIQGPPGP RGPKGDQGLF GEPGFIGPTG LRGPPGPVGP
RGRPGVDGEI GASGLAGPKG LPGLPGLPGL PGHRGHKGEI GPNGIKGDVG LRGTPGPYGR
DGEVGPPGEP GAVGLQGPMG PVGPAGLRGL PGRIGPNGLK GEFGDSGEMG PPGPQGPVGP
PGVPGPQGIR GLTGKPGARG PPGVTGLPGS PGMTGNPGPQ GNTGPKGEPG LTGEPGDTGP
VGASGYKGDR GPRGLPGAKG TQGAAGLTGV RGERGEKGSK GERGLPGPRG SEGPVGLKGE
PGIVGELGPR GLEGQKGNEG PRGSPGYPGG PGPKGSVGPA GGIGSPGEKG DPGHMGPLGL
NGPVGPRGAP GPRGAPGQSG PVGFKGEEGP TGPTGPPGAA GPQGPRGSPG FVGVPGPPGL
PGRPGPVGLL GDRGEPGEPG IPGEIGPIGM VGVPGQIGEQ GAPGERGPPG PRGPQGVAGK
PGVEGIEGVQ GEKGTKGAAG PPGPSGPVGL RGTRGPPGPS GDPGSKGDEG RIGPQGSLGD
KGDRGPPGPH GPPGPIGPVG PQGLPGSQGE PGDKGARGPP GPIGETGEKG VQGLIGKPGL
RGPTGADGEQ GDQGSPGPQG TKGDRGEAGE QGSAGQQGAA GSPGPQGMVG EPGEIGIAGV
AGIKGVEGKV GPPGAPGPAG VQGSPGRSGP RGEPGIKGVI GAEGPPGLPG PVGLAGAPGK
MGIPGVPGPP GSPGLKGEQG KPGARGEQGA QGTTGPPGPK GMKGMLGRPG RQGQPGSAGP
PGIVGEPGVQ GAQGPMGPIG APGLPGEPGA AGPPGKDGVR GAVGLEGLFG DDGPPGPPGP
VGVPGVQGPP GTAGETGPTG DPGDRGAAGS VGLPGERGPP GIMGAEGQVG PQGPEGEQGL
QGPPGETGPK GNTGKTGSPG PNGPKGESGP KGVSGNPGTV GPPGPKGEQG SIGDPGPTGK
QGEMGDHGDQ GPPGPRGPNG DEGESGPPGP TGPPGPPGLQ GKVTSPRGEQ GHQGDPGPMG
KQGPQGQVKI TNNGENLRMR YKRSAAIQSE ETLEQINQNI FGRVDALNKK VRSRRYATGL
SPQNPARTCK DVRLTNKFAQ NDTYWIDPNE GSNKDAIRAK CTFFDDGTVE TCVDSSMDQV
SEFTYLKPLP SDSEWQSQLR VMNSTTPLDL HNYGSHSQVN MLRIQHRYAS QELEFLCDGT
EIYGSWNSRT SQSDFNKATA LLAHNERMLD LTSGERLGPG KYHMDKRNYR NNLDRVDTSI
IVEYDGCQYL TKGAATKLLI ETRDLEVLPI IDFKVRNFDK QGTCSLSVNV GAVCYRT
//