GenomeNet

Database: UniProt
Entry: A0A183K145_9TREM
LinkDB: A0A183K145_9TREM
Original site: A0A183K145_9TREM 
ID   A0A183K145_9TREM        Unreviewed;      1377 AA.
AC   A0A183K145;
DT   07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT   07-SEP-2016, sequence version 1.
DT   27-MAR-2024, entry version 28.
DE   SubName: Full=Fibrillar collagen NC1 domain-containing protein {ECO:0000313|WBParaSite:SCUD_0000870801-mRNA-1};
GN   ORFNames=SCUD_LOCUS8708 {ECO:0000313|EMBL:VDP32095.1};
OS   Schistosoma curassoni.
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC   Digenea; Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma.
OX   NCBI_TaxID=6186 {ECO:0000313|Proteomes:UP000050789, ECO:0000313|WBParaSite:SCUD_0000870801-mRNA-1};
RN   [1] {ECO:0000313|WBParaSite:SCUD_0000870801-mRNA-1}
RP   IDENTIFICATION.
RG   WormBaseParasite;
RL   Submitted (JUN-2016) to UniProtKB.
RN   [2] {ECO:0000313|EMBL:VDP32095.1, ECO:0000313|Proteomes:UP000279833}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Dakar {ECO:0000313|EMBL:VDP32095.1}, and Dakar, Senegal
RC   {ECO:0000313|Proteomes:UP000279833};
RG   Pathogen Informatics;
RL   Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; UZAK01032873; VDP32095.1; -; Genomic_DNA.
DR   STRING; 6186.A0A183K145; -.
DR   WBParaSite; SCUD_0000870801-mRNA-1; SCUD_0000870801-mRNA-1; SCUD_0000870801.
DR   Proteomes; UP000050789; Unplaced.
DR   Proteomes; UP000279833; Unassembled WGS sequence.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   Gene3D; 3.30.750.130; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   SMART; SM00038; COLFI; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000279833}.
FT   DOMAIN          1118..1376
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          1..34
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          83..714
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          728..1096
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        606..620
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1377 AA;  134604 MW;  68633BE59C2C9B19 CRC64;
     GQPGEKGEVG PKGEKGQPGR DGRNGIPGAP GLPGVPSHIL MMPLPSFGKN ELSRMASFRT
     TVQQAVMMLK GSRGQQGMTG LPGIQGPPGP RGPKGDQGLF GEPGFIGPTG LRGPPGPVGP
     RGRPGVDGEI GASGLAGPKG LPGLPGLPGL PGHRGHKGEI GPNGIKGDVG LRGTPGPYGR
     DGEVGPPGEP GAVGLQGPMG PVGPAGLRGL PGRIGPNGLK GEFGDSGEMG PPGPQGPVGP
     PGVPGPQGIR GLTGKPGARG PPGVTGLPGS PGMTGNPGPQ GNTGPKGEPG LTGEPGDTGP
     VGASGYKGDR GPRGLPGAKG TQGAAGLTGV RGERGEKGSK GERGLPGPRG SEGPVGLKGE
     PGIVGELGPR GLEGQKGNEG PRGSPGYPGG PGPKGSVGPA GGIGSPGEKG DPGHMGPLGL
     NGPVGPRGAP GPRGAPGQSG PVGFKGEEGP TGPTGPPGAA GPQGPRGSPG FVGVPGPPGL
     PGRPGPVGLL GDRGEPGEPG IPGEIGPIGM VGVPGQIGEQ GAPGERGPPG PRGPQGVAGK
     PGVEGIEGVQ GEKGTKGAAG PPGPSGPVGL RGTRGPPGPS GDPGSKGDEG RIGPQGSLGD
     KGDRGPPGPH GPPGPIGPVG PQGLPGSQGE PGDKGARGPP GPIGETGEKG VQGLIGKPGL
     RGPTGADGEQ GDQGSPGPQG TKGDRGEAGE QGSAGQQGAA GSPGPQGMVG EPGEIGIAGV
     AGIKGVEGKV GPPGAPGPAG VQGSPGRSGP RGEPGIKGVI GAEGPPGLPG PVGLAGAPGK
     MGIPGVPGPP GSPGLKGEQG KPGARGEQGA QGTTGPPGPK GMKGMLGRPG RQGQPGSAGP
     PGIVGEPGVQ GAQGPMGPIG APGLPGEPGA AGPPGKDGVR GAVGLEGLFG DDGPPGPPGP
     VGVPGVQGPP GTAGETGPTG DPGDRGAAGS VGLPGERGPP GIMGAEGQVG PQGPEGEQGL
     QGPPGETGPK GNTGKTGSPG PNGPKGESGP KGVSGNPGTV GPPGPKGEQG SIGDPGPTGK
     QGEMGDHGDQ GPPGPRGPNG DEGESGPPGP TGPPGPPGLQ GKVTSPRGEQ GHQGDPGPMG
     KQGPQGQVKI TNNGENLRMR YKRSAAIQSE ETLEQINQNI FGRVDALNKK VRSRRYATGL
     SPQNPARTCK DVRLTNKFAQ NDTYWIDPNE GSNKDAIRAK CTFFDDGTVE TCVDSSMDQV
     SEFTYLKPLP SDSEWQSQLR VMNSTTPLDL HNYGSHSQVN MLRIQHRYAS QELEFLCDGT
     EIYGSWNSRT SQSDFNKATA LLAHNERMLD LTSGERLGPG KYHMDKRNYR NNLDRVDTSI
     IVEYDGCQYL TKGAATKLLI ETRDLEVLPI IDFKVRNFDK QGTCSLSVNV GAVCYRT
//
DBGET integrated database retrieval system