GenomeNet

Database: UniProt
Entry: A0A0N5D9H5_THECL
LinkDB: A0A0N5D9H5_THECL
Original site: A0A0N5D9H5_THECL 
ID   A0A0N5D9H5_THECL        Unreviewed;       604 AA.
AC   A0A0N5D9H5;
DT   09-DEC-2015, integrated into UniProtKB/TrEMBL.
DT   09-DEC-2015, sequence version 1.
DT   27-MAR-2024, entry version 31.
DE   SubName: Full=Fibrillar collagen NC1 domain-containing protein {ECO:0000313|WBParaSite:TCLT_0000978501-mRNA-1};
GN   ORFNames=TCLT_LOCUS9774 {ECO:0000313|EMBL:VDN07430.1};
OS   Thelazia callipaeda (Oriental eyeworm) (Parasitic nematode).
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Spirurina; Spiruromorpha; Thelazioidea; Thelaziidae; Thelazia.
OX   NCBI_TaxID=103827 {ECO:0000313|Proteomes:UP000046394, ECO:0000313|WBParaSite:TCLT_0000978501-mRNA-1};
RN   [1] {ECO:0000313|WBParaSite:TCLT_0000978501-mRNA-1}
RP   IDENTIFICATION.
RG   WormBaseParasite;
RL   Submitted (FEB-2017) to UniProtKB.
RN   [2] {ECO:0000313|EMBL:VDN07430.1, ECO:0000313|Proteomes:UP000276776}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   Pathogen Informatics;
RL   Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; UYYF01004877; VDN07430.1; -; Genomic_DNA.
DR   STRING; 103827.A0A0N5D9H5; -.
DR   WBParaSite; TCLT_0000978501-mRNA-1; TCLT_0000978501-mRNA-1; TCLT_0000978501.
DR   OMA; CSWKPME; -.
DR   Proteomes; UP000046394; Unplaced.
DR   Proteomes; UP000276776; Unassembled WGS sequence.
DR   GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1105; MULTIPLEXIN, ISOFORM R; 1.
DR   Pfam; PF01391; Collagen; 8.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000276776};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT   REGION          30..269
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          281..336
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          354..530
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        95..109
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        112..126
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        202..216
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        322..336
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        413..427
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   604 AA;  59355 MW;  380A5372667DDA6B CRC64;
     MFLKSPLRRQ LRSFGFLYAP NGKAIQLRGI AGPPGPPGPK GTRGYPGFPG PIGLDGPKGV
     AGMPGPKGER GERGPMGPPG YPGPKGDRGY SAALSRLQVG NSDSASPMLI QGPPGNPGPP
     GEVGKPGPAG PKGERGLPGF DGESRIGVKG EPGEPGMQGK NGLIGPPGPP GLKGEPGAPG
     QRGLPGSPSK HVVEHLEPIA GPPGKPGLPG SPGVPGPPGA KGDAGFPGRD GAEGRMGRTG
     SPGQRGAPGQ KGERGEKGDI GVSGAPGLPG IVTAASARAT QIIAGPPGPP GRDGRKGEKG
     EKGDMGSKGM AGIPGQPGAK GDTGRRGKKG KDGLAVNEEK LIQKVLSVVQ HARIGGLPGA
     PGPQGPKGEQ GSRGERGPQG LPGHAGTKGE RGEIGPPGLI GPPGLPGLPG SVIEATASSS
     SSLMVSGPPG PRGSPGLPGP PGLKGEKGDQ GLAGLPGSLG LPGPPGPMGI RGSPGTPGIE
     GRQGKIGPVG PPGPKGDIGL PGARGPPGDR GPQGEQGKHG LPGLQGEKGE QGIPGLDAPC
     PTGPDGLPMP YCAWKPLDQN VRTAYGIDVA NDNNDGNGIG VQYAEINEDK LDWTADGFAK
     RIIH
//
DBGET integrated database retrieval system