ID A0A0N5D9H5_THECL Unreviewed; 604 AA.
AC A0A0N5D9H5;
DT 09-DEC-2015, integrated into UniProtKB/TrEMBL.
DT 09-DEC-2015, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE SubName: Full=Fibrillar collagen NC1 domain-containing protein {ECO:0000313|WBParaSite:TCLT_0000978501-mRNA-1};
GN ORFNames=TCLT_LOCUS9774 {ECO:0000313|EMBL:VDN07430.1};
OS Thelazia callipaeda (Oriental eyeworm) (Parasitic nematode).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Spirurina; Spiruromorpha; Thelazioidea; Thelaziidae; Thelazia.
OX NCBI_TaxID=103827 {ECO:0000313|Proteomes:UP000046394, ECO:0000313|WBParaSite:TCLT_0000978501-mRNA-1};
RN [1] {ECO:0000313|WBParaSite:TCLT_0000978501-mRNA-1}
RP IDENTIFICATION.
RG WormBaseParasite;
RL Submitted (FEB-2017) to UniProtKB.
RN [2] {ECO:0000313|EMBL:VDN07430.1, ECO:0000313|Proteomes:UP000276776}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Pathogen Informatics;
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; UYYF01004877; VDN07430.1; -; Genomic_DNA.
DR STRING; 103827.A0A0N5D9H5; -.
DR WBParaSite; TCLT_0000978501-mRNA-1; TCLT_0000978501-mRNA-1; TCLT_0000978501.
DR OMA; CSWKPME; -.
DR Proteomes; UP000046394; Unplaced.
DR Proteomes; UP000276776; Unassembled WGS sequence.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1105; MULTIPLEXIN, ISOFORM R; 1.
DR Pfam; PF01391; Collagen; 8.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000276776};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REGION 30..269
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 281..336
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 354..530
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 95..109
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 112..126
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 202..216
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 322..336
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 413..427
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 604 AA; 59355 MW; 380A5372667DDA6B CRC64;
MFLKSPLRRQ LRSFGFLYAP NGKAIQLRGI AGPPGPPGPK GTRGYPGFPG PIGLDGPKGV
AGMPGPKGER GERGPMGPPG YPGPKGDRGY SAALSRLQVG NSDSASPMLI QGPPGNPGPP
GEVGKPGPAG PKGERGLPGF DGESRIGVKG EPGEPGMQGK NGLIGPPGPP GLKGEPGAPG
QRGLPGSPSK HVVEHLEPIA GPPGKPGLPG SPGVPGPPGA KGDAGFPGRD GAEGRMGRTG
SPGQRGAPGQ KGERGEKGDI GVSGAPGLPG IVTAASARAT QIIAGPPGPP GRDGRKGEKG
EKGDMGSKGM AGIPGQPGAK GDTGRRGKKG KDGLAVNEEK LIQKVLSVVQ HARIGGLPGA
PGPQGPKGEQ GSRGERGPQG LPGHAGTKGE RGEIGPPGLI GPPGLPGLPG SVIEATASSS
SSLMVSGPPG PRGSPGLPGP PGLKGEKGDQ GLAGLPGSLG LPGPPGPMGI RGSPGTPGIE
GRQGKIGPVG PPGPKGDIGL PGARGPPGDR GPQGEQGKHG LPGLQGEKGE QGIPGLDAPC
PTGPDGLPMP YCAWKPLDQN VRTAYGIDVA NDNNDGNGIG VQYAEINEDK LDWTADGFAK
RIIH
//