ID Q17417_CAEEL Unreviewed; 297 AA.
AC Q17417;
DT 01-NOV-1996, integrated into UniProtKB/TrEMBL.
DT 01-NOV-1996, sequence version 1.
DT 27-MAR-2024, entry version 154.
DE SubName: Full=Nematode cuticle collagen N-terminal domain-containing protein {ECO:0000313|EMBL:CAA94874.1};
GN Name=col-149 {ECO:0000313|EMBL:CAA94874.1,
GN ECO:0000313|WormBase:B0024.1};
GN ORFNames=B0024.1 {ECO:0000313|WormBase:B0024.1}, CELE_B0024.1
GN {ECO:0000313|EMBL:CAA94874.1};
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239 {ECO:0000313|EMBL:CAA94874.1, ECO:0000313|Proteomes:UP000001940};
RN [1] {ECO:0000313|EMBL:CAA94874.1, ECO:0000313|Proteomes:UP000001940}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2 {ECO:0000313|EMBL:CAA94874.1,
RC ECO:0000313|Proteomes:UP000001940};
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RA Sulson J.E., Waterston R.;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
CC -!- SUBUNIT: Collagen polypeptide chains are complexed within the cuticle
CC by disulfide bonds and other types of covalent cross-links.
CC {ECO:0000256|ARBA:ARBA00011518}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BX284605; CAA94874.1; -; Genomic_DNA.
DR PIR; T18637; T18637.
DR RefSeq; NP_505646.1; NM_073245.4.
DR AlphaFoldDB; Q17417; -.
DR DIP; DIP-24917N; -.
DR IntAct; Q17417; 1.
DR STRING; 6239.B0024.1.1; -.
DR EPD; Q17417; -.
DR PaxDb; 6239-B0024-1; -.
DR PeptideAtlas; Q17417; -.
DR EnsemblMetazoa; B0024.1.1; B0024.1.1; WBGene00000722.
DR GeneID; 179431; -.
DR KEGG; cel:CELE_B0024.1; -.
DR UCSC; B0024.1; c. elegans.
DR AGR; WB:WBGene00000722; -.
DR WormBase; B0024.1; CE05146; WBGene00000722; col-149.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00970000196288; -.
DR HOGENOM; CLU_001074_4_3_1; -.
DR InParanoid; Q17417; -.
DR OMA; AYCACPS; -.
DR OrthoDB; 2883277at2759; -.
DR PhylomeDB; Q17417; -.
DR Proteomes; UP000001940; Chromosome V.
DR Bgee; WBGene00000722; Expressed in larva and 1 other cell type or tissue.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0042302; F:structural constituent of cuticle; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR InterPro; IPR002486; Col_cuticle_N.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24637:SF265; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24637; COLLAGEN; 1.
DR Pfam; PF01484; Col_cuticle_N; 1.
DR Pfam; PF01391; Collagen; 1.
DR SMART; SM01088; Col_cuticle_N; 1.
PE 1: Evidence at protein level;
KW Collagen {ECO:0000313|EMBL:CAA94874.1};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Proteomics identification {ECO:0007829|EPD:Q17417,
KW ECO:0007829|PeptideAtlas:Q17417};
KW Reference proteome {ECO:0000313|Proteomes:UP000001940};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 6..30
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 7..58
FT /note="Nematode cuticle collagen N-terminal"
FT /evidence="ECO:0000259|SMART:SM01088"
FT REGION 102..137
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 156..282
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 102..118
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 297 AA; 30561 MW; E8E4E0C90C1453BB CRC64;
MFEEKLLVGI ASLASTIAIL TCVVVIPGLY STINEMHDEV LDGVKMFRAD TDAAWVEMLD
VQVMVSPPSQ PKENPFNSVF RQKRRSTFSG LPAWCQCEPT KPKCPRGPPG PPGHPGQRGI
PGIPGRNGQD NYNTIRAPAC PPRNQDCIKC PAGPPGPSGT CGQVGRPGPD GRPGQPGRRG
NDGRPGQPGP QGNAGQPGRD GNPGQPGHPG KDGRRGHGSP GAPGRAGQPG RQGAPGNPGR
PGERGPSGPC GPAGRSGQPG NRGSDGHPGA PGNPGLQGSD AAYCACPTRS VMFLKRH
//