GenomeNet

Database: UniProt
Entry: U4PM21_CAEEL
LinkDB: U4PM21_CAEEL
Original site: U4PM21_CAEEL 
ID   U4PM21_CAEEL            Unreviewed;       221 AA.
AC   U4PM21;
DT   11-DEC-2013, integrated into UniProtKB/TrEMBL.
DT   11-DEC-2013, sequence version 1.
DT   27-MAR-2024, entry version 67.
DE   SubName: Full=Collagen triple helix repeat protein {ECO:0000313|EMBL:CDH93114.1};
GN   Name=col-111 {ECO:0000313|EMBL:CDH93114.1,
GN   ECO:0000313|WormBase:F29B9.9b};
GN   ORFNames=CELE_F29B9.9 {ECO:0000313|EMBL:CDH93114.1}, F29B9.9
GN   {ECO:0000313|WormBase:F29B9.9b};
OS   Caenorhabditis elegans.
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC   Caenorhabditis.
OX   NCBI_TaxID=6239 {ECO:0000313|EMBL:CDH93114.1, ECO:0000313|Proteomes:UP000001940};
RN   [1] {ECO:0000313|EMBL:CDH93114.1, ECO:0000313|Proteomes:UP000001940}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Bristol N2 {ECO:0000313|EMBL:CDH93114.1,
RC   ECO:0000313|Proteomes:UP000001940};
RX   PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG   The C. elegans sequencing consortium;
RA   Sulson J.E., Waterston R.;
RT   "Genome sequence of the nematode C. elegans: a platform for investigating
RT   biology.";
RL   Science 282:2012-2018(1998).
CC   -!- SUBUNIT: Collagen polypeptide chains are complexed within the cuticle
CC       by disulfide bonds and other types of covalent cross-links.
CC       {ECO:0000256|ARBA:ARBA00011518}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BX284604; CDH93114.1; -; Genomic_DNA.
DR   RefSeq; NP_001294383.1; NM_001307454.1.
DR   AlphaFoldDB; U4PM21; -.
DR   EnsemblMetazoa; F29B9.9b.1; F29B9.9b.1; WBGene00000685.
DR   AGR; WB:WBGene00000685; -.
DR   WormBase; F29B9.9b; CE48577; WBGene00000685; col-111.
DR   GeneTree; ENSGT00940000168023; -.
DR   HOGENOM; CLU_001074_4_1_1; -.
DR   OrthoDB; 2882842at2759; -.
DR   Proteomes; UP000001940; Chromosome IV.
DR   Bgee; WBGene00000685; Expressed in material anatomical entity and 4 other cell types or tissues.
DR   ExpressionAtlas; U4PM21; baseline and differential.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR24637:SF252; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR24637; COLLAGEN; 1.
DR   Pfam; PF01391; Collagen; 3.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:CDH93114.1};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001940};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT   REGION          1..166
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          179..221
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        110..124
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        133..155
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        192..221
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   221 AA;  21825 MW;  8E8AC6EAFF5B998C CRC64;
     MDGKPGEPGA SGPHGMSGSE LTKMMRNDDT CIKCPAGPPG PRGAPGSVGE PGPQGRDGMG
     GRPGNMGRPG PPGPAGDPGQ AGSKGAPGTH GRQGMPGVRY QLGEPGPAGY PGPRGAPGPV
     GPPGVAEEGP DGLPGPAGPP GKPGMPGPMG TPGGPGEPGI PGSDAAYCPC PKREALVESS
     VISQEVPETG YGDGGDKSKR EKVVETHQEE QTYDSFKFEQ V
//
DBGET integrated database retrieval system