GenomeNet

Database: UniProt
Entry: G1RCL9_NOMLE
LinkDB: G1RCL9_NOMLE
Original site: G1RCL9_NOMLE 
ID   G1RCL9_NOMLE            Unreviewed;      1386 AA.
AC   G1RCL9;
DT   19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT   28-FEB-2018, sequence version 3.
DT   27-MAR-2024, entry version 66.
DE   SubName: Full=Collagen type I alpha 1 chain {ECO:0000313|Ensembl:ENSNLEP00000010968.3};
GN   Name=COL1A1 {ECO:0000313|Ensembl:ENSNLEP00000010968.3};
OS   Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates leucogenys).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hylobatidae;
OC   Nomascus.
OX   NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000010968.3, ECO:0000313|Proteomes:UP000001073};
RN   [1] {ECO:0000313|Ensembl:ENSNLEP00000010968.3, ECO:0000313|Proteomes:UP000001073}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   Gibbon Genome Sequencing Consortium;
RL   Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSNLEP00000010968.3}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC       forming collagen). {ECO:0000256|ARBA:ARBA00003647}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ADFV01128872; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01128873; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01128874; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   STRING; 61853.ENSNLEP00000010968; -.
DR   Ensembl; ENSNLET00000011502.2; ENSNLEP00000010968.3; ENSNLEG00000008996.2.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000156584; -.
DR   HOGENOM; CLU_001074_20_0_1; -.
DR   InParanoid; G1RCL9; -.
DR   OMA; YYDRDVW; -.
DR   Proteomes; UP000001073; Chromosome 14.
DR   GO; GO:0005584; C:collagen type I trimer; IEA:Ensembl.
DR   GO; GO:0005737; C:cytoplasm; IEA:Ensembl.
DR   GO; GO:0005615; C:extracellular space; IEA:Ensembl.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:Ensembl.
DR   GO; GO:0042802; F:identical protein binding; IEA:Ensembl.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   GO; GO:0048407; F:platelet-derived growth factor binding; IEA:Ensembl.
DR   GO; GO:0002020; F:protease binding; IEA:Ensembl.
DR   GO; GO:0001568; P:blood vessel development; IEA:Ensembl.
DR   GO; GO:0060346; P:bone trabecula formation; IEA:Ensembl.
DR   GO; GO:0060351; P:cartilage development involved in endochondral bone morphogenesis; IEA:Ensembl.
DR   GO; GO:0071230; P:cellular response to amino acid stimulus; IEA:Ensembl.
DR   GO; GO:0071260; P:cellular response to mechanical stimulus; IEA:Ensembl.
DR   GO; GO:0032964; P:collagen biosynthetic process; IEA:Ensembl.
DR   GO; GO:0030199; P:collagen fibril organization; IEA:Ensembl.
DR   GO; GO:0048706; P:embryonic skeletal system development; IEA:Ensembl.
DR   GO; GO:0001958; P:endochondral ossification; IEA:Ensembl.
DR   GO; GO:0060325; P:face morphogenesis; IEA:Ensembl.
DR   GO; GO:0001957; P:intramembranous ossification; IEA:Ensembl.
DR   GO; GO:0010812; P:negative regulation of cell-substrate adhesion; IEA:Ensembl.
DR   GO; GO:0001649; P:osteoblast differentiation; IEA:Ensembl.
DR   GO; GO:0090263; P:positive regulation of canonical Wnt signaling pathway; IEA:Ensembl.
DR   GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl.
DR   GO; GO:0045893; P:positive regulation of DNA-templated transcription; IEA:Ensembl.
DR   GO; GO:0010718; P:positive regulation of epithelial to mesenchymal transition; IEA:Ensembl.
DR   GO; GO:0034504; P:protein localization to nucleus; IEA:Ensembl.
DR   GO; GO:0015031; P:protein transport; IEA:Ensembl.
DR   GO; GO:0007605; P:sensory perception of sound; IEA:Ensembl.
DR   GO; GO:0043589; P:skin morphogenesis; IEA:Ensembl.
DR   GO; GO:0034505; P:tooth mineralization; IEA:Ensembl.
DR   GO; GO:0007601; P:visual perception; IEA:Ensembl.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR   Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR001007; VWF_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1108; ENDOSTATIN DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 14.
DR   Pfam; PF00093; VWC; 1.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00214; VWC; 1.
DR   SUPFAM; SSF57603; FnI-like domain; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001073};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..22
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           23..1386
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5014191018"
FT   DOMAIN          1156..1386
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          81..138
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          172..200
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          224..1140
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        81..95
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        107..138
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        361..375
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        491..514
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        830..844
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1103..1122
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1386 AA;  133744 MW;  F132E63582DF732A CRC64;
     MFSFVDLRLL LLLAATALLT HGQEEGQVEG QDEDIPPITC VQNGLRYHDR DVWKPEPCRI
     CVCDNGKVLC DDQTLALTRV SSPLESPTDQ ETTGVEGPKG DTGPRGPRGP AGPPGRDGIP
     GQPGLPGPPG PPGPPGPPVS SIIGAVFCLS IHCPDPDFSC FPWAVVGRQG HDGLFSPSQG
     EAGKPGRPGE RGPPGPQVSR RLRLNLGFTA LGLHLGGFSG LDGAKGDAGP AGPKGEPGSP
     GENGAPGQMG PRGLPGERGR PGAPGPAGAR GNDGATGAAG PPVSVACNVG FPRVGEAGPQ
     GPRGSEGPQG VRGEPGPPGP AGAAGPAGNP GADGQPGAKG ANGAPGIAGA PGFPGARGPS
     GPQGPGGPPG PKGNSGEPGA PGSKGDTGAK GEPVSLPATL LAAQPTLPWE PPEGNPERKR
     SSWSPGGVPG GPGSRGFPGA DGVAGPKGPA GERGSPGPAG PKGSPGEAGR PGEAGLPGAK
     GLTGSPGSPG PDGKTGPPGP AGQDGRPGPP GPPGARGQAG VMGFPGPKGA AGEPGKAGER
     GVPGPPGAVG PAGKDGEAGA QGPPGPAGPA GERGEQGPAG SPGFQGLPGP AGPPGEAGKP
     GEQGVPGDLG APGPSGARGE RGFPGERGVQ GPPGPAGPRG ANGAPGNDGA KGDAGAPGAP
     GSQGAPGLQG MPGERGAAGL PGPKGDRGDA GPKGADGSPG KDGVRGLTGP IGPPGPAGAP
     GDKGETGPSG PAGPTGARGA PGDRGEPGPP GPAGFAGPPG ADGQPGAKGE PGDAGAKGDA
     GPPGPAGPAG PPGPIGNVGA PGAKGARGSA GPPGATGFPG AAGRVGPPGP SGNAGPPGPP
     GPAGKEGGKG PRGETGPAGR PGEVGPPGPP GPAGEKGSPG ADGPAGAPGT PGPQGIAGQR
     GVVGLPGQRG ERGFPGLPGE PGKQGPSGAS GERGPPGPMG PPGLAGPPGE SGREGAPGAE
     GSPGRDGLLA PRGGLILQPQ PQPPLAKRSC FPCSPQGPAG PAGPVGPVGA RGPAGPQGPR
     GDKGETGEQG DRGIKGHRGF SGLQGPPGPP GSPGEQGPSG ASGPAGPRGP PGSAGAPGKD
     GLNGLPGPIG PPGPRGRTGD AGPVGPPGPP GPPGPPGPPS AGFDFSFLPQ PPQEKAHDGG
     RYYRADDANV VRDRDLEVDT TLKSLSQQIE NIRSPEGSRK NPARTCRDLK MCHSDWKSGE
     YWIDPNQGCN LDAIKVFCNM ETGETCVYPT QPSVAQKNWY ISKNPKDKRH VWFGESMTDG
     FQFEYGGQGS DPADVAIQLT FLRLMSTEAS QNITYHCKNS VAYMDQQTGN LKKALLLQGS
     NEIEIRAEGN SRFTYSVTVD GCTSHTGAWG KTVIEYKTTK TSRLPIIDVA PLDVEFGFDV
     GPVCFL
//
DBGET integrated database retrieval system