ID G1S4F3_NOMLE Unreviewed; 1787 AA.
AC G1S4F3;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 2.
DT 27-MAR-2024, entry version 75.
DE SubName: Full=Collagen type XXVII alpha 1 chain {ECO:0000313|Ensembl:ENSNLEP00000020391.2};
GN Name=COL27A1 {ECO:0000313|Ensembl:ENSNLEP00000020391.2};
OS Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates leucogenys).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hylobatidae;
OC Nomascus.
OX NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000020391.2, ECO:0000313|Proteomes:UP000001073};
RN [1] {ECO:0000313|Ensembl:ENSNLEP00000020391.2, ECO:0000313|Proteomes:UP000001073}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Gibbon Genome Sequencing Consortium;
RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSNLEP00000020391.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADFV01019771; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01019772; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01019773; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01019774; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01019775; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01019776; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01019777; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01019778; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01019779; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01019780; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 61853.ENSNLEP00000020391; -.
DR Ensembl; ENSNLET00000021407.2; ENSNLEP00000020391.2; ENSNLEG00000016786.3.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000163466; -.
DR HOGENOM; CLU_001074_19_1_1; -.
DR InParanoid; G1S4F3; -.
DR OMA; HQHIAVG; -.
DR TreeFam; TF344135; -.
DR Proteomes; UP000001073; Chromosome 8.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR GO; GO:0005583; C:fibrillar collagen trimer; IEA:Ensembl.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:Ensembl.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:Ensembl.
DR GO; GO:0003431; P:growth plate cartilage chondrocyte development; IEA:Ensembl.
DR Gene3D; 2.60.120.1000; -; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF844; COLLAGEN ALPHA-1(XXVII) CHAIN; 1.
DR Pfam; PF01410; COLFI; 2.
DR Pfam; PF01391; Collagen; 7.
DR PRINTS; PR01217; PRICHEXTENSN.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000001073};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..1787
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014182799"
FT DOMAIN 1587..1787
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 256..587
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 601..754
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 828..1552
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 274..288
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 325..383
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 406..422
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 440..500
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 550..587
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 601..644
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1078..1112
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1168..1185
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1532..1548
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1787 AA; 180005 MW; 7585D0996D69B34F CRC64;
GFLFSWILVS FACHLASTQG APEDVDVLQR LGLSWTKAGS PAPLGVIPFQ SGFIFTQRAR
LQAPTGTIIP AALGTELALV LSLCSHRVNH AFLFAVRSRK HKLQLGLQFL PGKTVVHLGS
RRSVAFDLDM HDGRWHHLAL ELRGRTVTLV TACGQRRVPV LLPFHRDPAL DPGGSFLFGK
MNPHAVQFEG ALCQFSIYPV TQVAHNYCTH LRKQCGQADT YQSPLGPLFS QDSGRPFTFQ
SDLALLGLEN LTTATPALGS LPARGPRGTV VPATPTKPQR TSPTNPHQHI AVGGPAQTPL
LPAKLSASNS LDPVLPASVG GSTRTPHPAA AQPSQKITAT KIPKSLPTKP SAPSTSIVPV
KSPRSTQKTA PSSFTKSAPP TQKQVPPTSR PVPAKVSHPA EKPIQRNPGM PRPPPPSTQP
LPPTTSSSKK PIPTLALTEA KITSHASKPA SANTSTHKPP PFTALSSSPA PTPGSTRTTR
PPATMVPPTS GTSTPRTAPA VPTPGSAPTG SKKPTGSEAS KKAGPKSSSR KPVPLRPGKA
ARDVPLSDLT TRPSPRQSQP SQQTTPALVL APAQFLSSSP QPTSSGYSFF HLAGSTPFPL
LMGPPGPKGD CGLPGPPGLP GLPGIPGARG PRGPPGPYGN PGLPGPPGAK GQKGDPGLSP
GKAHDGAKGD MGLPGLSGNP GPPGRKGHKG YPGPAGHPGE QGQPGPEGSP GAKGYPGRQG
LPGPVGDPGP KGSRGYIGLP GLFGLPGSDG ERGLPGVPGK RGKMGMPGFP GVFGERGPPG
LDGNPGELGL PGPPGVPGLI GDLGVLGPIG YPGPKGMKGE QGVPGVSGDP GFQGDKGSQG
LPGFPGARGK PGPLGKVGDK GSIGFPGPPG PEGFPGDIGP PGDNGPEGMK GKPGARGLPG
PRGQLGPEGD EGPMGPPGAP GLEGQPGRKG FPGRPGLDGV KGEPGDPGRP GPVGEQGFMG
FIGLVGEPGI VGEKGDRGMM GPPGVPGPKG SMGHPGMPGG MGTPGEPGPQ GPPGSRGPPG
MRGAKGRRGP RGPDGPAGEQ GSRGLKGPPG PQGRPGQPGQ QGVAGERGHL GSRGFPGIPG
PSGPPGTKGL PGEPGPQGPQ GPIGPPGEMG PKGPPGAVGE PGLPGEAGMK GDLGPLGTPG
EQGLIGQRGE PGLEGDSGPM GPDGLKGDRG DPGPDGEHGE KGQEGLMGED GPPGPPGVTG
VRGPEGKSGK QGEKGRTGAK GAPGRMGAQG EPGLAGYDGH KGIVGPLGPP GPKGEKGEQG
EDGKAEGPPG PPGDRGPVGD RGDRGEPGDP GYPGQEGVQG LRGKPGQQGQ PGHPGPRGRP
GPKGSKGAEG PKGKQGKAGA PGRRGIQGLQ GLPGPRGVVG RQGLEGIPGP DGLPGRDGQA
GQQGEQGDDG DPGPMGPAGK RGNPGVAGLP GAQGPPGFKG ESGLPGQLGP PGKRGTEGRT
GLPGNQGEPG SKGQPGDSGE MGFPGMAGLF GPKGPPGDIG FKGIQGPRGP PGLMGKEGIV
GPLGILGPSG LPGPKGDKGS RGDWGLQGPR GPPGPRGQPG PPGPPGGPIQ LQQDDLGAAF
QTWMDTSGAL RPEGYSYPDR LVLDQGGEIF KTLHYLSNLI QSIKMPLGTK ENPARVCRDL
MDCEQKMVDG TYWVDPNLGC SSDTIEVSCN FTHGGQTCLK PITASKVEFA ISRVQMNFLH
LLSSEVTQHI TIHCLNMTVW QEGTGQTPAK QAVRFRAWNG QIFEAGGQFR PEVSMDGCKV
QDGHWHQTLF TFRTQDPQQL PIIGVDNLPP ASSGKQYRLE VGPACFL
//