GenomeNet

Database: UniProt
Entry: G1S4F3_NOMLE
LinkDB: G1S4F3_NOMLE
Original site: G1S4F3_NOMLE 
ID   G1S4F3_NOMLE            Unreviewed;      1787 AA.
AC   G1S4F3;
DT   19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT   28-FEB-2018, sequence version 2.
DT   27-MAR-2024, entry version 75.
DE   SubName: Full=Collagen type XXVII alpha 1 chain {ECO:0000313|Ensembl:ENSNLEP00000020391.2};
GN   Name=COL27A1 {ECO:0000313|Ensembl:ENSNLEP00000020391.2};
OS   Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates leucogenys).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hylobatidae;
OC   Nomascus.
OX   NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000020391.2, ECO:0000313|Proteomes:UP000001073};
RN   [1] {ECO:0000313|Ensembl:ENSNLEP00000020391.2, ECO:0000313|Proteomes:UP000001073}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   Gibbon Genome Sequencing Consortium;
RL   Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSNLEP00000020391.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ADFV01019771; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01019772; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01019773; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01019774; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01019775; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01019776; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01019777; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01019778; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01019779; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01019780; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   STRING; 61853.ENSNLEP00000020391; -.
DR   Ensembl; ENSNLET00000021407.2; ENSNLEP00000020391.2; ENSNLEG00000016786.3.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000163466; -.
DR   HOGENOM; CLU_001074_19_1_1; -.
DR   InParanoid; G1S4F3; -.
DR   OMA; HQHIAVG; -.
DR   TreeFam; TF344135; -.
DR   Proteomes; UP000001073; Chromosome 8.
DR   GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR   GO; GO:0005583; C:fibrillar collagen trimer; IEA:Ensembl.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:Ensembl.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:Ensembl.
DR   GO; GO:0003431; P:growth plate cartilage chondrocyte development; IEA:Ensembl.
DR   Gene3D; 2.60.120.1000; -; 2.
DR   Gene3D; 2.60.120.200; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR048287; TSPN-like_N.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF844; COLLAGEN ALPHA-1(XXVII) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 2.
DR   Pfam; PF01391; Collagen; 7.
DR   PRINTS; PR01217; PRICHEXTENSN.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001073};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..20
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           21..1787
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5014182799"
FT   DOMAIN          1587..1787
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          256..587
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          601..754
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          828..1552
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        274..288
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        325..383
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        406..422
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        440..500
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        550..587
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        601..644
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1078..1112
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1168..1185
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1532..1548
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1787 AA;  180005 MW;  7585D0996D69B34F CRC64;
     GFLFSWILVS FACHLASTQG APEDVDVLQR LGLSWTKAGS PAPLGVIPFQ SGFIFTQRAR
     LQAPTGTIIP AALGTELALV LSLCSHRVNH AFLFAVRSRK HKLQLGLQFL PGKTVVHLGS
     RRSVAFDLDM HDGRWHHLAL ELRGRTVTLV TACGQRRVPV LLPFHRDPAL DPGGSFLFGK
     MNPHAVQFEG ALCQFSIYPV TQVAHNYCTH LRKQCGQADT YQSPLGPLFS QDSGRPFTFQ
     SDLALLGLEN LTTATPALGS LPARGPRGTV VPATPTKPQR TSPTNPHQHI AVGGPAQTPL
     LPAKLSASNS LDPVLPASVG GSTRTPHPAA AQPSQKITAT KIPKSLPTKP SAPSTSIVPV
     KSPRSTQKTA PSSFTKSAPP TQKQVPPTSR PVPAKVSHPA EKPIQRNPGM PRPPPPSTQP
     LPPTTSSSKK PIPTLALTEA KITSHASKPA SANTSTHKPP PFTALSSSPA PTPGSTRTTR
     PPATMVPPTS GTSTPRTAPA VPTPGSAPTG SKKPTGSEAS KKAGPKSSSR KPVPLRPGKA
     ARDVPLSDLT TRPSPRQSQP SQQTTPALVL APAQFLSSSP QPTSSGYSFF HLAGSTPFPL
     LMGPPGPKGD CGLPGPPGLP GLPGIPGARG PRGPPGPYGN PGLPGPPGAK GQKGDPGLSP
     GKAHDGAKGD MGLPGLSGNP GPPGRKGHKG YPGPAGHPGE QGQPGPEGSP GAKGYPGRQG
     LPGPVGDPGP KGSRGYIGLP GLFGLPGSDG ERGLPGVPGK RGKMGMPGFP GVFGERGPPG
     LDGNPGELGL PGPPGVPGLI GDLGVLGPIG YPGPKGMKGE QGVPGVSGDP GFQGDKGSQG
     LPGFPGARGK PGPLGKVGDK GSIGFPGPPG PEGFPGDIGP PGDNGPEGMK GKPGARGLPG
     PRGQLGPEGD EGPMGPPGAP GLEGQPGRKG FPGRPGLDGV KGEPGDPGRP GPVGEQGFMG
     FIGLVGEPGI VGEKGDRGMM GPPGVPGPKG SMGHPGMPGG MGTPGEPGPQ GPPGSRGPPG
     MRGAKGRRGP RGPDGPAGEQ GSRGLKGPPG PQGRPGQPGQ QGVAGERGHL GSRGFPGIPG
     PSGPPGTKGL PGEPGPQGPQ GPIGPPGEMG PKGPPGAVGE PGLPGEAGMK GDLGPLGTPG
     EQGLIGQRGE PGLEGDSGPM GPDGLKGDRG DPGPDGEHGE KGQEGLMGED GPPGPPGVTG
     VRGPEGKSGK QGEKGRTGAK GAPGRMGAQG EPGLAGYDGH KGIVGPLGPP GPKGEKGEQG
     EDGKAEGPPG PPGDRGPVGD RGDRGEPGDP GYPGQEGVQG LRGKPGQQGQ PGHPGPRGRP
     GPKGSKGAEG PKGKQGKAGA PGRRGIQGLQ GLPGPRGVVG RQGLEGIPGP DGLPGRDGQA
     GQQGEQGDDG DPGPMGPAGK RGNPGVAGLP GAQGPPGFKG ESGLPGQLGP PGKRGTEGRT
     GLPGNQGEPG SKGQPGDSGE MGFPGMAGLF GPKGPPGDIG FKGIQGPRGP PGLMGKEGIV
     GPLGILGPSG LPGPKGDKGS RGDWGLQGPR GPPGPRGQPG PPGPPGGPIQ LQQDDLGAAF
     QTWMDTSGAL RPEGYSYPDR LVLDQGGEIF KTLHYLSNLI QSIKMPLGTK ENPARVCRDL
     MDCEQKMVDG TYWVDPNLGC SSDTIEVSCN FTHGGQTCLK PITASKVEFA ISRVQMNFLH
     LLSSEVTQHI TIHCLNMTVW QEGTGQTPAK QAVRFRAWNG QIFEAGGQFR PEVSMDGCKV
     QDGHWHQTLF TFRTQDPQQL PIIGVDNLPP ASSGKQYRLE VGPACFL
//
DBGET integrated database retrieval system