ID G3QE20_GORGO Unreviewed; 1497 AA.
AC G3QE20;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=Collagen type XVII alpha 1 chain {ECO:0000313|Ensembl:ENSGGOP00000000505.2};
GN Name=COL17A1 {ECO:0000313|Ensembl:ENSGGOP00000000505.2};
OS Gorilla gorilla gorilla (Western lowland gorilla).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Gorilla.
OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000000505.2, ECO:0000313|Proteomes:UP000001519};
RN [1] {ECO:0000313|Ensembl:ENSGGOP00000000505.2, ECO:0000313|Proteomes:UP000001519}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Scally A.;
RT "Insights into the evolution of the great apes provided by the gorilla
RT genome.";
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGGOP00000000505.2, ECO:0000313|Proteomes:UP000001519}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=22398555; DOI=10.1038/nature10842;
RA Scally A., Dutheil J.Y., Hillier L.W., Jordan G.E., Goodhead I.,
RA Herrero J., Hobolth A., Lappalainen T., Mailund T., Marques-Bonet T.,
RA McCarthy S., Montgomery S.H., Schwalie P.C., Tang Y.A., Ward M.C., Xue Y.,
RA Yngvadottir B., Alkan C., Andersen L.N., Ayub Q., Ball E.V., Beal K.,
RA Bradley B.J., Chen Y., Clee C.M., Fitzgerald S., Graves T.A., Gu Y.,
RA Heath P., Heger A., Karakoc E., Kolb-Kokocinski A., Laird G.K., Lunter G.,
RA Meader S., Mort M., Mullikin J.C., Munch K., O'Connor T.D., Phillips A.D.,
RA Prado-Martinez J., Rogers A.S., Sajjadian S., Schmidt D., Shaw K.,
RA Simpson J.T., Stenson P.D., Turner D.J., Vigilant L., Vilella A.J.,
RA Whitener W., Zhu B., Cooper D.N., de Jong P., Dermitzakis E.T.,
RA Eichler E.E., Flicek P., Goldman N., Mundy N.I., Ning Z., Odom D.T.,
RA Ponting C.P., Quail M.A., Ryder O.A., Searle S.M., Warren W.C.,
RA Wilson R.K., Schierup M.H., Rogers J., Tyler-Smith C., Durbin R.;
RT "Insights into hominid evolution from the gorilla genome sequence.";
RL Nature 483:169-175(2012).
RN [3] {ECO:0000313|Ensembl:ENSGGOP00000000505.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CABD030075914; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030075915; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030075916; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030075917; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030075918; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_018890943.1; XM_019035398.1.
DR STRING; 9593.ENSGGOP00000000505; -.
DR Ensembl; ENSGGOT00000000517.3; ENSGGOP00000000505.2; ENSGGOG00000000510.3.
DR GeneID; 101145485; -.
DR KEGG; ggo:101145485; -.
DR CTD; 1308; -.
DR GeneTree; ENSGT00940000161242; -.
DR HOGENOM; CLU_004285_0_0_1; -.
DR InParanoid; G3QE20; -.
DR OMA; YRQTQSP; -.
DR OrthoDB; 5362506at2759; -.
DR Proteomes; UP000001519; Chromosome 10.
DR Bgee; ENSGGOG00000000510; Expressed in testis.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0062023; C:collagen-containing extracellular matrix; IBA:GO_Central.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0030056; C:hemidesmosome; IBA:GO_Central.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IBA:GO_Central.
DR GO; GO:0008201; F:heparin binding; IBA:GO_Central.
DR GO; GO:0030198; P:extracellular matrix organization; IBA:GO_Central.
DR GO; GO:0031581; P:hemidesmosome assembly; IEA:Ensembl.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01391; Collagen; 3.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000001519};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 464..488
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT REGION 1..154
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 167..186
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 562..1009
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1209..1234
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1261..1316
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1434..1497
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..16
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 17..99
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 108..135
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 169..186
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 819..842
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 857..896
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 904..921
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 933..949
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1213..1229
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1280..1316
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1454..1468
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1469..1488
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1497 AA; 150410 MW; 68E0E97C6CC5C418 CRC64;
MDVTKKNKRD GTEVTERIVT ETVTTRLTSL PPKGGTSNGY AKTASLGGGS RLEKQSLTHG
SSGYINSTGS TRGHASTSSY RRAHSPASTL PNSPGSTFER KIHVTRHTYE GSSSGNSSPE
YPRKEFASSS TRGRSQTRES EIRVRLQSAS PSTRWTELDD VKRLLKGSRS ASVSPTRNSS
NTLPIPKKGT VETKIVTASS QSVSGTYDAT ILDANLPSHV WSSTLPAGSS MGTYHNNMTT
QSSSLLNTNA YSAGSVFGVP NNMASCSPTL HPGLSTSSSV FGMQNNLAPS LTTLSHGTTT
TSTAYGVKKN MPQSPAAVNT GVSTSAACTT SVQSDDLLHK DCKFLILEKD NTPAKKEMEL
LIMTKDSGKV FTASPASITA TSFSEDTLKK EKQAAYNADS GLKAEANGDL KTVSTKGKTT
TADIHSYGSG GGGGSGGDGS VGGAGGGPWG AAPAWCPCGS CCSWWKWLLG LLLTWLLLLG
LLFGLIALAE EVRKLKARVD ELERIRRSML PYGDSMDRTE KDRLQGMAPA AGADLDKIGL
HSDSQEELWM FVRKKLMMEQ ENGNLRGSPG PKGDMGSPGP KGDRGFPGTP GIPGPLGHPG
PEGPKGQKGS VGDPGMEGPM GQRGREGPMG PRGEPGPPGS GEKGERGAAG EPGPHGPPGV
PGSVGPKGSS GSPGPQGPPG PVGLQGLRGE VGLPGVKGDK GPMGPPGPKG DQGEKGPRGL
TGEPGMRGLP GAVGEPGAKG ATGPAGPDGH QGPRGEQGLT GMPGIRGPPG PSGDPGKPGL
TGPQGPQGLP GTPGRPGIKG EPGAPGKIVT SEGSSMITVP GPPGPPGAMG PPGPPGAPGP
AGPAGLPGQQ EVLNLQGPPG PPGPRGPPGL SIPGPPGPRG PPGEGLPGPP GPPGSFLPNS
ETFLSGPPGP PGPPGPKGDQ GPPGPRGHQG EQGLPGFSTS GSSSFGLNLQ GPPGPPGPQG
PKGDKGDPGV PGALGIPSGP SEGGSSSTMY VSGPPGPPGP PGPPGSISIS GQEIQQYISE
YMQSDSIRSY LSGVQGPPGP PGPPGPVTTI TGETFDYSEL ASHVVSYLRT SGYGVSLFSS
SISSEDILAV LQRDDVRQYL RQYLMGPRGP PGPPGASGDG SLLSLDYAEL SSRILSYMSS
SGISIGLPGP PGPPGLPGTS YEELLSLLRG SEFRGIVGPP GPPGPPGIPG NVWSSISVED
LSSYLHTAGL SFIPGPPGPP GPPGPRGPPG VSGALATYAA ENSDSFRSEL ISYLTSPDVR
SFIVGPPGPP GPQGPPGDSR LLSTDASHSR GSSSSSHSSS VRRGSSYSSS MSTGGGGAGS
LGAGGAFGAA AGDGGPYGTD IGPGGGYGAA AEGGMYAGNG GLLGADFAGG LDYNELAVRV
SESMQRQGLL QGMAYTVQGP PGRPGPQGPP GISKVFSAYS NVTADLMDFF RTYGAIQGPP
GQKGEMGTPG PKGDRGPAGP PGHPGPPGPR GHKGEKGDKG DQVYAGRRRR RSIAVKP
//