GenomeNet

Database: UniProt
Entry: H2R6B8_PANTR
LinkDB: H2R6B8_PANTR
Original site: H2R6B8_PANTR 
ID   H2R6B8_PANTR            Unreviewed;      1499 AA.
AC   H2R6B8; A0A2J8PRU2;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   21-MAR-2012, sequence version 1.
DT   20-JUN-2018, entry version 44.
DE   SubName: Full=COL5A2 isoform 1 {ECO:0000313|EMBL:PNI86755.1};
DE   SubName: Full=Collagen type V alpha 2 chain {ECO:0000313|Ensembl:ENSPTRP00000048958};
DE   SubName: Full=Collagen, type V, alpha 2 {ECO:0000313|EMBL:JAA44274.1};
GN   Name=COL5A2 {ECO:0000313|EMBL:JAA44274.1,
GN   ECO:0000313|Ensembl:ENSPTRP00000048958, ECO:0000313|VGNC:VGNC:10725};
GN   ORFNames=CK820_G0001731 {ECO:0000313|EMBL:PNI86755.1};
OS   Pan troglodytes (Chimpanzee).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
OC   Catarrhini; Hominidae; Pan.
OX   NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000048958, ECO:0000313|Proteomes:UP000002277};
RN   [1] {ECO:0000313|Ensembl:ENSPTRP00000048958, ECO:0000313|Proteomes:UP000002277}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=16136131; DOI=10.1038/nature04072;
RG   Chimpanzee sequencing and analysis consortium;
RT   "Initial sequence of the chimpanzee genome and comparison with the
RT   human genome.";
RL   Nature 437:69-87(2005).
RN   [2] {ECO:0000313|Ensembl:ENSPTRP00000048958}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (FEB-2012) to UniProtKB.
RN   [3] {ECO:0000313|EMBL:JAA44274.1}
RP   NUCLEOTIDE SEQUENCE.
RC   TISSUE=Skeletal muscle {ECO:0000313|EMBL:JAA44274.1};
RA   Maudhoo M.D., Meehan D.T., Norgren R.B.Jr.;
RT   "De novo assembly of the reference chimpanzee transcriptome from
RT   NextGen mRNA sequences.";
RL   Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN   [4] {ECO:0000313|EMBL:PNI86755.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Yerkes chimp pedigree #C0471 {ECO:0000313|EMBL:PNI86755.1};
RC   TISSUE=Blood {ECO:0000313|EMBL:PNI86755.1};
RA   Pollen A., Hastie A., Hormozdiari F., Dougherty M., Liu R.,
RA   Chaisson M., Hoppe E., Hill C., Pang A., Hillier L., Baker C.,
RA   Armstrong J., Shendure J., Paten B., Wilson R., Chao H., Schneider V.,
RA   Ventura M., Kronenberg Z., Murali S., Gordon D., Cantsilieris S.,
RA   Munson K., Nelson B., Raja A., Underwood J., Diekhans M., Fiddes I.,
RA   Haussler D., Eichler E.;
RT   "High-resolution comparative analysis of great ape genomes.";
RL   Submitted (DEC-2017) to the EMBL/GenBank/DDBJ databases.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; AACZ04058618; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; GABE01000465; JAA44274.1; -; mRNA.
DR   EMBL; NBAG03000211; PNI86755.1; -; Genomic_DNA.
DR   RefSeq; XP_001164152.1; XM_001164152.3.
DR   STRING; 9598.ENSPTRP00000048958; -.
DR   Ensembl; ENSPTRT00000041307; ENSPTRP00000048958; ENSPTRG00000006601.
DR   GeneID; 459817; -.
DR   KEGG; ptr:459817; -.
DR   CTD; 1290; -.
DR   VGNC; VGNC:10725; COL5A2.
DR   eggNOG; KOG3544; Eukaryota.
DR   eggNOG; ENOG4110XTV; LUCA.
DR   GeneTree; ENSGT00900000140789; -.
DR   KO; K19721; -.
DR   OMA; IGIRGQP; -.
DR   OrthoDB; EOG091G03LV; -.
DR   TreeFam; TF344135; -.
DR   Proteomes; UP000002277; Chromosome 2B.
DR   Bgee; ENSPTRG00000006601; -.
DR   GO; GO:0005588; C:collagen type V trimer; IEA:Ensembl.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   GO; GO:0046332; F:SMAD binding; IEA:Ensembl.
DR   GO; GO:0071230; P:cellular response to amino acid stimulus; IEA:Ensembl.
DR   GO; GO:0030199; P:collagen fibril organization; IEA:Ensembl.
DR   GO; GO:0048592; P:eye morphogenesis; IEA:Ensembl.
DR   GO; GO:1903225; P:negative regulation of endodermal cell differentiation; IEA:Ensembl.
DR   GO; GO:0001501; P:skeletal system development; IEA:Ensembl.
DR   GO; GO:0043588; P:skin development; IEA:Ensembl.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR001007; VWF_dom.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 6.
DR   Pfam; PF00093; VWC; 1.
DR   ProDom; PD002078; Fib_collagen_C; 1.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00214; VWC; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
DR   PROSITE; PS01208; VWFC_1; 1.
DR   PROSITE; PS50184; VWFC_2; 1.
PE   2: Evidence at transcript level;
KW   Collagen {ECO:0000313|EMBL:JAA44274.1};
KW   Complete proteome {ECO:0000313|Proteomes:UP000002277};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002277};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL        1     26       {ECO:0000256|SAM:SignalP}.
FT   CHAIN        27   1499       {ECO:0000256|SAM:SignalP}.
FT                                /FTId=PRO_5015093353.
FT   DOMAIN       39     97       VWFC. {ECO:0000259|PROSITE:PS50184}.
FT   DOMAIN     1266   1499       Fibrillar collagen NC1.
FT                                {ECO:0000259|PROSITE:PS51461}.
SQ   SEQUENCE   1499 AA;  144898 MW;  57D5A79B394DB747 CRC64;
     MMANWAEARP LLILIVLLGQ FVSIKAQEED EDEGYGEEIA CTQNGQMYLN RDIWKPAPCQ
     ICVCDNGAIL CDKIECQDVL DCADPVTPPG ECCPVCSQTP GGGNTNFGRG RKGQKGEPGL
     VPVVTGIRGR PGPAGPPGSQ GPRGERGPKG RPGPRGPQGI DGEPGVPGQP GAPGPPGHPS
     HPGPDGLSRP FSAQMAGLDE KSGLGSQVGL MPGSVGPVGP RGPQGLQGQQ GGAGPTGPPG
     EPGDPGPMGP IGSRGPEGPA GKPGEDGEPG RNGNPGEVGF AGSPGARGFP GAPGLPGLKG
     HRGHKGLEGP KGEVGAPGSK GEAGPTGPMG AMGPLGPRGM PGERGRLGPQ GAPGQRGAHG
     MPGKPGPMGP LGIPGSSGFP GNPGMKGEAG PTGARGPEGP QGQRGETGPP GPVGSPGLPG
     AIGTDGTPGA KGPTGSPGTS GPPGSAGPPG SPGPQGSTGP QGIRGQPGDP GVPGFKGEAG
     PKGEPGPHGI QGPIGPPGEE GKRGPRGDPG TVGPPGPVGE RGAPGNRGFP GSDGLPGPKG
     AQGERGPVGS SGPKGSQGDP GRPGEPGLPG ARGLTGNPGV QGPEGKLGPL GAPGEDGRPG
     PPGSIGIRGQ PGSMGLPGPK GSSGDPGKPG EAGNAGVPGQ RGAPGKDGEV GPSGPVGPPG
     LAGERGEQGP PGPTGFQGLP GPPGPPGEGG KPGDQGVPGD PGAVGPLGPR GERGNPGERG
     EPGITGLPGE KGMAGGHGPD GPKGSPGPSG TPGDTGPPGL QGMPGERGIA GTPGPKGDRG
     GIGEKGAEGT AGNDGARGLP GPLGPPGPAG PTGEKGEPGP RGLVGPPGSR GNPGSRGENG
     PTGAVGFAGP QGPDGQPGVK GEPGEPGQKG DAGSPGPQGL AGSPGPHGPN GVPGLKGGRG
     TQGPPGATGF PGSAGRVGPP GPAGAPGPAG PLGEPGKEGP PGLRGDPGSH GRVGDRGPAG
     PPGGPGDKGD PGEDGQPGPD GPPGPAGTTG QRGIVGMPGQ RGERGMPGLP GPAGTPGKVG
     PTGATGDKGP PGPVGPPGSN GPVGEPGPEG PAGNDGTPGR DGAVGERGDR GDPGPAGLPG
     SQGAPGTPGP VGAPGDAGQR GDPGSRGPIG PPGRAGKRGL PGPQGPRGDK GDHGDRGDRG
     QKGHRGFTGL QGLPGPPGPN GEQGSAGIPG PFGPRGPPGP VGPSGKEGNP GPLGPIGPPG
     VRGSIGEAGP EGPPGEPGPP GPPGPPGHLT AALGDIMGHY DESMPDPLPE FTEDQAAPDD
     KNKTDPGVHA TLKSLSSQIE TMRSPDGSKK HPARTCDDLK LCHSAKQSGE YWIDPNQGSV
     EDAIKVYCNM ETGETCISAN PSSVPRKTWW ASKSPDNKPV WYGLDMNRGS QFAYGDHQSP
     NTAITQMTFL RLLSKEASQN ITYICKNSVG YMDDQAKNLK KAVVLKGAND LDIKAEGNIR
     FRYIVLQDTC SKRNGNVGKT VFEYRTQNVA RLPIIDLAPV DVGGTDQEFG VEIGPVCFV
//
DBGET integrated database retrieval system