GenomeNet

Database: UniProt
Entry: H9GLU4_ANOCA
LinkDB: H9GLU4_ANOCA
Original site: H9GLU4_ANOCA 
ID   H9GLU4_ANOCA            Unreviewed;      1501 AA.
AC   H9GLU4;
DT   16-MAY-2012, integrated into UniProtKB/TrEMBL.
DT   26-JUN-2013, sequence version 2.
DT   10-OCT-2018, entry version 37.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSACAP00000014773};
GN   Name=COL5A2 {ECO:0000313|Ensembl:ENSACAP00000014773};
OS   Anolis carolinensis (Green anole) (American chameleon).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata;
OC   Toxicofera; Iguania; Dactyloidae; Anolis.
OX   NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000014773, ECO:0000313|Proteomes:UP000001646};
RN   [1] {ECO:0000313|Ensembl:ENSACAP00000014773}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=JBL SC #1 {ECO:0000313|Ensembl:ENSACAP00000014773};
RG   The Genome Sequencing Platform;
RA   Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J.,
RA   Lander E.S., Lindblad-Toh K.;
RT   "The Genome Sequence of Anolis carolinensis (Green Anole Lizard).";
RL   Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Proteomes:UP000001646}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=JBL SC #1 {ECO:0000313|Proteomes:UP000001646};
RX   PubMed=21881562; DOI=10.1038/nature10390;
RA   Alfoldi J., Di Palma F., Grabherr M., Williams C., Kong L.,
RA   Mauceli E., Russell P., Lowe C.B., Glor R.E., Jaffe J.D., Ray D.A.,
RA   Boissinot S., Shedlock A.M., Botka C., Castoe T.A., Colbourne J.K.,
RA   Fujita M.K., Moreno R.G., Ten Hallers B.F., Haussler D., Heger A.,
RA   Heiman D., Janes D.E., Johnson J., de Jong P.J., Koriabine M.Y.,
RA   Lara M., Novick P.A., Organ C.L., Peach S.E., Poe S., Pollock D.D.,
RA   de Queiroz K., Sanger T., Searle S., Smith J.D., Smith Z.,
RA   Swofford R., Turner-Maier J., Wade J., Young S., Zadissa A.,
RA   Edwards S.V., Glenn T.C., Schneider C.J., Losos J.B., Lander E.S.,
RA   Breen M., Ponting C.P., Lindblad-Toh K.;
RT   "The genome of the green anole lizard and a comparative analysis with
RT   birds and mammals.";
RL   Nature 477:587-591(2011).
RN   [3] {ECO:0000313|Ensembl:ENSACAP00000014773}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (MAR-2012) to UniProtKB.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   -----------------------------------------------------------------------
DR   STRING; 28377.ENSACAP00000014773; -.
DR   Ensembl; ENSACAT00000015074; ENSACAP00000014773; ENSACAG00000014738.
DR   eggNOG; KOG3544; Eukaryota.
DR   eggNOG; ENOG410XNMM; LUCA.
DR   GeneTree; ENSGT00900000140789; -.
DR   InParanoid; H9GLU4; -.
DR   OrthoDB; EOG091G03LV; -.
DR   TreeFam; TF344135; -.
DR   Proteomes; UP000001646; Unplaced.
DR   Bgee; ENSACAG00000014738; Expressed in 4 organ(s), highest expression level in heart.
DR   GO; GO:0005588; C:collagen type V trimer; IEA:Ensembl.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   GO; GO:0046332; F:SMAD binding; IEA:Ensembl.
DR   GO; GO:0071230; P:cellular response to amino acid stimulus; IEA:Ensembl.
DR   GO; GO:0030199; P:collagen fibril organization; IEA:Ensembl.
DR   GO; GO:0048592; P:eye morphogenesis; IEA:Ensembl.
DR   GO; GO:1903225; P:negative regulation of endodermal cell differentiation; IEA:Ensembl.
DR   GO; GO:0001501; P:skeletal system development; IEA:Ensembl.
DR   GO; GO:0043588; P:skin development; IEA:Ensembl.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR001007; VWF_dom.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF00093; VWC; 1.
DR   ProDom; PD002078; Fib_collagen_C; 1.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00214; VWC; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
DR   PROSITE; PS01208; VWFC_1; 1.
DR   PROSITE; PS50184; VWFC_2; 1.
PE   4: Predicted;
KW   Complete proteome {ECO:0000313|Proteomes:UP000001646};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001646};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL        1     26       {ECO:0000256|SAM:SignalP}.
FT   CHAIN        27   1501       {ECO:0000256|SAM:SignalP}.
FT                                /FTId=PRO_5003619492.
FT   DOMAIN       41     99       VWFC. {ECO:0000259|PROSITE:PS50184}.
FT   DOMAIN     1268   1501       Fibrillar collagen NC1.
FT                                {ECO:0000259|PROSITE:PS51461}.
SQ   SEQUENCE   1501 AA;  145137 MW;  ADC9EC4A8145942D CRC64;
     MMASWTQRRK LVLLIAFLGH LGTIRTQEEE EFEGELDAEE VACTQNGQVY LNRDIWKPAP
     CQICVCDNGA ILCDEIQCLD VLECESPQVP PGECCPVCPN TARNGFEGAI GRGRKGQKGE
     PGIVPVVTGI RGRPGPSGPP GSQGPRGIRG PKGRPGARGP PGLDGEPGIP GQPGDPGPPG
     QPSTGPDGVG RPFTSQMAGL DEKSGLASQM GFMPGAVGPV GPRGPPGAQG LTGGRGPPGP
     PGEHGDPGPM GPAGVRGPEG PPGKPGEDGE AGRSGQPGEV GFPGSPGARG FPGAPGLPGL
     KGHRGHKGLE GPKGEIGATG SKGEPGPPGQ MGLTGPMGPR GMAGERGRIG PQGAPGGRGS
     HGMPGKPGPM GPLGIPGSAG FPGVPGMKGE AGPTGARGPE GPQGPRGETG QPGPAGLSGL
     PGAPGTDGSM GAKGPTGSPG TSGPHGSPGP LGSTGPQGST GPPGIRGQMG DPGVPGFKGE
     AGPKGEPGPH GPQGPIGPVG EEGKRGPRGD PGSVGPPGPL GERGPPGNRG FPGSDGLPGP
     KGAQGERGLA GASGPKGSQG DPGRTGEPGL PGARGLTGNP GVSGPEGKSG PLGAPGEDGR
     PGPAGPIGIR GQPGSMGLPG PKGISGDPGK AGEAGNAGVP GQRGAPGKDG EVGPSGPVGP
     PGPAGERGEQ GPPGPTGFQG LPGPPGPPGE GGKPGDQGVP GDGGAPGPLG PRGERGNPGE
     RGNPGTSGMP GEKGMAGGQG PDGPKGNPGP SGTSGDQGPP GLQGMPGERG IAGTPGPKGD
     RGSIGEKGSE GTAGNDGARG LPGPLGPGGP AGPAGEKGEP GPRGLVGPAG SRGNPGSRGE
     NGPTGPVGFA GPPGPDGQPG VKGEPGEPGQ KGDAGSPGPQ GLSGSHGPPG PSGVPGLKGG
     RGTKGPPGAT GFPGSAGRVG PPGPTGAPGP AGPIGEPGKE GPPGLRGDPG AHGRVGDRGP
     AGPAGSAGDK GDSGEDGQPG PDGPPGPAGT TGQRGIVGMP GQRGERGMPG LPGPAGTPGK
     QGSTGPPGDK GPSGPIGSPG ATGPVGEAGP EGPAGNDGTP GRDGAVGERG DRGEHGPAGL
     PGSSGSPGTP GPVGPTGDPG QRGEPGSRGP VGPPGRAGKR GLPGPQGPRG DKGDHGDRGD
     RGQKGHRGFT GLQGLPGPPG PVGEQGSTGI PGPFGPRGPP GPVGPSGKEG NAGPLGPVGP
     PGGRGTLGEA GPEGPPGDPG PPGPPGPPGH LTAAIGDIMG HYDDGLADAL PEFTDDEAAP
     DDSNKTDPGV HATLKSLSSQ IETMRSPDGS KKHPARTCDD LRQCHPSKKN GEYWIDPNQG
     CVEDAIKVFC NMETGETCIS ANPSIIPRKT WWTSRSSELK PVWYGLDMNR GSQFVYGDGE
     SPNTAVTQMT FLRLLSKEAS QNITYHCKNS IGYMDDQAKN MKKAVILKGA NDLEIKAEGN
     SRFRYTVLHD SCSKRNGNTG STVFEYKTQN VARLPIVDIA PVDIGSADQE FGIEIGPVCF
     V
//
DBGET integrated database retrieval system