GenomeNet

Database: UniProt
Entry: K7GFE1_PELSI
LinkDB: K7GFE1_PELSI
Original site: K7GFE1_PELSI 
ID   K7GFE1_PELSI            Unreviewed;      1464 AA.
AC   K7GFE1;
DT   09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT   09-JAN-2013, sequence version 1.
DT   11-DEC-2019, entry version 46.
DE   SubName: Full=Collagen type V alpha 2 chain {ECO:0000313|Ensembl:ENSPSIP00000019002};
GN   Name=COL5A2 {ECO:0000313|Ensembl:ENSPSIP00000019002};
OS   Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Testudines; Cryptodira; Trionychia; Trionychidae;
OC   Pelodiscus.
OX   NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000019002, ECO:0000313|Proteomes:UP000007267};
RN   [1] {ECO:0000313|Ensembl:ENSPSIP00000019002}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=17381049; DOI=10.1080/10425170600760091;
RA   Jung S.-O., Lee Y.-M., Kartavtsev Y., Park I.-S., Kim D.S., Lee J.-S.;
RT   "The complete mitochondrial genome of the Korean soft-shelled turtle
RT   Pelodiscus sinensis (Testudines, Trionychidae).";
RL   DNA Seq. 17:471-483(2006).
RN   [2] {ECO:0000313|Proteomes:UP000007267}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG   Soft-shell Turtle Genome Consortium;
RL   Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|Ensembl:ENSPSIP00000019002}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2012) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGCU01034887; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01034888; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01034889; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01034890; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01034891; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01034892; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01034893; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   STRING; 13735.ENSPSIP00000019002; -.
DR   Ensembl; ENSPSIT00000019088; ENSPSIP00000019002; ENSPSIG00000015890.
DR   eggNOG; KOG3544; Eukaryota.
DR   eggNOG; ENOG4110XTV; LUCA.
DR   GeneTree; ENSGT00940000155675; -.
DR   OMA; LCDKIEC; -.
DR   TreeFam; TF344135; -.
DR   Proteomes; UP000007267; Unassembled WGS sequence.
DR   GO; GO:0005588; C:collagen type V trimer; IEA:Ensembl.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   GO; GO:0030199; P:collagen fibril organization; IEA:Ensembl.
DR   GO; GO:0048592; P:eye morphogenesis; IEA:Ensembl.
DR   GO; GO:1903225; P:negative regulation of endodermal cell differentiation; IEA:Ensembl.
DR   GO; GO:0043588; P:skin development; IEA:Ensembl.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR001007; VWF_dom.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 5.
DR   Pfam; PF00093; VWC; 1.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00214; VWC; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
DR   PROSITE; PS01208; VWFC_1; 1.
DR   PROSITE; PS50184; VWFC_2; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000007267}.
FT   DOMAIN          3..61
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          1231..1464
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          71..1234
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        640..654
FT                   /note="Pro-rich"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        710..724
FT                   /note="Pro-rich"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        886..900
FT                   /note="Pro-rich"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1094..1108
FT                   /note="Polyampholyte"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1136..1150
FT                   /note="Pro-rich"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1177..1192
FT                   /note="Pro-rich"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1464 AA;  141366 MW;  21FA4EDCF62D4C31 CRC64;
     EEKACTQNGQ MYLNRDIWKP SPCQICVCDN GAILCDEIQC QDVLECESPQ VPPGECCPVC
     PNNPRVGFDS NIGRGRKGQK GEPGLVPVVT GIRGRSGPAG PPGSQGPRGE RGPKGRPGPR
     GPQGIDGEPG IPGQPGDPGP PGHPTHSGPD GRSAPFAAQM AGLDEKSGLS SQMGFMPGAV
     GPIGPRGPQG LQGQQGGVGP TGPPGEPGEP GSMGPAGARG PEGPPGKPGE DGESGRSGQP
     GETGFPGSPG ARGFPGAPGL PGLKGHRGHK GLEGPKGEVG ATGSKGEAGP TGPLGQTGPL
     GPRGMSGERG RIGPQGAPGG RGSHGMPGKP GPVGPLGIIG SPGFPGNPGV KGEAGPTGAR
     GPEGPQGQRG ETGQPGPAGS QGLPGTIGTD GSPGAKGPTG SPGTSGPPGL AGPLGSPGPQ
     GSTGPPGIRG QVGDPGVPGF KGEAGPKGEP GPHGPQGPIG PVGEEGKRGP RGDPGSVGPQ
     GPVGERGAPG NRGFPGSDGL PGPKGAQGER GPVGSSGPKG SQGDPGRTGE PGLPGARGLT
     GNPGVQGPEG KLGPLGAPGE DGRPGPAGSI GIRGQPGSMG LPGPKGSSGD LGKPGEAGNA
     GVPGQRGAPG KDGEIGPSGP VGPPGLAGER GEQGPPGPTG FQGLPGPPGP PGEGGKPGDQ
     GVPGDPGAGG PLGPRGERGN PGERGEPGSA GLQGEKGMAG GHGPDGPKGS PGPTGTPGDP
     GPPGLQGMPG ERGIAGTPGP KGDRGGVGEK GSEGTAGNDG ARGLPGPIGP TGPAGPTGEK
     GEPGPRGLVG PPGSRGNPGS RGENGPTGAV GFAGPPGPDG QPGVKGEPGE PGQKGDAGSP
     GPQGLSGSPG PPGPHGVPGL KGGRGTQGPP GATGFPGSAG RVGPPGPTGA PGPAGPIGDP
     GKEGPPGLRG DPGAHGRVGD RGPAGPPGGP GDKGDSGEDG QPGPDGPPGP AGTTGQRGIV
     GMPGQRGERG MPGLPGPGGT PGKQGPTGPP GDKGPSGPVG HSGPTGPVGE PGPEGPAGND
     GTPGRDGAVG ERGDRGEPGP VGLPGAQGGP GTPGPVGPTG EAGQRGEPGS RGPMGPPGRA
     GKRGLPGPQG PRGDKGDNGD RGDRGQKGHR GFTGLQGLPG PPGPIGEQGS SGIPGPFGPR
     GPPGPVGPSG KDGNPGPLGP LGPPGVRGSL GEAGPEGPPG DPGPPGPPGP PGHLTAAIGD
     IMGHFDDSMS DPLPEFTEDE AAPDGNNKTD PGVHATLKSL SSQIETMRSP DGSKKHPART
     CDDLKLCHPS KKSGEYWIDP NQGCVEDAVK VYCNMETGET CISANPSGIP RKTWWTSRSP
     DFKPVWYGLD MNRGSQFAYG DSESPNTAIT QMTFLRLLSK EASQNITYHC KNSIGYMDDQ
     SKNLKKAVIL KGANDLEIKA EGNNRFRYTV LQDSCSKRNG NVGKTVFEYR TQNVARLPII
     DIAPVDIGTT DQEFGVEIGP VCFV
//
DBGET integrated database retrieval system