GenomeNet

Database: UniProt
Entry: W5KPT4_ASTMX
LinkDB: W5KPT4_ASTMX
Original site: W5KPT4_ASTMX 
ID   W5KPT4_ASTMX            Unreviewed;       434 AA.
AC   W5KPT4;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   05-DEC-2018, sequence version 2.
DT   27-MAR-2024, entry version 48.
DE   SubName: Full=Collagen, type XIX, alpha 1 {ECO:0000313|Ensembl:ENSAMXP00000009596.2};
OS   Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC   Characoidei; Characidae; Astyanax.
OX   NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000009596.2, ECO:0000313|Proteomes:UP000018467};
RN   [1] {ECO:0000313|Proteomes:UP000018467}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RA   Jeffery W., Warren W., Wilson R.K.;
RL   Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Proteomes:UP000018467}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RX   PubMed=25329095; DOI=10.1038/ncomms6307;
RA   McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA   Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D., O'Quin K.E.,
RA   Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C., Volff J.N.,
RA   Yoshizawa M., Warren W.C.;
RT   "The cavefish genome reveals candidate genes for eye loss.";
RL   Nat. Commun. 5:5307-5307(2014).
RN   [3] {ECO:0000313|Ensembl:ENSAMXP00000009596.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; W5KPT4; -.
DR   STRING; 7994.ENSAMXP00000009596; -.
DR   Ensembl; ENSAMXT00000009596.2; ENSAMXP00000009596.2; ENSAMXG00000009334.2.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000158302; -.
DR   HOGENOM; CLU_030770_0_0_1; -.
DR   InParanoid; W5KPT4; -.
DR   Proteomes; UP000018467; Unassembled WGS sequence.
DR   Bgee; ENSAMXG00000009334; Expressed in muscle tissue and 5 other cell types or tissues.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR37456:SF5; -; 1.
DR   PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR   Pfam; PF01391; Collagen; 3.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000018467}.
FT   REGION          1..144
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          163..320
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          357..434
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        124..139
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        369..384
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        403..420
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   434 AA;  42219 MW;  5AAE0A3CBC37C3E3 CRC64;
     GRRGPPGAPG IPGPPGEKGV TGVTGEGGAK GEKGDAPGNE GAAGIKGEKG DPGGAPGPQG
     LPGPKGEPGD TAPGPRGSRG PKGERGVPGL PGEPGEKGDP GVGITGPPGI MGPKGEPGIE
     GPPGRDGRPG PPGPPGEPTA LPIVGDMGTL LKNACSVCQT RVPGLPGQKG EKGAFGPPGL
     EGMQGEKGDQ GLRGPIGNPG KEGPKGVKGE RGFPGPIGDK GDEGPIGRPG PSGPAGPKGE
     RGAPYVEGNG MSSLYKLQGE PGPMGLPGLE GLPGAKGKTG PRGESGTPGD PGRPGKDGIP
     GYEGARGRPG DRGSKGERSD AATVEEIKMF IRNEVLRVFE KLSSYASQQK TPAAILSAHG
     RQGPSGPPGT DGSPGPPGEP GPPGPQYRGQ KGERGQLGLG IPGEPGPPGP PPPGIGLYGP
     PGPQGPHGRC NPSD
//
DBGET integrated database retrieval system