ID W5KPT4_ASTMX Unreviewed; 434 AA.
AC W5KPT4;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 48.
DE SubName: Full=Collagen, type XIX, alpha 1 {ECO:0000313|Ensembl:ENSAMXP00000009596.2};
OS Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC Characoidei; Characidae; Astyanax.
OX NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000009596.2, ECO:0000313|Proteomes:UP000018467};
RN [1] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RA Jeffery W., Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RX PubMed=25329095; DOI=10.1038/ncomms6307;
RA McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D., O'Quin K.E.,
RA Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C., Volff J.N.,
RA Yoshizawa M., Warren W.C.;
RT "The cavefish genome reveals candidate genes for eye loss.";
RL Nat. Commun. 5:5307-5307(2014).
RN [3] {ECO:0000313|Ensembl:ENSAMXP00000009596.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; W5KPT4; -.
DR STRING; 7994.ENSAMXP00000009596; -.
DR Ensembl; ENSAMXT00000009596.2; ENSAMXP00000009596.2; ENSAMXG00000009334.2.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000158302; -.
DR HOGENOM; CLU_030770_0_0_1; -.
DR InParanoid; W5KPT4; -.
DR Proteomes; UP000018467; Unassembled WGS sequence.
DR Bgee; ENSAMXG00000009334; Expressed in muscle tissue and 5 other cell types or tissues.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR37456:SF5; -; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF01391; Collagen; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018467}.
FT REGION 1..144
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 163..320
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 357..434
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 124..139
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 369..384
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 403..420
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 434 AA; 42219 MW; 5AAE0A3CBC37C3E3 CRC64;
GRRGPPGAPG IPGPPGEKGV TGVTGEGGAK GEKGDAPGNE GAAGIKGEKG DPGGAPGPQG
LPGPKGEPGD TAPGPRGSRG PKGERGVPGL PGEPGEKGDP GVGITGPPGI MGPKGEPGIE
GPPGRDGRPG PPGPPGEPTA LPIVGDMGTL LKNACSVCQT RVPGLPGQKG EKGAFGPPGL
EGMQGEKGDQ GLRGPIGNPG KEGPKGVKGE RGFPGPIGDK GDEGPIGRPG PSGPAGPKGE
RGAPYVEGNG MSSLYKLQGE PGPMGLPGLE GLPGAKGKTG PRGESGTPGD PGRPGKDGIP
GYEGARGRPG DRGSKGERSD AATVEEIKMF IRNEVLRVFE KLSSYASQQK TPAAILSAHG
RQGPSGPPGT DGSPGPPGEP GPPGPQYRGQ KGERGQLGLG IPGEPGPPGP PPPGIGLYGP
PGPQGPHGRC NPSD
//