ID G1U5C8_RABIT Unreviewed; 333 AA.
AC G1U5C8;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 11-DEC-2019, sequence version 2.
DT 24-JAN-2024, entry version 59.
DE SubName: Full=Collagen type XXV alpha 1 chain {ECO:0000313|Ensembl:ENSOCUP00000024607.2};
GN Name=COL25A1 {ECO:0000313|Ensembl:ENSOCUP00000024607.2};
OS Oryctolagus cuniculus (Rabbit).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Lagomorpha; Leporidae; Oryctolagus.
OX NCBI_TaxID=9986 {ECO:0000313|Ensembl:ENSOCUP00000024607.2, ECO:0000313|Proteomes:UP000001811};
RN [1] {ECO:0000313|Ensembl:ENSOCUP00000024607.2, ECO:0000313|Proteomes:UP000001811}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thorbecke inbred {ECO:0000313|Ensembl:ENSOCUP00000024607.2,
RC ECO:0000313|Proteomes:UP000001811};
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSOCUP00000024607.2}
RP IDENTIFICATION.
RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000024607.2};
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAGW02021946; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02021947; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02021948; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02021949; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02021950; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02021951; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02021952; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02021953; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02021954; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02021955; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; G1U5C8; -.
DR Ensembl; ENSOCUT00000028889.3; ENSOCUP00000024607.2; ENSOCUG00000000201.4.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000159823; -.
DR TreeFam; TF338175; -.
DR Proteomes; UP000001811; Chromosome 15.
DR Bgee; ENSOCUG00000000201; Expressed in liver and 11 other cell types or tissues.
DR ExpressionAtlas; G1U5C8; baseline.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR37456:SF3; COLLAGEN ALPHA-1(XXV) CHAIN; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF01391; Collagen; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001811}.
FT REGION 1..192
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 210..261
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 47..64
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 333 AA; 33841 MW; 7C0CF3F60BF6C37B CRC64;
MGPLGPPGQK GSVGAPGIPG TNGQKGEPGL PGAVGQNGIP GPKGEPGEQG EKGDAGENGP
KGDTGEKGDP GSSAAGIKGE PGESGRPGQK GEPGLPGLPG LPGIKGEPGF IGPQGEPGLP
GLPGTKGERG EAGPPGRGER GEPGAPGPKG KQGESGTRGP KGSKGDHGDK GDSGALGPRG
PPGPKGDQGA TEIIDYNGNL HEALQRITTL TVTGPPGPPG PQGLQGPKGE QGSPGIPGVD
GEQGLKGSKG DMGDPGCQWN ERRKRRLRIA WSSGSFYHRP TRPTRSPWPT RPHGAPWTSW
TEGSIWLRRK ARIPGYRWSC GTPWPCWSQR RKR
//