ID G1Q8X9_MYOLU Unreviewed; 599 AA.
AC G1Q8X9;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 27-MAR-2024, entry version 68.
DE RecName: Full=Collagen type IX alpha 3 chain {ECO:0008006|Google:ProtNLM};
OS Myotis lucifugus (Little brown bat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; Vespertilionidae;
OC Myotis.
OX NCBI_TaxID=59463 {ECO:0000313|Ensembl:ENSMLUP00000020162.1, ECO:0000313|Proteomes:UP000001074};
RN [1] {ECO:0000313|Ensembl:ENSMLUP00000020162.1, ECO:0000313|Proteomes:UP000001074}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSMLUP00000020162.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAPE02060286; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAPE02060287; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; G1Q8X9; -.
DR STRING; 59463.ENSMLUP00000020162; -.
DR Ensembl; ENSMLUT00000003612.2; ENSMLUP00000020162.1; ENSMLUG00000016336.2.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000161930; -.
DR InParanoid; G1Q8X9; -.
DR OMA; MINEQIA; -.
DR Proteomes; UP000001074; Unassembled WGS sequence.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 8.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001074};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..599
FT /note="Collagen type IX alpha 3 chain"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003418114"
FT REGION 21..205
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 224..525
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 548..599
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 143..163
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 176..190
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 304..318
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 554..577
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 599 AA; 56689 MW; 52865343D8652578 CRC64;
MARAPTLALL LLGQLLARTE AQVSTQKVGP RGPPGPQGPP GKPGKDGVDG EAGPPGLPGP
PGPKGAAGKP GKPGEAGLPG LPGVDGLTGR EGPPGSKGAP GERGSLGPPG PPGLGGKGLP
GPPGEAGISG LPGGLGLRGP PGPSGLPGLP GPPGPPGPPG HPGVLPEGAT DLQCPAICPP
GPPGPPGMPG FKGPTGYKGE QGEVGKDGEK VPWWWQLEGR ACPASPALSP QGPRGLRGLP
GPLGPPGDRG PIGFRGPPGI PGAPGKPGDR GERGPEGFRG PKGDLGRPGP KGVPGGAGPV
GEPVSPASPR PVPTPPLDLP QGEAGRNGAP GEKGPSGLPG LPGRAGSKGE KGELGRAGEL
GEAGPLGEPG IPGDAGAPGE RGEAGHRGSA GALGPQGPPG APGIRGFQGW KGSLGDPGLP
GPQGLRGSVG DRGPGGATGP KGDEGIAGSD GLPGDKGELG ASGPVGPKGE PGSRGELGPK
GIQGPNGTSG VEGVPGPPGP VGLQGVQGVP GITGKPGVPG KEASEQRIRE LCGGMVSEQI
AQLAAHLRKP LAPGPIGRPG PAGPPGPPGP PGSIGHPGGR GPPGYRGPTG ELGDPGPRG
//