ID M4AUT5_XIPMA Unreviewed; 530 AA.
AC M4AUT5;
DT 01-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 57.
DE SubName: Full=Keratin 222 {ECO:0000313|Ensembl:ENSXMAP00000018230.2};
GN Name=KRT222 {ECO:0000313|Ensembl:ENSXMAP00000018230.2};
OS Xiphophorus maculatus (Southern platyfish) (Platypoecilus maculatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Xiphophorus.
OX NCBI_TaxID=8083 {ECO:0000313|Ensembl:ENSXMAP00000018230.2, ECO:0000313|Proteomes:UP000002852};
RN [1] {ECO:0000313|Proteomes:UP000002852}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JP 163 A {ECO:0000313|Proteomes:UP000002852};
RA Walter R., Schartl M., Warren W.;
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000002852}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JP 163 A {ECO:0000313|Proteomes:UP000002852};
RX PubMed=23542700; DOI=10.1038/ng.2604;
RA Schartl M., Walter R.B., Shen Y., Garcia T., Catchen J., Amores A.,
RA Braasch I., Chalopin D., Volff J.N., Lesch K.P., Bisazza A., Minx P.,
RA Hillier L., Wilson R.K., Fuerstenberg S., Boore J., Searle S.,
RA Postlethwait J.H., Warren W.C.;
RT "The genome of the platyfish, Xiphophorus maculatus, provides insights into
RT evolutionary adaptation and several complex traits.";
RL Nat. Genet. 45:567-572(2013).
RN [3] {ECO:0000313|Ensembl:ENSXMAP00000018230.2}
RP IDENTIFICATION.
RC STRAIN=JP 163 A {ECO:0000313|Ensembl:ENSXMAP00000018230.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the intermediate filament family.
CC {ECO:0000256|RuleBase:RU000685}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 8083.ENSXMAP00000039744; -.
DR Ensembl; ENSXMAT00000018257.2; ENSXMAP00000018230.2; ENSXMAG00000018192.2.
DR Ensembl; ENSXMAT00000024072.1; ENSXMAP00000039744.1; ENSXMAG00000018192.2.
DR eggNOG; ENOG502QQ07; Eukaryota.
DR GeneTree; ENSGT00940000159655; -.
DR HOGENOM; CLU_892921_0_0_1; -.
DR OMA; QIASWGV; -.
DR Proteomes; UP000002852; Unassembled WGS sequence.
DR GO; GO:0005882; C:intermediate filament; IEA:UniProtKB-KW.
DR Gene3D; 1.20.5.170; -; 1.
DR Gene3D; 1.20.5.1160; Vasodilator-stimulated phosphoprotein; 1.
DR InterPro; IPR018039; IF_conserved.
DR InterPro; IPR039008; IF_rod_dom.
DR PANTHER; PTHR47082; KERATIN-LIKE PROTEIN KRT222; 1.
DR PANTHER; PTHR47082:SF1; KERATIN-LIKE PROTEIN KRT222; 1.
DR Pfam; PF00038; Filament; 2.
DR SMART; SM01391; Filament; 1.
DR SUPFAM; SSF64593; Intermediate filament protein, coiled coil region; 1.
DR PROSITE; PS00226; IF_ROD_1; 1.
DR PROSITE; PS51842; IF_ROD_2; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW Intermediate filament {ECO:0000256|ARBA:ARBA00022754,
KW ECO:0000256|RuleBase:RU000685};
KW Reference proteome {ECO:0000313|Proteomes:UP000002852}.
FT DOMAIN 7..353
FT /note="IF rod"
FT /evidence="ECO:0000259|PROSITE:PS51842"
FT REGION 368..405
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 11..45
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 88..143
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 318..352
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 374..388
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 530 AA; 59717 MW; 5CD9DAF76AC16A6C CRC64;
MDLLQDSSHT MWDLNTRLKS FMEQVNRLQE ANHQLEAEIA EWSFRNASRS QNWSKQEQTV
RDLRSQICNL LMENAQLALQ SDNMRSRAAA IQARCETEER TTRRLEQQVS LFREAKREAD
ESNKALQAEC HRSMTQLQQM DQEFMAAQAL QLQQADSCDA LLAAAGREEE DGTAMELTQL
FDQIRAQCDQ SRPAGLAERH RVLGTASDPA AGLVGPSQSG SGAAAASRTH RGAVTEEEAA
WAQVSLGGAA LKEARAELAE ARKQWRSLQV EIETLHALEK GLECSLQHTQ ELYTSQLHDL
SQVIVGLESE LEQVRSGLAT QRQRHSQLLN TKMRLEREIT IYRQLLEREE GRYVSRRAHP
LVLRPWRSPV TEPKENGLEN SFSDSAVTPD EPKSEPLPDI PSLLPADNGL KKSKLYRQQS
LVILTEPEQD KDLPLSTVKT QEILQGNVVR ESAEGHGTIE TEKIDKVIKQ WEGSFFRGNP
KLRKKSVSLR FDLHMAAADE GCGQTKQDSL PDVEVRLIMK RSRSISTITQ
//