ID A0A3P8RL69_AMPPE Unreviewed; 413 AA.
AC A0A3P8RL69;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 24-JAN-2024, entry version 24.
DE SubName: Full=Iroquois homeobox 5 {ECO:0000313|Ensembl:ENSAPEP00000001228.1};
GN Name=IRX5 {ECO:0000313|Ensembl:ENSAPEP00000001228.1};
OS Amphiprion percula (Orange clownfish) (Lutjanus percula).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Pomacentridae; Amphiprion.
OX NCBI_TaxID=161767 {ECO:0000313|Ensembl:ENSAPEP00000001228.1, ECO:0000313|Proteomes:UP000265080};
RN [1] {ECO:0000313|Ensembl:ENSAPEP00000001228.1, ECO:0000313|Proteomes:UP000265080}
RP NUCLEOTIDE SEQUENCE.
RA Lehmann R.;
RT "Finding Nemo's genes: A chromosome-scale reference assembly of the genome
RT of the orange clownfish Amphiprion percula.";
RL Submitted (MAR-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSAPEP00000001228.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/IRO homeobox family.
CC {ECO:0000256|ARBA:ARBA00008446}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3P8RL69; -.
DR STRING; 161767.ENSAPEP00000001228; -.
DR Ensembl; ENSAPET00000001259.1; ENSAPEP00000001228.1; ENSAPEG00000000940.1.
DR GeneTree; ENSGT00940000159483; -.
DR OMA; CRLRSQN; -.
DR Proteomes; UP000265080; Chromosome 4.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR003893; Iroquois_homeo.
DR PANTHER; PTHR11211; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX; 1.
DR PANTHER; PTHR11211:SF17; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX-5; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00548; IRO; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000265080};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 111..168
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 113..169
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 171..341
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 194..216
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 230..248
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 284..299
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 413 AA; 44385 MW; 4394CB99640DC2DF CRC64;
MAYPQGFLFQ PSVSLALHSA CPSFGSGVIL GPRTEELGRS SSGSAFAPYS GSASSPGFNS
HLPYGGEPRA AATLNSFVSP GYDPSSGISG SLDYHPFGAL GPYPYGDPTY RKNATRDATA
TLKAWLNEHR KNPYPTKGEK IMLAIITKMT LTQVSTWFAN ARRRLKKENK MTWTPRNRSE
DEEDEDNIDL ERNDEDDEPM KPSGDETTEQ KSEAVGRRSS DSCGLMFRDD SGSDTDRGFT
DPDFKDSGDQ RLGLLPGPTS TPGAPPPGAA QGPLRASEPD LTSSKEPCGA TQNLNPTPKP
KLWSLAEIAT SSDKTRGCSD SAPGAGSGPQ QGPVPRTSFP HSPALPRHLY YGAPFLPGYS
GYGPLGPLSG SGPHLNRLQQ TVLQRAEAAV RDCRLRSQNQ LELHELKRGM TNV
//