ID A0A0Q3QKK8_AMAAE Unreviewed; 389 AA.
AC A0A0Q3QKK8;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 22-FEB-2023, entry version 27.
DE SubName: Full=Iroquois-class homeodomain protein IRX-4 {ECO:0000313|EMBL:KQK73633.1};
GN ORFNames=AAES_257742 {ECO:0000313|EMBL:KQK73633.1};
OS Amazona aestiva (Blue-fronted Amazon parrot).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Psittaciformes; Psittacidae; Amazona.
OX NCBI_TaxID=12930 {ECO:0000313|EMBL:KQK73633.1, ECO:0000313|Proteomes:UP000051836};
RN [1] {ECO:0000313|EMBL:KQK73633.1, ECO:0000313|Proteomes:UP000051836}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=FVVF132 {ECO:0000313|EMBL:KQK73633.1};
RA Gilbert D.G.;
RL Submitted (OCT-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/IRO homeobox family.
CC {ECO:0000256|ARBA:ARBA00008446}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KQK73633.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LMAW01003177; KQK73633.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0Q3QKK8; -.
DR STRING; 12930.A0A0Q3QKK8; -.
DR Proteomes; UP000051836; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR003893; Iroquois_homeo.
DR PANTHER; PTHR11211; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX; 1.
DR PANTHER; PTHR11211:SF16; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX-4; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00548; IRO; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000051836}.
FT DOMAIN 111..174
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 113..175
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 86..124
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 175..315
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 175..191
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 205..225
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 249..275
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 285..310
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 389 AA; 42546 MW; 9F31DF59C69411C3 CRC64;
MSTNSLTTCC ESSGRTLAES GAAAPAQAPV YCPVYESRLL ATARHELSSA AALGVYGSPY
AGPQGYGNYV TYGTEAPAFY SLVSRPGGAA DRQSSSPDPR FPGSGGRYGT MDGGTRRKNA
TRETTSTLKA WLQEHRKNPY PTKGEKIMLA IITKMTLTQV STWFANARRR LKKENKMTWP
PRNKCSDEKR PYEEEEEEEE EGSQEEAMKS GKAEEPTGKE EKELELSDLE DLDAAESESS
ECDLRRPFPH PLPHPLPGGS HPPRAAEPPA KMPPAPAASR EEEEEEEAAA GRARSCLKRA
AEERGPDPLG ARQRSCESKM CFQQGQQLLE AKPRIWSLAH TATSLNQAEH STLNQALNNT
TGQLSNLAHH DSNKEFLAFP KSGSKMFCS
//