ID A0A158N979_ATTCE Unreviewed; 670 AA.
AC A0A158N979;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
GN Name=105617120 {ECO:0000313|EnsemblMetazoa:XP_012054087.1};
OS Atta cephalotes (Leafcutter ant).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Myrmicinae; Atta.
OX NCBI_TaxID=12957 {ECO:0000313|EnsemblMetazoa:XP_012054087.1, ECO:0000313|Proteomes:UP000005205};
RN [1] {ECO:0000313|Proteomes:UP000005205}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21347285; DOI=10.1371/journal.pgen.1002007;
RA Suen G., Teiling C., Li L., Holt C., Abouheif E., Bornberg-Bauer E.,
RA Bouffard P., Caldera E.J., Cash E., Cavanaugh A., Denas O., Elhaik E.,
RA Fave M.J., Gadau J., Gibson J.D., Graur D., Grubbs K.J., Hagen D.E.,
RA Harkins T.T., Helmkampf M., Hu H., Johnson B.R., Kim J., Marsh S.E.,
RA Moeller J.A., Munoz-Torres M.C., Murphy M.C., Naughton M.C., Nigam S.,
RA Overson R., Rajakumar R., Reese J.T., Scott J.J., Smith C.R., Tao S.,
RA Tsutsui N.D., Viljakainen L., Wissler L., Yandell M.D., Zimmer F.,
RA Taylor J., Slater S.C., Clifton S.W., Warren W.C., Elsik C.G., Smith C.D.,
RA Weinstock G.M., Gerardo N.M., Currie C.R.;
RT "The genome sequence of the leaf-cutter ant Atta cephalotes reveals
RT insights into its obligate symbiotic lifestyle.";
RL PLoS Genet. 7:e1002007-e1002007(2011).
RN [2] {ECO:0000313|EnsemblMetazoa:XP_012054087.1}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (APR-2016) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/IRO homeobox family.
CC {ECO:0000256|ARBA:ARBA00008446}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADTU01008881; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADTU01008882; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADTU01008883; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADTU01008884; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADTU01008885; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADTU01008886; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADTU01008887; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADTU01008888; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADTU01008889; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_012054087.1; XM_012198697.1.
DR AlphaFoldDB; A0A158N979; -.
DR STRING; 12957.A0A158N979; -.
DR EnsemblMetazoa; XM_012198697.1; XP_012054087.1; LOC105617120.
DR GeneID; 105617120; -.
DR KEGG; acep:105617120; -.
DR eggNOG; KOG0773; Eukaryota.
DR InParanoid; A0A158N979; -.
DR OrthoDB; 2915644at2759; -.
DR Proteomes; UP000005205; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0048646; P:anatomical structure formation involved in morphogenesis; IEA:UniProt.
DR GO; GO:0001654; P:eye development; IEA:UniProt.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR003893; Iroquois_homeo.
DR PANTHER; PTHR11211:SF40; HOMEOBOX PROTEIN ARAUCAN-RELATED; 1.
DR PANTHER; PTHR11211; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00548; IRO; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000005205}.
FT DOMAIN 207..270
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 209..271
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 42..87
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 271..324
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 405..475
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 518..646
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 42..79
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 518..535
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 576..646
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 670 AA; 70414 MW; 82B5C3681D22A1EE CRC64;
MDMSEDCWDH VGLLQEVSAA EPPPPLPASD FLIHHQELLV SGGQPTTATS PAMSGGALSP
GALSPSSTAT TTTGVGPAGT GGATTPVGGA ATGPGCCENG RPMMTDPVTG QTVCSCQYDN
AARFALSTYP RLPTATSYSS YPTPTPSTTD QGPYPSIGMD SSAFYSPLDA PRMILHRQKD
DKNVSEKFMI ADSAIHSQSI SPYGAGYDLA ARRKNATRES TATLKAWLNE HKKNPYPTKG
EKIMLAIITK MTLTQVSTWF ANARRRLKKE NKMTWEPKNK TDDDDDAVLT DSEDNKEKDD
LASDNRGDRV GEDARRGLDD TEPMRHVKAE LLQHEKDLDD EDELDLEDDR RRSEHPFHHT
MQHHHHHHQG YGEDHLKEEG IKSDCSGGGV PIPATKPKIW SLADTAACKT PPPPSHPHHQ
QYHHLHHQQH YPQQQQQHHV SSQGHHHHSQ QPWLGPGGAG GGAGGGGGGG GGGNLGSSFA
LPSSAGMSPS AAATAPYSGA AARYGGFLSS SSGGQLHYNP NSSSGSSSSA SSSAAAAAGF
PEVGTDTPPQ TPPNMKVATP NGVIQAPPGG YCPGGNTASA ATNSNPNIPY GANHPGNGGY
LSSSSGSASS ASSFNSRLQS SPHKDFSVGQ NSILHQHQTT ASLPPTEATT AFKPFYKGSQ
SMGSGFVSPV
//