GenomeNet

Database: UniProt
Entry: A0A3P9AWS9_9CICH
LinkDB: A0A3P9AWS9_9CICH
Original site: A0A3P9AWS9_9CICH 
ID   A0A3P9AWS9_9CICH        Unreviewed;       449 AA.
AC   A0A3P9AWS9;
DT   13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT   13-FEB-2019, sequence version 1.
DT   24-JAN-2024, entry version 23.
DE   SubName: Full=Iroquois homeobox 2 {ECO:0000313|Ensembl:ENSMZEP00005002113.1};
OS   Maylandia zebra (zebra mbuna).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC   Pseudocrenilabrinae; Haplochromini; Maylandia; Maylandia zebra complex.
OX   NCBI_TaxID=106582 {ECO:0000313|Ensembl:ENSMZEP00005002113.1, ECO:0000313|Proteomes:UP000265160};
RN   [1] {ECO:0000313|Ensembl:ENSMZEP00005002113.1, ECO:0000313|Proteomes:UP000265160}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=25186727; DOI=10.1038/nature13726;
RA   Brawand D., Wagner C.E., Li Y.I., Malinsky M., Keller I., Fan S.,
RA   Simakov O., Ng A.Y., Lim Z.W., Bezault E., Turner-Maier J., Johnson J.,
RA   Alcazar R., Noh H.J., Russell P., Aken B., Alfoldi J., Amemiya C.,
RA   Azzouzi N., Baroiller J.F., Barloy-Hubler F., Berlin A., Bloomquist R.,
RA   Carleton K.L., Conte M.A., D'Cotta H., Eshel O., Gaffney L., Galibert F.,
RA   Gante H.F., Gnerre S., Greuter L., Guyon R., Haddad N.S., Haerty W.,
RA   Harris R.M., Hofmann H.A., Hourlier T., Hulata G., Jaffe D.B., Lara M.,
RA   Lee A.P., MacCallum I., Mwaiko S., Nikaido M., Nishihara H.,
RA   Ozouf-Costaz C., Penman D.J., Przybylski D., Rakotomanga M., Renn S.C.P.,
RA   Ribeiro F.J., Ron M., Salzburger W., Sanchez-Pulido L., Santos M.E.,
RA   Searle S., Sharpe T., Swofford R., Tan F.J., Williams L., Young S., Yin S.,
RA   Okada N., Kocher T.D., Miska E.A., Lander E.S., Venkatesh B., Fernald R.D.,
RA   Meyer A., Ponting C.P., Streelman J.T., Lindblad-Toh K., Seehausen O.,
RA   Di Palma F.;
RT   "The genomic substrate for adaptive radiation in African cichlid fish.";
RL   Nature 513:375-381(2014).
RN   [2] {ECO:0000313|Ensembl:ENSMZEP00005002113.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108}.
CC   -!- SIMILARITY: Belongs to the TALE/IRO homeobox family.
CC       {ECO:0000256|ARBA:ARBA00008446}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A3P9AWS9; -.
DR   Ensembl; ENSMZET00005002213.1; ENSMZEP00005002113.1; ENSMZEG00005001683.1.
DR   GeneTree; ENSGT00940000161198; -.
DR   Proteomes; UP000265160; LG11.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR008422; Homeobox_KN_domain.
DR   InterPro; IPR003893; Iroquois_homeo.
DR   PANTHER; PTHR11211; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX; 1.
DR   PANTHER; PTHR11211:SF15; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX-2; 1.
DR   Pfam; PF05920; Homeobox_KN; 1.
DR   SMART; SM00389; HOX; 1.
DR   SMART; SM00548; IRO; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   3: Inferred from homology;
KW   Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000265160};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT   DOMAIN          109..172
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        111..173
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          174..216
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          248..352
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          404..426
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        194..216
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        248..264
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        266..312
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        329..352
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   449 AA;  48580 MW;  177FD81545B8D049 CRC64;
     MSYPQGYLYQ PPGSLALYSC PAYGASALAA PRNEDLARSS SGSAFSPYPG SAAFSASAGA
     GFSTPLSYST DPTTGFPSYM SSPYDAHTTG MAGALSYPPY GSPGYPFQLN DPAYRKNATR
     DATATLKAWL QEHRKNPYPT KGEKIMLAII TKMTLTQVST WFANARRRLK KENKMTWAPR
     NKSEDEDEED GDGERKEVER SEKTLDNSEA SAEDEGISLH VDTLTDHSCS AESDGEKVCV
     GELGSEQAGD KCEEDGEDPN RDQRAQLSPK PVTSSPLTGV EAPVLTRSLV SSNNINTNKS
     SSCLDNRPSS GAPPNPTVKP KLWSLAEIAT SDQKHQHQQQ QLGQSTGSPS LYPPPSILGR
     PIYYTSPFYS NYTNYGNFSP LQGQGILRYT NSSGVSLAGL SSSQQALEAS TNPKHRPDSP
     LVKNNPNQIV VEQQQQHFRT ANLEAKKGT
//
DBGET integrated database retrieval system