GenomeNet

Database: UniProt
Entry: A0A3P9D6A2_9CICH
LinkDB: A0A3P9D6A2_9CICH
Original site: A0A3P9D6A2_9CICH 
ID   A0A3P9D6A2_9CICH        Unreviewed;       491 AA.
AC   A0A3P9D6A2;
DT   13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT   13-FEB-2019, sequence version 1.
DT   27-MAR-2024, entry version 25.
DE   SubName: Full=UNC homeobox {ECO:0000313|Ensembl:ENSMZEP00005030205.1};
GN   Name=UNCX {ECO:0000313|Ensembl:ENSMZEP00005030205.1};
OS   Maylandia zebra (zebra mbuna).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC   Pseudocrenilabrinae; Haplochromini; Maylandia; Maylandia zebra complex.
OX   NCBI_TaxID=106582 {ECO:0000313|Ensembl:ENSMZEP00005030205.1, ECO:0000313|Proteomes:UP000265160};
RN   [1] {ECO:0000313|Ensembl:ENSMZEP00005030205.1, ECO:0000313|Proteomes:UP000265160}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=25186727; DOI=10.1038/nature13726;
RA   Brawand D., Wagner C.E., Li Y.I., Malinsky M., Keller I., Fan S.,
RA   Simakov O., Ng A.Y., Lim Z.W., Bezault E., Turner-Maier J., Johnson J.,
RA   Alcazar R., Noh H.J., Russell P., Aken B., Alfoldi J., Amemiya C.,
RA   Azzouzi N., Baroiller J.F., Barloy-Hubler F., Berlin A., Bloomquist R.,
RA   Carleton K.L., Conte M.A., D'Cotta H., Eshel O., Gaffney L., Galibert F.,
RA   Gante H.F., Gnerre S., Greuter L., Guyon R., Haddad N.S., Haerty W.,
RA   Harris R.M., Hofmann H.A., Hourlier T., Hulata G., Jaffe D.B., Lara M.,
RA   Lee A.P., MacCallum I., Mwaiko S., Nikaido M., Nishihara H.,
RA   Ozouf-Costaz C., Penman D.J., Przybylski D., Rakotomanga M., Renn S.C.P.,
RA   Ribeiro F.J., Ron M., Salzburger W., Sanchez-Pulido L., Santos M.E.,
RA   Searle S., Sharpe T., Swofford R., Tan F.J., Williams L., Young S., Yin S.,
RA   Okada N., Kocher T.D., Miska E.A., Lander E.S., Venkatesh B., Fernald R.D.,
RA   Meyer A., Ponting C.P., Streelman J.T., Lindblad-Toh K., Seehausen O.,
RA   Di Palma F.;
RT   "The genomic substrate for adaptive radiation in African cichlid fish.";
RL   Nature 513:375-381(2014).
RN   [2] {ECO:0000313|Ensembl:ENSMZEP00005030205.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC       ECO:0000256|RuleBase:RU000682}.
CC   -!- SIMILARITY: Belongs to the paired homeobox family. Unc-4 subfamily.
CC       {ECO:0000256|ARBA:ARBA00038351}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_004552281.1; XM_004552224.1.
DR   AlphaFoldDB; A0A3P9D6A2; -.
DR   STRING; 106582.ENSMZEP00005030205; -.
DR   Ensembl; ENSMZET00005031154.1; ENSMZEP00005030205.1; ENSMZEG00005022512.1.
DR   GeneID; 101464031; -.
DR   KEGG; mze:101464031; -.
DR   CTD; 340260; -.
DR   GeneTree; ENSGT00940000161420; -.
DR   OrthoDB; 2902937at2759; -.
DR   Proteomes; UP000265160; LG4.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR   GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   PANTHER; PTHR46799; HOMEOBOX PROTEIN UNC-4 HOMOLOG; 1.
DR   PANTHER; PTHR46799:SF1; HOMEOBOX PROTEIN UNC-4 HOMOLOG; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   3: Inferred from homology;
KW   Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW   Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000265160};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT   DOMAIN          98..158
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        100..159
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          156..303
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          384..429
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          444..491
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        183..198
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        214..267
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        268..283
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        449..468
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   491 AA;  53533 MW;  C4E2197D4150C6AF CRC64;
     MMDSRILDPP HAQFGGSLGG MVGFPYHLSH HHVYELAGHQ LQSASAVPFS IDGLLNGSCA
     ASVGNSNPLL SSGCGMNGDN QQYKLTDSGD PDKDSPGCKR RRTRTNFTGW QLEELEKAFN
     ESHYPDVFMR EALALRLDLI ESRVQVWFQN RRAKWRKKEN TKKGPGRPAH NAHPTSCSGE
     PMDPEEIARR ELTRLEKKKR KQERRLLKSQ NKLVSGDLFH TPGSDSDSGV SHVTDSDHNT
     GPPFDSVGGN QTQSSCDQTP HSLQSQNHNQ RHLDRDAGDS ELDSPDPSHG PSLCPNNSRA
     SSMQKLNPFS VESLLSDHRP RRNPAALPSS RPLIGKGHFL LYPITQPLGF IVPQTALKTT
     TAVNSDCDTP AERTQAHVRC SVSASSAECT SSSPDSAETT QRAQVGSEAK TNSPHTSPPR
     AAFSTGKSTQ TSVICSDSTF AHEHGSQLVE SVDAGKKEHL ENKDHPSSSE SVLSDCPADT
     KTDSKEHVDM E
//
DBGET integrated database retrieval system