ID A0A3B4GKZ9_9CICH Unreviewed; 311 AA.
AC A0A3B4GKZ9;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Homeobox protein orthopedia B-like {ECO:0000313|Ensembl:ENSPNYP00000023670.1, ECO:0000313|RefSeq:XP_005741859.1};
GN Name=LOC102195419 {ECO:0000313|RefSeq:XP_005741859.1};
OS Pundamilia nyererei.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Haplochromini; Pundamilia.
OX NCBI_TaxID=303518 {ECO:0000313|Ensembl:ENSPNYP00000023670.1, ECO:0000313|Proteomes:UP000261460};
RN [1] {ECO:0000313|Ensembl:ENSPNYP00000023670.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
RN [2] {ECO:0000313|RefSeq:XP_005741859.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_005741859.1; XM_005741802.1.
DR AlphaFoldDB; A0A3B4GKZ9; -.
DR STRING; 303518.ENSPNYP00000023670; -.
DR Ensembl; ENSPNYT00000024253.1; ENSPNYP00000023670.1; ENSPNYG00000017891.1.
DR GeneID; 102195419; -.
DR GeneTree; ENSGT00940000159952; -.
DR OrthoDB; 5398847at2759; -.
DR Proteomes; UP000261460; Unplaced.
DR Proteomes; UP000695023; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR000047; HTH_motif.
DR InterPro; IPR003654; OAR_dom.
DR PANTHER; PTHR46770; HOMEOBOX PROTEIN ORTHOPEDIA; 1.
DR PANTHER; PTHR46770:SF1; HOMEOBOX PROTEIN ORTHOPEDIA; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF03826; OAR; 1.
DR PRINTS; PR00031; HTHREPRESSR.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50803; OAR; 1.
PE 4: Predicted;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000695023};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 97..157
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 292..305
FT /note="OAR"
FT /evidence="ECO:0000259|PROSITE:PS50803"
FT DNA_BIND 99..158
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 32..105
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 215..254
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 267..287
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 68..97
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 239..254
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 311 AA; 33469 MW; 1A66E21391D33D83 CRC64;
MLSHADLLDA RLGMKDAADL LGHREALKCR LGGGVPDQGH PGDMPPGSDS VEGTTLLPGE
DIATVGSNSA GMPVNGKEQE KQQQQPQQNS GQTTSQQKQK RHRTRFTPAQ LNELERSFAK
THYPDIFMRE ELALRIGLTE SRVQVWFQNR RAKWKKRKKT TNVFRAPGTL LPTPGLPQFS
AAAAMENSLC SFHANDSRWA TGMPGVSQLQ LPPALGRQPG MAQSLSQCSL GPGPPPNSMG
LSNGLSSNGS GLQSHLYQTP FPGMSASLSG PTNVSGSPQL CSSPDSDMWR GTSIASLRRK
ALEHTVSMSF T
//