ID A0A3P8RDQ2_ASTCA Unreviewed; 324 AA.
AC A0A3P8RDQ2;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 24-JAN-2024, entry version 21.
DE SubName: Full=Homeobox protein CDX-1-like {ECO:0000313|Ensembl:ENSACLP00000039202.1};
OS Astatotilapia calliptera (Eastern happy) (Chromis callipterus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Haplochromini; Astatotilapia.
OX NCBI_TaxID=8154 {ECO:0000313|Ensembl:ENSACLP00000039202.1, ECO:0000313|Proteomes:UP000265100};
RN [1] {ECO:0000313|Ensembl:ENSACLP00000039202.1, ECO:0000313|Proteomes:UP000265100}
RP NUCLEOTIDE SEQUENCE.
RA Datahose.;
RL Submitted (MAY-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACLP00000039202.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the Caudal homeobox family.
CC {ECO:0000256|ARBA:ARBA00010341}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3P8RDQ2; -.
DR Ensembl; ENSACLT00000040131.1; ENSACLP00000039202.1; ENSACLG00000026454.1.
DR GeneTree; ENSGT00940000164078; -.
DR OMA; VDKDTNM; -.
DR OrthoDB; 728401at2759; -.
DR Proteomes; UP000265100; Chromosome 10.
DR Bgee; ENSACLG00000026454; Expressed in ovary.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR006820; Caudal_activation_dom.
DR InterPro; IPR047152; Caudal_homeobox.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR020479; Homeobox_metazoa.
DR InterPro; IPR000047; HTH_motif.
DR PANTHER; PTHR24332; HOMEOBOX PROTEIN CDX; 1.
DR PANTHER; PTHR24332:SF27; HOMEOBOX PROTEIN CDX-2; 1.
DR Pfam; PF04731; Caudal_act; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00024; HOMEOBOX.
DR PRINTS; PR00031; HTHREPRESSR.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 198..258
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 200..259
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 12..142
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 256..313
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 71..87
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 263..284
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 291..313
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 324 AA; 35111 MW; 1A83EBB89E3F9BD8 CRC64;
MYVSYLQLDK DPAMYPHQNP VTRHPGLSLS PQNFSVPAPP QYPDFASYHH HHHGIGNEQH
PAQQAPPAAG GWSPAYPPPP PARDDWSSHP YAPPAASSVT GAVGTTLGFG PTEFPGQPPA
LIPASLNASA GPLSPGSPRR RTPYEWIRTS APPSNPTFTV THELRDHKGI KADRALFYAN
AAEAGSQFEC RSHDGKTRTK DKYRVVYTDH QRLELEKEFH YSKYITIRRK AELATALSLS
ERQVKIWFQN RRAKERKINK KKLQQPASST TTPTPPTGSN GSGGGGGGNG LHANGSSNVA
MVTSSSGSNG LVSPSLALNI KEEY
//