ID A0A452EMS7_CAPHI Unreviewed; 283 AA.
AC A0A452EMS7;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 24-JAN-2024, entry version 24.
DE SubName: Full=Caudal type homeobox 4 {ECO:0000313|Ensembl:ENSCHIP00000013361.1};
GN Name=CDX4 {ECO:0000313|Ensembl:ENSCHIP00000013361.1};
OS Capra hircus (Goat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Capra.
OX NCBI_TaxID=9925 {ECO:0000313|Ensembl:ENSCHIP00000013361.1, ECO:0000313|Proteomes:UP000291000};
RN [1] {ECO:0000313|Proteomes:UP000291000}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Bickhart D.M., Koren S., Rosen B., Hastie A., Liachko I., Sullivan S.T.,
RA Burton J., Sayre B.L., Huson H.J., Lee J., Lam E., Kelley C.M.,
RA Hutchison J.L., Zhou Y., Sun J., Crisa A., Schwartz J.C., Hammond J.A.,
RA Schroeder S.G., Liu G.E., Dunham M., Shendure J., Sonstegard T.S.,
RA Phillippy A.M., Van Tassell C.P., Smith T.P.;
RT "Polished mammalian reference genomes with single-molecule sequencing and
RT chromosome conformation capture applied to the Capra hircus genome.";
RL Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCHIP00000013361.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the Caudal homeobox family.
CC {ECO:0000256|ARBA:ARBA00010341}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_005700695.1; XM_005700638.1.
DR AlphaFoldDB; A0A452EMS7; -.
DR STRING; 9925.ENSCHIP00000013361; -.
DR Ensembl; ENSCHIT00000021153.1; ENSCHIP00000013361.1; ENSCHIG00000014825.1.
DR GeneID; 102182770; -.
DR KEGG; chx:102182770; -.
DR CTD; 1046; -.
DR GeneTree; ENSGT00940000162554; -.
DR OMA; YPHMPGM; -.
DR OrthoDB; 728401at2759; -.
DR Proteomes; UP000291000; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0001228; F:DNA-binding transcription activator activity, RNA polymerase II-specific; IEA:Ensembl.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IEA:Ensembl.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR006820; Caudal_activation_dom.
DR InterPro; IPR047152; Caudal_homeobox.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR020479; Homeobox_metazoa.
DR InterPro; IPR000047; HTH_motif.
DR PANTHER; PTHR24332; HOMEOBOX PROTEIN CDX; 1.
DR PANTHER; PTHR24332:SF15; HOMEOBOX PROTEIN CDX-4; 1.
DR Pfam; PF04731; Caudal_act; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00024; HOMEOBOX.
DR PRINTS; PR00031; HTHREPRESSR.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000291000}.
FT DOMAIN 170..230
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 172..231
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 98..133
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 98..131
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 283 AA; 30579 MW; 5FE2BCEF36908FC5 CRC64;
MYRSCLLEKE ADMYSSTLRS PAGGGTAGAG ATGDGGSPLP ASNFAAAPSY AHYMGYPHMP
GMDTHGPPLG AWGSPYSPPR EDWSVYPGPS STMGTVPMND MSSSPAAFSS PEYSNLGPAG
GGNSGSSLPT PAGGSLFPID AGIADESSSR SRHSPYAWMR KTVQVTGKTR TKEKYRVVYT
DHQRLELEKE FHCNRYITIR RKSELAVNLG LSERQVKIWF QNRRAKERKM IKKKISQFEN
SGGSVQSDSG SISPGELPNI FFTTPSAVRG FQPIEIQQVI VSE
//