ID A0A3L8SXH9_CHLGU Unreviewed; 771 AA.
AC A0A3L8SXH9;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 22-FEB-2023, entry version 18.
DE RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
GN ORFNames=DV515_00001855 {ECO:0000313|EMBL:RLW10463.1};
OS Chloebia gouldiae (Gouldian finch) (Erythrura gouldiae).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Passeroidea; Passeridae;
OC Chloebia.
OX NCBI_TaxID=44316 {ECO:0000313|EMBL:RLW10463.1, ECO:0000313|Proteomes:UP000276834};
RN [1] {ECO:0000313|EMBL:RLW10463.1, ECO:0000313|Proteomes:UP000276834}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Red01 {ECO:0000313|EMBL:RLW10463.1};
RC TISSUE=Muscle {ECO:0000313|EMBL:RLW10463.1};
RX PubMed=30282656;
RA Toomey M.B., Marques C.I., Andrade P., Araujo P.M., Sabatino S.,
RA Gazda M.A., Afonso S., Lopes R.J., Corbo J.C., Carneiro M.;
RT "A non-coding region near Follistatin controls head colour polymorphism in
RT the Gouldian finch.";
RL Proc. R. Soc. B 285:0-0(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RLW10463.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QUSF01000004; RLW10463.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3L8SXH9; -.
DR STRING; 44316.ENSEGOP00005003698; -.
DR Ensembl; ENSEGOT00005004181; ENSEGOP00005003698; ENSEGOG00005002911.
DR OMA; MCGEMEA; -.
DR Proteomes; UP000276834; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR031701; SIX1_SD.
DR PANTHER; PTHR10390:SF44; HOMEOBOX DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR10390; HOMEOBOX PROTEIN SIX; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF16878; SIX1_SD; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000276834}.
FT DOMAIN 219..270
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 221..271
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 22..58
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 74..94
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 259..314
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 281..295
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 296..311
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 771 AA; 82232 MW; D190F4628C078294 CRC64;
MSSASPATDD IVIAVEIKEE NVMEMLSEAP DGPAPPPPPA AAQFPMEHAG SAAAGEEGAA
EQVLLHTELL ARNHHAASSP SSSSSSSSSS SQTPLAFSPD HVACVCEALQ QGGNLDRLAR
FLWSLPPSDL LRGNESLMKA RALVAFHQGI YAELYSILES HNFDSSNHPL LQELWYKARY
TEAERARGRP LGAVDKYRLR RKYPLPRTIW DGEETVYCFK EKSRNALKEL YKQNRYPSPA
EKRNLAKITG LSLTQVSNWF KNRRQRDRNP SETQSKSESD GNPSTEDESS KGREDLSPHP
LSSSSDGVTS LSLPGHMEPV YMQQLGNTKI ALSSSGVLLN GNLMPASTSP VFLNGSSFLQ
GPNSVILNGL SVGTSQTVTL NSPKTATSVV SNGVSITDIL SSSSSEDVKD FKLLQASVPN
ATAAFSPSNI PVTFPGLIPS SEVKREGVET AASQDGGSVV TFTAPVQINQ YGIVQIPNSG
TNGQLLNGSI GFSSLQLPPV SVAASQGNVS ANPSTSDGGT FTTESSTVQQ GKVFFSPLTP
SAVVYTVPNS GQAVGSVKQE GLERSLVFSQ LMPVSQNTQL NVNMSSENIS SAGLQSLASS
LVNVTPSHNF SLTPPTLLNA AELSSGISES QSMSSPVTST STVISISNTN YATLQNCPLI
TSHDLLSIST AQPVLGEIVS TSGDRVSHPP AQVHQDFGRE HRLVLQAVPD VKENFLPNSE
SKSTGNLMML DSKSKYVMST MVDTVCEELE TDKKELAKLQ TVQMDEVMQD L
//