ID A0A1S3XY68_TOBAC Unreviewed; 722 AA.
AC A0A1S3XY68;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=BEL1-like homeodomain protein 4 isoform X2 {ECO:0000313|RefSeq:XP_016444840.1};
GN Name=LOC107770090 {ECO:0000313|RefSeq:XP_016444840.1};
OS Nicotiana tabacum (Common tobacco).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae;
OC Nicotiana.
OX NCBI_TaxID=4097 {ECO:0000313|Proteomes:UP000084051, ECO:0000313|RefSeq:XP_016444840.1};
RN [1] {ECO:0000313|Proteomes:UP000084051}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TN90 {ECO:0000313|Proteomes:UP000084051};
RX PubMed=24807620; DOI=10.1038/ncomms4833;
RA Sierro N., Battey J.N., Ouadi S., Bakaher N., Bovet L., Willig A.,
RA Goepfert S., Peitsch M.C., Ivanov N.V.;
RT "The tobacco genome sequence and its comparison with those of tomato and
RT potato.";
RL Nat. Commun. 5:3833-3833(2014).
RN [2] {ECO:0000313|RefSeq:XP_016444840.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/BELL homeobox family.
CC {ECO:0000256|ARBA:ARBA00006454}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016444840.1; XM_016589354.1.
DR AlphaFoldDB; A0A1S3XY68; -.
DR GeneID; 107770090; -.
DR OMA; HNIGTVH; -.
DR OrthoDB; 3180467at2759; -.
DR Proteomes; UP000084051; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR006563; POX_dom.
DR PANTHER; PTHR11850:SF246; BEL1-LIKE HOMEODOMAIN PROTEIN 4; 1.
DR PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR Pfam; PF07526; POX; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00574; POX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000313|RefSeq:XP_016444840.1};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000084051}.
FT DOMAIN 483..546
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 485..547
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 179..199
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 322..365
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 555..593
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 322..359
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 568..593
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 722 AA; 79986 MW; 0515C0EB86054872 CRC64;
MVIHFNKSKA VHNPIQLPKK SISLDNSMSQ DYHHQGLFSF SSVFEKPQNE QQQHIAHQIR
QEKLRVEGFN PPPPLVAIEE EQSSGLQVYG TAGMLSEMFN FPPGTTTATA TELLQNQLTQ
CYRHPNQRPQ QQLPWTTNLG SEWFSNRQGM VVGGSLQRQH NQISSINAAE SAMNLFAMNP
QPRSPSPSSS HPTTTTLQGF PNAGGGHFGQ FICGGASNTG NNFTNEIGGV NVIEGQGLSL
SLSSSLQHLE AAKVEDQLRM SGEEMLFFNQ GTSENHHHHQ VHVGFGSSLG LVNVLRNSKY
AKAAQELLEE FCSVGRSQLF KKNKVSRNNN TTTSSNPNSN NPSRSNNHNT NSSSSKDLPP
LSAADRIGHQ RRKIKLLSMF DEVDKRYNHY CEQMQMVVNS FDLVMGFGAA APYTALAQKA
MSRHFKCLKD AIAAQLKHSC ELLGEKDTTT SGLTKGETPR LKLLEKSLRQ QQTAFHQMGM
MDAEAWRPQR GLPERSVTIL RAWLFEHFLH PYPSDADKHL LARQTGLSRN QVSNWFINAR
VRLWKPMVED MYQKEAKEEE DDEMEKSQNS SNNIAQTPTP NSSTNTITTG TGTETKTAAT
VAATILTVAS DKRSEINVLE NDPSIVAMNR LCFSENQAQH HESSSMATHE MAHNNFPAIQ
DSDDMSRREA VSGVEYGTTN IMANSDNGTR VIRFGTSVAG DVSLTLGLHH AGNLPENSHF
FG
//