ID V4L2Q5_EUTSA Unreviewed; 1380 AA.
AC V4L2Q5;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 27-MAR-2024, entry version 45.
DE RecName: Full=Homeobox domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=EUTSA_v10003131mg {ECO:0000313|EMBL:ESQ44585.1};
OS Eutrema salsugineum (Saltwater cress) (Sisymbrium salsugineum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema.
OX NCBI_TaxID=72664 {ECO:0000313|EMBL:ESQ44585.1, ECO:0000313|Proteomes:UP000030689};
RN [1] {ECO:0000313|EMBL:ESQ44585.1, ECO:0000313|Proteomes:UP000030689}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23518688; DOI=10.3389/fpls.2013.00046;
RA Yang R., Jarvis D.E., Chen H., Beilstein M.A., Grimwood J., Jenkins J.,
RA Shu S., Prochnik S., Xin M., Ma C., Schmutz J., Wing R.A.,
RA Mitchell-Olds T., Schumaker K.S., Wang X.;
RT "The Reference Genome of the Halophytic Plant Eutrema salsugineum.";
RL Front. Plant Sci. 4:46-46(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI517441; ESQ44585.1; -; Genomic_DNA.
DR RefSeq; XP_006403132.1; XM_006403069.1.
DR STRING; 72664.V4L2Q5; -.
DR EnsemblPlants; ESQ44585; ESQ44585; EUTSA_v10003131mg.
DR GeneID; 18020679; -.
DR Gramene; ESQ44585; ESQ44585; EUTSA_v10003131mg.
DR KEGG; eus:EUTSA_v10003131mg; -.
DR eggNOG; ENOG502QQYM; Eukaryota.
DR OMA; MSSAECR; -.
DR OrthoDB; 394285at2759; -.
DR Proteomes; UP000030689; Unassembled WGS sequence.
DR GO; GO:0031010; C:ISWI-type complex; IEA:EnsemblPlants.
DR GO; GO:0000976; F:transcription cis-regulatory region binding; IEA:EnsemblPlants.
DR GO; GO:0045892; P:negative regulation of DNA-templated transcription; IEA:EnsemblPlants.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro.
DR GO; GO:0010228; P:vegetative to reproductive phase transition of meristem; IEA:EnsemblPlants.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR007759; Asxl_HARE-HTH.
DR InterPro; IPR018501; DDT_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR044977; RLT1-3.
DR InterPro; IPR028942; WHIM1_dom.
DR InterPro; IPR028941; WHIM2_dom.
DR PANTHER; PTHR36968; HOMEOBOX-DDT DOMAIN PROTEIN RLT2; 1.
DR PANTHER; PTHR36968:SF5; HOMEOBOX-DDT DOMAIN PROTEIN RLT2; 1.
DR Pfam; PF02791; DDT; 1.
DR Pfam; PF05066; HARE-HTH; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF15612; WHIM1; 1.
DR Pfam; PF15613; WSD; 1.
DR SMART; SM00571; DDT; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50827; DDT; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51913; HTH_HARE; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000030689};
KW Transcription {ECO:0000256|ARBA:ARBA00023163}.
FT DOMAIN 17..77
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 524..583
FT /note="DDT"
FT /evidence="ECO:0000259|PROSITE:PS50827"
FT DOMAIN 706..775
FT /note="HTH HARE-type"
FT /evidence="ECO:0000259|PROSITE:PS51913"
FT DNA_BIND 19..78
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 74..98
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 110..132
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 223..245
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 316..345
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 367..390
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 805..824
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 988..1034
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 84..98
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 331..345
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 806..823
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1380 AA; 155436 MW; 02ECD249E5622745 CRC64;
MEGVCEEAEK NKTPEGAESK SKRKMKTAAQ LEVLENTYAA EPYPSEAIRA DLSVKLNLSD
RQLQMWFCHR RLKDRKSTTP SKRQRKEPPA SSTAVESSIP AVNAGDLVAG NEHNSFGHEL
DSRRTPRGGG GGGLTVVRRF SEPSSAEVRA VGYVEAQLGE RLRDNGPILG MEFDPLPPGA
FGMPIELPSH RKAARQAFET NIYVRSDVKP VKESVRTIRE YQFLPEQPSS KTDHSERASP
SHHFGVPLDA SVLRASSVSA GHRDDYKVSP RIPNLNLSTH QGKHVFSPNL GEYDSPYQKP
YVDTGNLNHE DPFIQSEREV GNDDDDGEDD VMQLERKRKS EEARISREVE AHEKRIRKEL
EKQDMLRRKR EEQMRKEMER QDRERRKEEE RLLRERQREE ERYLKEQLRE RQRREKFLKK
ETIRAEKMRQ KEEMRRVKEV ARLKAANERA IARKIAKESM ELIEDERLEL MEVAALTKGL
PSMLALDFET LQNLEAYKDK KVIFPPTSVK LKKPFAVKPW NGSDENVANL LMVWRFLITF
ADVLGLWPFT LDEFTQSFHD YDPRLMGEIH IVLLKTIIKD IEGVARTISM GVGANQNAAA
NPGGGHPHVV EGAYAWGFDI RNWRRNLNVF TWPEILRQLA LSAGLGPQLK KPDIKTMSVH
DDNEANNSEN VIFNLRVGVA AENAFAKMQE RGLSNPRRSR HRLTPGTVKF AAFHVLSIEG
EKGLTILDVA DKIQKSGLRD LTTSRTPEAS VAAALSRDSK LFERVAPSTY CVRASYRKDV
GDAETIFAEA RERIRMFKSG VTDVEDVDDA ERDEDSESDV ADDPEVDLNL KKVDPDAVEI
ENLAGVEPVL ENGKLETVTM KTEQVLPLKD EKRDDSLANE ALEDPVANDE DNACFDESKL
GEQWVQGLVE GDYSNLSVQE RLNALVALIG IAIEGNTIRI ALEERLEVAS ALKKQMWGEV
QLDKRWKEES LIRANYLSYP TPKPGLLNNA TAASGNQESS SADVTPISSQ DPLSLPQIDV
NTGPSLPSQE NVSGMESLQY QQGYTADRER LRAQLKAYVG YKAEELYVYR SLPLGQDRRR
NRYWRFSASA SRNDPGCGRI FVELQDGRWR LIDSEEGFDY LVKSLDVRGV RESHLHFMLL
KIEASFKDAV RRNVDCSISS SLDSDTEEIS TTFKMELGDH HNAMARFRSF EKWMWDNTLH
PGALSAFKYG AKTSGPLLRI CRICAELNFV EDVCCPSCGQ MHGGSNVGEL CFAEQVAQLG
DNSKGGDPLF ILRGSVSSPL RIRLLKIQLA LIEASLPPEG LQAFWTENLR KSWGLKLLSS
SSPEELHQVL TALEVALKRD FLSSNFETAS ELLGLPEESL ASDLSCSGNV LPWVPKTTGV
//