ID A0A2P5B6M5_PARAD Unreviewed; 1723 AA.
AC A0A2P5B6M5;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 02-APR-2025, entry version 21.
DE SubName: Full=Octamer-binding transcription factor {ECO:0000313|EMBL:PON44449.1};
GN ORFNames=PanWU01x14_266840 {ECO:0000313|EMBL:PON44449.1};
OS Parasponia andersonii (Sponia andersonii).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Rosales; Cannabaceae; Parasponia.
OX NCBI_TaxID=3476 {ECO:0000313|EMBL:PON44449.1, ECO:0000313|Proteomes:UP000237105};
RN [1] {ECO:0000313|Proteomes:UP000237105}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. WU1-14 {ECO:0000313|Proteomes:UP000237105};
RA Van Velzen R., Holmer R., Bu F., Rutten L., Van Zeijl A., Liu W.,
RA Santuari L., Cao Q., Sharma T., Shen D., Roswanjaya Y., Wardhani T.,
RA Kalhor M.S., Jansen J., Van den Hoogen J., Gungor B., Hartog M.,
RA Hontelez J., Verver J., Yang W.-C., Schijlen E., Repin R., Schilthuizen M.,
RA Schranz E., Heidstra R., Miyata K., Fedorova E., Kohlen W., Bisseling T.,
RA Smit S., Geurts R.;
RT "Parallel loss of symbiosis genes in relatives of nitrogen-fixing non-
RT legume Parasponia.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PON44449.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JXTB01000350; PON44449.1; -; Genomic_DNA.
DR STRING; 3476.A0A2P5B6M5; -.
DR OrthoDB; 6159439at2759; -.
DR Proteomes; UP000237105; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR007759; Asxl_HARE-HTH.
DR InterPro; IPR018501; DDT_dom.
DR InterPro; IPR001356; HD.
DR InterPro; IPR009057; Homeodomain-like_sf.
DR InterPro; IPR044977; RLT1-3.
DR InterPro; IPR028942; WHIM1_dom.
DR InterPro; IPR028941; WHIM2_dom.
DR PANTHER; PTHR36968; HOMEOBOX-DDT DOMAIN PROTEIN RLT2; 1.
DR PANTHER; PTHR36968:SF5; HOMEOBOX-DDT DOMAIN PROTEIN RLT2; 1.
DR Pfam; PF02791; DDT; 1.
DR Pfam; PF05066; HARE-HTH; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF15612; WHIM1; 1.
DR Pfam; PF15613; WSD; 1.
DR SMART; SM00571; DDT; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50827; DDT; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51913; HTH_HARE; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000237105};
KW Transcription {ECO:0000256|ARBA:ARBA00023163}.
FT DOMAIN 18..78
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 495..554
FT /note="DDT"
FT /evidence="ECO:0000259|PROSITE:PS50827"
FT DOMAIN 677..746
FT /note="HTH HARE-type"
FT /evidence="ECO:0000259|PROSITE:PS51913"
FT DNA_BIND 20..79
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..29
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 76..96
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 121..140
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 773..813
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1491..1588
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1601..1723
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 320..422
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 9..21
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 85..96
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 774..801
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1504..1516
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1540..1551
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1552..1568
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1620..1637
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1658..1674
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1675..1688
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1695..1709
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1714..1723
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1723 AA; 193342 MW; C5ADB716D2097726 CRC64;
MEVSGSEGGE MKKKPPEGEN KTKRKMKTAS QLEILEKTYA AESYPSESLR AELSVKLGLS
DRQLQMWFCH RRLKDRKATP VQRQQRRDSP AGRGEEMAVG ELGNEHASGS GSGQIPFGHG
IESRRGVPRH SVAGSRTGAG GDMPVMMKRY YESQQTIAEL RAIAFVEAQL GEQLREDGPI
LGMEFDTLPP DAFGAPIAMA GQRKQSGRSF DAKYDRSETK SIKGTGRALQ EYQFIPEKPT
VRTEAYERLA PPSYHYGSPA DGPNARSSLL STGHAYLHGD EYLSSGYVFQ ASEKRFTNEE
DVLLIGRKRK SEETRVARDI EAHEKRIRKE LEKQDILRRK REEQMRKEME RHDRERRKEE
ERLLREKQRE EERYQREQRR ELERREKFLQ KESIRAEKMR QKEELRREKE AARLRAANER
AIARKIAKES MELIEDERLE LMELAASSKG LSSIVSLDYE TLQNLELYRV VYNDLSVGLS
SSCNIMYLPV TVISCHLYIR AWKVWRFLIT FADVLGLWPF TLDEFIQAFH DYDPRLLGEI
HVSLLKSVIK DIEDVARTPS TGLGANQTSA ANPGGGHPLI VEGAYAWGFD IRSWQRHLNP
LTWPEILRQL SLSAGFGPQL KKRNVEPSYL RDDNEGNDGA DIVSNLRSGA AVENAFAKMQ
ERGFSNPRRS RHRLTPGTVK FAAFHVLSLE GDKGLTILEV ADRIQKSGLR DLTTSKTPEA
SIAAALSRDT KLFERTAPST YCVRAAYRKD PADAEAILSA ARERIRVFKN GFLDGEDADD
GERDEDSESD VAEDPDIDDL GTEINPEKET PGCQEVSQLS AVSLLGNGNE SVQVIETPKK
DLQNIGGGLS AIHSESYVKM NDDDSSLPQS IDVVGVYNDA SNFDDIDPDI DESNPGEPWV
QGLMEGEYSD LSVEERLNAL VALIGVAIEG NSIRHVLEER LEAANALKKQ MWAEVQLEKR
RMKEDFVMRT PYTSFTGNKF ELNPGISSAE GRQSPFVHVD VKSNETKVDL AVHEERISDP
PNENPCVSSF PSEGNLQMQE VCAGPDNHLF QQPGHVADRS RSQVKSYIGH KAEEMYVYRS
LPLGQDRARN RYWQFITSAS QNDPGCGRIF VELHDGRWRL IESEEGFDAL LASLDVRGVR
ESQLHTMLLK IEISFKKAVR KKMLRPNMGR QSEDTAKVAA VELTPHADCS GSTDSPSSTL
CLADSDLSES STFVIELGRN ENEKNGAFKR YQDLERWIWK ECFSSSMLFA TKHEKKRCNQ
LLDVCDSCHG IYSSEVDHCL SCHRTFTTSD SGTSFAEHVT QCEEKLNLNR NWTLHGSDAF
PLRIRLLKVV LALIEVSVPS EALQLLWTDS HRKSWGTKLK ASSSAEDLLQ VLTLLESAIK
REHLFSEFET TYELLDSWST RRYNATSSTP LVTITVLPWV PLTTAAVALR VMEFDAALFY
VLQQKLESQK DRGSGNVIKL SSKYAFMKHS LDDGLTGITC QADNRNEDSW ADLGDGLANS
DHRKGSRGRA RGRSRTRGGQ PQRSAIGSRG ESAKQNDGTL GHGLGWKGRG QGCKRGRRTI
RSRQRPAKRM FEIGVVEGTP EENIGDKSPR ILVQDWNAED ATGFQLEDAE PASSSGRSEY
DEENGQGSGD EFDDMAMDDY ASGFNGRPGD LDGSDYNGIE DEVGDDNDDD QEEYDIGHDE
QGDFDVDRYI NGNSNEEENG GGEEEENMVL DEPSGSTSSD YSD
//