ID A0A1S4C388_TOBAC Unreviewed; 1687 AA.
AC A0A1S4C388;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Uncharacterized protein LOC107814661 {ECO:0000313|RefSeq:XP_016495590.1};
GN Name=LOC107814661 {ECO:0000313|RefSeq:XP_016495590.1};
OS Nicotiana tabacum (Common tobacco).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae;
OC Nicotiana.
OX NCBI_TaxID=4097 {ECO:0000313|Proteomes:UP000084051, ECO:0000313|RefSeq:XP_016495590.1};
RN [1] {ECO:0000313|Proteomes:UP000084051}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TN90 {ECO:0000313|Proteomes:UP000084051};
RX PubMed=24807620; DOI=10.1038/ncomms4833;
RA Sierro N., Battey J.N., Ouadi S., Bakaher N., Bovet L., Willig A.,
RA Goepfert S., Peitsch M.C., Ivanov N.V.;
RT "The tobacco genome sequence and its comparison with those of tomato and
RT potato.";
RL Nat. Commun. 5:3833-3833(2014).
RN [2] {ECO:0000313|RefSeq:XP_016495590.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016495590.1; XM_016640104.1.
DR STRING; 4097.A0A1S4C388; -.
DR PaxDb; 4097-A0A1S4C388; -.
DR GeneID; 107814661; -.
DR KEGG; nta:107814661; -.
DR OMA; SHTTHNE; -.
DR OrthoDB; 5482374at2759; -.
DR Proteomes; UP000084051; Unplaced.
DR GO; GO:0000976; F:transcription cis-regulatory region binding; IEA:UniProt.
DR GO; GO:0010597; P:green leaf volatile biosynthetic process; IEA:UniProt.
DR CDD; cd00167; SANT; 1.
DR Gene3D; 1.20.58.1880; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR PANTHER; PTHR47340; DUPLICATED HOMEODOMAIN-LIKE SUPERFAMILY PROTEIN; 1.
DR PANTHER; PTHR47340:SF1; DUPLICATED HOMEODOMAIN-LIKE SUPERFAMILY PROTEIN; 1.
DR Pfam; PF00249; Myb_DNA-binding; 2.
DR SMART; SM00717; SANT; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS51293; SANT; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000084051}.
FT DOMAIN 810..861
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT DOMAIN 1031..1079
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT REGION 1..279
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 462..490
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1332..1356
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 429..456
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 8..40
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 83..97
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 177..196
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 213..268
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1334..1356
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1687 AA; 185403 MW; 7A200508BFB6E526 CRC64;
MPSEPLPWDR KDFFKERRQH DRSELLRGGP RWREPPPRHH YGSSRWVPAD FRPTRGAPPG
HGKQGSWHMY PEESGHGFMP SRSNEKIVED ESCRQSRGDG GGKYGSRSSS RENRSFGGQR
DWRRGGGLSW EAAASPSGPV RQHDTATNDQ RPADVMVPHN SEHVNNTWEQ SHSRDQHNKS
GSANGTASTG QRFERGNSLG SIEWRPLKWA RSGSLSSRGS LSHSGSSKSM GVDSNETKPE
LQPGNSKALQ SPTGDATACV TSAAPSEETF SRKKPRLGWG EGLAKYEKKK VPEDSAAKVG
ACISGDSVEP GHPHPLNTAD KSPRVAVSLD CPSPATPSSV ACSSSPGLED KQPVKATNID
QDVGNLCGSP SIISQYHSEE FAFNLENFDL SQISNLNSSI NELLQSEDSS SVDSGFMRST
AVNKLLIWKN DISKVLEKTE VEIDSLENEL KTMISEPEYT QLVPSGSCSP RKECNSNSHE
DRGTTDIASR PAPLQVVIPE DVIGQEGTNI QEKEHTEVKV EDIDSPGSAT SKFVELPSEK
DTAPVDAMKH VGGMLISDDS KSLSNNVKVC SSTEDKAKSR SSDVKVCSFS EDMARDTLAC
GESSQLTARC SRPVSDGSLN CGKDALYNLI LAANKDTAYR AFDVFKNLLP AGKCSFDFSS
VSSLQIDHAV KERFARRKQF KQFKEKIIAL KFRVHQHLWK EDMRMLSARK FRAKSQKKFD
FSLRPVQIGH QKHRSTVRSR FLTTVGKSNL VPSSEVLNFA SRLLSDLRTK VYRNTLRMPA
LVLDQERTMS RFISKNSLVE DPCAVEKERS VINPWTSEER EIFIDKLATF GKDFRKIASF
LDHKTTADCI EFYYKNHKSD CFERTKRKSD YSKQAKVCSA NTYLVASSGK RWNRESNSVS
LDILGAASAI AANVEDSIEI QQKCTSKYSV RMVTEHKTSR HNELERSNSL DVCHSERERV
AADVLTGICG SLSSEAMSSC ITSSIDPAEG NQEWKHQKVG SLTRLPLTPE VTQSVDDETC
SDESCGEMDP TDWTDEEKSI FIQAVSAYGK DFVMVSRCVR TRSREQCKIF FSKARKCLGL
DKILPGPGNL VRQDVNGGND PDACVMETEL FCNEKSSLKL KELSDLCVSA GISKPDMTSF
DDKDGAGELD SVDTELVSKN SVQVNCHVDK QRVEFNRHCE IHIGACTENG RGDENMVTVS
QEGGVQIDGD VSENGPADIL CANKVSGEHL GEEIKEVVPE RDFKNRKADS AEVSRSNFFL
EDTASRSNSR LAAVRGGELC PLNGSQNTTL LESDSECKPD VNYSESNISV QRKKMPRASN
AVYLSELELE NVGDQQRENA TQSAEQPLPS TSQIAHVDSR QILGSYSLGE SATKESGDGC
STSAALQEIQ KVGKNLRSDT SSTTGFFFQR CNGTNREQTV GGSSSNVDKP CRNGDVKLFG
QILSKPCPQA NTSSNAQQSD SSNQQLKVCS NMSSATHSLD GNSATAKFER NNFLGSENHQ
VRSFGFWDGN RIQTGFSSLP DSAILLAKYP AAFGNYAIAS SKVEQQPPLH GVVKTATERS
LNGVPVFPXV FPTRDVSSNN GVAAADYQVY RSLDVQPFTI EMKQRQDAVF SEMQRRNGFD
VVSSMQQQAR GVVVGRGGIL VGGQCTGVSD PVAAIKMHYA KAEQFSGQAT SIIREDDYWL
SKGDISR
//