ID A0A2I4F343_JUGRE Unreviewed; 1169 AA.
AC A0A2I4F343;
DT 28-FEB-2018, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE SubName: Full=Protein ALWAYS EARLY 3 isoform X3 {ECO:0000313|RefSeq:XP_018826075.1};
GN Name=LOC108995051 {ECO:0000313|RefSeq:XP_018826075.1};
GN ORFNames=F2P56_015513 {ECO:0000313|EMBL:KAF5465516.1};
OS Juglans regia (English walnut).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fagales; Juglandaceae; Juglans.
OX NCBI_TaxID=51240 {ECO:0000313|Proteomes:UP000235220, ECO:0000313|RefSeq:XP_018826075.1};
RN [1] {ECO:0000313|EMBL:KAF5465516.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Leaves {ECO:0000313|EMBL:KAF5465516.1};
RA Martinez-Garcia P.J., Crepeau M.W., Puiu D., Gonzalez-Ibeas D., Whalen J.,
RA Stevens K., Paul R., Butterfield T., Britton M., Reagan R., Chakraborty S.,
RA Walawage S.L., Vasquez-Gross H.A., Cardeno C., Famula R., Pratt K.,
RA Kuruganti S., Aradhya M.K., Leslie C.A., Dandekar A.M., Salzberg S.L.,
RA Wegrzyn J.L., Langley C.H., Neale D.B.;
RL Submitted (OCT-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:KAF5465516.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Leaves {ECO:0000313|EMBL:KAF5465516.1};
RA Marrano A., Britton M., Zimin A.V., Zaini P.A., Workman R., Puiu D.,
RA Bianco L., Allen B.J., Troggio M., Leslie C.A., Timp W., Dendekar A.,
RA Salzberg S.L., Neale D.B.;
RT "Walnut 2.0.";
RL Submitted (MAR-2020) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|RefSeq:XP_018826075.1}
RP IDENTIFICATION.
RC TISSUE=Leaves {ECO:0000313|RefSeq:XP_018826075.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LIHL02000007; KAF5465516.1; -; Genomic_DNA.
DR RefSeq; XP_018826075.1; XM_018970530.2.
DR EnsemblPlants; Jr07_19020_p1; cds.Jr07_19020_p1; Jr07_19020.
DR GeneID; 108995051; -.
DR Gramene; Jr07_19020_p1; cds.Jr07_19020_p1; Jr07_19020.
DR OrthoDB; 180996at2759; -.
DR Proteomes; UP000235220; Chromosome 7.
DR Proteomes; UP000619265; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0017053; C:transcription repressor complex; IEA:InterPro.
DR GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR CDD; cd00167; SANT; 1.
DR Gene3D; 1.20.58.1880; -; 1.
DR InterPro; IPR033471; DIRP.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR010561; LIN-9/ALY1.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR PANTHER; PTHR21689; LIN-9; 1.
DR PANTHER; PTHR21689:SF2; PROTEIN LIN-9 HOMOLOG; 1.
DR Pfam; PF06584; DIRP; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM01135; DIRP; 1.
DR SMART; SM00717; SANT; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51293; SANT; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000235220}.
FT DOMAIN 49..86
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT REGION 123..154
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 188..229
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 241..279
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 320..344
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 460..483
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 515..586
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1052..1090
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 204..229
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 244..279
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 324..344
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 462..480
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 523..573
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1052..1083
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1169 AA; 129395 MW; A4A6BF4472E6D287 CRC64;
MAPSRKSRSV NKRFSYINEV ASSKDGESAN KRQRVSPGYV QKKRKLSDML GPQWSKDELE
RFYAAYRKYG KDWKKVAAVV RNRSVDMVEA LYTMNKAYLS LPEGTASVVG LIAMMTDHYS
MLEGSDSEEE SNEGAGASRK PQKRARGKIQ NSTSKELIGH FPELSRSHSI ASSYGCLSLL
KKRRSGIKPH AVGKRTPRVP VSYTFDKDSR EKKFSPARQG SKQKVDAKDD DVAHEIALVL
TEASQRGGSP QLSQTPNRKS NGAAPSPVQN GERMHTESEM TCANLRGSEV DEGGCELSLG
STEADNGDYA LDKGYLRGRE GVGTVEGQPK RKRYSGKKPE VEESINNDLD DIKEACSGTE
EGQKPGAVKG KLESEVVARS SSKGLRKRSK KVLFGGDEGF SFDALQTLAD LSLMMPDTEL
SGQVKEESLD VVDKSKVKEN HSIPGVKVSA LRTPKLGKGF AHHIGDTPES KEEAHQSNTG
MRKRKQKFLP FKIFETEAHT DSHLSEPKKF EATAEVKVSL SKGKRSSHNT TQSKSGKMVK
PMEHTSSSTD LGRERNDSAL STLQVSSTDQ VKPPTKVRSR RKMDVQKPVI QNDSKSSGNI
FIEHPNIPIP SLHDRALALK EKLSNSLSLY QAQRWCTFEW FYSAIDYPWF AKREFMEYLD
HVGLGHVPRL TRVEWGVIRS SLGRPRRFSV QFLKEEREKL YQYRDSVRKH YAELRAGTRE
GLPTDLARPL SVGQRVIALH PRTREIHDGS VLTVDHSRCR IQFDRPELGV EFVMDIDCMP
SNPLENMPAS LRRHNTAVNK LFENCHEFKT KFAPSENLES IDGSYTSPSC HHHISKFLKQ
AGSPSSSVQV KVRPGEITNT QQAANSQLSL LAQIQAKEAD VQALSVLTSS LDKKKAVVSE
LRCMNDAVFE NQKDGDNSLK DSEHFKKQYA AVLLQLHEVN EQVSSALFCL RQRNTYQESS
PLMLLKPGSG LGDASGQSNS FDCSTCHIQD SGPHVVEIVE SSRTKAQTMV DVAMRAVSSF
KNGGATIEMI EEAIDFVNNQ LSVDDSSMLA RSASAPADST HITTPVSQDQ STACTSATTH
APDPKSDRLT DQNEAKIPSE LIAHCVATLL MIQKCTERQF PPADVAQVLD SAVTSLQPCC
SQNLPIYAEI QKCMGIIKNQ ILALIPTST
//