ID A0A061F544_THECC Unreviewed; 709 AA.
AC A0A061F544;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 22-FEB-2023, entry version 36.
DE SubName: Full=Phytochrome interacting factor 3, putative isoform 3 {ECO:0000313|EMBL:EOY11782.1};
GN ORFNames=TCM_026851 {ECO:0000313|EMBL:EOY11782.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY11782.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY11782.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001883; EOY11782.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061F544; -.
DR EnsemblPlants; EOY11782; EOY11782; TCM_026851.
DR Gramene; EOY11782; EOY11782; TCM_026851.
DR Proteomes; UP000026915; Chromosome 5.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR044273; PIF3-like.
DR PANTHER; PTHR46807; TRANSCRIPTION FACTOR PIF3; 1.
DR PANTHER; PTHR46807:SF1; TRANSCRIPTION FACTOR PIF3; 1.
DR Pfam; PF00010; HLH; 1.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR PROSITE; PS50888; BHLH; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 466..518
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT REGION 66..92
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 181..209
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 349..479
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 655..709
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 375..389
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 421..436
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 655..695
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 709 AA; 75372 MW; 57C6A778F9B3DB81 CRC64;
MPLSELYRMA RGKLDSSQDK NPSCSTDLSF VPENDFVELV LENGQILMQG QSSKARRIPA
CNSLPSHCLP SHTPKTRDKD TGNGGTNTKM GKFGTIDSVL SEIPMSVPSA EMSLNQDDEV
VPWLNYPVDQ SLQSEYSDFL PELSGVAVNE TSTHSNFASF DRRSQSIRDS CTVSLNNGAV
FEQGNPSKVP TPADGEARPR SGTSQLSTLP SQLCQTSSPF LRSRILENIG NSLGHTSTHH
AIGGDSIGVQ ASDGGLPGIK MQKQDQVAPC NNTVLMNFSH FSRPAALVKA SLQNISAIAS
IERIGSKEKG SAASISDPAD TTFIDSSIDL QKEKFSQCQP TIVLMKTDRK ESKAKSLDEP
VTAEPIDAIC EENTPKNVKN PSQVTGESAS KGLPDGDKTV EPVLAASSVC SGNSVERASD
DPVYNLKRKS RDNEESECPS EDAEEESVGV KKAVPARGGS GSKRSRAAEV HNLSERRRRD
RINEKMRALQ ELIPNCNKVQ IMSMGAGLYM PPMMLPTGMQ HMHAAHMAHF SPMGVGLGMG
MGFGMPLPDM NAGSSARPMV QVPPIHGAPF SGPGPTALQG MAGSNLQLFG LPGQGLPMSM
PHTPLIPISG GHLMKPAMGL SACGLVGPMD NMGSATASSS KDPVQNINSQ VAQNTNVNSS
MNQTPSQCPT TNQSFEQPAA VQENGQASEI TGSVPFRSAD GNEKVPDRS
//