ID A0A1R3IBC6_COCAP Unreviewed; 777 AA.
AC A0A1R3IBC6;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 03-MAY-2023, entry version 24.
DE RecName: Full=BHLH domain-containing protein {ECO:0000259|PROSITE:PS50888};
GN ORFNames=CCACVL1_13376 {ECO:0000313|EMBL:OMO79855.1};
OS Corchorus capsularis (Jute).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX NCBI_TaxID=210143 {ECO:0000313|EMBL:OMO79855.1, ECO:0000313|Proteomes:UP000188268};
RN [1] {ECO:0000313|EMBL:OMO79855.1, ECO:0000313|Proteomes:UP000188268}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. CVL-1 {ECO:0000313|Proteomes:UP000188268};
RC TISSUE=Whole seedling {ECO:0000313|EMBL:OMO79855.1};
RA Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA Rashid M.M., Khan S.A., Rahman M.S., Alam M.;
RT "Corchorus capsularis genome sequencing.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OMO79855.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWWV01010350; OMO79855.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1R3IBC6; -.
DR STRING; 210143.A0A1R3IBC6; -.
DR EnsemblPlants; OMO79855; OMO79855; CCACVL1_13376.
DR Gramene; OMO79855; OMO79855; CCACVL1_13376.
DR OMA; DDCCSLN; -.
DR OrthoDB; 347824at2759; -.
DR Proteomes; UP000188268; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR043561; LHW-like.
DR InterPro; IPR025610; MYC/MYB_N.
DR PANTHER; PTHR46196; TRANSCRIPTION FACTOR BHLH155-LIKE ISOFORM X1-RELATED; 1.
DR PANTHER; PTHR46196:SF2; TRANSCRIPTION FACTOR BHLH157; 1.
DR Pfam; PF14215; bHLH-MYC_N; 2.
DR PROSITE; PS50888; BHLH; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000188268};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 560..610
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT REGION 476..495
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 547..572
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 557..572
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 777 AA; 85461 MW; 02279C5D6CAF7878 CRC64;
MVEAEMSSVL KQTLKNLCCS NGWSYGVFWR FDQRNSMMLT MEDAYYEEQM GPLIDNMLLK
FHILGEGIIG QAALTGKHQW LFSDFHGTVL DSSANQDIIQ DESEFQNQFS SGIKTIAIIS
VSTRGVVEFG STQKILERVE FLDETKKLFN AMESFHGLIP LENDTCNLDG HFASLALSGN
FYSENLTSSK ISLANCMVAD TPCMSAWSSD GSILTSFETS LQSERAMWGS SNVHPEKENG
LLSGNLEQHS QGGSTFTSFY NPGELVDADL PILDGFRKTS ENQYSFGANG VLLDSVISLQ
RIPEEFNPAD FTTDLSNSFT LDDLSQWFVS SPQHNINGEG ATLTTDLPCS IGVSSVSSTR
DTNIPVRQTA NSLQSSITGT CVSNLEKSIN GDGNDLFDGV GLDFQFGKTG ECLEDIIMPL
CDGNKSAVSS GMSSVSELDN PSMTGKRKGL FSELGLEKLL EGVSSSSHII RSSIEDQFST
SKRRKAENPS SFHQGQVVSC SGRSMILVHH SHNWDKINSS IYGKEVNQKS QVGLWIDDSY
SVNAGQAAVA TSKKPTRKRA KPGESTRPRP KDRQLIQDRI KELRGIIPHS GKQLSIDLLL
ERTIKHLLFL QGVTKYADKI KQADEPKLIG QENGTLPKHN KTGGGATWAF EMGAQTIPIV
VKDLNTPGQM LIEMLCEDRG FFLEIADVIR GFGLNILKGV MELQEDKIWA RFMVEANEQV
ERTDIIWSLL LLLQQTGNSG TDSANHPNRA MDGGISLPNN FQQPLLMPPV SMAETLQ
//