GenomeNet

Database: UniProt
Entry: A0A2G5EFI7_AQUCA
LinkDB: A0A2G5EFI7_AQUCA
Original site: A0A2G5EFI7_AQUCA 
ID   A0A2G5EFI7_AQUCA        Unreviewed;       937 AA.
AC   A0A2G5EFI7;
DT   31-JAN-2018, integrated into UniProtKB/TrEMBL.
DT   31-JAN-2018, sequence version 1.
DT   22-FEB-2023, entry version 16.
DE   RecName: Full=Transcription factor MYC/MYB N-terminal domain-containing protein {ECO:0000259|Pfam:PF14215};
GN   ORFNames=AQUCO_00900698v1 {ECO:0000313|EMBL:PIA54337.1};
OS   Aquilegia coerulea (Rocky mountain columbine).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; Ranunculales; Ranunculaceae; Thalictroideae;
OC   Aquilegia.
OX   NCBI_TaxID=218851 {ECO:0000313|EMBL:PIA54337.1, ECO:0000313|Proteomes:UP000230069};
RN   [1] {ECO:0000313|EMBL:PIA54337.1, ECO:0000313|Proteomes:UP000230069}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Goldsmith {ECO:0000313|Proteomes:UP000230069};
RA   Hodges S., Kramer E., Nordborg M., Tomkins J., Borevitz J., Derieg N.,
RA   Yan J., Mihaltcheva S., Hayes R.D., Rokhsar D.;
RT   "WGS assembly of Aquilegia coerulea Goldsmith.";
RL   Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KZ305026; PIA54337.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A2G5EFI7; -.
DR   Proteomes; UP000230069; Unassembled WGS sequence.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   InterPro; IPR043561; LHW-like.
DR   InterPro; IPR025610; MYC/MYB_N.
DR   PANTHER; PTHR46196; TRANSCRIPTION FACTOR BHLH155-LIKE ISOFORM X1-RELATED; 1.
DR   PANTHER; PTHR46196:SF4; TRANSCRIPTION FACTOR LHW; 1.
DR   Pfam; PF14215; bHLH-MYC_N; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000230069}.
FT   DOMAIN          5..182
FT                   /note="Transcription factor MYC/MYB N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF14215"
FT   REGION          722..759
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          907..937
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        724..759
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        907..929
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   937 AA;  102541 MW;  05A8BFCC49D1874E CRC64;
     MGTLLKEALK CLCGENRWSY AIFWKIGYQN PTLLVWEDFH YEPTRNSSLS SIAGVDSTDL
     LLKELEGRWS YHENRFPQFG GQGEDRVSSL LNRMMINNHV HVVGEGIVGR AAFTDAHVWI
     LKENIIRDQF PSETLVEVHH QFSAGMQTVA AIPVLPHGVI QLGSTLSIME NMGFINDVKS
     LFQQVGCAPI SLSNDYYTKT NPDQKTASPP LFGIPISADS ARNSCSQMMK LSPITNTYCN
     QEMPTSQASG FASQPPHSFI TQNQAYMQSK ASALRTTPII SKSASDFYQV KSPSVTKPHH
     PFSGQLGTRT VGAQVILCSP NAQLHQRDSQ YQPSSRLNDQ LAQPGSSFNS LAFMKQQTVP
     SVGLQGPATL TNTSTSQLRS YGDKIPNSVK DSVIVSLLSG NRPPNASSDV PILTAVPNPA
     TSSSTRHVGN FKFTSSDKSV VPISNQGNTA NNNYVPSVVS HRSHPSRIDN LTSSKFTKGK
     QVMENDLFEA LNNPLDQSND QIPQSGPMLD YLPECSASSS RQERESLKFL NISEGACVQL
     PSGDDLYDIL GLDFKSKQLY GNWNSFLIHG EDAKPENSSI DVSTCTTQTD AGREVNDGIS
     ESGIFNGTGS DHLLDAVISK VHSSAKQNSD DDMSCWTTLT QTSNSSVLTD SLKGARNNVP
     DQKQVKMFGF PTSVAKSQLT GSSSFRTGCS LDNAGESSQI NSAYGSQISL LCEDGLSMKR
     DNSISTAHSK RPDETGKPNR KRLRPGENPR PRPKDRQMIQ DRVKELREIV PNGAKSVTKH
     AEKLKHSGES KIISKDGGLL MEDNNFDGGA TWAFEVGSQS MVCPIIVEDL NTPRQMLVEM
     LCEEQGFFLE IADIIRGLGL TILKGVMEAR NDKVWVRFTV EANRDVTRME IFLSLVHLLE
     QAMKSSASAR GPDNSNLIAQ NSFHPSSIPV TGVQDAR
//
DBGET integrated database retrieval system