ID A0A2V3J129_9FLOR Unreviewed; 929 AA.
AC A0A2V3J129;
DT 12-SEP-2018, integrated into UniProtKB/TrEMBL.
DT 12-SEP-2018, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE SubName: Full=CCAAT/enhancer-binding protein zeta {ECO:0000313|EMBL:PXF48122.1};
GN ORFNames=BWQ96_02074 {ECO:0000313|EMBL:PXF48122.1};
OS Gracilariopsis chorda.
OC Eukaryota; Rhodophyta; Florideophyceae; Rhodymeniophycidae; Gracilariales;
OC Gracilariaceae; Gracilariopsis.
OX NCBI_TaxID=448386 {ECO:0000313|EMBL:PXF48122.1, ECO:0000313|Proteomes:UP000247409};
RN [1] {ECO:0000313|EMBL:PXF48122.1, ECO:0000313|Proteomes:UP000247409}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SKKU-2015 {ECO:0000313|EMBL:PXF48122.1,
RC ECO:0000313|Proteomes:UP000247409};
RC TISSUE=Whole body {ECO:0000313|EMBL:PXF48122.1};
RX PubMed=29688518; DOI=10.1093/molbev/msy081;
RA Lee J., Yang E.C., Graf L., Yang J.H., Qiu H., Zel Zion U., Chan C.X.,
RA Stephens T.G., Weber A.P.M., Boo G.H., Boo S.M., Kim K.M., Shin Y.,
RA Jung M., Lee S.J., Yim H.S., Lee J.H., Bhattacharya D., Yoon H.S.;
RT "Analysis of the draft genome of the red seaweed Gracilariopsis chorda
RT provides insights into genome size evolution in Rhodophyta.";
RL Mol. Biol. Evol. 0:0-0(2018).
CC -!- SIMILARITY: Belongs to the CBF/MAK21 family.
CC {ECO:0000256|ARBA:ARBA00007797}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PXF48122.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NBIV01000016; PXF48122.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2V3J129; -.
DR STRING; 448386.A0A2V3J129; -.
DR Proteomes; UP000247409; Unassembled WGS sequence.
DR GO; GO:0043231; C:intracellular membrane-bounded organelle; IEA:UniProt.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR005612; CCAAT-binding_factor.
DR InterPro; IPR040155; CEBPZ/Mak21-like.
DR PANTHER; PTHR12048; CCAAT-BINDING FACTOR-RELATED; 1.
DR PANTHER; PTHR12048:SF0; CCAAT_ENHANCER-BINDING PROTEIN ZETA; 1.
DR Pfam; PF03914; CBF; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000247409}.
FT DOMAIN 407..615
FT /note="CCAAT-binding factor"
FT /evidence="ECO:0000259|Pfam:PF03914"
FT REGION 528..554
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 703..729
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 749..816
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 839..929
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 528..551
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 794..816
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 862..888
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 929 AA; 104927 MW; 0993A984C5F66471 CRC64;
MGRPSKAPIS ERVLGLHQLP GQVLADEDVN GEYWYECAPK WPEEGLGGPT ATLSLWKEVA
TKALSSASEM FEKKVNNASS AALKRALDKE KTVGDKIAAE TLLVQESPVH RLDELRNLLG
FASKKKRRER NLAIDALKDL FIHNLLPNDR RLVFFEDRVF SCGKGELTKR HLIYALYETE
LKSIYREFLQ ILEECARDPL TFMKEIAVKT MTELLIEKPE NESALLAMLV NKLGDPERKV
SSLASQQLIS LIYRHHPQMR LVVIKEVERL VLRQNVTRKT QYYAINFLNQ IRFSNEDVEL
ARRFVRLYMD LFTRIISEDD KSKQEKPAKS EKTKIKRVMK RGRISKKKVK TKEKVHDSGD
SRLIGALLIG VNRAFPYTRP EEETKTYEAY YDALFRVAHA KSMASATQAL AFLLQVSQAN
STQSDRLYNA LYSRIFDLPW SAEEKQAAFM NLVYKAMKAD TSTKRMKAFM KRLLQASLFG
SSGFAGACVM VISESLSGKD VGLLRSYVSM GENEDEEEVF DDVEVVHDRD EKESNPTVTD
ESKEDMPTLK DPTKLRMNGA SKEIRSAPIQ KGLASNTTTY DPLKRDPRFA GAEKSSLWEI
LGLSSHFHPS VSMFARSICK NLQALQYNGD PLKDFAEIAF LDKFSYKKAK NRVAKSLHGK
RSGQYRDDPI PNSQQFQNFI QNGTMGEDNE FFARFFRTNP HRVVSTSEEQ KAGVRDADDE
IPGYDDASDI DSEEEAFEKA MHAEMRRLGG GKGLLDDSKG VEVNDPDEDD DDELKAFDLA
FGNEGEESDD LEDSQNTNGQ QDSGTKSRQQ QKLSGGLKSS GVFMSLEDFE QAVEAGALAG
SASVQDGMPL SKTKSRRRRV RSGKESRTES KDPLLSSQRQ HHAKSFTGKQ DLDGNISLAK
LTGSTKRKSP QAEDHPVERK RRRKRKADQ
//