GenomeNet

Database: UniProt
Entry: A0A1D2MK68_ORCCI
LinkDB: A0A1D2MK68_ORCCI
Original site: A0A1D2MK68_ORCCI 
ID   A0A1D2MK68_ORCCI        Unreviewed;       796 AA.
AC   A0A1D2MK68;
DT   30-NOV-2016, integrated into UniProtKB/TrEMBL.
DT   30-NOV-2016, sequence version 1.
DT   27-MAR-2024, entry version 23.
DE   SubName: Full=CCAAT/enhancer-binding protein zeta {ECO:0000313|EMBL:ODM93370.1};
GN   ORFNames=Ocin01_13312 {ECO:0000313|EMBL:ODM93370.1};
OS   Orchesella cincta (Springtail) (Podura cincta).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Collembola;
OC   Entomobryomorpha; Entomobryoidea; Orchesellidae; Orchesellinae; Orchesella.
OX   NCBI_TaxID=48709 {ECO:0000313|EMBL:ODM93370.1, ECO:0000313|Proteomes:UP000094527};
RN   [1] {ECO:0000313|EMBL:ODM93370.1, ECO:0000313|Proteomes:UP000094527}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   TISSUE=Mixed pool {ECO:0000313|EMBL:ODM93370.1};
RX   PubMed=27289101; DOI=10.1093/gbe/evw134;
RA   Faddeeva-Vakhrusheva A., Derks M.F., Anvar S.Y., Agamennone V., Suring W.,
RA   Smit S., van Straalen N.M., Roelofs D.;
RT   "Gene Family Evolution Reflects Adaptation to Soil Environmental Stressors
RT   in the Genome of the Collembolan Orchesella cincta.";
RL   Genome Biol. Evol. 8:2106-2117(2016).
CC   -!- SIMILARITY: Belongs to the CBF/MAK21 family.
CC       {ECO:0000256|ARBA:ARBA00007797}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:ODM93370.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LJIJ01001009; ODM93370.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A1D2MK68; -.
DR   STRING; 48709.A0A1D2MK68; -.
DR   OMA; MHYHPSV; -.
DR   OrthoDB; 1214522at2759; -.
DR   Proteomes; UP000094527; Unassembled WGS sequence.
DR   GO; GO:0043231; C:intracellular membrane-bounded organelle; IEA:UniProt.
DR   Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 1.
DR   InterPro; IPR011989; ARM-like.
DR   InterPro; IPR016024; ARM-type_fold.
DR   InterPro; IPR005612; CCAAT-binding_factor.
DR   InterPro; IPR040155; CEBPZ/Mak21-like.
DR   PANTHER; PTHR12048; CCAAT-BINDING FACTOR-RELATED; 1.
DR   PANTHER; PTHR12048:SF0; CCAAT_ENHANCER-BINDING PROTEIN ZETA; 1.
DR   Pfam; PF03914; CBF; 1.
DR   SUPFAM; SSF48371; ARM repeat; 1.
PE   3: Inferred from homology;
KW   Reference proteome {ECO:0000313|Proteomes:UP000094527}.
FT   DOMAIN          231..478
FT                   /note="CCAAT-binding factor"
FT                   /evidence="ECO:0000259|Pfam:PF03914"
FT   REGION          347..370
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          566..585
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          675..714
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        356..370
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        570..584
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   796 AA;  91746 MW;  74D871A457D95BEF CRC64;
     MNTLKDLFLE DLLPPSRKLL GFEKRRDSLS QASKHQLIMW IYEDRLKMCY SSFIDALRRM
     SLDNVDKVRA KAVVLMYTLL TTHPEHEAEL LQMLVNKLGD PIKKVGSRAL FYLNSLLASH
     PRMKIVVIDE VERLLFRNNI AKRAQYYGVC FLNTINLEDE HPEVACKLIN VYVAFFKGCI
     KTGDIDSKMM SGLLTGLNKA YPLAKEYGFK FTENTDTLYK IVHMSSFNIG IQALSILFQL
     TDPQNRPNEA DRYYRALYSE LLRPEIAVTS QNTHFLNLVF KSMKHDNNIG RVMGFCKRLM
     QICQSVQPNL ACAILFLLSE VSKTHKNIIR VGTGAVNSEE EVVACEEEIN EPDDDDADSE
     KKSRNDENMK NLIKRSNVKV TEESSTTEKP KTLLGLRTVP LVKNNHILSS ETDQNKRDAE
     SGIKAENENA SVKVEYKNQL SDYDPTARNP IYCGAEFSLS YEIVTLVDHF HPSVALFARK
     LLQKGHIEYE GNPLKDFTIT RFLERFVYKN PKKPKVTPGV PRTYQPKGLR NIPVNSNDYL
     KNEESKIPVE EKFFYRFFTE KKAEKLDASD EDSETESVGD DEFDDLMDNY FKSSKGKNAD
     EDEDDFDKKK IKKLKGGDMF APAEQFAEML ESNADFSLSA PSTLINKDNA DPKQLQWEME
     RDRWMKGYKG SSKTVFKNKR KPTRCDNDDI EDDDTEDQRA SVSPPSRNKE TDNFDKTIKS
     EDIQNELKRW AEAYSIRNKV DFEAKEPKQI VLFLTSSLEL AKSNQQLWSW HQIVRFNNGI
     LRFRKLGFRA GRWYGI
//
DBGET integrated database retrieval system