ID I0YJJ7_COCSC Unreviewed; 659 AA.
AC I0YJJ7;
DT 13-JUN-2012, integrated into UniProtKB/TrEMBL.
DT 13-JUN-2012, sequence version 1.
DT 24-JAN-2024, entry version 36.
DE RecName: Full=C2H2-type domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=COCSUDRAFT_68267 {ECO:0000313|EMBL:EIE18566.1};
OS Coccomyxa subellipsoidea (strain C-169) (Green microalga).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Trebouxiophyceae incertae sedis; Elliptochloris clade; Coccomyxa;
OC Coccomyxa subellipsoidea.
OX NCBI_TaxID=574566 {ECO:0000313|EMBL:EIE18566.1, ECO:0000313|Proteomes:UP000007264};
RN [1] {ECO:0000313|EMBL:EIE18566.1, ECO:0000313|Proteomes:UP000007264}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=C-169 {ECO:0000313|EMBL:EIE18566.1,
RC ECO:0000313|Proteomes:UP000007264};
RX PubMed=22630137; DOI=10.1186/gb-2012-13-5-r39;
RA Blanc G., Agarkova I., Grimwood J., Kuo A., Brueggeman A., Dunigan D.,
RA Gurnon J., Ladunga I., Lindquist E., Lucas S., Pangilinan J., Proschold T.,
RA Salamov A., Schmutz J., Weeks D., Yamada T., Claverie J.M., Grigoriev I.,
RA Van Etten J., Lomsadze A., Borodovsky M.;
RT "The genome of the polar eukaryotic microalga coccomyxa subellipsoidea
RT reveals traits of cold adaptation.";
RL Genome Biol. 13:R39-R39(2012).
CC -!- SIMILARITY: Belongs to the ARS2 family.
CC {ECO:0000256|ARBA:ARBA00005407}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EIE18566.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGSI01000023; EIE18566.1; -; Genomic_DNA.
DR RefSeq; XP_005643110.1; XM_005643053.1.
DR AlphaFoldDB; I0YJJ7; -.
DR SMR; I0YJJ7; -.
DR STRING; 574566.I0YJJ7; -.
DR GeneID; 17036451; -.
DR KEGG; csl:COCSUDRAFT_68267; -.
DR eggNOG; KOG2295; Eukaryota.
DR OrthoDB; 24436at2759; -.
DR Proteomes; UP000007264; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR039727; SE/Ars2.
DR InterPro; IPR007042; SERRATE/Ars2_C.
DR InterPro; IPR021933; SERRATE/Ars2_N.
DR PANTHER; PTHR13165; ARSENITE-RESISTANCE PROTEIN 2; 1.
DR PANTHER; PTHR13165:SF0; SERRATE RNA EFFECTOR MOLECULE HOMOLOG; 1.
DR Pfam; PF04959; ARS2; 1.
DR Pfam; PF12066; SERRATE_Ars2_N; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000007264}.
FT DOMAIN 118..202
FT /note="SERRATE/Ars2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF12066"
FT DOMAIN 371..544
FT /note="SERRATE/Ars2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF04959"
FT REGION 1..114
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 186..252
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 586..659
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 37..69
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 79..97
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 222..246
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 659 AA; 71539 MW; 4819222346AFD078 CRC64;
MRGGKRRRSP PPPGRRGEDR RMGGGAPYPP QGYRDRRPRS TSPEPRYRRS PPPYNKRYRR
EDDAYDRFPP GARGPDQFSD RRGPYRERFE EEPYGGRPPY GRRFPSPPPA KTGPMSYKEF
MMRLSDDVTP EDAQVEYQKY LAQYWGSETR AEFQQKQNLD WMRKKYDPRQ LMVALEQRNT
DAQAAAEAFG EALAEGKLDP ASPSFHQGTG EGAAADVQNG HADGSAEKES AADGAKEGDD
KELDAETSSS AVPGAGWKAA RVEADLALSK ELMLKLDAEK GIESNPLEAQ ATAAEPAKEG
EVGEEAATKG YEEQVGQLDL QLTYLWRVHS LDYYGGAELQ DPADVSAAKR TLRGPRPEEG
EQPDEAEEGK ELKALEEQVD AVWKKRLTAG DPVEAYLQRD KVQKLVEEWV EDQVLQIEEN
KYGCKLSQKL FVGKEYVLKH IRLKHTAVLE AHREKIYDQI YYENFESDKK EQEERARAEA
ERAQAAAWAA GGPHSGPPGF EGAGWEGEEG FGGWGGRGGP MGRGGRGAPG LPFPPNGNGF
ASPDMMGAPV MGGPAIGPGQ MLVLAPGAGP LGPFITVPIP EGGGPVAMGG PSMGPPSGGG
RGPGRGGRGP PRGGPMMRGR GYGGGPSGYG GPMGGPPQRG YYDLDAPENQ RSVLDYGDL
//