ID A0A1E7FHR8_9STRA Unreviewed; 547 AA.
AC A0A1E7FHR8;
DT 18-JAN-2017, integrated into UniProtKB/TrEMBL.
DT 18-JAN-2017, sequence version 1.
DT 13-SEP-2023, entry version 27.
DE RecName: Full=C2H2-type domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=FRACYDRAFT_268783 {ECO:0000313|EMBL:OEU17726.1};
OS Fragilariopsis cylindrus CCMP1102.
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Bacillariophyceae; Bacillariophycidae; Bacillariales; Bacillariaceae;
OC Fragilariopsis.
OX NCBI_TaxID=635003 {ECO:0000313|EMBL:OEU17726.1, ECO:0000313|Proteomes:UP000095751};
RN [1] {ECO:0000313|EMBL:OEU17726.1, ECO:0000313|Proteomes:UP000095751}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1102 {ECO:0000313|EMBL:OEU17726.1,
RC ECO:0000313|Proteomes:UP000095751};
RG DOE Joint Genome Institute;
RA Mock T., Otillar R.P., Strauss J., Dupont C., Frickenhaus S., Maumus F.,
RA Mcmullan M., Sanges R., Schmutz J., Toseland A., Valas R., Veluchamy A.,
RA Ward B.J., Allen A., Barry K., Falciatore A., Ferrante M., Fortunato A.E.,
RA Gloeckner G., Gruber A., Hipkin R., Janech M., Kroth P., Leese F.,
RA Lindquist E., Lyon B.R., Martin J., Mayer C., Parker M., Quesneville H.,
RA Raymond J., Uhlig C., Valentin K.U., Worden A.Z., Armbrust E.V., Bowler C.,
RA Green B., Moulton V., Van Oosterhout C., Grigoriev I.;
RT "Extensive genetic diversity and differential bi-allelic expression allows
RT diatom success in the polar Southern Ocean.";
RL Submitted (SEP-2016) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KV784357; OEU17726.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1E7FHR8; -.
DR EnsemblProtists; OEU17726; OEU17726; FRACYDRAFT_268783.
DR KEGG; fcy:FRACYDRAFT_268783; -.
DR InParanoid; A0A1E7FHR8; -.
DR OrthoDB; 2881978at2759; -.
DR Proteomes; UP000095751; Unassembled WGS sequence.
DR CDD; cd09212; PUB; 1.
DR CDD; cd02947; TRX_family; 1.
DR Gene3D; 1.20.58.2190; -; 1.
DR Gene3D; 1.10.8.10; DNA helicase RuvA subunit, C-terminal domain; 1.
DR Gene3D; 3.40.30.10; Glutaredoxin; 1.
DR InterPro; IPR036339; PUB-like_dom_sf.
DR InterPro; IPR018997; PUB_domain.
DR InterPro; IPR036249; Thioredoxin-like_sf.
DR InterPro; IPR017937; Thioredoxin_CS.
DR InterPro; IPR013766; Thioredoxin_domain.
DR InterPro; IPR009060; UBA-like_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR46340; UBX DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR46340:SF1; UBX DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF09409; PUB; 1.
DR Pfam; PF00085; Thioredoxin; 1.
DR SMART; SM00580; PUG; 1.
DR SUPFAM; SSF143503; PUG domain-like; 1.
DR SUPFAM; SSF52833; Thioredoxin-like; 1.
DR SUPFAM; SSF46934; UBA-like; 1.
DR PROSITE; PS00194; THIOREDOXIN_1; 1.
DR PROSITE; PS51352; THIOREDOXIN_2; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Reference proteome {ECO:0000313|Proteomes:UP000095751};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 1..104
FT /note="Thioredoxin"
FT /evidence="ECO:0000259|PROSITE:PS51352"
FT DOMAIN 264..293
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT REGION 140..200
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 355..374
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 412..439
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 173..200
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 547 AA; 59849 MW; 7C728E90FD9BEBE5 CRC64;
MTLIEISSAE ELKTFIQKNT VCVVTFSAHW CGPCKQSKPQ LEELAKASPV PISIVHESDI
GDYLHTFKVT AFPSYVLFVK ETEVQRIKGV NLKGVKDMIE AHADRADPAI PKAGGNTLGG
SHSAAEARAL RLAKLEAGAP KAAAAATTTS TEKAEDDDDD NKATPMETEE SKPAASDTGK
TDDTKMEDAT TEEDKKNPID DLDKEAIKTL TESMGFSLLR AQKGLLFSTG GTVESAVEWL
MEHQDDTDID EDIPEGSLGS ARSYKCNDCG KILSNMANLE LHANKTGHSD FEESTCIIVP
LTPEEKAAKI LEIKSLLASK RSEREEAEKV DQTEQEKRRR FMGKEMVKTK EVMEREARKR
EATARKREKT EIKRERDRIR AELAKDKAER IANKGKLLGK LGVDGYAPSA IQYNTSGGKD
DGDEEEPAAQ RPKTAGSTAS AANIDDYITK VSSYRAGGDG GKSLKILKLL VGNAADNPNE
DKYKKINMET NAYKNKVRPF VGAKKILLAV GFAPDEKDKT HLVLKEDADA ELLLSTKVKL
EQALAKF
//