ID K0SH30_THAOC Unreviewed; 806 AA.
AC K0SH30;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE RecName: Full=MYND-type domain-containing protein {ECO:0000259|PROSITE:PS50865};
DE Flags: Fragment;
GN ORFNames=THAOC_15003 {ECO:0000313|EMBL:EJK64279.1};
OS Thalassiosira oceanica (Marine diatom).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK64279.1, ECO:0000313|Proteomes:UP000266841};
RN [1] {ECO:0000313|EMBL:EJK64279.1, ECO:0000313|Proteomes:UP000266841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK64279.1,
RC ECO:0000313|Proteomes:UP000266841};
RX PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA Rosenstiel P., Hippler M., Laroche J.;
RT "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT limitation.";
RL Genome Biol. 13:R66-R66(2012).
CC -!- SIMILARITY: Belongs to the sel-1 family.
CC {ECO:0000256|ARBA:ARBA00038101}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJK64279.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGNL01017450; EJK64279.1; -; Genomic_DNA.
DR AlphaFoldDB; K0SH30; -.
DR EnsemblProtists; EJK64279; EJK64279; THAOC_15003.
DR eggNOG; ENOG502RZ2E; Eukaryota.
DR Proteomes; UP000266841; Unassembled WGS sequence.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR Gene3D; 6.10.140.2220; -; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR006597; Sel1-like.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR002893; Znf_MYND.
DR PANTHER; PTHR11102:SF147; FIBRONECTIN TYPE-II DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR11102; SEL-1-LIKE PROTEIN; 1.
DR Pfam; PF08238; Sel1; 3.
DR Pfam; PF01753; zf-MYND; 1.
DR SMART; SM00671; SEL1; 3.
DR SUPFAM; SSF81901; HCP-like; 1.
DR SUPFAM; SSF144232; HIT/MYND zinc finger-like; 1.
DR PROSITE; PS01360; ZF_MYND_1; 1.
DR PROSITE; PS50865; ZF_MYND_2; 1.
PE 3: Inferred from homology;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000266841};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00134}.
FT DOMAIN 518..559
FT /note="MYND-type"
FT /evidence="ECO:0000259|PROSITE:PS50865"
FT REGION 1..87
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 153..172
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 27..59
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EJK64279.1"
SQ SEQUENCE 806 AA; 88797 MW; C68AE187F3CD371D CRC64;
VNEEDATVET STVDETESVP GEAETTAEEA SVEEATAEEI EDEVDGEGVD EEEATVETST
VDDAETIPGE AETTAEESVN AMQTTADDIS FEEYPYIAEE SSMSMSMPIA SSGFTANEID
AENNPELVEE LEEEEEVVED EEEELVMDEE KEIVEDEEKS DTLPQGGDPD TLPQISAADR
LTLQALTCDM IGKAVASSLD LVNEAGAVPW PVVAIPLGTS KNKRLRQTRG GKPLPGKARH
FHVQGEADLA LYDRHVRGGR GPGRGSRALA VEKGLVELTF LSDLEMSLFR RGRARTNARS
KQQPYVSFDP PKIEHLEEKT KASIKSYKIS CICVAVNAKN DALIRMIPFY VVFHCHFGEN
GIYQAVVVRF GRFQVCHPCG DTPGDVFSRF HPSGGLIPAN LAPMIGSLWT KGLRAAMVTP
KIRIGTATSA RPVRLSKARR GEQTHGHTVT RKKAMLKCPS SRAPPCSNRW QRSSGAVTRA
HSFSPSLTPT FFVHSKIIVI LDLTMSCVPV AGDDDGVCAN CGKQGSDTVK LKNCTACRLV
KYCGVDCQRA HRKQHKKACK QRADELKDEQ LYCQGYERPE VDFCPICTLP IPIPMDEHSG
LNVCCMKRIC HGCDYAAQKR GMHDCPFCRS PYPDNDDDLS MVQARVAKKD PEAINFLGEK
YFNGTLGLQK DVQKAVELWT EAAELGSIGA LYSLGTLHVF GDRAQEDKAK GIQLWKKAAM
RGHVLARHNL GVYEGHHKGN KDRAVRHLLI SAKMGDMSSL EHIKNFAMNG KATKNQYAQA
LKGYQDAVEA MKSHDRDEAK RFRYLK
//