ID R7T4C3_CAPTE Unreviewed; 596 AA.
AC R7T4C3;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 46.
DE RecName: Full=SET and MYND domain-containing protein 4 {ECO:0000256|ARBA:ARBA00026173};
GN ORFNames=CAPTEDRAFT_118237 {ECO:0000313|EMBL:ELT87827.1};
OS Capitella teleta (Polychaete worm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Polychaeta;
OC Sedentaria; Scolecida; Capitellidae; Capitella.
OX NCBI_TaxID=283909 {ECO:0000313|EMBL:ELT87827.1};
RN [1] {ECO:0000313|Proteomes:UP000014760}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=I ESC-2004 {ECO:0000313|Proteomes:UP000014760};
RA Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ELT87827.1, ECO:0000313|Proteomes:UP000014760}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=I ESC-2004 {ECO:0000313|EMBL:ELT87827.1,
RC ECO:0000313|Proteomes:UP000014760};
RX PubMed=23254933; DOI=10.1038/nature11696;
RA Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT "Insights into bilaterian evolution from three spiralian genomes.";
RL Nature 493:526-531(2013).
RN [3] {ECO:0000313|EnsemblMetazoa:CapteP118237}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMQN01015657; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KB312122; ELT87827.1; -; Genomic_DNA.
DR AlphaFoldDB; R7T4C3; -.
DR STRING; 283909.R7T4C3; -.
DR EnsemblMetazoa; CapteT118237; CapteP118237; CapteG118237.
DR HOGENOM; CLU_021727_3_0_1; -.
DR OMA; HERKGRC; -.
DR Proteomes; UP000014760; Unassembled WGS sequence.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd10536; SET_SMYD4; 1.
DR Gene3D; 1.10.220.160; -; 1.
DR Gene3D; 6.10.140.2220; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR044421; SMYD4_SET.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR002893; Znf_MYND.
DR PANTHER; PTHR46165:SF8; RE32936P; 1.
DR PANTHER; PTHR46165; SET AND MYND DOMAIN-CONTAINING PROTEIN 4; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF01753; zf-MYND; 1.
DR SUPFAM; SSF144232; HIT/MYND zinc finger-like; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS50865; ZF_MYND_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000014760};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00134}.
FT DOMAIN 178..449
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 223..262
FT /note="MYND-type"
FT /evidence="ECO:0000259|PROSITE:PS50865"
SQ SEQUENCE 596 AA; 67691 MW; F0FE9ADCE510145A CRC64;
MFNKVTSKLK SESKIDGISK EFGALSSDQD RASYALSLAA VEEFIRPGPL VQLKSVEHSL
SFKTTGNTAY RASEYRDAVQ LYTDSIAWAP MLTDGGEDAL SLAYGNRSAA LYQLQQYEAC
IRDIDRALTE QYPPRLLHKV FHRKAMCRLN LEQFEAAEES FLQPNQPDSH DQYVSLSSAC
SVASTENEGR FITAVRNIAP GEIVLIEKPF ASVLLRANYS NHCHHCLKHT LEGIPCRTCP
DARFCSEACR DTAMQTYHQY ECSVLNTLHH SQINKFGCLA FRAITKQSYQ SLKDIRAQDL
PLNGCHSDGL YRPQDYNTII QLVTHAKDRP VQDLFHRTVM AVYLLKLLQQ TSYFNGEEDV
EMQAYIAGLF LSHLQSFPCN AHEVPELYLD PNAIDLSMPN ELGAGIYSTL SLFNHSCDPG
VNRNFYGDTC VVRAIKTIRK GHQVSDNYGA LYATNTLKER HDKLQPQYFF SCRCEPCSND
WPLYQKINID SPRYKCTQCQ KEMTRDDITN CCSSNIEEIR AKFNKSEVEF RSAFEDLLAC
RVEEALPVFL RHLALIQEMI VLPWRQFNDC QEALKQCYSL MGNSNTSCAL HNGNIF
//