GenomeNet

Database: UniProt
Entry: A0A226EW66_FOLCA
LinkDB: A0A226EW66_FOLCA
Original site: A0A226EW66_FOLCA 
ID   A0A226EW66_FOLCA        Unreviewed;       844 AA.
AC   A0A226EW66;
DT   25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT   25-OCT-2017, sequence version 1.
DT   24-JAN-2024, entry version 15.
DE   SubName: Full=Protein msta, isoform A {ECO:0000313|EMBL:OXA61430.1};
GN   ORFNames=Fcan01_03055 {ECO:0000313|EMBL:OXA61430.1};
OS   Folsomia candida (Springtail).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Collembola;
OC   Entomobryomorpha; Isotomoidea; Isotomidae; Proisotominae; Folsomia.
OX   NCBI_TaxID=158441 {ECO:0000313|EMBL:OXA61430.1, ECO:0000313|Proteomes:UP000198287};
RN   [1] {ECO:0000313|EMBL:OXA61430.1, ECO:0000313|Proteomes:UP000198287}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=VU population {ECO:0000313|EMBL:OXA61430.1,
RC   ECO:0000313|Proteomes:UP000198287};
RC   TISSUE=Whole body {ECO:0000313|EMBL:OXA61430.1};
RA   Faddeeva A., Derks M.F., Anvar Y., Smit S., Van Straalen N., Roelofs D.;
RT   "The genome of Folsomia candida.";
RL   Submitted (DEC-2015) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OXA61430.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LNIX01000001; OXA61430.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A226EW66; -.
DR   STRING; 158441.A0A226EW66; -.
DR   Proteomes; UP000198287; Unassembled WGS sequence.
DR   GO; GO:0043229; C:intracellular organelle; IEA:UniProt.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   Gene3D; 6.10.140.2220; -; 1.
DR   Gene3D; 2.170.270.10; SET domain; 1.
DR   Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR   Gene3D; 1.20.58.320; TPR-like; 1.
DR   InterPro; IPR010323; DUF924.
DR   InterPro; IPR001214; SET_dom.
DR   InterPro; IPR046341; SET_dom_sf.
DR   InterPro; IPR011990; TPR-like_helical_dom_sf.
DR   InterPro; IPR002893; Znf_MYND.
DR   PANTHER; PTHR46455:SF1; SET AND MYND DOMAIN CONTAINING, ARTHROPOD-SPECIFIC, MEMBER 2; 1.
DR   PANTHER; PTHR46455; SET AND MYND DOMAIN CONTAINING, ARTHROPOD-SPECIFIC, MEMBER 4, ISOFORM A; 1.
DR   Pfam; PF06041; DUF924; 1.
DR   Pfam; PF00856; SET; 1.
DR   Pfam; PF01753; zf-MYND; 1.
DR   SUPFAM; SSF144232; HIT/MYND zinc finger-like; 1.
DR   SUPFAM; SSF82199; SET domain; 1.
DR   SUPFAM; SSF48452; TPR-like; 1.
DR   PROSITE; PS50280; SET; 1.
PE   4: Predicted;
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Reference proteome {ECO:0000313|Proteomes:UP000198287};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833};
KW   Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT   DOMAIN          224..511
FT                   /note="SET"
FT                   /evidence="ECO:0000259|PROSITE:PS50280"
SQ   SEQUENCE   844 AA;  95685 MW;  6DDF9742164E71A5 CRC64;
     MTSESSPEEI LAFWFGPHGE ERPFWFSSTP SWDAKIRQRF LQQHEKAASG VYAATWTLEP
     RSTLALILLL DQVPRNIFRG SARQYATGAE ALEVAKIALA RGFVNDLKLD AEKKFCIMPF
     CHSEMLEDQD QAVELFKVFK KSQPGWYEYA KGYHEIIRSK ITVTVLGQCI VCEEVTTLRC
     QSCLKGGKEA DEIPFYCSRD HQSKHWKAGH KANCGQLPKK PLGWKLEHGP CTLSQNVPEG
     ISYFVATEDL EPGHVLFQRK PLLAFPAIPS PQDFNNFLEK MPPSVKTKML NSEDRLVWTC
     VGCYEVLVYG EVTQTIATGL MVNKCVQCGW PLHSNIHPKY GPTKCQAVHE EECKILMERG
     LTWETLQKEK LYRDPEFYYH LGLLRAVMLP PQLKEELMNL PVFIPAYFRF FNLKLAIPFV
     RERCGMGDKV GEEEAQKILE VLFGNILGPP IQHSTSKNKL AFIVYAGGPT GRVMTHDCTP
     NCYPYIGKDL TATFKTIRKV AAGELLTINY IMISNHCKTV DERLTLFTQL LLPPCRCARC
     QSSTEDGTYF SAIKCPADKK TGDGKRECQY LLPMPNPNHT QGFKDIFIWK CACTACTGAT
     RQSAVEQLVS TIKEKIQTAT RSPNAYDELK SIVARNSGRT VHPDHAVILY ALLNIAATAF
     RHITNVQANK DPAFSFNLHK KYMTDFNIVL EKMQVIQPGR SDHYGLLLFG QTLLRMSELA
     ENVLESQSKP SLEDLDTKLT LILSALKESW TIINANQIAE PYDTARESRL RKFYANHLIS
     SIRKGMGEDC LDKVSSVKKL KEKYFSEVPV NLDMSEKELE YRELLVIDET VPNLIKKMEE
     HGIV
//
DBGET integrated database retrieval system