ID A0A2B4T1A4_STYPI Unreviewed; 959 AA.
AC A0A2B4T1A4;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 22-FEB-2023, entry version 11.
DE RecName: Full=THAP-type domain-containing protein {ECO:0000259|PROSITE:PS50950};
GN ORFNames=AWC38_SpisGene565 {ECO:0000313|EMBL:PFX34437.1};
OS Stylophora pistillata (Smooth cauliflower coral).
OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; Scleractinia;
OC Astrocoeniina; Pocilloporidae; Stylophora.
OX NCBI_TaxID=50429 {ECO:0000313|EMBL:PFX34437.1, ECO:0000313|Proteomes:UP000225706};
RN [1] {ECO:0000313|Proteomes:UP000225706}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Voolstra C.R., Li Y., Liew Y.J., Baumgarten S., Zoccola D., Flot J.-F.,
RA Tambutte S., Allemand D., Aranda M.;
RT "Comparative analysis of the genomes of Stylophora pistillata and Acropora
RT digitifera provides evidence for extensive differences between species of
RT corals.";
RL bioRxiv 0:0-0(2017).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PFX34437.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSMT01000004; PFX34437.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2B4T1A4; -.
DR STRING; 50429.A0A2B4T1A4; -.
DR Proteomes; UP000225706; Unassembled WGS sequence.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 1.50.10.10; -; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR006775; GH116_catalytic.
DR InterPro; IPR024462; GH116_N.
DR InterPro; IPR006612; THAP_Znf.
DR PANTHER; PTHR12654; BILE ACID BETA-GLUCOSIDASE-RELATED; 1.
DR PANTHER; PTHR12654:SF0; NON-LYSOSOMAL GLUCOSYLCERAMIDASE; 1.
DR Pfam; PF04685; DUF608; 1.
DR Pfam; PF12215; Glyco_hydr_116N; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
DR PROSITE; PS50950; ZF_THAP; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00309};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00309};
KW Reference proteome {ECO:0000313|Proteomes:UP000225706};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00309};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00309}.
FT DOMAIN 1..64
FT /note="THAP-type"
FT /evidence="ECO:0000259|PROSITE:PS50950"
SQ SEQUENCE 959 AA; 108172 MW; E92D30F483B8DE50 CRC64;
MRFFKFTTAK SKRKIWIENV SAGLENFSPG SGTYGCANHF VDGKTDDLVP HLDALQFRNG
PSFVYANQSF GLQQFCVLYT EPENVSTIRF GWDTLGLRNY LFQNPGDPKM QRSPPMGMRS
AVPLGGLGTG SFELRADGSI HEWTIENQTP AGSAKLNQEA LDLAVFGVRV QTGTSSNVAL
LRTHPPDGYP GVASMGYSGS FPVSKLTVRD EQFGGISLDL YAYSALKPRD SKTSATPAVA
FTFRINNPTE KTVNVSLMFN LPLGIQTATA RLGESYKDIN MSTVSSTICS EACTKDPKCF
SWQIEIKNKT CLFFNETFPH YWKPGFTSGQ KNTWNAHDSM LTLNRPGNYP QSGNTTILTE
KSNNPSFMVS DSFGQIWKQF STHGYLLSAA KSFGGGFHGA AAINVTMEPG DENTLTMVLG
WYYPNRDFTE EIVGNYYSNI FKSSEEAALI VSRDPASTLR SISDWHSSMI LDSTKNSVQT
LPEWLQDVLV NGLSFWRTGL YLRDGRWRQF EALDCIDIDS VHNDFQREIP YVIFYPDLVK
KVMHAWAKYQ SEDGHIVETL VMGCYSPTRK MDSGPPNQRI MGDVTTVFIV ETYHIYQWTN
DTEFLRDLWP HVVHALDWLI YKGTNGTGLP YKQQCTYDIE ALNLYDHNVF NSFMYVLAMR
AAQELGTIMQ DRNVWREATI AMEFAKHVIS EELWSEEDGY YHAWWDKKLG SPPWLMSDSL
YAQVWAYTLG LGHLDDPRRL KSHLNKELEI NDTPYGLRVM FTGHPTNKSV GSCPQNISAG
ELNKLVAVRE SIWVGGSPDW TTLQIHLGLD PQNASHQAQK ALDHVRSELN DQWDFHGLYS
GPGYGLDGLP WCTSHYTFHM VLWHIPFAIS GQYFSAPNST LTFSPKFLCP YKIPFYTPFA
IGTLQCSVID KETMKFEMLS TSGDIYLQKL VVSGFQYPKS VELKQGEVIT WLSHDDILL
//