ID A0A267EQ00_9PLAT Unreviewed; 631 AA.
AC A0A267EQ00;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 13-SEP-2023, entry version 22.
DE RecName: Full=PHD-type domain-containing protein {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
GN ORFNames=BOX15_Mlig012286g2 {ECO:0000313|EMBL:PAA63625.1};
OS Macrostomum lignano.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
OC Rhabditophora; Macrostomorpha; Macrostomida; Macrostomidae; Macrostomum.
OX NCBI_TaxID=282301 {ECO:0000313|EMBL:PAA63625.1, ECO:0000313|Proteomes:UP000215902};
RN [1] {ECO:0000313|EMBL:PAA63625.1, ECO:0000313|Proteomes:UP000215902}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DV1 {ECO:0000313|EMBL:PAA63625.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:PAA63625.1};
RA Berezikov E.;
RT "A platform for efficient transgenesis in Macrostomum lignano, a flatworm
RT model organism for stem cell research.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PAA63625.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NIVC01001827; PAA63625.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A267EQ00; -.
DR STRING; 282301.A0A267EQ00; -.
DR Proteomes; UP000215902; Unassembled WGS sequence.
DR GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd15553; PHD_Cfp1; 1.
DR Gene3D; 2.60.120.650; Cupin; 1.
DR InterPro; IPR037869; Spp1/CFP1.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR002857; Znf_CXXC.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR PANTHER; PTHR46174:SF1; AT26187P-RELATED; 1.
DR PANTHER; PTHR46174; CXXC-TYPE ZINC FINGER PROTEIN 1; 1.
DR Pfam; PF00628; PHD; 1.
DR Pfam; PF02008; zf-CXXC; 1.
DR SMART; SM00249; PHD; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR PROSITE; PS51058; ZF_CXXC; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000215902};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00509}.
FT DOMAIN 47..97
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 367..416
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT REGION 1..45
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 105..235
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 325..353
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 459..495
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 166..203
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 470..495
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:PAA63625.1"
SQ SEQUENCE 631 AA; 68254 MW; 61CBBE0ABA0CE965 CRC64;
ARSGVGSRRS PTGSPRLKVE TKAMQKAVHM SSAAPRTSSR PAKPPKPVYC LCRSTDSIRF
MIACDSCEEW FHGDCVSVTA SLAERIVAYY CEACRKRNRR LAVQYKPAAS PTANSKPDAK
HRRRGKQSAT RKRDASEASK SSNSSPAESA TRRKARNVKK QQDGSKADGP ASTLQNGRGG
SPSGRQQRRQ RGGQQSRRAE PAQPDSSPDT ARRRGRRRAR QSPIRQISND ANDEAIVDEA
LDDSIDDVSA ELVPDGGGSG GVGGNNGIVV VPDPPSLLSS PSAAAAAAAA AALGAVVSDV
DVDEDEAEAV VVDADDIAVE EEMLLPAEDE SATDDELETS GFRPPQMESD VEQDQNSLLD
EDDFASTSVF RFPKRCGYCE ACRLKADCGR CEVCQAKKRY PSFKLDSVDC LARQCRTMAK
LGGRLGSLPF PIQKRGRGRP SKASLSNEYF LYLSAGARPR RRDDYSGGHQ KQQRLSPAPA
ALLQQQQQQQ QRPTAAVAAP IADLGRKRRH HQQHQRLPVD EGLLVVQPAS RPQFCTPYGL
SYSDHSYCKH HASGGWQRQS PQPVSQLVSP VSSAAAVAAA LSAAASVSAS VAQPFSQQQQ
LIQQHQQQQA FYTLNDDTDN LKVTWPEEAA A
//