ID A0A0L0CLG2_LUCCU Unreviewed; 1242 AA.
AC A0A0L0CLG2;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 24-JAN-2024, entry version 38.
DE RecName: Full=PHD finger protein 20-like protein 1 {ECO:0008006|Google:ProtNLM};
GN ORFNames=FF38_12723 {ECO:0000313|EMBL:KNC33208.1};
OS Lucilia cuprina (Green bottle fly) (Australian sheep blowfly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Oestroidea;
OC Calliphoridae; Luciliinae; Lucilia.
OX NCBI_TaxID=7375 {ECO:0000313|EMBL:KNC33208.1, ECO:0000313|Proteomes:UP000037069};
RN [1] {ECO:0000313|EMBL:KNC33208.1, ECO:0000313|Proteomes:UP000037069}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=LS {ECO:0000313|EMBL:KNC33208.1,
RC ECO:0000313|Proteomes:UP000037069};
RC TISSUE=Full body {ECO:0000313|EMBL:KNC33208.1};
RX PubMed=26108605; DOI=10.1038/ncomms8344;
RA Anstead C.A., Korhonen P.K., Young N.D., Hall R.S., Jex A.R., Murali S.C.,
RA Hughes D.S., Lee S.F., Perry T., Stroehlein A.J., Ansell B.R.,
RA Breugelmans B., Hofmann A., Qu J., Dugan S., Lee S.L., Chao H., Dinh H.,
RA Han Y., Doddapaneni H.V., Worley K.C., Muzny D.M., Ioannidis P.,
RA Waterhouse R.M., Zdobnov E.M., James P.J., Bagnall N.H., Kotze A.C.,
RA Gibbs R.A., Richards S., Batterham P., Gasser R.B.;
RT "Lucilia cuprina genome unlocks parasitic fly biology to underpin future
RT interventions.";
RL Nat. Commun. 6:7344-7344(2015).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KNC33208.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JRES01000223; KNC33208.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0L0CLG2; -.
DR STRING; 7375.A0A0L0CLG2; -.
DR EnsemblMetazoa; KNC33208; KNC33208; FF38_12723.
DR OMA; IRAIWIK; -.
DR Proteomes; UP000037069; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd20104; MBT_PHF20L1-like; 1.
DR CDD; cd01396; MeCP2_MBD; 1.
DR CDD; cd20386; Tudor_PHF20-like; 1.
DR Gene3D; 2.30.30.140; -; 2.
DR Gene3D; 6.20.210.20; THAP domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR043449; PHF20-like.
DR InterPro; IPR006612; THAP_Znf.
DR InterPro; IPR038441; THAP_Znf_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR15856:SF51; MBD-R2; 1.
DR PANTHER; PTHR15856; PHD FINGER PROTEIN 20-RELATED; 1.
DR Pfam; PF01429; MBD; 1.
DR Pfam; PF20826; PHD_5; 1.
DR Pfam; PF05485; THAP; 1.
DR SMART; SM00692; DM3; 1.
DR SMART; SM00391; MBD; 1.
DR SMART; SM00980; THAP; 1.
DR SMART; SM00355; ZnF_C2H2; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 2.
DR PROSITE; PS50982; MBD; 1.
DR PROSITE; PS50950; ZF_THAP; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00309};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000037069};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 1..84
FT /note="THAP-type"
FT /evidence="ECO:0000259|PROSITE:PS50950"
FT DOMAIN 421..490
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 660..685
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT REGION 336..374
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 738..784
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 810..843
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 901..944
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1114..1149
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 340..364
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 906..943
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1119..1133
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1242 AA; 137624 MW; C5A7A820DF808ADC CRC64;
MGRRCCITGC TSTSRLPEHH GVTYHSFPAD QNARSIWIRN TKISSDRHIT KSVLVCSRHF
RRADFQPQRS GKYLLKQKVF PTVFPWGKID AAQIEEDLEF LNSTGSSAPS NLLSTSSTVD
EDTKATVAAT VAQIMAQTAE LNAAAGIKLE KPESGLEESF NETAVVAANP STPPNVKFEP
VTSFNPGARL EAQDFDGVWH AARIVEVDND DREVLIKFEK TGKNKSTMAG TEEWIPMNST
RLRQRISTKP ILNFELEEKC LARWSGPRKF PGTVKKILPN DVYEILFDDG YIKNVRAIHM
NKVPTTTTSL EAAVVETIPA EVPADVPAAA AVLADTEEPV PVTPTTQAAK RPSSGQNSGS
QAKKRPANGG RKDWPLLDLS KLDLSTLNLP EIPKDGEWTC HWVNDQPIGR EGYLVVGEHR
KPTVIVEDWR LPPGWIKHMY QRSNVLGKWD VILVSPNGKR FRSKADLKSY LEDLGQYYNP
DIYDFSIHRR RAKDINCYVH TPDYVPQQPI KSKSSLNTSL DTSLETKTST IVGTLPTAVS
SPYMETPIAE PLPPIELLTT SSSSMAAEDK SVTQVEAEVP KQLLEAAEVS STPTPTSTPT
VGLKEEVVAS SDCATPTPVI TAAAVSSASV AVPAALESQK ADDGYALIGG LKVQIIDNLF
RCPQEGCSKN FRKENHLQIH VKHYHRNLIK LLGTCPKMLE LAEKRTHPVD NEASEPVPKN
QIPNQQFFAK LHQQDLEQTR AHRRSTGALK NSVDAVPSEI KTEPTEETAI KVEENMDTSQ
NDTSINTASS TVLESSFAST SAADETVATE SSIAAEAPPS KRSRFSPSKR TPGSRKSNRQ
RTTRKYLTAA QATAGVVATT TASSAPVPVP VADTSFSGVA EFEETRHSFN ATPDINKEVK
KRKTVLPSNT PLSSVDSPIT ADSGSNSFQP QNSTENDAIN QPPPQYIKEN GELIKIVHMR
QEEIINCLCG YCEEDGLMIQ CELCLCWQHG LCNGIDKVSQ VPDKYVCYIC RNPQRCRESL
RFKHDQDWLY EGKLPVANYH TSNNSVLNKR AEYLKRSHTL TGNLLELKNY MHSLRVKINI
ANNKCHPKLY LWAKKWDEDE VADVKNEIKL EAEDENKTNN PAMPLTPSKK IKSEPTTPAR
LVPNIPQPEA AIDPNECQQR LMEHIKIQQE LVMRRLNDIE AAIDVLDSED DLPDLREDDL
GTTTDVLAAF IKELDTVKQI AKLNSLEHTK LAYKNPIPTA IK
//