ID A0A016RX06_9BILA Unreviewed; 1422 AA.
AC A0A016RX06;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE RecName: Full=SET domain protein {ECO:0008006|Google:ProtNLM};
GN Name=Acey_s0347.g3160 {ECO:0000313|EMBL:EYB82920.1};
GN ORFNames=Y032_0347g3160 {ECO:0000313|EMBL:EYB82920.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYB82920.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYB82920.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01001683; EYB82920.1; -; Genomic_DNA.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0043933; P:protein-containing complex organization; IEA:UniProt.
DR CDD; cd15666; ePHD2_KMT2C_like; 1.
DR Gene3D; 3.30.160.360; -; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR034732; EPHD.
DR InterPro; IPR003889; FYrich_C.
DR InterPro; IPR003888; FYrich_N.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR45888; HL01030P-RELATED; 1.
DR PANTHER; PTHR45888:SF6; HL01030P-RELATED; 1.
DR Pfam; PF05965; FYRC; 1.
DR Pfam; PF05964; FYRN; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF13771; zf-HC5HC2H; 1.
DR SMART; SM00542; FYRC; 1.
DR SMART; SM00541; FYRN; 1.
DR SMART; SM00249; PHD; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS51805; EPHD; 1.
DR PROSITE; PS51543; FYRC; 1.
DR PROSITE; PS51542; FYRN; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000024635};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 898..1007
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS51805"
FT DOMAIN 1302..1421
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 282..309
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 421..455
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 675..707
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1220..1248
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 177..204
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 289..306
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1222..1248
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1422 AA; 158882 MW; 918687B21AF53151 CRC64;
MAKKKKDQKF AGIISLNVEI GNKDVGAYVL LNASLEMGVK ISTPMVNFYQ YENMLEKYPD
WNDRVKHIQR LWRVLDTENR QDFVSRAREN RANRGKQPRV KRVVQQQMQA NVDERFKVPN
VPGGAQMRPE YLQSDMGGQP NQEVPMQQIR STAHLTQPVL EQYELMRTRT LDLQKHQQVI
ESDLNRMRKQ KKNLAAKRRQ MMKSAGTDAE GKQIPVDLNE QDRMALQTLL DQIPARQKDL
ESCKRDLKSH LATVYEFEHK WNIIRNVEPG DAMRIAMAQR AAAGGAPQPG GPVVPPPGPP
LSSPTAMPGA APQFHSQLPG NSPQFQQMRV PQQVPYGAVC GPQQEMLQRT PVQVPRAPMW
TRVPAPVLGG IFYERLQTSF EKEVYECLDD IVSRVSMHFD GRDPAREGSG MLKRLLEPAM
GQQMPPMGAP GPQGPASHLM DQLEPPRPKK KRTQQKKFET LGMGNEYDLM VERVNTQLRL
CEQLPKRALE PAPRNPGAAF ATMGISDLPD RRDKRALVGN EFGNLSLSFV DDYYTGPGRG
IECLEMSSLS MSDLAPQANL PALLGMPPPP IYEMVVDAED AAYGTFDQWV VFAHALDRQP
SADHMRVTSK IPEFTIPLEP RRKVFDMKPE TVEEVEVSLV VKEESNVPPG TGVAALFEQL
RSILGVEQHI DYQLDTPPLS PEPSTKTEIT ENVKREPAEP AAPVSGGRCR ACDRSMEAVL
IQQTMSQLGL TPSDDEKDDT VCFCSMKCYY QFVAASKVAL SPDQLTAAES HVDEETLAKL
RQISAESFAK CINQGKMRMD LPAAPAVPPD AFLTSPRDTR YVMDEGRRDN VQIIRVADLA
SIHDVSQRKD PSRNPGDDWK TYSRDLLQSF FKIQQTKHEL ALSPKMGVGL GTSFELDRRV
CVLCGGIGDG DPALCGRLLN LSANLWVHVN CAMWSTEVFE SSSGALLHVD RAIVRAAGVI
CHLCGRVGAS VQCHKVDCNV NFHLPCAAKI QTAKFIKDKT FFCTKHPEIS PDVMVSSLEA
LRRIYIERDE NALLSRLFEV SDSNTRFCLR LGALCFFQVG QLLPEQLKTF HNATHIFPNG
YHASRWFWSP NDPRTRQLYD CQIKEADHRP KFVVSCEGRT FEGDSASAAW SEVVTAVERL
RAQTDALRFF AKGIQGETLF GLNESAITKI TESLPGVDGL FTYTFRHPGS PLLDLPLAVN
PSGCARCEPR FRTLIKHKQR ALASAPSTSR STQDAGSSSS GRTRGRVGSS FTDELATAQM
RAMLQASGIG AEWALALGGR EPFSSNSQSY TLYQKMRKDW RQTVYLARSK IQGLGLYAKR
DIHMGDMIIE YKGEVIRSEV GEMREKRYVA QNRGVYMFRI DEDLLVDATM AGGPARYINH
SCDPNCSTRI LAAGPYPEDK KIIITANRPI KALEEVLKYL CC
//