ID A0A016U7W9_9BILA Unreviewed; 1033 AA.
AC A0A016U7W9;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE RecName: Full=DZF domain-containing protein {ECO:0000259|PROSITE:PS51703};
GN Name=Acey_s0053.g2423 {ECO:0000313|EMBL:EYC11006.1};
GN Synonyms=Acey-Y95B8A.8 {ECO:0000313|EMBL:EYC11006.1};
GN ORFNames=Y032_0053g2423 {ECO:0000313|EMBL:EYC11006.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYC11006.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYC11006.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01001389; EYC11006.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A016U7W9; -.
DR STRING; 53326.A0A016U7W9; -.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 1.10.1410.40; -; 1.
DR Gene3D; 3.30.460.10; Beta Polymerase, domain 2; 1.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 3.
DR InterPro; IPR006561; DZF_dom.
DR InterPro; IPR049402; DZF_dom_C.
DR InterPro; IPR049401; DZF_dom_N.
DR InterPro; IPR003604; Matrin/U1-like-C_Znf_C2H2.
DR InterPro; IPR043519; NT_sf.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR45762; ZINC FINGER RNA-BINDING PROTEIN; 1.
DR PANTHER; PTHR45762:SF19; ZINC-FINGER PROTEIN AT 72D, ISOFORM B; 1.
DR Pfam; PF20965; DZF_C; 1.
DR Pfam; PF07528; DZF_N; 1.
DR Pfam; PF12874; zf-met; 3.
DR SMART; SM00572; DZF; 1.
DR SMART; SM00355; ZnF_C2H2; 3.
DR SMART; SM00451; ZnF_U1; 3.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 3.
DR PROSITE; PS51703; DZF; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000024635}.
FT DOMAIN 600..1014
FT /note="DZF"
FT /evidence="ECO:0000259|PROSITE:PS51703"
FT REGION 985..1033
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 985..1016
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1033 AA; 111237 MW; 98262B0452BC03A1 CRC64;
MGGVGRVCAF RIRRTIDFLA TRRPLISARG FRLYSSFFSF VLSETVVNLL SNGYHVNYWV
QSAMYGYSQG NYGNYGSAYT AAAAAAAVYG NLGVGQSTQA QQQSNSVFGS STTNPYATYS
NSVAASYGYG NQSTNAASVY AAQAAQAQVQ ASQVSRLSAD AAYAAAAGVA PSQYSAFGAG
QLAGFGSTQA NAGATVGYGR PSASATNQAA KSLAALTNSS NKSALAVNSS STYSNYDAAV
YAAASSYLHS KATGTTNMWM SKKPGGVGRG GGFANKRFGA GAGTQKEAQQ FYCEVCKISC
AGQLTYKEHL EGQRHKKKEA LAKGENNPSL PKSKVSFRCD LCNVTCTGQD TYNAHVRGAK
HQKTLTLCKK LGKPIPSTEP TIIPPSEMGG VIPPPPATTS AAATAGANNS AKRVVGISTV
NFVAGAKLSS TAGQLEAKKQ QVMQAVGSAG TKAPEDSTVQ AIKGQTEELQ ALIAAEQNLK
PVGEEFVEAQ RDATGKLLQY LCKLCDCKFS DPNAKEIHLK GRRHRLQYKM KVDPNLEVEV
KQAANRRGAK YERGRGPMGV IPRGPPSTAL FSMAIARGPF GPAAPGMRGP WFGNGPIDGR
RFETTDDRHV QAKHTSIYPD DEQLTVIERL VTETEKALKK VSDYFNERDH LETKPQIKAA
AAGESAKDTA VQEKDRLLKG VMRVGMLSKG LLLKDDTEVH LVVLCSHIPG LSLLKEVATL
IPKYYESPEG SSINITVEES TSSMILQQSS VPLRCRISLT SREFREDAAG DAAPAPPPGD
ALNKEACLKA LAQLRHAKWF QRDGIWYSVS ETTSPPERSG IQRRRGCDVC QVAHVTSKSL
VDIIKITMRR DTQIRDEDAR CIYLQSCQVT LRILRDIRAR IESWRPLNDW MCELLVEKVL
SSCLAPLSVG DALRRFFEAV ASGVLLKTGP GLPDPCEKES VDVLAPLTGQ EREAITASAQ
HALRLIAFNQ IYKVLCLERL PDVRPPLTDR KRPMDTSNAS DEVKKDKKEG ENGHSAEEEN
SSGVAVKMEK MEV
//