ID A0A182NSK7_9DIPT Unreviewed; 1411 AA.
AC A0A182NSK7;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 08-NOV-2023, entry version 31.
DE RecName: Full=Fe2OG dioxygenase domain-containing protein {ECO:0000259|PROSITE:PS51471};
OS Anopheles dirus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7168 {ECO:0000313|EnsemblMetazoa:ADIR010647-PA, ECO:0000313|Proteomes:UP000075884};
RN [1] {ECO:0000313|Proteomes:UP000075884}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=WRAIR2 {ECO:0000313|Proteomes:UP000075884};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Walton C., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles dirus WRAIR2.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ADIR010647-PA}
RP IDENTIFICATION.
RC STRAIN=WRAIR2 {ECO:0000313|EnsemblMetazoa:ADIR010647-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- COFACTOR:
CC Name=L-ascorbate; Xref=ChEBI:CHEBI:38290;
CC Evidence={ECO:0000256|ARBA:ARBA00001961};
CC -!- SIMILARITY: Belongs to the WD repeat TAF5 family.
CC {ECO:0000256|ARBA:ARBA00009435}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 7168.A0A182NSK7; -.
DR EnsemblMetazoa; ADIR010647-RA; ADIR010647-PA; ADIR010647.
DR VEuPathDB; VectorBase:ADIR010647; -.
DR OrthoDB; 3138699at2759; -.
DR Proteomes; UP000075884; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0051213; F:dioxygenase activity; IEA:UniProtKB-KW.
DR GO; GO:0005506; F:iron ion binding; IEA:InterPro.
DR GO; GO:0031418; F:L-ascorbic acid binding; IEA:InterPro.
DR GO; GO:0016705; F:oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen; IEA:InterPro.
DR CDD; cd00200; WD40; 1.
DR Gene3D; 2.60.120.620; q2cbj1_9rhob like domain; 1.
DR Gene3D; 1.25.40.500; TFIID subunit TAF5, NTD2 domain; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR005123; Oxoglu/Fe-dep_dioxygenase.
DR InterPro; IPR006620; Pro_4_hyd_alph.
DR InterPro; IPR044862; Pro_4_hyd_alph_FE2OG_OXY.
DR InterPro; IPR037264; TFIID_NTD2_sf.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR19879; TRANSCRIPTION INITIATION FACTOR TFIID; 1.
DR PANTHER; PTHR19879:SF5; WD REPEAT-CONTAINING PROTEIN 55 HOMOLOG; 1.
DR Pfam; PF13640; 2OG-FeII_Oxy_3; 1.
DR Pfam; PF00400; WD40; 6.
DR SMART; SM00702; P4Hc; 1.
DR SMART; SM00320; WD40; 6.
DR SUPFAM; SSF160897; Taf5 N-terminal domain-like; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS51471; FE2OG_OXY; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 5.
DR PROSITE; PS50294; WD_REPEATS_REGION; 4.
PE 3: Inferred from homology;
KW Dioxygenase {ECO:0000256|ARBA:ARBA00022964};
KW Iron {ECO:0000256|ARBA:ARBA00023004};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Oxidoreductase {ECO:0000256|ARBA:ARBA00023002};
KW WD repeat {ECO:0000256|PROSITE-ProRule:PRU00221}.
FT REPEAT 636..667
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 678..710
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 720..761
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 762..803
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 804..836
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT DOMAIN 1291..1390
FT /note="Fe2OG dioxygenase"
FT /evidence="ECO:0000259|PROSITE:PS51471"
FT REGION 1..29
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 301..323
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 843..912
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1046..1076
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1119..1174
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..27
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 306..320
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 846..892
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1130..1146
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1411 AA; 154963 MW; D54F8FDE6A5A25AF CRC64;
MNNSGSGGGS SSNNSNNNNP GSTSKSRKSK NDLIRSIVGS YLKARNYSVT DRYRKSDLIL
TQSTEQIVMN TAIKNDTAMV NSFLYSNICL NTNLTQVDQH FAKFAKFIRS QPETIRVELI
EIVPPLLCHL YIEMLKGRDW RPAIEFLRKH APMVGKLEPT PGPAGSSPLL LLSQQKINGT
IDGSPSEAGA FLPAATVAAV QSSVIVFAAG RTPDSARTDT YKRLIHKLSQ IARFQDCESE
PLVVQFRSGQ TQLRLHSASI DAMSQYLGKY GHSLILQTLR TWFFFDTSDD KNFADFHPDV
NGRPGQAPRT SATNGQLNGG GTPMELDDGA ESVGGRSLHD GYDFAEHTER ENERYLLLNG
FNAQYVRHRL VEAENGLAGS DRVIGTITLE DDDEDEYDSC MKDAQASQGG VLEQQQQQEE
EQLNAAAAAA AAAVGVDVGM DGMFPRMSSQ ERLRRLQESA ERLSLYQRPL CVYSLENVGH
QLTSLAIDPG CCHVASGFED STIMLWSTNR STQLGRKPYA CFRDRQCCWN VTSCDSRFSE
SDESGDDECA DDGAAVGGAE AEYRIAGRGD GYRDGTSQLD DGVDSMFDRK TPADIARMNK
LLPSHKRRLT KRERWKQFLE KRCLENTFSE TGGLALRGHG NAVTDLLFSE HAPLLVSVSR
DCTMRAWTGR DYACRAVYRG HNHPIWCVTE SPTGLYLATG SRDTTARLWS TDRRFPLQMY
VGHTQDVDTV AFHPNGNYLA TGSTDLSVRL WCVTSGKLLR IFTDCRQPVQ RICFSPDGKY
LAAGGEENRV RIFDLTAGSQ LTELKDHTAA ITCVAWSTDS SQFVSACADG TIRIWDAKRM
LLPPPSSSSA GGTSTSTASS TVAASSPGQP TGRATATLNN GTAPTSVPNP KMHITASGGG
PGGATISSDG SDSGDEFLNE LAVAKLFPFD ALASVEELAD LDQLDVDHIL NENNFLNNIN
NLNVFDGTDQ NHQEQQPEQL QEQFIEGGFV ATVDNNNRNV ATGDNCDTDV FFNNNNFVTS
PVLLGSGDSA GDQVVLFNQS EPTMAQSGSV MLLQQPQHQQ QKPQQPQNEQ RTSKVQLQQQ
QLAALLRQAG GVDGSLLSKV DPSDVAGLRL GGIKRNARKG RDLRTLSERQ RHQGAATPNG
TPASVDVSPV TMPGAMPPMT MTTSPAEGTE GGALDLDESE SLDEACRSLI RDMNEYGVCV
LDSFLGQERG RQVLDEVTGM YSSGVFRDGQ LVSNRGGNNL RHIRGDKITW IDGKEPGCSS
IGYLINRVDA VITNCKRMEN NGKLGLYNIK ERTKAMVACY PGSGSHYVKH VDNPNRDGRC
ITAIYYLNLD WDVRESGGLL RIFPEGCNDR VADIEPVFDR ILFFWSDRRN PHEVQPAHRT
RYAITLWYLD AEERESARLR YQKDCENRFT A
//