ID A0A182NTN5_9DIPT Unreviewed; 488 AA.
AC A0A182NTN5;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE RecName: Full=PWWP domain-containing protein {ECO:0000259|PROSITE:PS50812};
OS Anopheles dirus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7168 {ECO:0000313|EnsemblMetazoa:ADIR011026-PA, ECO:0000313|Proteomes:UP000075884};
RN [1] {ECO:0000313|Proteomes:UP000075884}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=WRAIR2 {ECO:0000313|Proteomes:UP000075884};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Walton C., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles dirus WRAIR2.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ADIR011026-PA}
RP IDENTIFICATION.
RC STRAIN=WRAIR2 {ECO:0000313|EnsemblMetazoa:ADIR011026-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the HDGF family.
CC {ECO:0000256|ARBA:ARBA00005309}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182NTN5; -.
DR STRING; 7168.A0A182NTN5; -.
DR EnsemblMetazoa; ADIR011026-RA; ADIR011026-PA; ADIR011026.
DR VEuPathDB; VectorBase:ADIR011026; -.
DR OrthoDB; 4271850at2759; -.
DR Proteomes; UP000075884; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR CDD; cd05834; PWWP_HRP; 1.
DR Gene3D; 2.30.30.140; -; 1.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR InterPro; IPR036218; HIVI-bd_sf.
DR InterPro; IPR021567; LEDGF_IBD.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR PANTHER; PTHR12550; HEPATOMA-DERIVED GROWTH FACTOR-RELATED; 1.
DR PANTHER; PTHR12550:SF49; JIL-1 ANCHORING AND STABILIZING PROTEIN, ISOFORM A; 1.
DR Pfam; PF11467; LEDGF; 1.
DR Pfam; PF00855; PWWP; 1.
DR SMART; SM00293; PWWP; 1.
DR SUPFAM; SSF140576; HIV integrase-binding domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS50812; PWWP; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils}.
FT DOMAIN 10..60
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT REGION 146..266
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 465..488
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 287..314
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 146..176
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 194..209
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 218..249
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 488 AA; 54422 MW; B0E9B13EDE55143D CRC64;
MVASKKAFTI GDLVFAKVKG YPPWPAKITR IDKNKYNVYF YGTGETANIK KEDLFPYETS
KEKFATEKIM KRKGFKEAIL QIESAMSGDD PSPVSDQSLA YDVVAVQSGI DDFGEQKMSI
STAEASKATF NESLVSANST MMDTSKVKLE PEEKRVEDEN NSSSRPAVHM KRDNKPTPNV
KSNVAARKSA AMAATPVTVT NNGAQKKTAG NHEGNGVEAD AVDPKEVVSR SGRKIKMKRF
MDRDEEEGNS PAMAGPSAKK RAIRSPVKDK TVSTLATAKK LNAFDKIENE RLYVLKLERE
LVELNLEIKS SVKLNSADPE RCVKLMEQYE KLAVTPTILK KNPNCVETMK RLRKYVGNAK
AWNMGDKEKL KFDFQAQQIR QKAELIYEQF KTILGMSENS VPFWEAFREE VTKFEEATKH
LTQEELYLLI DEADVMVKEG HNALTNDAGA ADHDDDAEDR TFELKQLTDG TEEPEPTAPA
ESTTNAGD
//