ID G3N5A5_GASAC Unreviewed; 578 AA.
AC G3N5A5;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 53.
DE RecName: Full=PWWP domain-containing protein {ECO:0000259|PROSITE:PS50812};
OS Gasterosteus aculeatus (Three-spined stickleback).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC Gasterosteus.
OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000000475.1, ECO:0000313|Proteomes:UP000007635};
RN [1] {ECO:0000313|Ensembl:ENSGACP00000000475.1, ECO:0000313|Proteomes:UP000007635}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.;
RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGACP00000000475.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the HDGF family.
CC {ECO:0000256|ARBA:ARBA00005309}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3N5A5; -.
DR STRING; 69293.ENSGACP00000000475; -.
DR Ensembl; ENSGACT00000000475.1; ENSGACP00000000475.1; ENSGACG00000000370.1.
DR eggNOG; KOG1904; Eukaryota.
DR GeneTree; ENSGT00940000153942; -.
DR InParanoid; G3N5A5; -.
DR OMA; WISLKND; -.
DR TreeFam; TF105385; -.
DR Proteomes; UP000007635; Unassembled WGS sequence.
DR Bgee; ENSGACG00000000370; Expressed in muscle tissue and 12 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR CDD; cd05834; PWWP_HRP; 1.
DR Gene3D; 2.30.30.140; -; 1.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR InterPro; IPR036218; HIVI-bd_sf.
DR InterPro; IPR021567; LEDGF_IBD.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR PANTHER; PTHR12550; HEPATOMA-DERIVED GROWTH FACTOR-RELATED; 1.
DR PANTHER; PTHR12550:SF78; HEPATOMA-DERIVED GROWTH FACTOR-RELATED PROTEIN 3; 1.
DR Pfam; PF11467; LEDGF; 1.
DR Pfam; PF00855; PWWP; 1.
DR SMART; SM00293; PWWP; 1.
DR SUPFAM; SSF140576; HIV integrase-binding domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS50812; PWWP; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW Reference proteome {ECO:0000313|Proteomes:UP000007635}.
FT DOMAIN 7..60
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT REGION 80..456
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 557..578
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 90..109
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 110..124
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 143..166
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 167..181
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 230..286
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 310..325
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 333..417
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 578 AA; 63802 MW; 82628392A78EF7BB CRC64;
SRDQYKQGDL VFAKMKGFPH WPARIFKPDN GNKKRVLVFF FGTHQIGQVL LKNIVPFVGN
KMKYGCGVRS KGFSEGMWEI QNTPGVGSKP KPPAKAPPAK APPTKAPPAT SSDKPSQDQL
KSTASSKRLT RRQEAARVSL RSAPQKTLES KPSSGETSAS TRSRRSAAGG RSEEEKEMAS
STRKTTTCCD LDRLSEKPQA APTLMSKNKG GPREEEESSQ SQASQAQDHR AKESPSERRG
VKRKSDYGES AKEEEEKPKK TKPDEGGVDG HKETMKTPEG RVMKTRRAGK TQTAPPGGQE
RMSAEVSEGP PEGRQRREEV PSRGLKEVSH GLFLNTRQEE EQRRNHREDR EGNSEEEPRA
TAKSQEKKKK KEFEAGKAAP TEHEMEKKSG KSAEEDAPLT RKLCAEYETK PQKKSDNPVA
EESGGAAAAA SEEAQSGSSG GAEAERRSPA MTLTDSTLHR IHGDIRISLK TDNPDIRKCL
TALDQLSMVY VTCKHVQRHS ELVSTLRKLR SYRANRAVMD KAAMLYSRFK NAFLVGEGEE
VVSAAFLRSL LEEKEREEAQ RAERGRGKGG RPRPGGED
//