ID H9ETF8_MACMU Unreviewed; 530 AA.
AC H9ETF8;
DT 16-MAY-2012, integrated into UniProtKB/TrEMBL.
DT 16-MAY-2012, sequence version 1.
DT 27-MAR-2024, entry version 56.
DE RecName: Full=PC4 and SFRS1-interacting protein {ECO:0000256|ARBA:ARBA00039324};
DE AltName: Full=Lens epithelium-derived growth factor {ECO:0000256|ARBA:ARBA00041831};
GN Name=PSIP1 {ECO:0000313|EMBL:AFE65667.1};
OS Macaca mulatta (Rhesus macaque).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9544 {ECO:0000313|EMBL:AFE65667.1};
RN [1] {ECO:0000313|EMBL:AFE65667.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Caudate {ECO:0000313|EMBL:AFE65667.1}, Testis
RC {ECO:0000313|EMBL:AFI34384.1}, and Thymus
RC {ECO:0000313|EMBL:AFH27908.1};
RX PubMed=25319552; DOI=10.1186/1745-6150-9-20;
RA Zimin A.V., Cornish A.S., Maudhoo M.D., Gibbs R.M., Zhang X., Pandey S.,
RA Meehan D.T., Wipfler K., Bosinger S.E., Johnson Z.P., Tharp G.K.,
RA Marcais G., Roberts M., Ferguson B., Fox H.S., Treangen T., Salzberg S.L.,
RA Yorke J.A., Norgren R.B.Jr.;
RT "A new rhesus macaque assembly and annotation for next-generation
RT sequencing analyses.";
RL Biol. Direct 9:20-20(2014).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the HDGF family.
CC {ECO:0000256|ARBA:ARBA00005309}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JU321911; AFE65667.1; -; mRNA.
DR EMBL; JU471104; AFH27908.1; -; mRNA.
DR EMBL; JV044313; AFI34384.1; -; mRNA.
DR RefSeq; XP_014973138.1; XM_015117652.1.
DR AlphaFoldDB; H9ETF8; -.
DR GeneID; 664733; -.
DR KEGG; mcc:664733; -.
DR CTD; 11168; -.
DR eggNOG; KOG1904; Eukaryota.
DR HOGENOM; CLU_034054_1_0_1; -.
DR OrthoDB; 4271850at2759; -.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR CDD; cd20151; PWWP_PSIP; 1.
DR Gene3D; 2.30.30.140; -; 1.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR InterPro; IPR036218; HIVI-bd_sf.
DR InterPro; IPR021567; LEDGF_IBD.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR PANTHER; PTHR12550; HEPATOMA-DERIVED GROWTH FACTOR-RELATED; 1.
DR PANTHER; PTHR12550:SF42; PC4 AND SFRS1-INTERACTING PROTEIN; 1.
DR Pfam; PF11467; LEDGF; 1.
DR Pfam; PF00855; PWWP; 1.
DR SMART; SM00293; PWWP; 1.
DR SUPFAM; SSF140576; HIV integrase-binding domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS50812; PWWP; 1.
PE 2: Evidence at transcript level;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125}.
FT DOMAIN 7..64
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT REGION 87..350
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 446..530
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 88..105
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 106..133
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 208..263
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 301..350
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 446..473
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 474..494
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 495..530
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 530 AA; 60205 MW; 39BD49C33299B86E CRC64;
MTRDFKPGDL IFAKMKGYPH WPARVDEVPD GAVKPPTNKL PIFFFGTHET AFLGPKDIFP
YSENKEKYGK PNKRKGFNEG LWEIDNNPKV KFSSQQAATK QSNASSDVEV EEKETSVSKE
DTDHEEKTSN EDVTKAVDIT TPKAARRGRK RKAEKQVETE EAGVVTTATA SVNLKVSPKR
GRPAATEVKI PKPRGRPKMV KQPCPSESDI ITEEDKSKKK GQEEKQPKKQ LKKDEEGQKE
EDKPRKEPDK KEGKKEVESK RKNLAKTGVT STSDSEEEGD DQEGEKKRKG GRNFQTAHRR
NMLKGQHEKE AADRKRKQEE QMETEQQNKD EVKKPEVKKV EKKRETSMDS RLQRIHAEIK
NSLKIDNLDV NRCIEALDEL ASLQVTMQQA QKHTEMITTL KKIRRFKVSQ VIMEKSTMLY
NKFKNMFLVG EGDSVITQVL NKSLAEQRQH EEANKTKDQG KKGPNKKLEK EQTGSKTLNG
GSDAQDGNQP QHNGESNEES KDNHEASTKK KPSSEERETE ISLKDSTLDN
//