ID G3X0Q8_SARHA Unreviewed; 532 AA.
AC G3X0Q8;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 54.
DE RecName: Full=PC4 and SFRS1-interacting protein {ECO:0000256|ARBA:ARBA00039324};
DE AltName: Full=Lens epithelium-derived growth factor {ECO:0000256|ARBA:ARBA00041831};
GN Name=PSIP1 {ECO:0000313|Ensembl:ENSSHAP00000021263.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000021263.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000021263.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000021263.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the HDGF family.
CC {ECO:0000256|ARBA:ARBA00005309}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_003762195.1; XM_003762147.2.
DR AlphaFoldDB; G3X0Q8; -.
DR STRING; 9305.ENSSHAP00000021263; -.
DR Ensembl; ENSSHAT00000021435.2; ENSSHAP00000021263.2; ENSSHAG00000018023.2.
DR eggNOG; KOG1904; Eukaryota.
DR GeneTree; ENSGT00940000154706; -.
DR HOGENOM; CLU_034054_1_0_1; -.
DR InParanoid; G3X0Q8; -.
DR TreeFam; TF105385; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000395; P:mRNA 5'-splice site recognition; IEA:Ensembl.
DR GO; GO:0009408; P:response to heat; IEA:Ensembl.
DR GO; GO:0006979; P:response to oxidative stress; IEA:Ensembl.
DR CDD; cd20151; PWWP_PSIP; 1.
DR Gene3D; 2.30.30.140; -; 1.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR InterPro; IPR036218; HIVI-bd_sf.
DR InterPro; IPR021567; LEDGF_IBD.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR PANTHER; PTHR12550; HEPATOMA-DERIVED GROWTH FACTOR-RELATED; 1.
DR PANTHER; PTHR12550:SF42; PC4 AND SFRS1-INTERACTING PROTEIN; 1.
DR Pfam; PF11467; LEDGF; 1.
DR Pfam; PF00855; PWWP; 1.
DR SMART; SM00293; PWWP; 1.
DR SUPFAM; SSF140576; HIV integrase-binding domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS50812; PWWP; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT DOMAIN 7..64
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT REGION 61..351
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 448..532
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 61..80
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 88..105
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 106..133
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 163..179
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 210..265
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 303..351
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 448..475
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 476..496
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 497..532
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 532 AA; 60637 MW; B79E2F2A919D4697 CRC64;
MTRDFKPGDL IFAKMKGYPH WPARVDEVPD GAVKPPTNKL PIFFFGTHET AFLGPKDIFP
YSENKDKYGK PNKRKGFNEG LWEIDNNPKV KFSSQQASTK QPNASSDVET EEKEISTSKE
DTDHEEKTSN EDVTKTMDIT TPKAARRGRK RKAEKQVETE EVGMVTAATT ATSVSPKVSP
KRGRPSATEV KVPKPRGRPK MVKPPCPSDN DNVTEEDKNK KKGQDEKQPK KQLKKEEEVQ
KEEDKPRKEP DKKEGKKEVE PKRKTTTKTG FVSTSDSEEE GDDQEGEKKR KGGRNFQAAH
RRNIIKGQHE KEAADRKRKQ EEQMETESQN KDESKKPEVK KVEKKRETSM DSRLQRIHAE
IKNSLKIDNL DVNRCIEALD ELASLQVTMQ QAQKHTEMIT TLKKIRRFKV SQVIMEKSTM
LYNKFKNMFL VGEGDSVITQ VLNKSLAEQR QHEEANKTKE QGKKGPNKKL DKEQTGSKTL
NGGSDAQDNN QPQHNGESNE DSKDRHEAIM KKKTSSEDRE PEKIPKDSTV EN
//