ID F7E4G6_HORSE Unreviewed; 804 AA.
AC F7E4G6;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 4.
DT 27-MAR-2024, entry version 73.
DE SubName: Full=Heteroous nuclear ribonucleoprotein U {ECO:0000313|Ensembl:ENSECAP00000005965.4};
GN Name=HNRNPU {ECO:0000313|Ensembl:ENSECAP00000005965.4,
GN ECO:0000313|VGNC:VGNC:49463};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000005965.4, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000005965.4, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000005965.4,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000005965.4}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000005965.4};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; F7E4G6; -.
DR STRING; 9796.ENSECAP00000005965; -.
DR PaxDb; 9796-ENSECAP00000005965; -.
DR Ensembl; ENSECAT00000008027.4; ENSECAP00000005965.4; ENSECAG00000007955.4.
DR VGNC; VGNC:49463; HNRNPU.
DR GeneTree; ENSGT00940000156546; -.
DR HOGENOM; CLU_012140_0_0_1; -.
DR InParanoid; F7E4G6; -.
DR OMA; PHQFQQR; -.
DR OrthoDB; 5402316at2759; -.
DR TreeFam; TF317301; -.
DR Proteomes; UP000002281; Chromosome 30.
DR Bgee; ENSECAG00000007955; Expressed in inner cell mass and 23 other cell types or tissues.
DR GO; GO:0071013; C:catalytic step 2 spliceosome; IBA:GO_Central.
DR GO; GO:0005813; C:centrosome; IBA:GO_Central.
DR GO; GO:0070937; C:CRD-mediated mRNA stability complex; IBA:GO_Central.
DR GO; GO:0036464; C:cytoplasmic ribonucleoprotein granule; IBA:GO_Central.
DR GO; GO:0098577; C:inactive sex chromosome; IBA:GO_Central.
DR GO; GO:0000776; C:kinetochore; IBA:GO_Central.
DR GO; GO:1990498; C:mitotic spindle microtubule; IBA:GO_Central.
DR GO; GO:1990023; C:mitotic spindle midzone; IBA:GO_Central.
DR GO; GO:0000228; C:nuclear chromosome; IBA:GO_Central.
DR GO; GO:0016363; C:nuclear matrix; IBA:GO_Central.
DR GO; GO:0016607; C:nuclear speck; IBA:GO_Central.
DR GO; GO:0005697; C:telomerase holoenzyme complex; IBA:GO_Central.
DR GO; GO:0017130; F:poly(C) RNA binding; IBA:GO_Central.
DR GO; GO:0034046; F:poly(G) binding; IBA:GO_Central.
DR GO; GO:0036002; F:pre-mRNA binding; IBA:GO_Central.
DR GO; GO:1990841; F:promoter-specific chromatin binding; IBA:GO_Central.
DR GO; GO:0043021; F:ribonucleoprotein complex binding; IBA:GO_Central.
DR GO; GO:0099122; F:RNA polymerase II C-terminal domain binding; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0017069; F:snRNA binding; IBA:GO_Central.
DR GO; GO:0001097; F:TFIIH-class transcription factor complex binding; IBA:GO_Central.
DR GO; GO:0003714; F:transcription corepressor activity; IBA:GO_Central.
DR GO; GO:0070934; P:CRD-mediated mRNA stabilization; IBA:GO_Central.
DR GO; GO:0032211; P:negative regulation of telomere maintenance via telomerase; IBA:GO_Central.
DR GO; GO:0000122; P:negative regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:1902425; P:positive regulation of attachment of mitotic spindle microtubules to kinetochore; IBA:GO_Central.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0000381; P:regulation of alternative mRNA splicing, via spliceosome; IBA:GO_Central.
DR GO; GO:1902275; P:regulation of chromatin organization; IBA:GO_Central.
DR GO; GO:1901673; P:regulation of mitotic spindle assembly; IBA:GO_Central.
DR GO; GO:1990280; P:RNA localization to chromatin; IBA:GO_Central.
DR CDD; cd12884; SPRY_hnRNP; 1.
DR Gene3D; 2.60.120.920; -; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 1.10.720.30; SAP domain; 1.
DR InterPro; IPR001870; B30.2/SPRY.
DR InterPro; IPR043136; B30.2/SPRY_sf.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR003034; SAP_dom.
DR InterPro; IPR036361; SAP_dom_sf.
DR InterPro; IPR003877; SPRY_dom.
DR InterPro; IPR035778; SPRY_hnRNP_U.
DR PANTHER; PTHR12381:SF11; HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN U; 1.
DR PANTHER; PTHR12381; HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN U FAMILY MEMBER; 1.
DR Pfam; PF13671; AAA_33; 1.
DR Pfam; PF02037; SAP; 1.
DR Pfam; PF00622; SPRY; 1.
DR SMART; SM00513; SAP; 1.
DR SMART; SM00449; SPRY; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF68906; SAP domain; 1.
DR PROSITE; PS50188; B302_SPRY; 1.
DR PROSITE; PS50800; SAP; 1.
PE 1: Evidence at protein level;
KW Methylation {ECO:0000256|ARBA:ARBA00022481};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Proteomics identification {ECO:0007829|PeptideAtlas:F7E4G6};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281}.
FT DOMAIN 8..42
FT /note="SAP"
FT /evidence="ECO:0000259|PROSITE:PS50800"
FT DOMAIN 249..445
FT /note="B30.2/SPRY"
FT /evidence="ECO:0000259|PROSITE:PS50188"
FT REGION 41..236
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 652..736
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 750..780
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 79..99
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 122..154
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 216..236
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 652..668
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 804 AA; 87989 MW; A87AC0373CA04E4E CRC64;
MSSSPVNVKK LKVSELKEEL KKRRLSDKGL KAELMERLQA ALDDEEAGGR PAMEPGNGSL
DLGGDSAGRS GAGLEQEAAA GGDDDDEEDE EEEEGIAALD GDQMELGEEN GAAGAADAGP
MEEEEAASED ENGDDQGFQE GEDELGDEEE GAGDENGHGE QQPQPPAAPQ PQPPQQRGAA
KEAGGKSGGP TSLFAVTVAP PGARQGQQQA GGDGKAEQKG GDKKRGVKRP REDHGRGYFE
YIEENKYSRA KSPQPPVEEE DEHFDDTVVC LDTYNCDLHF KISRDRLSAS SLTMESFAFL
WAGGRASYGV SKGKVCFEMK VTEKIPVRHL YTKDIDIHEV RIGWSLTASG MLLGEEEFSY
GYSLKGIKTC NCETEDYGEK FDENDVITCF ANFESDEVEL SYAKNGQDLG IAFKISKEVL
AGRPLFPHVL CHNCAVEFNF GQKEKPYFPI PEDYTFIQNV PLEDRVRGPK GPEEKKDCEV
VMMIGLPGAG KTTWVTKHAA ENPGKYNILG TNTIMDKMMV AGFKKQMADT GKLNTLLQRA
PQCLGKFIEI AARKKRNFIL DQTNVSAAAQ RRKMCLFAGF QRKAVVVCPK DEDYKQRTQK
KAEVEGKDLP EHAVLKMKGN FTLPEVAECF DEITYVELQK EEAQKLLEQY KEESKKALPP
EKKQNTGSKK SNKNKSGKNQ FNRGGGHRGR GGFNMRGGNF RGGAPGNRGG YNRRGNMPQR
GGGGGGSGGI GYPYPRGPVF PSRGGYSNRG NYNRGGMPNR GNYNQNFRGR GNNRGYKNQS
QGYNQWQQGS VHVNVLCGRV KGPN
//