ID F7AZI4_HORSE Unreviewed; 1279 AA.
AC F7AZI4;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 27-MAR-2024, entry version 66.
DE SubName: Full=UPF2 regulator of nonsense mediated mRNA decay {ECO:0000313|Ensembl:ENSECAP00000014909.3};
GN Name=UPF2 {ECO:0000313|Ensembl:ENSECAP00000014909.3,
GN ECO:0000313|VGNC:VGNC:24805};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000014909.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000014909.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000014909.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000014909.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000014909.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; F7AZI4; -.
DR STRING; 9796.ENSECAP00000014909; -.
DR PaxDb; 9796-ENSECAP00000014909; -.
DR Ensembl; ENSECAT00000018288.4; ENSECAP00000014909.3; ENSECAG00000017022.4.
DR VGNC; VGNC:24805; UPF2.
DR GeneTree; ENSGT00530000064318; -.
DR HOGENOM; CLU_002633_2_1_1; -.
DR InParanoid; F7AZI4; -.
DR OMA; DFQHHQI; -.
DR OrthoDB; 276824at2759; -.
DR TreeFam; TF300543; -.
DR Proteomes; UP000002281; Chromosome 29.
DR Bgee; ENSECAG00000017022; Expressed in oviduct epithelium and 23 other cell types or tissues.
DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central.
DR GO; GO:0036464; C:cytoplasmic ribonucleoprotein granule; IEA:Ensembl.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0035145; C:exon-exon junction complex; IBA:GO_Central.
DR GO; GO:0005844; C:polysome; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0042162; F:telomeric DNA binding; IEA:Ensembl.
DR GO; GO:0031100; P:animal organ regeneration; IEA:Ensembl.
DR GO; GO:0001889; P:liver development; IEA:Ensembl.
DR GO; GO:0000184; P:nuclear-transcribed mRNA catabolic process, nonsense-mediated decay; IBA:GO_Central.
DR Gene3D; 1.25.40.180; -; 3.
DR Gene3D; 4.10.80.160; -; 1.
DR Gene3D; 6.10.250.770; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR InterPro; IPR039762; Nmd2/UPF2.
DR InterPro; IPR007193; Upf2/Nmd2_C.
DR PANTHER; PTHR12839; NONSENSE-MEDIATED MRNA DECAY PROTEIN 2 UP-FRAMESHIFT SUPPRESSOR 2; 1.
DR PANTHER; PTHR12839:SF7; REGULATOR OF NONSENSE TRANSCRIPTS 2; 1.
DR Pfam; PF02854; MIF4G; 3.
DR Pfam; PF04050; Upf2; 1.
DR SMART; SM00543; MIF4G; 3.
DR SUPFAM; SSF48371; ARM repeat; 3.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281}.
FT DOMAIN 166..362
FT /note="MIF4G"
FT /evidence="ECO:0000259|SMART:SM00543"
FT DOMAIN 576..765
FT /note="MIF4G"
FT /evidence="ECO:0000259|SMART:SM00543"
FT DOMAIN 780..993
FT /note="MIF4G"
FT /evidence="ECO:0000259|SMART:SM00543"
FT REGION 1..124
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 423..443
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1025..1105
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1227..1279
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1032..1085
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1086..1104
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1279 AA; 148693 MW; E97CAB6DCFA63C5D CRC64;
MPAERKKPAS MEEKESLLNN KEKDCGERRP VSSREKPKDE IKLTAKKEVI KVPEDKKKKL
EEDKRKKEDK ERKKKEEEKV KAEEELKKKE EEKKKHEEEE RKKQEEQAKR QQEEEAAQLK
EKEESLQLHQ EAWERHQLRK ELRSKNQNAP DSRPEENFFS RLDSSLKKNT AFVKKLKTIT
EQQRDSLSHD FNGLNLSKYI AEAVASIVEA KLKISDVNCA VHLCSLFHQR YADFAPSLLQ
VWKKHFEARK EEKTPNITKL RTDLRFIAEL TIVGIFTDKE GLSLIYEQLK NIINADRESH
THVSVVISFC RHCGDDIAGL APRKVKSAAE KFNLGFPPSE IISPEKQQPF QNLLKEYFTS
LTKHLKRDHR ELQNTERQNR RILHSKGELS EDRHKQYEEF AMSYQKLLAN SQSLADLLDE
NMPDLPQDKP TPEEHGPGID IFTPGKPGEY DLEGGIWEDE DARNFYENLI DLKAFVPAIL
FKDNEKSCQN KESNKDDSKE ILLLYFRTEA KEPKDSKEVS SPDDLELELE NLEISDDTLE
LEGGDEAEDL TKKLLDEQEQ EDEEASTGSH LKLIVDAFLQ QLPNCVNRDL IDKAAMDFCM
NMNTKANRKK LVRALFIVPR QRLDLLPFYA RLVATLHPCM SDVAEDLCSM LRGDFRFHVR
KKDQINIETK NKTVRFIGEL TKFKMFTKND TLHCLKMLLS DFSHHHIEMA CTLLETCGRF
LFRSPESHLR TSVLLEQMMR KKQAMHLDAR YVTMVENAYY YCNPPPAEKT VKKKRPPLQE
YVRKLLYKDL SKVTTEKVLR QMRKLPWQDQ EVKDYVICCM INIWNVKYNS IHCVANLLAG
LVLYQEDVGI HVVDGVLEDI RLGMEVNQPK FNQRRISSAK FLGELYNYRM VESAVIFRTL
YSFTSFGVNP DGSPSSLDPP EHLFRIRLVC TILDTCGQYF DRGSSKRKLD CFLVYFQRYV
WWKKSLEVWT KDHPFPIDID YMISDTLELL RPKIKLCNSL EESIRQVQDL EREFLIKLGL
VNDKDSKDSM TEGENLEEDE EEEEGGAETE EQSGNESEVN EPEEEEGSDN DDDEGEEEEE
ENTDYLTDSN KENETDEENT EVMIKGGGLK HVPCVEDEDF IQALDKMMLE NLQQRSGESV
KVHQLDVAIP LHLKSQLRKG PPLGGGEGEA ESADTMPFVM LTRKGNKQQF KILNVPMSSQ
LAANHWNQQQ AEQEERMRMK KLTLDINERQ EQEDYQEMLQ SLAQRPAPAN TNRERRPRYQ
HPKGAPNADL IFKTGGRRR
//