ID A0A0B2PUP5_GLYSO Unreviewed; 1116 AA.
AC A0A0B2PUP5;
DT 04-MAR-2015, integrated into UniProtKB/TrEMBL.
DT 04-MAR-2015, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Pre-mRNA 3'-end-processing factor FIP1 {ECO:0000313|EMBL:KHN12870.1};
GN ORFNames=glysoja_029404 {ECO:0000313|EMBL:KHN12870.1};
OS Glycine soja (Wild soybean).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3848 {ECO:0000313|EMBL:KHN12870.1};
RN [1] {ECO:0000313|EMBL:KHN12870.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Root {ECO:0000313|EMBL:KHN12870.1};
RA Lam H.-M., Qi X., Li M.-W., Liu X., Xie M., Ni M., Xu X.;
RT "Identification of a novel salt tolerance gene in wild soybean by whole-
RT genome sequencing.";
RL Submitted (JUL-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the FIP1 family.
CC {ECO:0000256|ARBA:ARBA00007459}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KN662943; KHN12870.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0B2PUP5; -.
DR Proteomes; UP000053555; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR007854; Fip1_dom.
DR InterPro; IPR044976; FIPS5-like.
DR PANTHER; PTHR36884; FIP1[III]-LIKE PROTEIN; 1.
DR PANTHER; PTHR36884:SF4; FIP1[III]-LIKE PROTEIN; 1.
DR Pfam; PF05182; Fip1; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242}.
FT DOMAIN 181..218
FT /note="Pre-mRNA polyadenylation factor Fip1"
FT /evidence="ECO:0000259|Pfam:PF05182"
FT REGION 1..107
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 328..349
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 371..422
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 966..1015
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1073..1116
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 21..39
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 65..95
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 328..343
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 371..419
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 980..1014
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1073..1104
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1116 AA; 127153 MW; C977746046F77EA3 CRC64;
MEGLDDDDFG ELYTVDIEVP NAIPEEEEEE EEEDEIDVNS NIVNLDKDMD SETVNDDCAA
ESDSSDDDDG LKIVLNDEDI PVGADARDEG NGDNGDDNNG SRFFHPKTGR SRGLAILNNM
KANASMGMAS YISSLNKGRR NGDACIQNLA LSSSRVCLAA NPMAVQCGYG SALPWYWGIF
DVNTDTLTEK LWKVPGVDIT DYFNFGFNES TWKLYCSSLV DAGIIYLSIP LASCTGISAT
EQLWRTSLQT GISVDDAANW NQEVMREQTD QVVSGNAFFP SSDCGLPKGR AIQVEDSMVE
RQPSIDVRRP RNRDFNVIEI KLLDSSDDCS GSGNSTVMNA SLEGESMAGN KRSVLNSSGE
LNEMLSEDQL EDVKKAEDSS LHRRSGPIPG VDGDEHRDQA DQHSEDTAEV PEGETKAEEG
GGIDACSSYP CWIESELSLG DQEHSLTSYT DSDSEAMDNS VQVDNDKSFS PLKRKSLNCV
TDMKESLPLC WKNSKNNSIN KKAVSAAYNS RTRGQFRKEW RHRSGGYEPS SYDMNKHTEN
DNDVSILKSS ARNLSLLARR PVDYGRHKDR LQVFGSHKIR DLSCNRETKQ SYYYGDEKVV
DELVACRSKY YHEDQESLRE NTNRHDRKNG DVEDYFFEPG PRFADSEDRE RDWYHLGCEY
SSDNLSPCSY RESRKFPPKH SSFPDEERYT QRKRMDGKSH FIDRNCIDDF DECEFKFLNK
SYRMSTIAER ELEFLDNYRE EQFPHIDRDW RRSVCRGRHY DSPPLVLNNL CSGIMEVEDN
CQKYTHCQTS SFKYRRQSYT DSAKNYAYGE RVNGNFGGLG RDKHARDNRG SNWLCGYTDT
AEDEDFPIYP VKKYQFYRSP SKFLNWTEDE IIYRHHETHA TSLFAKVQSD DLPLQRHQLS
MPIRDSEKYF KGSSKIMCRS KGGQALLRCR KSVDLIHGEG KSQVRSSRVL CNGRLENANQ
RIAKKRRRAA VGFDESNKNA SKFDTPKHKS NQESKKWVQD LQDQAQKESS EIEEGQFVAE
EPYMEEASEG PAVTDGVNKK RMSQNENSSE QCIGGYDSQR ILDSLAKMEK RRERFKQPMT
MKKEAEESLK LNDDSIVDKG EMKQHRPARK RRWVGN
//