ID H2PDF0_PONAB Unreviewed; 588 AA.
AC H2PDF0; A0A2J8SGI9;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 25-MAY-2022, sequence version 3.
DT 27-MAR-2024, entry version 58.
DE RecName: Full=Pre-mRNA 3'-end-processing factor FIP1 {ECO:0000256|ARBA:ARBA00017456};
DE AltName: Full=FIP1-like 1 protein {ECO:0000256|ARBA:ARBA00031816};
GN Name=FIP1L1 {ECO:0000313|Ensembl:ENSPPYP00000016499.3};
GN ORFNames=CR201_G0043173 {ECO:0000313|EMBL:PNJ19893.1};
OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pongo.
OX NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000016499.3, ECO:0000313|Proteomes:UP000001595};
RN [1] {ECO:0000313|Ensembl:ENSPPYP00000016499.3, ECO:0000313|Proteomes:UP000001595}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Wilson R.K., Mardis E.;
RT "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome.";
RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:PNJ19893.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Susie {ECO:0000313|EMBL:PNJ19893.1};
RA Pollen A., Hastie A., Hormozdiari F., Dougherty M., Liu R., Chaisson M.,
RA Hoppe E., Hill C., Pang A., Hillier L., Baker C., Armstrong J.,
RA Shendure J., Paten B., Wilson R., Chao H., Schneider V., Ventura M.,
RA Kronenberg Z., Murali S., Gordon D., Cantsilieris S., Munson K., Nelson B.,
RA Raja A., Underwood J., Diekhans M., Fiddes I., Haussler D., Eichler E.;
RT "High-resolution comparative analysis of great ape genomes.";
RL Submitted (DEC-2017) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Ensembl:ENSPPYP00000016499.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the FIP1 family.
CC {ECO:0000256|ARBA:ARBA00007459}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NDHI03003571; PNJ19893.1; -; Genomic_DNA.
DR Ensembl; ENSPPYT00000017173.3; ENSPPYP00000016499.3; ENSPPYG00000014773.3.
DR GeneTree; ENSGT01080000257472; -.
DR HOGENOM; CLU_035577_1_0_1; -.
DR OrthoDB; 449619at2759; -.
DR TreeFam; TF318610; -.
DR Proteomes; UP000001595; Chromosome 4.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR InterPro; IPR007854; Fip1_dom.
DR PANTHER; PTHR13484; FIP1-LIKE 1 PROTEIN; 1.
DR PANTHER; PTHR13484:SF9; PRE-MRNA 3'-END-PROCESSING FACTOR FIP1; 1.
DR Pfam; PF05182; Fip1; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000001595}.
FT DOMAIN 139..181
FT /note="Pre-mRNA polyadenylation factor Fip1"
FT /evidence="ECO:0000259|Pfam:PF05182"
FT REGION 1..81
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 223..291
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 328..588
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 17..36
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 249..277
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 340..398
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 445..555
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 588 AA; 65698 MW; E8903E5A369FA96B CRC64;
MSAGEVERLV SELSGGTGGD EEEEWLYGDE NEVERPEEEN ASANPPSGIE DETAENGVPK
PKVTETEDDS DSDSDDDEDD VHVTIGDIKT GAPQYGSYGT APVNLNIKTG GRVYGTTGTK
VKGVDLDAPG SINGVPLLEV DLDSFEDKPW RKPGADLSDY FNYGFNEDTW KAYCEKQKRI
RMGLEVIPVT STTNKITAED CTMEVTPGAE IQDGRFNLFK VQQGRTGNSE KETALPSAKA
EFTSPPSLFK TGLPPSRNST SSQSQTSTAS RKANSSVGKW QDRYGRAESP DLRRLPGAID
VIGQTITISR VEGRRRANEN SNIQVLSERS ATEVDNNFSK PPPFFPPGAP PTHLPPPPFL
PPPPTVSTAP PLIPPPGIPI TVPPPGFPPP PGAPPPSLIP TIESGHSSGY DSRSARAFPY
GNVAFPHLPG SAPSWPSLVD TSKQWDYYAR REKDRDRERD RDRERDRDRD RERERTRERE
RERDHSPTPS VFNSDEERYR YREYAERGYE RHRASREKEE RHRERRHREK EETRHKSSRS
NSRRRHESEE GDSHRRHKHK KSKRSKEGKE AGSEPAPEQE STEATPAE
//