ID A0A3Q1MKB1_BOVIN Unreviewed; 1469 AA.
AC A0A3Q1MKB1;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Cleavage and polyadenylation specific factor 1 {ECO:0000313|Ensembl:ENSBTAP00000071935.1};
GN Name=CPSF1 {ECO:0000313|Ensembl:ENSBTAP00000071935.1,
GN ECO:0000313|VGNC:VGNC:27670};
OS Bos taurus (Bovine).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Bovinae; Bos.
OX NCBI_TaxID=9913 {ECO:0000313|Ensembl:ENSBTAP00000071935.1, ECO:0000313|Proteomes:UP000009136};
RN [1] {ECO:0000313|Ensembl:ENSBTAP00000071935.1, ECO:0000313|Proteomes:UP000009136}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000071935.1,
RC ECO:0000313|Proteomes:UP000009136};
RA Rosen B.D., Bickhart D.M., Koren S., Schnabel R.D., Hall R., Zimin A.,
RA Dreischer C., Schultheiss S., Schroeder S.G., Elsik C.G., Couldrey C.,
RA Liu G.E., Van Tassell C.P., Phillippy A.M., Smith T.P.L., Medrano J.F.;
RT "ARS-UCD1.2.";
RL Submitted (MAR-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSBTAP00000071935.1}
RP IDENTIFICATION.
RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000071935.1};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the CPSF1 family.
CC {ECO:0000256|ARBA:ARBA00038446}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSBTAT00000011004.6; ENSBTAP00000011004.6; ENSBTAG00000008355.6.
DR Ensembl; ENSBTAT00000069645.1; ENSBTAP00000071935.1; ENSBTAG00000008355.6.
DR VEuPathDB; HostDB:ENSBTAG00000008355; -.
DR VGNC; VGNC:27670; CPSF1.
DR GeneTree; ENSGT00950000183151; -.
DR OMA; PMTKFKL; -.
DR Reactome; R-BTA-159231; Transport of Mature mRNA Derived from an Intronless Transcript.
DR Reactome; R-BTA-72187; mRNA 3'-end processing.
DR Reactome; R-BTA-72203; Processing of Capped Intron-Containing Pre-mRNA.
DR Reactome; R-BTA-73856; RNA Polymerase II Transcription Termination.
DR Reactome; R-BTA-77595; Processing of Intronless Pre-mRNAs.
DR Proteomes; UP000009136; Chromosome 14.
DR Bgee; ENSBTAG00000008355; Expressed in retina and 106 other cell types or tissues.
DR GO; GO:0005847; C:mRNA cleavage and polyadenylation specificity factor complex; IBA:GO_Central.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0019899; F:enzyme binding; IEA:Ensembl.
DR GO; GO:0035925; F:mRNA 3'-UTR AU-rich region binding; IEA:Ensembl.
DR GO; GO:0006378; P:mRNA polyadenylation; IBA:GO_Central.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR PANTHER; PTHR10644:SF2; CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR SUBUNIT 1; 1.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000009136}.
FT DOMAIN 95..671
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 1074..1409
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
FT REGION 406..439
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 549..571
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 716..778
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 902..924
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1426..1449
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 411..426
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 902..919
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1434..1449
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1469 AA; 163942 MW; 742E85032B291585 CRC64;
MYAVYKQAHP PTGLEFSMYC NFFNNSERNL VVAGTSQLYV YRLNRDSEAP TKNDRSTDGK
AHREHREKLE LVASFSFFGN VMSMASVQLA GAKRDALLLS FKDAKLSVVE YDPGTHDLKT
LSLHYFEEPE LRDGFVQNVH TPRVRVDPDG RCAAMLIYGT RLVVLPFRRE SLAEEHEGLV
GEGQRSSFLP SYIIDVRALD EKLLNIVDLQ FLHGYYEPTL LILFEPNQTW PGRVAVRQDT
CSIVAISLNI TQKVHPVIWS LTSLPFDCTQ ALAVPKPIGG VVIFAVNSLL YLNQSVPPYG
VALNSLTTGT TAFPLRTQEG VRITLDCAQA AFISYDKMVI SLKGGEIYVL TLITDGMRSV
RAFHFDKAAA SVLTTSMVTM EPGYLFLGSR LGNSLLLKYT EKLQEPPAST AREAADKEEP
PSKKKRVDAT TGWSGSKSVP QDEVDEIEVY GSEAQSGTQL ATYSFEVCDS ILNIGPCANA
AMGEPAFLSE EFQNSPEPDL EIVVCSGYGK NGALSVLQKS IRPQVVTTFE LPGCYDMWTV
IAPVRKEQEE TLKGEGTEPE PGAPEAEDDG RRHGFLILSR EDSTMILQTG QEIMELDASG
FATQGPTVFA GNIGDNRYIV QVSPLGIRLL EGVNQLHFIP VDLGSPIVQC AVADPYVVIM
SAEGHVTMFL LKNDSYGGRH HRLALHKPPL HHQSKVITLC VYRDVSGMFT TESRLGGVRD
ELGGRGGPEA EGQGAETSPT VDDEEEMLYG DSGSLFSPSK EEARRSSQPP ADRDPAPFRA
EPTHWCLLVR ENGAMEIYQL PDWRLVFLVK NFPVGQRVLV DSSFGQPTTQ GEARKEEATR
QGELPLVKEV LLVALGSRQR RPYLLVHVDQ ELLIYEAFPH DSQLGQGNLK VRFKKVPHNI
NFREKKPKPS KKKAEGGSTE EGTGPRGRVA RFRYFEDIYG YSGVFICGPS PHWLLVTGRG
ALRLHPMGID GPIDSFAPFH NINCPRGFLY FNRQGELRIS VLPAYLSYDA PWPVRKIPLR
CTAHYVAYHV ESKVYAVATS TSTPCTRVPR MTGEEKEFET IERDERYVHP QQEAFCIQLI
SPVSWEAIPN ARIELEEWEH VTCMKTVSLR SEETVSGLKG YVAAGTCLMQ GEEVTCRGRI
LIMDVIEVVP EPGQPLTKNK FKVLYEKEQK GPVTALCHCN GHLVSAIGQK IFLWSLRASE
LTGMAFIDTQ LYIHQMISVK NFILAADVMK SISLLRYQEE SKTLSLVSRD AKPLEVYSVD
FMVDNAQLGF LVSDRDRNLM VYMYLPEAKE SFGGMRLLRR ADFHVGAHVN TFWRTPCRGA
AEGPSKKSVV WENKHITWFA TLDGGIGLLL PMQEKTYRRL LMLQNALTTM LPHHAGLNPR
AFRMLHVDRR VLQNAVRNVL DGELLNRYLY LSTMERGELA KKIGTTPDIV SRPHRPLPAR
PPPAPRPHPA LSLQILDDLL ETDRVTAHF
//