ID A0A2K6N625_RHIBE Unreviewed; 1386 AA.
AC A0A2K6N625;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 28-MAR-2018, sequence version 1.
DT 24-JAN-2024, entry version 22.
DE SubName: Full=Cleavage and polyadenylation specific factor 1 {ECO:0000313|Ensembl:ENSRBIP00000043489.1};
GN Name=CPSF1 {ECO:0000313|Ensembl:ENSRBIP00000043489.1};
OS Rhinopithecus bieti (Black snub-nosed monkey) (Pygathrix bieti).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Colobinae; Rhinopithecus.
OX NCBI_TaxID=61621 {ECO:0000313|Ensembl:ENSRBIP00000043489.1, ECO:0000313|Proteomes:UP000233180};
RN [1] {ECO:0000313|Ensembl:ENSRBIP00000043489.1, ECO:0000313|Proteomes:UP000233180}
RP NUCLEOTIDE SEQUENCE.
RA Wu, C.-I. and Zhang, Y.;
RT "Genome of Rhinopithecus bieti.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSRBIP00000043489.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the CPSF1 family.
CC {ECO:0000256|ARBA:ARBA00038446}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSRBIT00000067538.1; ENSRBIP00000043489.1; ENSRBIG00000044907.1.
DR GeneTree; ENSGT00950000183151; -.
DR Proteomes; UP000233180; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 1.10.150.910; -; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR PANTHER; PTHR10644:SF2; CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR SUBUNIT 1; 1.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000233180}.
FT DOMAIN 92..664
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 1134..1351
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
FT REGION 404..434
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 539..565
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 713..771
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 895..916
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 408..427
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 895..912
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1386 AA; 154786 MW; B7D6C7D6E4B643EC CRC64;
MYAVYKQAHP PTGLEFAMYC NFFNNSERNL VVAGTSQLYV YRLNRDAEAL TKNDRSTEGK
AHREKLELAA SFSFFGNVMS MASVQLAGAK RDALLLSFKD AKLSVVEYDP GTHDLKTLSL
HYFEEPELRD GFVQNVHTPR VRVDPDGRCA AMLVYGTRLV VLPFRRESLA EEHEGLVGEG
QRSSFLPSYI IDVRALDEKL LNIIDLQFLH GYYEPTLLIL FEPNQTWPGR VAVRQDTCSI
VAISLNITQK VHPVIWSLTS LPFDCTQALA VPKPIGGVVV FAVNSLLYLN QSVPPYGVAL
NSLTTGTTAF PLRTQEGVRI TLDCAQATFI SYDKMVISLK GGEIYVLTLI TDGMRSVRAF
HFDKAAASVL TTSMVTMEPG YLFLGSRLGN SLLLKYTEKL QEPPASAVRE AADKEEPPSK
KKRVDATTTG GKSVPQDEVD EIEVYGSEAQ SGTQLATYSF EVCDSILNIG PCANAAMGEP
AFLSEENSPE PDLEIVVCSG HGKNGALSVL QKSIRPQVVT TFELPGCYDM WTVIAPVRKE
EEDNPKGEVT EQEPRSPEAD DDGRRHGFLI LSREDSTMIL QTGQEIMELD TSGFATQGPT
VFAGNIGDNR YIVQVSPLGI RLLEGVNQLH FIPVDLGAPI VQCAVADPYV VIMSAEGHVT
MFLLKSDSYG GRHHRLALHK PPLHHQSKVI TLCLYRDLSG MFTTESRLGG ARDELGGRIG
SEAEGLGSET SPTVDDEEEM LYGDSGSLFS PSKEEARRSS QPPADRDPAP FRAEPTHWCL
LVRENGTMEI YQLPDWRLVF LVKNFPVGQR VLVDSSFGQP TTQGEARREE ATRQGELPLV
KEVLLVALGS RQSRPYLLVH VDQELLIYEA FPHDSQLGQG NLKVRFKKVP HNINFREKKP
KPSKKKAEGG GTEEGAGARG RVARFRYFED IYGYSGVFIC GPSPHWLLVT GRGALRLHPM
AIDGPVDSFA PFHNVNCPRG FLYFNRQGEL RISVLPAYLS YDAPWPVRKI PLRCTAHYVA
YHVESKVYAV ATSTNTPCAR IPRMTGEEKE FETIERDERY IHPQQEAFSI QLISPVSWEA
IPNARIELQE WEHVTCMKTV SLRSEETVSG LKGYVAAGTC LMQGEEVTCR GRIFLWSLRA
SELTGMAFID TQLYIHQMIS VKNFILAADV MKSISLLRYQ EESKTLSLVS RDAKPLEVYS
VDFMVDNAQL GFLVSDRDRN LMVYMYLPEA KESFGGMRLL RRADFHVGAH VNTFWRTPCR
GATEGLSKKS VVWENKHITW FATLDGGIGL LLPMQEKTYR RLLMLQNALT TMLPHHAGLN
PRAFRMLHVD RRTLQNAVRN VLDGELLNRY LYLSTMERSE LAKKIGTTPD IILDDLLETD
RVTAHF
//