ID A0A060X5I6_ONCMY Unreviewed; 1471 AA.
AC A0A060X5I6;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 25.
DE RecName: Full=Cleavage and polyadenylation specific factor 1 {ECO:0008006|Google:ProtNLM};
GN ORFNames=GSONMT00023775001 {ECO:0000313|EMBL:CDQ74853.1};
OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Protacanthopterygii; Salmoniformes;
OC Salmonidae; Salmoninae; Oncorhynchus.
OX NCBI_TaxID=8022 {ECO:0000313|EMBL:CDQ74853.1, ECO:0000313|Proteomes:UP000193380};
RN [1] {ECO:0000313|EMBL:CDQ74853.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24755649; DOI=10.1038/ncomms4657;
RA Berthelot C., Brunet F., Chalopin D., Juanchich A., Bernard M., Noel B.,
RA Bento P., Da Silva C., Labadie K., Alberti A., Aury J.M., Louis A.,
RA Dehais P., Bardou P., Montfort J., Klopp C., Cabau C., Gaspin C.,
RA Thorgaard G.H., Boussaha M., Quillet E., Guyomard R., Galiana D., Bobe J.,
RA Volff J.N., Genet C., Wincker P., Jaillon O., Roest Crollius H.,
RA Guiguen Y.;
RT "The rainbow trout genome provides novel insights into evolution after
RT whole-genome duplication in vertebrates.";
RL Nat. Commun. 5:3657-3657(2014).
RN [2] {ECO:0000313|EMBL:CDQ74853.1}
RP NUCLEOTIDE SEQUENCE.
RA Genoscope - CEA;
RL Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the CPSF1 family.
CC {ECO:0000256|ARBA:ARBA00038446}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FR905004; CDQ74853.1; -; Genomic_DNA.
DR STRING; 8022.A0A060X5I6; -.
DR PaxDb; 8022-A0A060X5I6; -.
DR Proteomes; UP000193380; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 1.10.150.910; -; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR PANTHER; PTHR10644:SF2; CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR SUBUNIT 1; 1.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 2.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000193380}.
FT DOMAIN 92..546
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 587..687
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 1102..1436
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
FT REGION 399..448
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 550..588
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 776..802
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 927..948
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 550..564
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 927..942
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1471 AA; 164971 MW; 83C3FF9B15C95B8E CRC64;
MYAVYRQAHS PTAIEFSVYC NFISNEEKNL VVAGTSQLYV YRIIHDVENA SKTDKSPDVK
SRKEKLEQVA CFSLFGNVMS MASVQLVGAN RDALLLSFKD AKLSVVEYDP GTHDLKTLSL
HFFEEPELRD GFVQNLHIPM VRVDPENRCA VMLVYGTQLV VLPFRKDTLT DEQEGVVEGP
KSSFLPSYII DVRELDEKLL NIVDMKFLHG YYEPTLLILY EPNQTWPGRV AVRQDTCSIV
AISLNIMQKV HPVIWSLSNL PFDCTQVMAV PKPIGGVVVF AVNSLLYLNQ SVPPYGVSLN
TQTTGTTAFP LRIQEEVKIT LDCSQSTFIG SDKMVISLKG GEIYVLTLIT DGMRSVRAFH
FDKAAASVLT TCMMTMEPGY LFLGSRLGNS LLLKYTEKYQ ENPEKPPVEE DKGKEEETDE
EKQEEPPSKK KRIDSSTNLT GKNQLPDEVD EIEVYGSEVA SGTQLATYSF EVCDSILNIG
PCAGASMGEP AFLSEEFQTN PEPDLEIVVC SGFGKNGGLS VLQRSIRPQV VTTFELPGCH
DMWTVIPNEM KEKGKKKDKP PATEEEGETP EEEEAEEKPA PVEGDKKKHG FLILSREDST
MILQTGQEIM ELDTSGFATQ GPTVFAGNIG DNKYIIQVSP MGIRLLEGVT QLHFIPVDLG
SPIVQCSVAD PYVVIMTADG VVTMFVLKTD SYMGKTHRLA LQKPQIAAQP RVITLCTFRD
VSGMFTTENK QTSHTKDQDT MFNSQSETET IMQDLSNTVD DEEEMLYGDC NPAGITPTKD
ESHPGYRPSG ALGGGSEGRS GRAEPTHWCM MVRENGVMEI YQLPDWRLVF LVKNFPVGQR
VLVDSSAGQS AAQGEGKKEE VTRQGEIPLV KEVALVSLGN NHCRPYLLVH VEQELLIYEA
FPYDQQQAQS NLKVRFKKMP HNINFREKKS KVKKDKKAEG QAGPSEEGPA VVKGRVARFR
YFEDISGYSG VFICGPSPHW MLVTSRGALR LHPMTIDGPI ESFSPFHNIN CPKGFLYFNK
QGELRISVLP TYLSYDAPWP VRKIPLRCTV HYVSYHVESK VYAVCTSVKD PCTRIPRMTG
EEKEFETIDR DERYIPPQQE NFSIQLISPV SWEAIPNTRI DLEEWEHVTC MKTVALRSQE
TVSGLKGYIA AGTCLMQGEE VTCRGRILIL DVIEVVPEPG QPLTKNKFKV LYEKEQKGPV
TALCHCSGYL VSAIGQKIFL WSLKDNDLTG MAFIDTQLYI HQMFSIKNFI VAADVMKSIS
LLRYQPESKT LSLISRDAKP LEVYSVEFMV DNNQLGFLVS DRDQNISVYM YLPEAKESFG
GMRLLRRADF NVGAHVNAFW RMPCRGALDT ATKKTLAWDN KHITWFATLD GGIGLLLPMQ
EKTYRRLLML QNALTTMLPH HAGLNPKAFR MLHADRRQLQ NAVRNILDGE LLNKYLYLST
MERSELAKKI GTTSDIILDD LLEVDRVTAH F
//