ID A0A1Y2FSJ9_PROLT Unreviewed; 1186 AA.
AC A0A1Y2FSJ9;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE SubName: Full=CPSF A subunit region-domain-containing protein {ECO:0000313|EMBL:ORY86166.1};
GN ORFNames=BCR37DRAFT_376713 {ECO:0000313|EMBL:ORY86166.1};
OS Protomyces lactucae-debilis.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Taphrinomycotina; Taphrinomycetes;
OC Taphrinales; Protomycetaceae; Protomyces.
OX NCBI_TaxID=2754530 {ECO:0000313|EMBL:ORY86166.1, ECO:0000313|Proteomes:UP000193685};
RN [1] {ECO:0000313|EMBL:ORY86166.1, ECO:0000313|Proteomes:UP000193685}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=12-1054 {ECO:0000313|EMBL:ORY86166.1,
RC ECO:0000313|Proteomes:UP000193685};
RG DOE Joint Genome Institute;
RA Mondo S.J., Dannebaum R.O., Kuo R.C., Labutti K., Haridas S., Kuo A.,
RA Salamov A., Ahrendt S.R., Lipzen A., Sullivan W., Andreopoulos W.B.,
RA Clum A., Lindquist E., Daum C., Ramamoorthy G.K., Gryganskyi A., Culley D.,
RA Magnuson J.K., James T.Y., O'Malley M.A., Stajich J.E., Spatafora J.W.,
RA Visel A., Grigoriev I.V.;
RT "Pervasive Adenine N6-methylation of Active Genes in Fungi.";
RL Submitted (JUL-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the RSE1 family.
CC {ECO:0000256|ARBA:ARBA00038266}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ORY86166.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCFI01000003; ORY86166.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y2FSJ9; -.
DR STRING; 56484.A0A1Y2FSJ9; -.
DR OMA; PRATGHW; -.
DR OrthoDB; 101343at2759; -.
DR Proteomes; UP000193685; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR PANTHER; PTHR10644:SF1; SPLICING FACTOR 3B SUBUNIT 3; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
DR SUPFAM; SSF63829; Calcium-dependent phosphotriesterase; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022728};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00022728};
KW Reference proteome {ECO:0000313|Proteomes:UP000193685};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 83..592
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 833..1153
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
SQ SEQUENCE 1186 AA; 130588 MW; 912FE4CB865DD679 CRC64;
MSTHGSSAFY AISLLPPSQQ ACPPIIGHFS GLKQQEVIVA TGATLTVYKV DNNTGKLVKL
ASHEAFGILR TMAAFRLTGS SKDYVLLCSD SGRLSVLEYR PDKACFTKIH QETYGKSGVR
RVVPGEFMAV DPKGRAVFIA SVEKNKLVYV LNRDAAANVT ISSPLEALSP GSLLFHSVAL
DVGYDNPVFA TIEVNYSDSD QDPTGRAYEE VQKTLTYYEL DLGLNHVVRE WTEIINRHAN
MLIAVPGGYD GPSGVLVCAP DEIVYMHKGR PSRRVPIPHR RGLLEDPDRQ RRIVAHVTQL
LKGSFFLLVQ TEDGDLFKLS IDHFEGDVTS MRIKYFDTVP VATGLTFFKS GYLLVASEFG
NSQLYQIERL GDDDDEPEFT ETSKHASFQP TALTNFTLVD EIECFNPILD AKVLNLTQED
APQIYAACGR GARSSFKTIR NGLEVSELVA SELPGKPTAI WTTQLAREDA HDAYIVLSFT
DGTLVLSIGE TVEELTETGF LTSAPTLAVQ QLGEDALIQV HPKGIRHVQR DLRVNEWHAP
ARRTIVQATT NNRQVVVALS SGELVYFELD EDGQLNEYQE KKEMSGGVTA LAIGAVPEGR
QRNPVLAVAC DDSTVRIISL DPDNTLESLS VQALTAPASA LCMVSMDDGH ATTVYLHIGL
QNGIYLRTSL DSTGQLTDTR TRFLGSKPVK LFNVKVQEQA AVLALSSRPW LGYIWNSAMQ
LTPLQYENID FAAPFSSDQC PEGLVGIKGP SLRIFTIDNL GNKMRAESHP LSYTPRKFVQ
HPNQALFYIL EADHHAMSAD KRSKAISEKQ NGSEHQSFLP EHGLPWAANG CWASCIRIFD
PSKQEDVSLL ELEDDIAAFC ACIVTFASRG NELFLAVGAA QGAELMPKSC KQAYLLIYRI
VDDGRALELV HKTETSDIPL ALCAFQGKLL VGLGGVLRLY DIGLKRCLRK SETEVTSNSI
IGLETQGDRI VISDNVEAVT LAVYKHNDNK IICFANDTLP RWTTASTMVD YDTIAGGDRF
GNLWIVRVPK NVSELSDNDT TGNTLIHERP YLQGAAHRLD TLAHYHIGDI PTSISKTQLV
AGGRSVLVIT GIMGSISLLI PFVAKEDAEF FAALETNLRT EDEPLAGRDH MMYRGYYAPP
KSVIDGDLCE RYALLSITKQ RMIAGELDRD VAEVKSKIEN QRIRFS
//