ID A0A1Y2A228_9PLEO Unreviewed; 1124 AA.
AC A0A1Y2A228;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE SubName: Full=CPSF A subunit region-domain-containing protein {ECO:0000313|EMBL:ORY16460.1};
GN ORFNames=BCR34DRAFT_106578 {ECO:0000313|EMBL:ORY16460.1};
OS Clohesyomyces aquaticus.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Dothideomycetes;
OC Pleosporomycetidae; Pleosporales; Lindgomycetaceae; Clohesyomyces.
OX NCBI_TaxID=1231657 {ECO:0000313|EMBL:ORY16460.1, ECO:0000313|Proteomes:UP000193144};
RN [1] {ECO:0000313|EMBL:ORY16460.1, ECO:0000313|Proteomes:UP000193144}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 115471 {ECO:0000313|EMBL:ORY16460.1,
RC ECO:0000313|Proteomes:UP000193144};
RG DOE Joint Genome Institute;
RA Mondo S.J., Dannebaum R.O., Kuo R.C., Labutti K., Haridas S., Kuo A.,
RA Salamov A., Ahrendt S.R., Lipzen A., Sullivan W., Andreopoulos W.B.,
RA Clum A., Lindquist E., Daum C., Ramamoorthy G.K., Gryganskyi A., Culley D.,
RA Magnuson J.K., James T.Y., O'Malley M.A., Stajich J.E., Spatafora J.W.,
RA Visel A., Grigoriev I.V.;
RT "Pervasive Adenine N6-methylation of Active Genes in Fungi.";
RL Submitted (JUL-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the RSE1 family.
CC {ECO:0000256|ARBA:ARBA00038266}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ORY16460.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCFA01000018; ORY16460.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y2A228; -.
DR STRING; 1231657.A0A1Y2A228; -.
DR OrthoDB; 101343at2759; -.
DR Proteomes; UP000193144; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 1.10.150.910; -; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR PANTHER; PTHR10644:SF1; SPLICING FACTOR 3B SUBUNIT 3; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022728};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00022728};
KW Reference proteome {ECO:0000313|Proteomes:UP000193144};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 1..480
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 764..1088
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
FT REGION 702..744
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 702..734
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1124 AA; 125656 MW; 563DEEA6610F7E59 CRC64;
MMAAIEKDKL VWTLSRTGQT DVSISISSPL GVHRPQTLVY YALGMDVGYE NPIFALLEVD
YSESDADPSG EAFHEVKKEL SYYELDLGLN HVVHRGRTTV DRTANMLFRV PGGNDLPSGV
LCCGEDNISY YHVYEKEVFR LAIPRREGAA ENPNRKRYIV AGTLYTLKGG NFFYLLQTED
GDVFKVTFDI KSGKVERLNI WYFDTIPVAS SICLLRAGFV YCASEAGDRL LYELETLGDE
HNEHVFSSDQ FPTDPTETYN PPYFQVRPLR NLNPVEKVAN MSPVMDMEVA NLSMEDAPQI
YTVSGTGARS TFRTTRNALD VLELVDSQLP QRATSVWTAK LRANDEHDTY IVLSLVNHTL
VLRIGDDVEE AQHSGLLAET TTLGVQQFGE DCIIQIHPKG IRHIRTVPYN EEDPTQKMYG
EITDWKTPAH RSIVACAANN RQVCIALSSG EIYYFACDYD SSLAQAEDEA QLDHTIHCLA
MPDVPEGRQG SDFLAVGCGD KTFRIFNLNP RDQDHRILGQ TGLMGLSALP HAIAFHAMKD
QSPAGYSLYA HVGLHSGIYV RALVDEFSGT LSGFRRRFLG PAPVKFARVS VGGEPAILAL
TTRPWLAYTH PVNNTLALTP LNYMSIEAAW SFESASFKGI ICVRGEDLRI VTLDDVDLSQ
NLSYEQISLQ YTPRKLVGNH EQQVYYVIES DNNTLDAATR DQLKREQDEK VKEEVKSEEE
SEDKPMKTAE EEDLTNGELE NDEETNGELL PVDFGLPKVE GRWASCIQVV DPVTEKAVIH
TIELRNNQCA VSAALIAFES KDNDLFFAVG VAQDLKFTPY SFSKAFIQLY KVSPDGRKLE
FYHDTEVSAP PLALLAFKGK LIAGLGNDLV LFDCGLKHLL RKAQASNCTG TRITDLKTQG
SRIVVADQSQ SVTYVVHKDM VHPNRLIPFA DDTVPRWTTC AEMADYDTTV GGDKFGNIWI
VRCPEKVSQA SDESEDGQHL IQDKAYLGGA PNRLDLCAHY FTNDIPTAIQ KTNLIAGGDR
IIFWAGLQGT LGVFVPFESR RDHKMFQQLE LLLRNEEKPI AGRDHLAYRS YYTPIKNVID
GDLVERFLTA SNDEKASWAA QLDGAWDASS VEDKIWTMRL NFAF
//