ID A0A316U635_9BASI Unreviewed; 1537 AA.
AC A0A316U635;
DT 10-OCT-2018, integrated into UniProtKB/TrEMBL.
DT 10-OCT-2018, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE RecName: Full=Cleavage/polyadenylation specificity factor A subunit C-terminal domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BCV69DRAFT_299186 {ECO:0000313|EMBL:PWN20706.1};
OS Pseudomicrostroma glucosiphilum.
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina;
OC Exobasidiomycetes; Microstromatales; Microstromatales incertae sedis;
OC Pseudomicrostroma.
OX NCBI_TaxID=1684307 {ECO:0000313|EMBL:PWN20706.1, ECO:0000313|Proteomes:UP000245942};
RN [1] {ECO:0000313|EMBL:PWN20706.1, ECO:0000313|Proteomes:UP000245942}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MCA 4718 {ECO:0000313|EMBL:PWN20706.1,
RC ECO:0000313|Proteomes:UP000245942};
RX PubMed=29771364; DOI=10.1093/molbev/msy072;
RA Kijpornyongpan T., Mondo S.J., Barry K., Sandor L., Lee J., Lipzen A.,
RA Pangilinan J., LaButti K., Hainaut M., Henrissat B., Grigoriev I.V.,
RA Spatafora J.W., Aime M.C.;
RT "Broad genomic sampling reveals a smut pathogenic ancestry of the fungal
RT clade Ustilaginomycotina.";
RL Mol. Biol. Evol. 0:0-0(2018).
CC -!- SIMILARITY: Belongs to the DDB1 family.
CC {ECO:0000256|ARBA:ARBA00007453}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KZ819327; PWN20706.1; -; Genomic_DNA.
DR STRING; 1684307.A0A316U635; -.
DR OrthoDB; 226997at2759; -.
DR Proteomes; UP000245942; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 1.10.150.910; -; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR PANTHER; PTHR10644:SF3; DNA DAMAGE-BINDING PROTEIN 1; 1.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000245942}.
FT DOMAIN 145..840
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 1071..1480
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
FT REGION 258..286
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 342..452
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1302..1337
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 260..281
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 375..416
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1302..1321
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1537 AA; 160774 MW; FF09F5E49000230C CRC64;
MQVVTHLHKP STQLQSAVLP TFLDAEHPSL CIVKHSSLDF LLLPSAEGSR RATQEPLGKV
AHVDVNARIL AVSPFTAHGR ERLAILTDHH QPRLLVLEAH PHASAGQDAA FNIVCNATIL
LDEMARLPAE LGLGVWTVSN GGRQAVVAHT HSGILRVLPT AEVEAEQQSE YASKVFSARL
PHPTILSLAP IDRPQSEDLT IALLSISSTP SSLPGLGSQC LPVLSFHSID EAKHDLVPLP
WGGPDRKVPP LIEEFKTGTS GAEASGSGSQ SAKNNGPQAK GKTFTPEEIR AREQKLAKSA
LGRSHVPLPF ADALGAHLIT SLPASAGGGV IVWSETSVLI VPPPAPQGAS EGQEAGQESP
STARGKRRKA SVTSPGERRA SQSSAKGIPP LPSSGSGSAL STSPDTAIVA HSPSSQANEH
GKRRRSSAKK SSISIAAGAQ PEPLDTLSNA PSDTRSKMLR LVLPRPVQIT SAEIVKEGDE
AVVPNQTGGR ATLSVLFATM AGSLNMLEIS LSRQDKSESW RPRSMKATKI GVTPRTSGPG
SLTYLGEGYV HAASASGDSS IVQLLSESSG AQNDASNDTV DDVEMVSPPV SPIKSSSQRK
LSTSAGPSIS TALAHSQALA QNKFDADLSS STTVAMRSIM TFPSLAPLLD FFIDGDANAP
SGGTQARIIA ACGSGPDGAI AVVKNGVSME ELGSLGVSDI RKLWSVDLGT CVGIVCGFAE
HTQVLGLGAE GVIDVSDSTG LDLSVKTIEA ASLGSGIWAR VTEDAVSVVS PSGVTSTWSP
SQMEGSVVGN TKIIVASFDP ASSQVLLALK GGHVALLTIQ GEGIALTASA SLAGEVSAVS
LAHGIAAVGQ WKTNSVRLLS APGLEDLTPS SMEEEEGDFG SLPRSLLVHS FDEDDGDASQ
KQLYLLIGLA SGAVVACALA LPTADSISKS IGVFDRKSSA FGLQPIRLLP FTTTQGRRAV
LAINGETPTV IWAEKRRLIF SAVSYPSVMA AVPISIPGAE PTLALALPDR IQLTSIKEIG
KLEISKTHLG LDNPTAIAGL GRDDRGYHRY FAVVTTPFRP EGNSTRESRP SKVILFDSVT
CEPLSSIDLE PRERAGCIGM MQVDGEEYLV AGTGFVDPYE SEVYSGRLIG FALKDPSGGA
DDKDEAVNMQ LVQAFEKQVS GNVYAVTSLL GNRLVAAINS EVISYSLEDL QASSGGGAME
VDSREPQMRL RQHSRWGCAF IACTLSPIAD EPQRFLVGDG LRSLCVLRVD AATGALSEIA
RDCDPFWTTA CSSLDSKSQT FIGADISFNL FTAQRATLAP ETKRRIEREA EKRRERAERG
EGSAALGSSN DPPVLNVDDE VEGEWSHIME RRGAWHYGDL INRFREGSLV AKATTGTATG
SGSAASTTTT AAGAAAPPID PRLVFCTSSG ALGIISTLDE SASQVLSQVE RNLQEYLSAP
SSSASSSLAP LGEISAREYR TLRTDHRTAE PVGFIDGDVL VGAYALGLSE TQKREVLDDV
IGVAGGGMGE GVGVGVGKVE ASKEVVDELV AALSRVC
//