ID A0A2S6BQN2_9PEZI Unreviewed; 1223 AA.
AC A0A2S6BQN2;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=DNA damage-binding protein 1 {ECO:0008006|Google:ProtNLM};
GN ORFNames=CBER1_03325 {ECO:0000313|EMBL:PPJ49797.1};
OS Cercospora berteroae.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Dothideomycetes;
OC Dothideomycetidae; Mycosphaerellales; Mycosphaerellaceae; Cercospora.
OX NCBI_TaxID=357750 {ECO:0000313|EMBL:PPJ49797.1, ECO:0000313|Proteomes:UP000237631};
RN [1] {ECO:0000313|Proteomes:UP000237631}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS538.71 {ECO:0000313|Proteomes:UP000237631};
RA de Jonge R., Ebert M.K., Huitt-Roehl C.R., Pal P., Suttle J.C.,
RA Spanner R.E., Neubauer J.D., Jurick W.M.II., Stott K.A., Secor G.A.,
RA Thomma B.P.H.J., Van de Peer Y., Townsend C.A., Bolton M.D.;
RT "Conservation of a gene cluster reveals novel cercosporin biosynthetic
RT mechanisms and extends production to the genus Colletotrichum.";
RL bioRxiv 0:0-0(2017).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the RSE1 family.
CC {ECO:0000256|ARBA:ARBA00038266}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PPJ49797.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PNEN01001798; PPJ49797.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2S6BQN2; -.
DR STRING; 357750.A0A2S6BQN2; -.
DR OrthoDB; 101343at2759; -.
DR Proteomes; UP000237631; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 1.10.150.910; -; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR PANTHER; PTHR10644:SF1; SPLICING FACTOR 3B SUBUNIT 3; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022728};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00022728};
KW Reference proteome {ECO:0000313|Proteomes:UP000237631};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 90..623
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 868..1189
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
SQ SEQUENCE 1223 AA; 136215 MW; A39EADC2BD6AA6D7 CRC64;
MANVQQTQAF YALTLEAPSA PTAAVLCNVI PGLKTGDQQI FEARGQRVLL HRITESADRT
ERKITTVCDQ DAFGIIRGVA AFRIPATATD QLVISSDSGR VAMLQYDHEK NRFKRLHLET
YGKSGVRRTI PGQYLASDPR GRCIMMASVE KNKVVYMLNR HADGNILISS PHEANQWGSL
CFALCALDTG WEPPIFATLE VEYTEAESDP TGEAYQRREK QLVYYTVDMG LNHVVKTWSE
PVDYTANMLF GVPGGQDGPS GVLVCCEDRI YYKHDKAANL SIAIPRREGA TEDKERKRQI
VAGCLHLAKT RHEFFFFLQT EDGDVFKLNI NMATDEQGRQ TADPEEMVLK YYDTFPVAKQ
MLLHKKGFLY IAAEDGNTQL FHIDDLADDP EFEPHNTFTS DGVSTDPSEP IEPTYFKPRE
LTMTHLAVDV PGLHPLMKTR VDNLTHEDAP QIYGIQGKGN RSQFKTIRHG LDVEILINNS
MGNVPYDNIW TFKHRATDEH HRYLLLSSNY GDLTIACSIG DSVEQIENSN FLENRATVHA
EQMGDAVLVQ VHARGIRSIY QDGKLNEWDV PPHRTCVVAS ANQYQLLVGL SSAELCFFFM
GEDGVLVQLD EMPEMSGKIT AMSVGQTPKG RQQSKYAVVG CDDCTIRVLS IELDTPLEAR
SVQALSAVPT SLEIVEMLDP ASGTTINVVH IGLQSGLYLR AVIDETTGEL GDVRTKFLGT
KAPRLCPVQV EDEDCVLACS SRPWLGYNHP QSHLYTVTPL IAEQMEAARA FISPDLSGLC
AIQGSSLLIF QLPSVEGRLS HSSIPLNNTP RGMTRNPYFP IWYTVQADGN TLSKATRDQL
RGKIIDDDEE ATALERHLGL PRGTSHWASC IQAIDPLNRQ AVVSTVELGE NEAALCCTCV
AFESRNYELY LAVGTGQHMS PGIAQQAAGY VHIYKLEEDG TKMTFVHKTK FAQPIYALLP
FDGRLALGVG NELFIYDMGM KALLRKARGT ATPNQIVSLE AHGNRIICGD VSESVTYLVY
KPGFNRLIPF VDDVIQRWTT GTTMIDYETV AGGDKFGNLW VVRCPEQPSQ EADEEGAGGF
IMNERSYLNG APYRLDLRAH YYCQDIPMSL QRTALVAGGQ EVLFWSGLQG TLGMLVPFVT
REDVEFFTSL EQQLRAEDPP LAGRDHLMYR SYYVPVKGVI DGDLCERFMA LSYDSKQKVA
AEVDRSVKEI EKKVQEMRTR VAF
//