ID A0A250WT64_9CHLO Unreviewed; 1214 AA.
AC A0A250WT64;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 24-JAN-2024, entry version 23.
DE RecName: Full=DNA damage-binding protein 1 {ECO:0008006|Google:ProtNLM};
GN ORFNames=CEUSTIGMA_g1455.t1 {ECO:0000313|EMBL:GAX74005.1};
OS Chlamydomonas eustigma.
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Chlorophyceae;
OC CS clade; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas.
OX NCBI_TaxID=1157962 {ECO:0000313|EMBL:GAX74005.1, ECO:0000313|Proteomes:UP000232323};
RN [1] {ECO:0000313|EMBL:GAX74005.1, ECO:0000313|Proteomes:UP000232323}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NIES-2499 {ECO:0000313|EMBL:GAX74005.1,
RC ECO:0000313|Proteomes:UP000232323};
RA Hirooka S., Hirose Y., Kanesaki Y., Higuchi S., Fujiwara T., Onuma R.,
RA Era A., Ohbayashi R., Uzuka A., Nozaki H., Yoshikawa H., Miyagishima S.Y.;
RT "Acidophilic green algal genome provides insights into adaptation to an
RT acidic environment.";
RL Submitted (AUG-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the RSE1 family.
CC {ECO:0000256|ARBA:ARBA00038266}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GAX74005.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BEGY01000005; GAX74005.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A250WT64; -.
DR STRING; 1157962.A0A250WT64; -.
DR OrthoDB; 101343at2759; -.
DR Proteomes; UP000232323; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR PANTHER; PTHR10644:SF1; SPLICING FACTOR 3B SUBUNIT 3; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 2.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000232323}.
FT DOMAIN 75..590
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 852..1180
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
SQ SEQUENCE 1214 AA; 132882 MW; CF32422FF609DFFF CRC64;
MHLYNLTLSR ASGIQCAIYG NFSGPKAQEL VVSRGKVMEL LRPNESGKLQ TVVATEIFGC
IRSLSAFRMI GASTDYIVVG SDSGRIVILK FNKEKNSFQK VHQETYGRSG CRRIVPGQYI
ACDPKGRACM IAAVEKQKFV YVLNRDNAAN LTISSPLEAH KSHTIVFSIV GMDMGFDNPV
FAAIELDYGD VDQDPTGEAA TEVQKHLTLY ELDLGVNNVI RKWSDPVDNG ANLLVAVPGG
GDGPGGVLVC SENFIIYKNQ DHEDVRAVIP RRSDLLADRG VLIVCYATHK KKAYSFFLVQ
SEYGDIYKVT LAYEGDTVTE VKIKYFDTIP VCTSICVLKT GFLYAASEAG NHALYQFIGT
GEDEEDVESS SLNLQETEDG FQPVFFDPRP LKNLLLIDET SSLMPITDMK VVNLLKEEIP
QIYAICGHGP RSTLAVLRPG LAVTELAISS LPSAPTAVWT IKRSTTDEFD AYIIVSFSNV
TLVFSIGEEV KETNDSGFLG TVSTIHTQLL SDSSMLQVHA GGLRHIKNDR RINEWKVPGR
RSITKAASNE KQVAIALSGG EVIYFELDQM GQLLEERKRD MDEDVTCMDI APVPEGLIRS
RYLAVGCANG AVKILGLDHE DGLKDLALQA MQSTPYSALM LYSAAVDEGG PVGGVSEAGG
LFLHVGLDNG VLTRTEVDRV TGQLSDTRSR FLGTRPPRLF ATTVRGSRSM LALSSRPWLG
YSDQGRFNIS PLSYEALDYA SGFASDQCPE GFVSVLKDQL RILSVENVGE SFNQQVTRLR
YTPRKMLVHP QHNTLIIAEA DHGAIPLAQR ADLQQRAEQS GQALQGVEFD EEAAALEEQF
GAPRGEAGQW ACCLRVVDPA ALVTTFVLEL DNNEAVTSMA IVAFNPAAQI AAGAASHEPL
LVVGTAKGLK YLPTDCDAAY IRTYKILDGG KRIDLLHKTQ LEIAAVPGAL TGFKGRLLAG
VGNVLRIYEL GKKKLLRKCE YKKLPHHIMY LNVQGGRIYV GDAQESIHLM KYKNNDNQLY
CFADEASPRY LTTLLPLDFD TVAGADKFGN VFVARLPADV SSQVEEDPSG GKLAAMMNSL
NGAPHKLKAL VNFHVGDTVT ALQRASLQPG GQEVILYATA MGSIGALYPF TSKEDIDFFV
HLEMHLRQEN QPLCGREHMA FRSAYFPVKD VVDGDMCAQY PTLPVAKQRS IAEELERSPG
EVLKKLEDIR NKIL
//