ID F7W299_SORMK Unreviewed; 1149 AA.
AC F7W299;
DT 21-SEP-2011, integrated into UniProtKB/TrEMBL.
DT 21-SEP-2011, sequence version 1.
DT 27-MAR-2024, entry version 57.
DE RecName: Full=DNA damage-binding protein 1 {ECO:0000256|ARBA:ARBA00014577};
GN ORFNames=SMAC_04731 {ECO:0000313|EMBL:CCC11749.1};
OS Sordaria macrospora (strain ATCC MYA-333 / DSM 997 / K(L3346) / K-hell).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Sordariaceae; Sordaria.
OX NCBI_TaxID=771870 {ECO:0000313|EMBL:CCC11749.1, ECO:0000313|Proteomes:UP000001881};
RN [1] {ECO:0000313|EMBL:CCC11749.1, ECO:0000313|Proteomes:UP000001881}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC MYA-333 / DSM 997 / K(L3346) / K-hell
RC {ECO:0000313|Proteomes:UP000001881};
RC TISSUE=Mycelium {ECO:0000313|EMBL:CCC11749.1};
RX PubMed=20386741; DOI=10.1371/journal.pgen.1000891;
RA Nowrousian M., Stajich J., Chu M., Engh I., Espagne E., Halliday K.,
RA Kamerewerd J., Kempken F., Knab B., Kuo H.C., Osiewacz H.D., Poeggeler S.,
RA Read N., Seiler S., Smith K., Zickler D., Kueck U., Freitag M.;
RT "De novo assembly of a 40 Mb eukaryotic genome from short sequence reads:
RT Sordaria macrospora, a model organism for fungal morphogenesis.";
RL PLoS Genet. 6:E1000891-E1000891(2010).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the DDB1 family.
CC {ECO:0000256|ARBA:ARBA00007453}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCC11749.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CABT02000021; CCC11749.1; -; Genomic_DNA.
DR RefSeq; XP_003346558.1; XM_003346510.1.
DR AlphaFoldDB; F7W299; -.
DR STRING; 771870.F7W299; -.
DR GeneID; 10803948; -.
DR KEGG; smp:SMAC_04731; -.
DR VEuPathDB; FungiDB:SMAC_04731; -.
DR eggNOG; KOG1897; Eukaryota.
DR HOGENOM; CLU_002893_1_1_1; -.
DR InParanoid; F7W299; -.
DR OMA; HQDFLMR; -.
DR OrthoDB; 226997at2759; -.
DR Proteomes; UP000001881; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 1.10.150.910; -; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR011047; Quinoprotein_ADH-like_supfam.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR PANTHER; PTHR10644:SF3; DNA DAMAGE-BINDING PROTEIN 1; 1.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
DR SUPFAM; SSF50998; Quinoprotein alcohol dehydrogenase-like; 1.
DR SUPFAM; SSF69322; Tricorn protease domain 2; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000001881}.
FT DOMAIN 73..564
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 773..1108
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
SQ SEQUENCE 1149 AA; 127768 MW; 6A7E306ED105D89B CRC64;
MAYVAPIHRP SSVRHALRIN LLSPEEESLI IAKTNRIEIW KLADGHLSMI HSKVINGTIT
ILQKLQPKDH PTDLLFVGTD QFEYFTAEWD HETQQLKTLN RFSDPGERHM RDSQSQDKCI
VDPSGRFMAM HLWEGVLSVW RLGNRKSTAT TLDILVQVRL SELFIKGSTF LYTETGIPKV
AFLYRNQANS NETKLATDRE IDADVEDPGA GILIPVKKVE EEVKRHHFRN TEQAKPHVGG
LIVIGETRLL YIDEVTKTQV ESALKEPSIF VAWAEYDPTH YFLADDYGNL HLLTILTEGA
VVTGMDVSNI GRTARAHVLT YLGDDMLFVG SHYGNSQLYR LNLLNEDLNE ILQLVQVLEN
IGPITDFTIM DMGNRENDSQ LGNEYSSGQA RIVTASGIFK DGTLRSVRSG VGLQDIAILG
ELQHTRALFS LQSYNSSRAD TLVASFLTDT RIFRFDPHGE IEEVADYCGM DLQHQTLLTT
NLDNGQLLQV TTAAATLLDA ESGVTIASWA PEGDRQIINA SANKHWLLLS VQGTTLVSIN
IDNDLTVVQE KDVSEQDQIA CIHVAPQLSD VGVVGFWTSG TVSIIDMSTL EPIHGESLRR
SADDASIPRD IVLAKVLPNT PGMTLFIAME DGNVVTFNIG EDLTFSGRKS VILGTREARF
HLLPQQDGIY SIFATTEHPS LIYGSEGRII YSAVTAEDAT CVCPFDSEAF PGAVVLSTET
EIKISEIDTA RRTHVRSLEL GEMVRRIAYS PSEKGFGLGC IRREMVNGEE IIQSSFKLVD
EILFARAGRE FRLGTSSYSE LVEDVIRAEL PDSYGNLLER FIVGTSFLED PDRGAGTDKR
GRILVFGIDS NRDPYLVLKH ELRGACRALA VMGSKIVAAL HKTVVISQYE ETSSTEARLV
KLASYRCTTY PIDIAVHGNI IAVADMMKSA TLVEYVQAKT EEEKYEPAKL VECARHRHSA
WATAVAHVEG ESWLEADANG NLVVLQRNVE GVTAEDQRQL RITSELNLGE QVNKIRPIKV
ETSPNTIIIP RAFLATAEGG IYLFGTIARE QDLLLRFQDK LAAVIKTVGE LDFNSYRAFR
NAERGPETDG TTGPVRFLDG ELLERFLDVD ETTQKEICEG LGPSVEQMRN MVEELRRSAL
SKRSRNLLR
//