ID I4Y6J1_WALMC Unreviewed; 986 AA.
AC I4Y6J1;
DT 05-SEP-2012, integrated into UniProtKB/TrEMBL.
DT 05-SEP-2012, sequence version 1.
DT 27-MAR-2024, entry version 48.
DE RecName: Full=DNA damage-binding protein 1 {ECO:0000256|ARBA:ARBA00014577};
GN ORFNames=WALSEDRAFT_33953 {ECO:0000313|EMBL:EIM19583.1};
OS Wallemia mellicola (strain ATCC MYA-4683 / CBS 633.66) (Wallemia sebi (CBS
OS 633.66)).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Wallemiomycotina;
OC Wallemiomycetes; Wallemiales; Wallemiaceae; Wallemia.
OX NCBI_TaxID=671144 {ECO:0000313|EMBL:EIM19583.1, ECO:0000313|Proteomes:UP000005242};
RN [1] {ECO:0000313|EMBL:EIM19583.1, ECO:0000313|Proteomes:UP000005242}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC MYA-4683 / CBS 633.66 {ECO:0000313|Proteomes:UP000005242};
RX PubMed=22326418; DOI=10.1016/j.fgb.2012.01.007;
RA Padamsee M., Kumar T.K.A., Riley R., Binder M., Boyd A., Calvo A.M.,
RA Furukawa K., Hesse C., Hohmann S., James T.Y., LaButti K., Lapidus A.,
RA Lindquist E., Lucas S., Miller K., Shantappa S., Grigoriev I.V.,
RA Hibbett D.S., McLaughlin D.J., Spatafora J.W., Aime M.C.;
RT "The genome of the xerotolerant mold Wallemia sebi reveals adaptations to
RT osmotic stress and suggests cryptic sexual reproduction.";
RL Fungal Genet. Biol. 49:217-226(2012).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the DDB1 family.
CC {ECO:0000256|ARBA:ARBA00007453}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH668248; EIM19583.1; -; Genomic_DNA.
DR RefSeq; XP_006960381.1; XM_006960319.1.
DR AlphaFoldDB; I4Y6J1; -.
DR STRING; 671144.I4Y6J1; -.
DR GeneID; 18471782; -.
DR KEGG; wse:WALSEDRAFT_33953; -.
DR eggNOG; KOG1897; Eukaryota.
DR HOGENOM; CLU_302513_0_0_1; -.
DR InParanoid; I4Y6J1; -.
DR OMA; IVDFCLF; -.
DR OrthoDB; 226997at2759; -.
DR Proteomes; UP000005242; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR011047; Quinoprotein_ADH-like_supfam.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR PANTHER; PTHR10644:SF3; DNA DAMAGE-BINDING PROTEIN 1; 1.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
DR SUPFAM; SSF50998; Quinoprotein alcohol dehydrogenase-like; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000005242}.
FT DOMAIN 39..434
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 695..944
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
SQ SEQUENCE 986 AA; 108939 MW; 7895A951707244AF CRC64;
MTDGISEIGR VDVNGRVIGI DKIIFQTHES LLLTLDHPQA QLVILSLSYE NGVIKHIVEC
TKMLVETTGE PSYEYCNSIV DKSTQIGVSH LWQGQLHAFK LSYDDRRKTH IIDGRNSQID
HSVVLSMAFL ATDKDEKPTL CRLVQSADVD NPLLVFEHLI CKEDYVDISQ TVLRIETQCP
SAQKIVAVEG KKRAVLVIGA FGCEYYEIPK KDLKGVRRKS STTVTSPQVV DMEHLSVKSP
MAAVRGYTAV NDSCTAWLIG DEKGDIYYIS ISNFLEITLV GNSSVASTLQ HLGSGFLFLG
SQNEDSKLLV VQTNPVRIVE LENYTNLAPV SDMALTHPDG IQGQLVVCSG SNKTGKLRVV
TTGIGLCDIY QVGLGDSISN VFILGSHLLV SYLSTTKIYD IPNGLYGTFD ESVAYQDVRR
DMPTLSVVSS GGRHGIYQSN LLTIFDDSGN LVERKETEID FVSVSEDLES VLVVGVEGLV
LHQKGQRRNL HKPEHAATYA ILDDNHIVYT TWTDYSVNIV DSRSNELIYS CKSRDDTPIL
SLLVENKVVL AGSADGSLYA FRFDEHLKSV DIQTVAIGST PVCLTRSNDG LIFALCDVPS
IVTLDNTRLR YSSININYIN GLTSYKTNDM VNYVFVQNDQ LKFSRILSTE NRVHIHSIEM
GADVPRQVAY KEDRYAVGCV RNAYRSDRTL YESSSCVKLL DNNYEQLAQM EMEKDEIVSV
VESLSIANME VFVVGTYYNN ETEGTEEATK GRFIILLVKD DKFIIASSFL VPGCVYAVCG
IDQKLAVAVN YQVRVYDIES IRDDTYKMRF IASYGNAFVV VSLTSVGKIL VVADFLKSAI
YLQLDTERGA LTQVGYDTAQ RWSSLVVALD DGNEETFTTL GADIRFHLFA LDRTSSGITS
RTLAQLPDNV SAIERAKTRS GDKLQPLAIY GTSAGAICAL ASADSSYEGL NGGRVKVRQE
VGTRQDQNVV DGDILSAHDD ELQRLL
//