ID A0A2P6TY21_CHLSO Unreviewed; 1036 AA.
AC A0A2P6TY21;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=DNA excision repair ERCC-8-like isoform A {ECO:0000313|EMBL:PRW58966.1};
DE SubName: Full=DNA excision repair ERCC-8-like isoform B {ECO:0000313|EMBL:PRW58967.1};
GN ORFNames=C2E21_2502 {ECO:0000313|EMBL:PRW58967.1};
OS Chlorella sorokiniana (Freshwater green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Chlorella clade; Chlorella.
OX NCBI_TaxID=3076 {ECO:0000313|EMBL:PRW58967.1, ECO:0000313|Proteomes:UP000239899};
RN [1] {ECO:0000313|EMBL:PRW58967.1, ECO:0000313|Proteomes:UP000239899}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=1602 {ECO:0000313|EMBL:PRW58967.1}, and UTEX 1602
RC {ECO:0000313|Proteomes:UP000239899};
RX PubMed=29178410; DOI=10.1111/tpj.13789;
RA Arriola M.B., Velmurugan N., Zhang Y., Plunkett M.H., Hondzo H.,
RA Barney B.M.;
RT "Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium
RT conductrix SAG 241.80: implications to maltose excretion by a green alga.";
RL Plant J. 93:566-586(2018).
RN [2] {ECO:0000313|EMBL:PRW58967.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=1602 {ECO:0000313|EMBL:PRW58967.1};
RA Cohen D.B., Kent A.D.;
RL Submitted (FEB-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PRW58967.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LHPG02000004; PRW58966.1; -; Genomic_DNA.
DR EMBL; LHPG02000004; PRW58967.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2P6TY21; -.
DR STRING; 3076.A0A2P6TY21; -.
DR OrthoDB; 1364175at2759; -.
DR Proteomes; UP000239899; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProt.
DR GO; GO:0006283; P:transcription-coupled nucleotide-excision repair; IEA:InterPro.
DR CDD; cd16514; RING-HC_LONFs_rpt2; 1.
DR Gene3D; 2.30.130.40; LON domain-like; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR020472; G-protein_beta_WD-40_rep.
DR InterPro; IPR003111; Lon_prtase_N.
DR InterPro; IPR046336; Lon_prtase_N_sf.
DR InterPro; IPR015947; PUA-like_sf.
DR InterPro; IPR042238; Rad28/ERCC8/Ckn1/ATCSA-1.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR InterPro; IPR001841; Znf_RING.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR46202; DNA EXCISION REPAIR PROTEIN ERCC-8; 1.
DR PANTHER; PTHR46202:SF1; DNA EXCISION REPAIR PROTEIN ERCC-8; 1.
DR Pfam; PF02190; LON_substr_bdg; 1.
DR Pfam; PF00400; WD40; 4.
DR Pfam; PF13923; zf-C3HC4_2; 1.
DR PRINTS; PR00320; GPROTEINBRPT.
DR SMART; SM00464; LON; 1.
DR SMART; SM00184; RING; 1.
DR SMART; SM00028; TPR; 1.
DR SMART; SM00320; WD40; 6.
DR SUPFAM; SSF88697; PUA domain-like; 1.
DR SUPFAM; SSF57850; RING/U-box; 1.
DR SUPFAM; SSF48452; TPR-like; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS51787; LON_N; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 4.
DR PROSITE; PS50294; WD_REPEATS_REGION; 4.
DR PROSITE; PS50089; ZF_RING_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00175};
KW Reference proteome {ECO:0000313|Proteomes:UP000239899};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}; Zinc {ECO:0000256|PROSITE-ProRule:PRU00175};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00175}.
FT DOMAIN 191..230
FT /note="RING-type"
FT /evidence="ECO:0000259|PROSITE:PS50089"
FT DOMAIN 278..481
FT /note="Lon N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51787"
FT REPEAT 646..682
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 733..775
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 891..932
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 979..1011
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REGION 782..880
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1016..1036
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 783..803
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1021..1036
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1036 AA; 111284 MW; 2F0429886C36594B CRC64;
MQEEKFEKAV NAASDLLRQS AIAGGTAGDP VAAQRHAVLL LARSAAYSAL SERLRSVPAS
QSERRAIFAP DPTQLAALGL KDADAALALL PAGAPDAPQA HLCRGRALAL LERYREAEAA
YTAGLAEQPT HEQLQQGLQQ LRAALADGQA GSGRVVGEVS GQTAGVDGAR EQAAGGQAAR
RLQEVSDDAE CTLCMRLFYD PVTTPCGHTF CKPCFARALD HSISRCPMCR TVLHTGRELT
VTVALKNLLE KSFPEEYAAR RQEELEAAAA APAGGASDAP LPMFVMSLLL PGECMALNIF
EPRYRLMIRR CMEGNRRFAM GTVDRQHQLG QVACEAEIVE CQPLPDGRFY IEIVGRRRFH
PTETWEQDGY RVARPTFIAD EPPAEGSEEA QRLQEAAAEV GQLADGLLER LRSLSQSRRG
IAELLSRLGT KPEPSSPEAF SFWVANLVCS CLAGEGELKC NFLSTRSTLE RLDSQRQLLQ
QLGNVQQPGC TVMKKQPSVP AALCTVAAAN MQGSRLSLIF GRQEAGGRGA ANRLSRERLQ
SLEFSHSKVI RSVHTGGVFC MDLDAGEQRY LLAGAADASI AVYDTQQPSP AAAQQQQEQA
REAASTAAAL TSNAVYLRDS AQVAAARGNA KHTEHAAVLT ITKQSPQAHK FSVSAVAWYP
VDSGLFVSGG YDNEVKVWDT NSLQVVCHFE LAARVHAAAM SACATSHCLV AVGSGDTQVQ
LCDIVSGGFT HSLAGHRAAV WALAWSPCNE WQLATGSCDG QLRLWDIRRA GPLHIFDQHD
TQEQRAAGGQ AAAAQQQQQQ QGEAEGEDEM EQDFWGADAP QPLPPREQDD QQLPPVQQAV
ADVMQRQRSG GSGGGRRPGR GSSGRAAAAA AGGSAAGSER MFLPERVTKY STAHTGSITG
VVPTPDGLHW LSAGTDDRVR LWDAATYKHR LVHYPGAFNR ASRARQLAVS DDSSTLFHPS
GSVVQMFDVQ SGRLVRTLGG GHFDTINCCK WNPAAEELYS GSSDSNIVVW SPAQEAVTAE
RDGWQGRDSD GDDWSD
//