ID A0A2P6TT42_CHLSO Unreviewed; 1024 AA.
AC A0A2P6TT42;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 24-JAN-2024, entry version 22.
DE RecName: Full=cysteine dioxygenase {ECO:0000256|ARBA:ARBA00013133};
DE EC=1.13.11.20 {ECO:0000256|ARBA:ARBA00013133};
GN ORFNames=C2E21_4008 {ECO:0000313|EMBL:PRW57240.1};
OS Chlorella sorokiniana (Freshwater green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Chlorella clade; Chlorella.
OX NCBI_TaxID=3076 {ECO:0000313|EMBL:PRW57240.1, ECO:0000313|Proteomes:UP000239899};
RN [1] {ECO:0000313|EMBL:PRW57240.1, ECO:0000313|Proteomes:UP000239899}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=UTEX 1602 {ECO:0000313|Proteomes:UP000239899};
RX PubMed=29178410; DOI=10.1111/tpj.13789;
RA Arriola M.B., Velmurugan N., Zhang Y., Plunkett M.H., Hondzo H.,
RA Barney B.M.;
RT "Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium
RT conductrix SAG 241.80: implications to maltose excretion by a green alga.";
RL Plant J. 93:566-586(2018).
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004141}; Multi-
CC pass membrane protein {ECO:0000256|ARBA:ARBA00004141}.
CC -!- SIMILARITY: Belongs to the ABC transporter superfamily. ABCG family.
CC Eye pigment precursor importer (TC 3.A.1.204) subfamily.
CC {ECO:0000256|ARBA:ARBA00005814}.
CC -!- SIMILARITY: Belongs to the cysteine dioxygenase family.
CC {ECO:0000256|ARBA:ARBA00006622}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PRW57240.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LHPG02000007; PRW57240.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2P6TT42; -.
DR STRING; 3076.A0A2P6TT42; -.
DR OrthoDB; 359054at2759; -.
DR Proteomes; UP000239899; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0140359; F:ABC-type transporter activity; IEA:InterPro.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0016887; F:ATP hydrolysis activity; IEA:InterPro.
DR GO; GO:0004176; F:ATP-dependent peptidase activity; IEA:InterPro.
DR GO; GO:0017172; F:cysteine dioxygenase activity; IEA:UniProtKB-EC.
DR GO; GO:0005506; F:iron ion binding; IEA:InterPro.
DR GO; GO:0004222; F:metalloendopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd10548; cupin_CDO; 1.
DR Gene3D; 2.60.120.10; Jelly Rolls; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR InterPro; IPR003593; AAA+_ATPase.
DR InterPro; IPR013525; ABC2_TM.
DR InterPro; IPR003439; ABC_transporter-like_ATP-bd.
DR InterPro; IPR017871; ABC_transporter-like_CS.
DR InterPro; IPR043926; ABCG_dom.
DR InterPro; IPR010300; CDO_1.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR037219; Peptidase_M41-like.
DR InterPro; IPR014710; RmlC-like_jellyroll.
DR InterPro; IPR011051; RmlC_Cupin_sf.
DR PANTHER; PTHR48042; ABC TRANSPORTER G FAMILY MEMBER 11; 1.
DR PANTHER; PTHR48042:SF11; ABC TRANSPORTER G FAMILY MEMBER 11; 1.
DR Pfam; PF01061; ABC2_membrane; 1.
DR Pfam; PF19055; ABC2_membrane_7; 1.
DR Pfam; PF00005; ABC_tran; 1.
DR Pfam; PF05995; CDO_I; 1.
DR SMART; SM00382; AAA; 1.
DR SUPFAM; SSF140990; FtsH protease domain-like; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF51182; RmlC-like cupins; 1.
DR PROSITE; PS00211; ABC_TRANSPORTER_1; 1.
DR PROSITE; PS50893; ABC_TRANSPORTER_2; 1.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Reference proteome {ECO:0000313|Proteomes:UP000239899};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}; Transport {ECO:0000256|ARBA:ARBA00022448}.
FT TRANSMEM 296..317
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 329..349
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 369..397
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 404..428
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 434..454
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 466..484
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 524..544
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 686..706
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 712..731
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 1..208
FT /note="ABC transporter"
FT /evidence="ECO:0000259|PROSITE:PS50893"
SQ SEQUENCE 1024 AA; 112254 MW; 89A5DAF0FD33C6C3 CRC64;
MGPSGCGKTT LLDTLAGRLA KSARSSGDIR VNGHKSKLSF GRSAYVTQDD VLIGTLTVYE
TIMYSAKLRL PQRMPAEEKE RIVNEVISEL GLESTRDTYI GTWHLRGISG GQRRRVSIGC
ELVTSPSLVF LDEPTSGLDS AAAYYVMAAV RRLAEHCRTV VSVIHQPSSE VYGLFDKLCL
LSDGHVVYFG AADRAADFFA EAGLGVPLNR NPADHFLHTI NRDFLESEDV EANIQKLVKQ
YSSSRISAHV KDHVQALEAA PGDKYTSGGA QPSWLFKTSV LTVRTFLNNL RNVGVFWMRL
AMYVMLCLAI GFVYFQLDDA WKDVFSRTAL LFFVVAFLTF MSIAAFPAFT DDMKVFVRER
LNGYYGVSVF TVANTLASLP FIFLIAVVST VCVYWLANLR GGAGYVWFFI FDLFLSLTVV
ESLMMAIAPL VPNYLMGIAA GAGVMGLYMI VCGFFQPMES MPKPIFRYPL SYMSYHTFSF
IGFMRNEFEG TSGWGCPPGL DGVAPASCGL NGDLVLSYYE IMDINKWICM VILACMAVFY
RGLFFAALKW KEWKTIDTMA VAAANTVQRV KSVVAARQVA RPARSAGRQR LCVVRSQQEG
AAATAEASFS RLADALEDYR RAPISFKQEV SSEVLAAVQQ LADAGALKKW GVVETPPRRN
VLQGELRLVG ISQPEKIAQI SVRNDAAFLF SVVGTTSIAA VVLGQLPGDW GFFSSYLTGG
IALAVLAVGS INPGLLQFAI DQFSQVFPDY KERVVRHEAA HFLTGYLLGV PVANYSLTLG
KEHTDFAEAK LQKRLIEGTL APEQVDQLSV VAMAGAASEA MKFDDVVGQN ADMFDLQRIM
QRQQPKLTDA QQQNQTRWAV YQAASLLRAH SAEYEALQAA MARGASVVRN LVYQDENFEV
LVLCWAPGQG SRIHNHGDSH GWVTVLRGHV EETRAPTSPR EGPAAPPAIP GVLGAVTPCP
QLEKLGKGVA GPAAQLYIND GQALHAHLYA PPTRRVLLYE PEADRVVTRV PGFYSRGGRV
VREG
//