ID A0A2P6V259_9CHLO Unreviewed; 1395 AA.
AC A0A2P6V259;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 22-FEB-2023, entry version 12.
DE SubName: Full=Acyl-coenzyme A thioesterase THEM4 isoform B {ECO:0000313|EMBL:PSC68179.1};
GN ORFNames=C2E20_8219 {ECO:0000313|EMBL:PSC68179.1};
OS Micractinium conductrix.
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Chlorella clade; Micractinium.
OX NCBI_TaxID=554055 {ECO:0000313|EMBL:PSC68179.1, ECO:0000313|Proteomes:UP000239649};
RN [1] {ECO:0000313|EMBL:PSC68179.1, ECO:0000313|Proteomes:UP000239649}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SAG 241.80 {ECO:0000313|EMBL:PSC68179.1,
RC ECO:0000313|Proteomes:UP000239649};
RX PubMed=29178410; DOI=10.1111/tpj.13789;
RA Arriola M.B., Velmurugan N., Zhang Y., Plunkett M.H., Hondzo H.,
RA Barney B.M.;
RT "Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium
RT conductrix SAG 241.80: implications to maltose excretion by a green alga.";
RL Plant J. 93:566-586(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PSC68179.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LHPF02000041; PSC68179.1; -; Genomic_DNA.
DR Proteomes; UP000239649; Unassembled WGS sequence.
DR CDD; cd03443; PaaI_thioesterase; 1.
DR CDD; cd00229; SGNH_hydrolase; 2.
DR Gene3D; 3.10.129.10; Hotdog Thioesterase; 1.
DR Gene3D; 3.40.50.1110; SGNH hydrolase; 2.
DR InterPro; IPR029069; HotDog_dom_sf.
DR InterPro; IPR013830; SGNH_hydro.
DR InterPro; IPR036514; SGNH_hydro_sf.
DR InterPro; IPR006683; Thioestr_dom.
DR PANTHER; PTHR34407; EXPRESSED PROTEIN; 1.
DR PANTHER; PTHR34407:SF1; EXPRESSED PROTEIN; 1.
DR Pfam; PF03061; 4HBT; 1.
DR Pfam; PF13472; Lipase_GDSL_2; 1.
DR SUPFAM; SSF52266; SGNH hydrolase; 2.
DR SUPFAM; SSF54637; Thioesterase/thiol ester dehydrase-isomerase; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000239649}.
FT DOMAIN 143..221
FT /note="Thioesterase"
FT /evidence="ECO:0000259|Pfam:PF03061"
FT DOMAIN 406..610
FT /note="SGNH hydrolase-type esterase"
FT /evidence="ECO:0000259|Pfam:PF13472"
FT REGION 305..325
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 310..324
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1395 AA; 151515 MW; 796AB3BB002F9176 CRC64;
MPGTVPAAAS SGQQRPLSQI DRIKSQKALA AANGSGGVAA HGGETYMRGA RGQAEDLDRL
VKLDWTKELL AQPDMQTLLT CGAMSKAPEP GDDGVNGVDP DHLFLSLLRQ DLIRDLLFLY
NPTERTFRTL MSVGMDVCGH PTIVHGGFTS AMIDETTGGL VYELKKAGEL GEGSAFTARL
EVDYKRPMPS NSDVVCTARV EKVEGRKIWT VAEVADRPGG TVYALVRSSM AFMGGSIESG
LLGSRSGTPS GTPRDSAALR FHSWLPGAAL AVALLLTAGL LGMRTAPVEP GLRLREAHLD
TVTGLESPGD ACPPPRPCPP QKACPPCAAA AAGAAQQRQK QRQQRERAGQ PQRVLDAWVQ
RLTVPWGPLL TEQEAQRGLS YYGSGERLQA VAAKLMAGKP IKVFTLGASV TRGIGTTDRR
YSYASRLFEW IQAAFPHKDH VFVNRGIGGT SSAIYSVCAE HMVQEDADLI VLEFSANDLK
DAPFSHPERK GYEQLVRKLL GMWGRPALIQ LHHYAWWHAV GDGVEDGGLF YHPAAEAQLG
VFAQYYDFPS VSVRGAMWHL MRSRVGQFNP DAARHGATAS PTDYPIPGAE PGTEKQYWYR
DRTHPGDEGH GMLAELLAHV LAKAVMESLS PRPRLHLAGR DADLAPARDA RGLPLPMIPG
NAATPTTLCA IEEDFQDVVD ASKGFAYKPE RPDGRNFVEQ KWGWSAAKPG AWAEMLFDSA
SGFVDLTGNA TNEGAQVSLS YLKSYKGMGT AELACVSGCT CEPQALDGTW ETELSLQQIL
QFWVSRHRRC RVRITVSERP GAVAQRGHKV QLLGIMVSHF SVRLTTYPDQ KESDPRLVAR
APGFACTTLL FALALVGSAT YLATCQRGGL SLCPAWTSLR PAAAPRCGVP QSHPQVINGG
RLFLPGDALR RGTTAYGSGQ RMERLGAKLL AGQPVTVTFL GGSITWGRGG NEGGSFVVRF
TEWLNSTWPH PGHRIINHGL PAVTSALFAA CYDNVPKDSD LVVLDFAVND AAVSPNGRDK
LGYSFSNGQR RGFEQLVRKS LKLRDEPAVV LLQFFSWNAT RDKTEGKVVN GMPTGSQPYG
DSFFWRTIED ELSTVAAYYD APVMSLRNAA YHLLREQQHG FQWNVSLHHL VDSDAPEAEI
ERQKDAQFFW DENHPWDRTG HRAMAELLMA TVSRGVQAAA DAHLAHAACA GGHAGVPLAL
PPLPPLPPPM VTGNYEANAT SCHLQEAFEN VIQEEGGFKY EARAPDEDTF VAQKWGLTAS
KPGASATLRV DTELGGLSGK MDAVQVHLLF LRSWRGMGRA LVECEGGCEC EATELEGHWE
RQATLTDLYT LQATLTDLYT LQVSPHPRCQ LRVTVQEGSA SGEHMFSVSG VVVSSVDARM
ERGKIGQWEY NKART
//