ID A0A2C6KTE9_9APIC Unreviewed; 777 AA.
AC A0A2C6KTE9;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 22-FEB-2023, entry version 17.
DE SubName: Full=Had family iia protein {ECO:0000313|EMBL:PHJ19453.1};
GN ORFNames=CSUI_006715 {ECO:0000313|EMBL:PHJ19453.1};
OS Cystoisospora suis.
OC Eukaryota; Sar; Alveolata; Apicomplexa; Conoidasida; Coccidia;
OC Eucoccidiorida; Eimeriorina; Sarcocystidae; Cystoisospora.
OX NCBI_TaxID=483139 {ECO:0000313|EMBL:PHJ19453.1, ECO:0000313|Proteomes:UP000221165};
RN [1] {ECO:0000313|EMBL:PHJ19453.1, ECO:0000313|Proteomes:UP000221165}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Wien I {ECO:0000313|EMBL:PHJ19453.1,
RC ECO:0000313|Proteomes:UP000221165};
RX PubMed=28161402; DOI=10.1016/j.ijpara.2016.11.007;
RA Palmieri N., Shrestha A., Ruttkowski B., Beck T., Vogl C., Tomley F.,
RA Blake D.P., Joachim A.;
RT "The genome of the protozoan parasite Cystoisospora suis and a reverse
RT vaccinology approach to identify vaccine candidates.";
RL Int. J. Parasitol. 47:189-202(2017).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PHJ19453.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MIGC01003435; PHJ19453.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2C6KTE9; -.
DR VEuPathDB; ToxoDB:CSUI_006715; -.
DR OrthoDB; 217676at2759; -.
DR Proteomes; UP000221165; Unassembled WGS sequence.
DR Gene3D; 3.40.50.1000; HAD superfamily/HAD-like; 3.
DR InterPro; IPR036412; HAD-like_sf.
DR InterPro; IPR006357; HAD-SF_hydro_IIA.
DR InterPro; IPR023214; HAD_sf.
DR PANTHER; PTHR19288; 4-NITROPHENYLPHOSPHATASE-RELATED; 1.
DR PANTHER; PTHR19288:SF46; HALOACID DEHALOGENASE-LIKE HYDROLASE DOMAIN-CONTAINING PROTEIN 2; 1.
DR Pfam; PF13344; Hydrolase_6; 1.
DR Pfam; PF13242; Hydrolase_like; 1.
DR SUPFAM; SSF56784; HAD-like; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000221165}.
FT REGION 1..248
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 478..508
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 632..696
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 8..45
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 56..82
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 98..121
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 122..142
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 143..158
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 188..217
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 218..248
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 662..680
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 777 AA; 86323 MW; 460DED424CB8D8AA CRC64;
MGRLSTGAAQ HPYQQLYSSS SPTSLCVSPF KKEHEGSNQN GMASSNKEKK RNRSLSSATK
NISSLSLQPS GKSQLSNRAI RKSQKSRQSH STTHHVDHPS LRLSPSSSFS LTSSSPSTFL
EPDHLRQEDA ETCGTSFKLS REKRSNKKVR KTLHKYDKRL SFRRTSLIPK ATSDKKTAPS
QNGERHGCLQ GDIEKIQEGR KKHHEEGRHK SNEKRGRRSP SSCSSGSSAG ASRSKKERSS
LVTPSSSFIP AKEQEKIASS SSSFALLSSS SSSRRGTKKE GETKMRERHL SSLSDFCWFQ
EKCLCIESLS TELSRQLPQI SPSRFVEKQR EYLSFLKDHA HLSSCTSKIL LSQEDYQKDF
LDHYENFIFD IDGVLLMGKQ SYTGISNTLQ LLRLRKKTIL FVTNSASKSR RICAKVLTDA
GIQAYEHEIV TASYAAAQYI RETYPDVKKV FMIGEEGLKE ELNLAEIQVV SLDTHPLPTS
QTSGVSTPQQ QQQSRTPNTH SSSSSCDGRG VISIESEADF RSVSGCLDPS IGAVVVGWDR
RLSFSKLCLA SLYLQRKIEN SKHISSSSRE NSDGFLPFIA ANRDSYDMVE AYKIPANGAA
VSYLEAASTR KATCVGKPSE WLARWLVSRH LPSPSSSEEE AEEEDQPILD RSNHSNSARD
GNDSLKEQQT SLQASCKQNG VAKNRKKKKK GGVEENKRNL KKTVVCGDRL DTDIELGHVM
NVDTCLVLTG CTNLDVFLKD LIFPQLQTSK RDKKKHSNAS FPTLVLPHLG ILAQDQL
//