ID Q5CRE2_CRYPI Unreviewed; 2069 AA.
AC Q5CRE2;
DT 12-APR-2005, integrated into UniProtKB/TrEMBL.
DT 12-APR-2005, sequence version 1.
DT 27-MAR-2024, entry version 88.
DE SubName: Full=LPS glycosyltransferase of possible cyanobacterial origin {ECO:0000313|EMBL:EAK88018.1};
GN ORFNames=cgd5_3140 {ECO:0000313|EMBL:EAK88018.1};
OS Cryptosporidium parvum (strain Iowa II).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Conoidasida; Coccidia;
OC Eucoccidiorida; Eimeriorina; Cryptosporidiidae; Cryptosporidium.
OX NCBI_TaxID=353152 {ECO:0000313|EMBL:EAK88018.1, ECO:0000313|Proteomes:UP000006726};
RN [1] {ECO:0000313|EMBL:EAK88018.1, ECO:0000313|Proteomes:UP000006726}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Iowa II {ECO:0000313|Proteomes:UP000006726};
RX PubMed=15044751; DOI=10.1126/science.1094786;
RA Abrahamsen M.S., Templeton T.J., Enomoto S., Abrahante J.E., Zhu G.,
RA Lancto C.A., Deng M., Liu C., Widmer G., Tzipori S., Buck G.A., Xu P.,
RA Bankier A.T., Dear P.H., Konfortov B.A., Spriggs H.F., Iyer L.,
RA Anantharaman V., Aravind L., Kapur V.;
RT "Complete genome sequence of the apicomplexan, Cryptosporidium parvum.";
RL Science 304:441-445(2004).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAK88018.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAEE01000007; EAK88018.1; -; Genomic_DNA.
DR RefSeq; XP_626244.1; XM_626244.1.
DR STRING; 353152.Q5CRE2; -.
DR EnsemblProtists; EAK88018; EAK88018; cgd5_3140.
DR GeneID; 3373256; -.
DR KEGG; cpv:cgd5_3140; -.
DR VEuPathDB; CryptoDB:cgd5_3140; -.
DR InParanoid; Q5CRE2; -.
DR OMA; IEREACH; -.
DR OrthoDB; 96at2759; -.
DR Proteomes; UP000006726; Chromosome 5.
DR GO; GO:0016757; F:glycosyltransferase activity; IEA:InterPro.
DR CDD; cd03801; GT4_PimA-like; 1.
DR Gene3D; 3.40.50.2000; Glycogen Phosphorylase B; 2.
DR Gene3D; 3.20.20.80; Glycosidases; 3.
DR Gene3D; 2.60.40.1180; Golgi alpha-mannosidase II; 1.
DR InterPro; IPR013780; Glyco_hydro_b.
DR InterPro; IPR001296; Glyco_trans_1.
DR InterPro; IPR028098; Glyco_trans_4-like_N.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR PANTHER; PTHR43651; 1,4-ALPHA-GLUCAN-BRANCHING ENZYME; 1.
DR PANTHER; PTHR43651:SF3; 1,4-ALPHA-GLUCAN-BRANCHING ENZYME; 1.
DR Pfam; PF13439; Glyco_transf_4; 1.
DR Pfam; PF00534; Glycos_transf_1; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF53756; UDP-Glycosyltransferase/glycogen phosphorylase; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000006726};
KW Transferase {ECO:0000313|EMBL:EAK88018.1}.
FT DOMAIN 1246..1435
FT /note="Glycosyltransferase subfamily 4-like N-terminal"
FT /evidence="ECO:0000259|Pfam:PF13439"
FT DOMAIN 1449..1612
FT /note="Glycosyl transferase family 1"
FT /evidence="ECO:0000259|Pfam:PF00534"
FT REGION 1182..1201
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1839..1869
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1186..1201
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2069 AA; 234829 MW; 7037D5E7ACF00914 CRC64;
MSDREELELD KNVNRMRLEL EEALDILTCH ENEEISSNRE GECCILDSYP LSMTLIRCEN
DKEKLGLIFQ LNEFSRMVKG SYWMVLEDNK DPRNLCMLEF FNYSDGVFRN KMFEVIVDKN
EREILQNYFS HSNSILNMNG GNKSEISNEK WMMTTLKRRR DIENSRNNLG LAFCNTFLKE
SNEKIRSFDL SSLAFMVKFS EVSNISELIN NVETLYEDFR LNKMEESFLT VFSFSIENRL
DGQARFEDEL KKLSLSNSKK DAYMIHIFTN PTDYAMVTST YNWSSYISTL KDLALGEART
EFMTQKLIYN SLENFEDLNI FRTEDSERDL EIKENMMGGE NLDGKSTIAT TLNYYSTDSC
KLVDSANGFS STDLDSVTTS INSSTENLTL SKKGSTLEGL DENKELEVID DIEIFEDEEV
IGFEEDPLLV SEWSNLRFWE SAVVGASDSV AISNIRKKNL LYSNSTINSN NNQNEKLTDN
QEIPVIQESV KEEEFYECRE MMFDTHTLKA KLAKSVWGGA TCSDRNCMQC FALLHAHAQT
NTDKDEKTME NERIWCREGF TKFCEVYKSM GVVELKRKSE LGNGNRAWLI REFIGDELTD
QVFCYGLNRR RRSIKGGLRT MNIYLVGDFN DWNKTSHPMQ LETENICSMN QEIYENYPSI
PHFMKSRVYS IILDENKLES FLSCSPKANS ISFKIRIVTS KADGFTEGHM AEVNNSMGAV
SSQELETYRL LSYSRSLTTS GIKDSTENAL MNCNFYLESQ KEHDGLRNRD LKYPLPRCVL
KPRLGGNLKQ NKDNIFDSNG KEYQIPVTSI VKGLVTNFRQ SPLYIYEANI AFSSKGEFGT
FASFKEKVLP RISRGGYNCL LLTGLLEHYQ SFSNSQYPFN YFTPYSKYGT LEEFKDLMDD
CHSRGIAVIV DFNLTFADIT NSNSMLQPCD REVNNNLSNL QDIIDIDEIN MCSGSEASNG
NMSREGSFIF GNGNSGTNIV RIDELFLDDS VVDLYESFML TNLNTFNPSF FKHTPDVISP
LFAYNKSYSR LNLGYIPMLA QIFSSLHYLL THLGIDGFRF TVPDLSEEQE KYSYISSVLA
LINDTIHTIK PFAITIANEL LLRENKDKAC SELAVPLEYG GMGFDYIWNT SICSSLQDII
LKSPSSLNIK KDIIDELLPH EDKMRNSNKK IRILYKNREK QNISSRGDND NGNNNSYSDM
GESDLHTVEF VSRNLYGIES LEMKTVTQNP LRIAMFSWES LYTHSVGGVS PHVSQLSASL
VRLGHEVHLF TRATTSAYII KVHDGVLYHE CPFQLNSDFV TEIINMCNSF VYYMQRWEDG
LTKVPDHPML NNVKNGYRFN ICHCHDWLAA PVLTSLRRIG NNNRTTILTL HSTEYGRCGN
QSYGGQSGRI SDIEREGSHT ADRLICVSGV FAEEVCRLFG INRNKIKVIF NGINCKTFDS
VAMDDAGEVK RQYGIGPLEP TFLCVARMAL QKGVDLLIEA VPGILKYRND TKFIIVGDGH
QKDEIVRRSH QLGIYNSIRF VGKKSGDDVI RLYKACDAVV VPSRNEPFGI VVLEGWTAGK
PVVATTSGGP RDFLTPNVDG YLVEPCKDSI AWGCCEILKN FDHSRWMGSR GRVKAAYSFS
WDSIARATSF LYFEQCNVLD VTPSLILKPN SPIILQAFGE NALHHHMMVF DGFDKSVFAL
KQLKLLILTT MAFAKNGVFQ SMGSEFGNPD SFDLPRPNNN MDNSKMCCKW DLADNKNLKF
KHLEFFNTLL IRLEKFLSWI AKNNIQDYGV NYNYLSPLTC GSEGELGKNG LTSACYSKSI
SPLTKGTSTS LIPSEISPAS TFSKRVLAKK EAIIKKLEES SGDNTSPVDN ESVIYLDSTP
QSNGKEGSKS IQNIYSRAKS LGKHSKDSLK ILTTDSLYNS SSNINVVLCH ELDKVMVIER
NNCVFVFNFS EKYYNDYGFG ISSELPQSLI IDTQDERFGG DKKYHITGSF CSTSIKSEQG
FKIFSPNSEP SNINSIVNNK FPNNFQHESP YRFVNTKSLK QTVFLNMSPF SAYILAPSNL
VQDFPCETVG DKLFSQNIDS FVDSIGLFC
//