ID A0A1Q9DFZ1_SYMMI Unreviewed; 4315 AA.
AC A0A1Q9DFZ1;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 24-JAN-2024, entry version 26.
DE SubName: Full=Glycosyltransferase-like protein LARGE2 {ECO:0000313|EMBL:OLP94104.1};
GN Name=Gyltl1b {ECO:0000313|EMBL:OLP94104.1};
GN ORFNames=AK812_SmicGene23912 {ECO:0000313|EMBL:OLP94104.1};
OS Symbiodinium microadriaticum (Dinoflagellate) (Zooxanthella
OS microadriatica).
OC Eukaryota; Sar; Alveolata; Dinophyceae; Suessiales; Symbiodiniaceae;
OC Symbiodinium.
OX NCBI_TaxID=2951 {ECO:0000313|EMBL:OLP94104.1, ECO:0000313|Proteomes:UP000186817};
RN [1] {ECO:0000313|EMBL:OLP94104.1, ECO:0000313|Proteomes:UP000186817}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP2467 {ECO:0000313|EMBL:OLP94104.1,
RC ECO:0000313|Proteomes:UP000186817};
RA Aranda M., Li Y., Liew Y.J., Baumgarten S., Simakov O., Wilson M., Piel J.,
RA Ashoor H., Bougouffa S., Bajic V.B., Ryu T., Ravasi T., Bayer T.,
RA Micklem G., Kim H., Bhak J., Lajeunesse T.C., Voolstra C.R.;
RT "Genome analysis of coral dinoflagellate symbionts highlights evolutionary
RT adaptations to a symbiotic lifestyle.";
RL Submitted (FEB-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004606}; Single-
CC pass type II membrane protein {ECO:0000256|ARBA:ARBA00004606}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OLP94104.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSRX01000556; OLP94104.1; -; Genomic_DNA.
DR OrthoDB; 1348065at2759; -.
DR Proteomes; UP000186817; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0016740; F:transferase activity; IEA:UniProtKB-KW.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 1.
DR Gene3D; 3.60.10.10; Endonuclease/exonuclease/phosphatase; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 2.
DR InterPro; IPR036691; Endo/exonu/phosph_ase_sf.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR NCBIfam; TIGR00756; PPR; 2.
DR PANTHER; PTHR12270:SF25; GLYCOSYLTRANSFERASE-LIKE PROTEIN LARGE; 1.
DR PANTHER; PTHR12270; GLYCOSYLTRANSFERASE-RELATED; 1.
DR Pfam; PF13896; Glyco_transf_49; 1.
DR Pfam; PF01535; PPR; 1.
DR Pfam; PF13041; PPR_2; 2.
DR Pfam; PF00075; RNase_H; 1.
DR SMART; SM00355; ZnF_C2H2; 2.
DR SUPFAM; SSF56219; DNase I-like; 1.
DR SUPFAM; SSF81901; HCP-like; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS51375; PPR; 4.
DR PROSITE; PS50879; RNASE_H_1; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|ARBA:ARBA00023136};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Reference proteome {ECO:0000313|Proteomes:UP000186817};
KW Signal-anchor {ECO:0000256|ARBA:ARBA00022968};
KW Transferase {ECO:0000313|EMBL:OLP94104.1};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT REPEAT 777..811
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 897..931
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 966..1000
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 1001..1035
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT DOMAIN 2353..2497
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT DOMAIN 3127..3421
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 3718..3754
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT REGION 507..528
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 619..693
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 821..874
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1438..1469
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4270..4291
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1571..1605
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 508..528
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 641..688
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 854..874
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 4315 AA; 476846 MW; DC1532E13640F0A8 CRC64;
MVEEIKSYHY ENAALFAEPF EVSVPSTWLN NMAKLELVKP IVPLEHRIAF VLNHKARHEK
THGEISVDTF CKLYEQKFKH ANENEKVSDT FVDTAVTVGR RVLRDPQLAN LLLQADEHVG
SPFNDYSKLQ ILVSKGKTQE GIRWVFNTMW DGYFYGWLKP EHLTKRALEG TSWPGGTAAM
MIGMMELKTL LLTQCCSKMQ WDNSIKQKMV EVFQSHESYR RHCGANDDKV DLTWKASWPK
SALHYFDHVE APDWPDHADI DRLYEDEEKQ RKKARLHFKV VIRKAAGELQ EMEKNLADDD
APEKSQDAEV EKHAHLKVEV PGDAKNKAPP KLDDEASRFK EKAMKLVRTH VLLVNEDVNV
KGMTKALRET KVPEWKGEGC LPASDVILLL DGGVQSAMLG SIKGPDGEVI NKQKSTLMLH
VSETSLRNRR KRDKGHHVYN GERVYVVTSD DGGLPQYRDR CHFSESTTRG EIIGPIQAPS
WEDENETWMV SYGDKEKIYG TQMVRVGGKN EPEEDATAGR RQDADKEPMS YHALPECAVE
EFFHAYNIQY MLDLSVGDGK TCMLAIKRRA ELVGITFNEF HKEQLYRRLE SQVFREFQNA
DSPLYEAALV QLLGKNKRKA VPKPKAKSSP SKKKKKEEDE DEEEGPAEET ASQEDDDEED
EDNEEEDDED MAEESEEEDE EEAKAEEEAA VDGDVPSRRC WMPWTFIFLS GDVAYGWVID
TGTSRKFRLK PVLDKSLGIS SDGVVGDCAR EQCASEKNVV SANVECVDET RQETDRNVVT
FTSMIDAAAR SGDDKAARHW YEMALDMGIQ PTIRRVDIRH SGGAKASKED TDPPCWACLP
PGTSLEDAGP AQGQRGGSTA SGRSTFNTLM SGPARQGNLT MTERWLRTAK FMGASPDRVT
YNTLISAAAR KGDLAAAEHW FKQAVESRVD PNVVTYTTMI SAAGNEGRLK KAQDWYERAA
AKVRPNVAMY NAVLSAAAKQ GALATAESWF ERMEADGILP DTRTFSILMD ACAKSADLEA
AERWFERASV AGSKHDAITF STLINAAANK GDLEAAGKWL EKAQQVQANL DTGALNSLLA
AASRVGDLDI GSWYRRALAA GCQPDETTLA ILGEASAIER NQKEQKSLEA QKKQNEKASR
FGWSAATRAY KKMEGLMAKK QALSKEADQA TIWQKRNALA TAWLEKKAAL QKKGKDVKWL
DSWIKAQTKD DADKRIVITH QSAAALLMSI PPSLSGDVRT EMKMTRDFIK LDEEYDARQQ
QLNTELAEQK LIIAEVQSTP KYVPPTYTPK TDWLGRPKGL TEEQVNQLQG TKLTNVAPPP
LVVQARVVGV GEPVEGNVVQ GIVVEPSPAI QQAQYEQFQL YQRQQQQMKR KLDQQRNYNA
YQDGAYGYDL SFYAYMGLWQ KQTFDIARLG SLRQRSVDYK DIIRRAAYQL AQPEGPAQGY
SQAAQAGPQG KAAGKQGKGK AAATPAEATE TFRVTPWQPQ GVPAAVEPKP APTEDPQMQE
LWGALRASQQ ALPPLVAEAL AKMEGNQGRM VTKSMHAHTK TMGSAKKQIH TLREARRKQD
ASWVEFLNNT VLALEQGAKR YQEAMQKFEE QETEARTRLT NARQAIRELA GKMETEEPNI
VDDAEDLDDS DIELMDTTQT ALTATTEESK VVQAQKKLRV TLDALMSKRP PADPGKATTS
PPKPIIFLPM LPNEMQAQEH IATPRSREVG AVSVGTLSEP TVGNGKSAQA MPKLDSGFPA
VILGQAAPVN SERRSCSSLH SQGRLALSAA QLRLCQDAAS AGPGDGKYSV FDRHRHHEVR
KSIPTWAASD YLNDAIASCA EATICAQFLQ PPIPDLPVPQ ITLTPPGWPL GSLSVAIDAR
KVGGTICTVV LTPGCEKLTV LDAIATECPT LAPVLNFAVA QDAVFLLDSA GRIWEVLPLQ
LDTLQWLSLQ LDVTGVPHLH SSLFLGGMVG SGPTTLTSTA AIASSSSSAP PTVTFILVGG
GTLIRLAPQP WMHASVLDSL TELLFVLALQ GRMPHRPLLQ ISAASPRVPA RLNTYFICFL
IYPEGPDIHV LEDFSVDGHM RAEASQPTSM ALHIRQPTAN WRQEISSRSQ GDYFAEVETP
SRLYELLPQL RYFALPVPVP HMQALAADPL SATLQVRAKE SLRRTLQLRI DDRQRHFGTF
ARSNQPIVVL GPGHPALSLY MDQPLTPGFD EAVEFLQNGE YLPAGTTFRD PLVLEWTTPV
FISVPPGSTR RTLLYPSPHS PHFLQVSVPP NLPLAGLPLP VRRGKRAVFP PDTESEHVIE
EKPLPPARRA SADASSGTSL LQISAELRQV RIPTLSRRTS CHFLTLCRAP GAFCHWRVTP
GISCVLPSCP AEHLDAVLVY TDGSFDPKAR RASWSFVAVG KQGSAICRLG ALAGRVSEPA
TAVNAFRAEL WALLQAVAFI IANNLQQVTL AVDCQAALDV AFGNAQAAAQ DPIGLAVQSL
LFFARTMGLQ VKPWKVEAHS GIPLNEAADA VAKTANGCWI ADGFHFDDSV LRSCIDDGTV
HRLWLTSCTG RLSAQLPPLD NGGCWNLPSC QLRPATEAAT SSIHSETSAP KNWKLAFSVI
TYNCLSARSR PARALLDAGL QASQCAIAGL QEARDPTTGI SSSDHYWIAT SGCDAQGSYG
CQIWISKTRA WGHSSDTALK PVRDSFSLFH AEPRLLVLLI RVGRIKCACV SAHAPTTAAG
TDASRNWWAH LRATCNRIPP GHTLLLMVDA NAAYDRFGPN GDTLESVPCS ENAKQFQQFC
DAVSLQPTAQ VDRLGHRLVS WTSPDGQTQK LLDYVCVPAA WRTRLSAAPN FCLGDLREGY
DHSPIMGSVV ATVCAPSTDT RPRFSPEALR TADGRKIAAA ALASAPPVPW EVDATTHLEV
LLGHVQRFLA KHLPPPPPRP RNPVLSDMSL RLILTRRQVR TVKKRLAMTY ARGLLFQCFA
AWSGRAELAR SQANRLDRLS CRLARAGFAL SALHKGIGES FRADKAQFTR RAMETARGRG
AAEFAHSIRA ILRTGRRFRA PQLLHQISDG NSTAVCEEEI LKLLGEHFAK PERARQITGG
ALAKLFNEVQ PPTHKIDMTS LPSIADIASA TLALKKGKAP GMSGVPAELF QADALATAEL
LFPILAKSVV RCSGPLQHNG GLARAIPKSK QQAGSPAGWR SILLLEPTGK ILQKAYRTQL
VLALEQHKSK NQFGGLPHRR IEDASVMVRA HFARLKAHRQ TGGALFIDSR AAYYSLVRDS
LVRSQTLQTE DQLYQRARAL FPFRDDQDHY VRHMQAGGLI PALQLPEPLV RYLESQLGTT
WFSMEAPSQA PYVSGSGTAP GSPIADLLFS FVYARFLQHV EEILLAEGHY VTMCSTADAS
IMPTWADDTA VLVGPVAPPL LAASLQRVTD LLTRGLSRAG LDPNFGPGKT EAVIHFEGPG
SRDTRRQLLC LVEPGVQFDS HLRGTLHLRL VPTYVHLGTV VSHNLSEEPN LQHRAHLLKQ
LFQPLRRRLL YNDDLTKQEK IRLLEERVFP KFLFGAGFWT PRNTREHDMA LDPLRCAMRQ
AFRPITGVSS TGYCNQEVAA ALGLPTADDC LAHARATTLL HLLKTGSREV WHGLLADGLW
YQLSWEALRT VLGEAWPVAL CTTAPPPPEQ LQSLLPNHCD KLCRNYLRKC KQNHASVALQ
PRCAEASEVP AITRTLHSGL THTCGVCGLS FLNARRLAVH KARKHQQRAK GIQLAWGTRC
ERCGTEFWNT SRLAQHLQHS ATCQSVYDHS DTQPACVETP SKVPLAWLRG ARCADCSFYE
HLRYQWHHQH YYYGYHDQDY HHHYDHFDTA PYDYGECQFA QEFGAEPPQD YWTGLPDDPP
EGESFERVTG DAASSIDFFP GAVYSNVEMY AGGEDDRPER GPHNGSWKTA CLSPAPPLDL
SEWFEEHIAR AEEDAKAAVL GRLGRLPYRR EVLLRTRSNQ RAGPGLCLVT QCSLDRLDRL
RDQLCGWAGE VSAAVFVDHM QDSEAAAAAR RGIRDMCFEA AMQFDGNVPA WTITVLYRLE
DADVKCEAYD RLYPVNALRN VALEHARSDL VFLVDVDFVP SRRLHEVLAS DDGGRRLLNA
LSHGSRKSRG PGALVIPAFE AAKSNTPLPL HGVELRHAAA RGEVEGFHVS HFPCGHRATD
FDRWLGVSKV ESPDAGNFSQ HAHGVDAYLI HYEEHFEPYV VVPRQSAPAY DERFRGYGLN
KISHLYELSQ RGFRFRTCAH PEAFVVAAKH PKSQSWHLVL GPQAEAEQRA RIATHYATFK
AELKKRLRLS PPLPIKPPKA RTTPWRGLPT KRLASGSAPL LPMPLDAPAP LPVAV
//