ID A0A1Q9C0W8_SYMMI Unreviewed; 5113 AA.
AC A0A1Q9C0W8;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 22-FEB-2023, entry version 15.
DE SubName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94 {ECO:0000313|EMBL:OLP76564.1};
GN ORFNames=AK812_SmicGene43485 {ECO:0000313|EMBL:OLP76564.1};
OS Symbiodinium microadriaticum (Dinoflagellate) (Zooxanthella
OS microadriatica).
OC Eukaryota; Sar; Alveolata; Dinophyceae; Suessiales; Symbiodiniaceae;
OC Symbiodinium.
OX NCBI_TaxID=2951 {ECO:0000313|EMBL:OLP76564.1, ECO:0000313|Proteomes:UP000186817};
RN [1] {ECO:0000313|EMBL:OLP76564.1, ECO:0000313|Proteomes:UP000186817}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP2467 {ECO:0000313|EMBL:OLP76564.1,
RC ECO:0000313|Proteomes:UP000186817};
RA Aranda M., Li Y., Liew Y.J., Baumgarten S., Simakov O., Wilson M., Piel J.,
RA Ashoor H., Bougouffa S., Bajic V.B., Ryu T., Ravasi T., Bayer T.,
RA Micklem G., Kim H., Bhak J., Lajeunesse T.C., Voolstra C.R.;
RT "Genome analysis of coral dinoflagellate symbionts highlights evolutionary
RT adaptations to a symbiotic lifestyle.";
RL Submitted (FEB-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OLP76564.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSRX01001988; OLP76564.1; -; Genomic_DNA.
DR OrthoDB; 1707450at2759; -.
DR Proteomes; UP000186817; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-UniRule.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.470.20; ATP-grasp fold, B domain; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR011761; ATP-grasp.
DR InterPro; IPR036400; Cyt_B5-like_heme/steroid_sf.
DR InterPro; IPR010869; DUF1501.
DR InterPro; IPR014917; DUF1800.
DR InterPro; IPR003034; SAP_dom.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR43737; BLL7424 PROTEIN; 1.
DR PANTHER; PTHR43737:SF1; CBM6 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF13535; ATP-grasp_4; 1.
DR Pfam; PF07394; DUF1501; 1.
DR Pfam; PF08811; DUF1800; 2.
DR SUPFAM; SSF55856; Cytochrome b5-like heme/steroid binding domain; 1.
DR SUPFAM; SSF56059; Glutathione synthetase ATP-binding domain-like; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50975; ATP_GRASP; 1.
DR PROSITE; PS50800; SAP; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW ATP-binding {ECO:0000256|PROSITE-ProRule:PRU00409};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Nucleotide-binding {ECO:0000256|PROSITE-ProRule:PRU00409};
KW Reference proteome {ECO:0000313|Proteomes:UP000186817};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 575..589
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 1034..1068
FT /note="SAP"
FT /evidence="ECO:0000259|PROSITE:PS50800"
FT DOMAIN 2430..2636
FT /note="ATP-grasp"
FT /evidence="ECO:0000259|PROSITE:PS50975"
FT REGION 1..136
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 479..571
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 599..628
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 977..1005
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1169..1196
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1477..1552
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1593..1621
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 2272..2299
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 13..28
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 35..136
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 493..527
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 549..566
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 608..628
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1490..1552
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 5113 AA; 563233 MW; 685D8C8538CBED7E CRC64;
MCRGVLTSAS SRKRNAPEEE EDFPAMQFGM KRRRNAAEAT AEKKRVEAER RKAAKAAAEK
KRAEAERRKA EAAEKKRAEA ERRKEAAAEK KRAEAERRKE AAAEKKRAEA ERRKEAAAEK
KRAEAEKKPL LKRRKRTEQG KEFMVAAGWL EQRGAFGGGS KGFYSGGAGP GVWVSASDIL
FGSCRWQLVG AWVPTEGEQG YDPGTMDQSF NGPAPADAGA QPDGAAAGGV PLDQVDLVVR
RRLEQALNGV FGRLLQTTER AAAAAEVQAN VTKSETLLKS IKCEVWRPAS REEELKTWRE
WWLQFSTWLL AVEPAYEKDL QEMEIDTPVD SDLMDDATLA RSQRLYAVLC SLVKNRPLLI
VRAYDSTKEG FEALRMLRRE MEPKEKTRSL ALMRRLAAWE FHGGQGLYEQ LIKYEEALKQ
YETAAGAPFP SELSLATIVT GLHEPLRSQV QMRLTPTTRY QDIREWVLQY ESLNAPWSSS
LLGGAGNKAR GSKDEAQPMD IDRIKGKDPK GKGKDSKGKK GKAGKNDGYK GWGKWTSKDY
NTGAGKGKND KNNKGKDSKG KPGKDGKGKG QGNICHLCGQ AGHWKNECPW KASTGAGKHG
VRQVEETGQG ANPGSSSQST VSTSASAYRT PSTINQIRAA SAVEMSTPPG CKEALVYDMT
GDDGDLEDFA LEEPGLMMIE AVPVFGQGVP VFFMDATDDD GVWTFAEGAE IEPNEDIHDE
GPMVISMVRA TGAVEVVVDS GADLSVAPLS YANRGQHAAA PRVIMQDAQG KRIYDKGARN
LSLEVETGDG GHVVLKERFN IANVGSVILS LGRLLRNGWN LGSEGGQQRI TRDGVSIPVA
LRRNTLVLSA VIGAISLMDS GPLPPELEEL GGHQGWHILP SGLPVLINNN VSEVPLENSI
WSQDDWAWVT CFIRKEPATR YPSPGDVWVQ AFTVTTEDFE RMGRTMKEID EELGERHDVV
LLFHVEEIPK NLLSEPGSFF KDPDLDEPPF LPEGERDTGG GVGADVEDVL GERDLRGHEE
EGEQDGVLDD VKLSVATPLK ELKLLCDRLG VAKSGSKNKV LKRLRDHRDV MERQLATDVA
KQLYKDRDRD PVAMKTPVLP SARQQELHAL THQPFAPWCA ACVMGRSRQS PHAGETGANQ
GGDGVPGPIQ CVLQIDYCYT FTRSKGEEMD VEGANANSGE APTGDDDAPP EAPPDYQDQH
GLNMVCCEST TGWVMALPLA AKGTTSLKRT TEALTRLSLL IADRDEVVIQ GDPEPSVRQV
LNAFQACRTK LGLKTIVRET QRGSHASNGA AEKAVSTIRA LARTLKAELE SRLKVEISGH
MAIYSWLLKH CSFLHNRYFV TSKGLTPFEV VHGKRYVGKL LVFGEQCIYY AGSRYKGDLQ
WRQGTWVGIN ERNGANIVLT ENGAKESRSI RRLPMESQWA ANAAANALGL PWAYGGKAKR
KKPIYSSQRV PLLPDAASLE ELARAAGRSA AEAIASSTPL GGIPATPAAP AAGNNNNDEV
EKQTQFQEQQ VEKQTQFPEQ QVEKRTQFPE QQVEKRTQFP EQQVEKQTQF PQQVVPAVQP
APWTQVHGDT LTQDEAQWEE WIDEVGAVLD AEHHSEEPEW DEHGDRPPDL SPADLEPLDH
KSDRAEVERL ISMGALRAPR PEEDIGGYAR LTTKIVRDWR KRPMWVRRSR LVAREFRTWA
PWTEELFAPS SSLGAVHQLI SIAMAKGLEV TTIDVKDAYL NVPQKAPVVI TVDRSLVEDG
GVGEAPFILE RLLPGQRVAA AEWFQYVKEM LLEADLENFE KEPTLFRGTG LDDDTNLILH
ADDGILASTQ KAREKVLGVL KRKVEVKVSD PFGPGCELEF LKRRYLWCSE GATMMSGTKH
LEGLLNALGH DIKERDAPAD TSFVEEDKGD ELTDSKRKLY QECVGRLLYL SHTRADIQYS
VSVLAGKMAK PTTTSMRWLV RVAGYLKRVP DLGFLIKPLV PDANLEYSGQ GKLVAGSTVV
LESVSDADWG GCKRTRRSKS SAHFYLAGGL IASHVRTQKS IALSSGESEF VAMVSGATEL
IFLKDCLGFM LKNKMTIEAK VRSDSAAAQG MAQRLGTGRV RHLACGMMWL QASVKSKMLK
IGSISGSTNP ADLGTKVLAG PKVRQLLYYA GARLEDGTAY GQLEAEEAAQ RTRINQIVTN
GVKKPSLKHV LPVLLVLSQI VGSDGASFEG LGLGVAMAGL EDTVAEVSYL ASAVLLRTSI
LLVFPCGLAY VIWKLLGIFF RSGKGSTKEM AVQASMRSRA EQAWADEYTD KVNYLNQILA
EERAEKEQME NALKKRFLED SVVGGALVDG RSTLRWLPWS TDEPAKPSGV VLVDGYGGTK
AMLQEKAAAR GLLVAELWSP PHPKPAPKGP ALKDWLKKLP FSVSGVLAFS DPHALPIVEE
VLSLVGLTGS GDVAADALLE QARRNKHLMQ VVLKDAGLTS CKWVLACSES EVRTFVEAEL
SQKAVMKPFL GGSAGNNVTL LDYGNRPVES EALVRQVFRS RPAGAEFMSV LVQEHLAGDE
YAVDIVVRDG AMAVMGVFKY DVRALNGAEF VVFGVLIEPI QKGSPEEAVA TYAMESLAAL
GMRNGPAHVE VKMTAENGPV LIEANCNRFT GLDMIFDVAE RAVGYTAIDA FLDAADDRVS
AEEWRSRYPA LPSPNLRAYG GVAQMHARES GIYEGGDEEA AAEIMELDSV FKLESRDIAV
GYPLVRPSDS DIVAAKWLEW DSEREQEMQQ EQEQDPEQVD FGKVVDDFVE EFYDQCWQGY
RFVGKGNAVV TYVSRLPGTM EGNRCYVKVK IDQEEHAFDR ALSSFALNMD FPPDYRYDTT
VGAEGLVGAG ASTAGGLSGA LFSGSSGGLS GVSAGRRGTR HSHGSFTHMP MDCAFPAAAD
FANCTGEAAQ GPQEHRRCGG LQPPRALACN SESMCSVLRR PTHSPRESST QPLHFWLSHL
LFVPRVSRDT SRRLYFMAGT MTTLKLLLAM WCGCQSATPP PGACTTGNSG LGLIQRSPAL
KKFELDLASQ EPEFSPVDGG DGRACRGATT GDNKAEYFQI QSSDSIEGCK ALCEGTVGCV
GIEFHTSGRC EVWTREGGIH ASIALSGYTC LKYGVDETVG AFLPVDGGIG RACRGANGND
NNPAHYTIES GLTEMQCKKL CMDTSACVGI EHAGTRCEVW TRPGGIKASR PASGSKCFKL
EFTKVEGSDN MACRGGSAND NNPSHYSIKT TASIDQCKET CRMSSACVGI EFINTRCEIW
TRPQGIEAGF PLAGYTCLKY HGGLKPPAST KQQRDARFLI QATFGPTLAS INNFNVSYQT
WIGQQMAMKP GYHREYYRKR VNPRPVRSAS SLRSGNLLSR CSPGSRWVNH VILKSDRYRK
IKVRGNKIFI DDFFRSDVDP AYLGNGLRQP DTCTDIPPKD WEGKGYTCTN RRYWVDDNCI
KDEVWVEEQY CQKSCFEENL AYDNMDCSPG WAGMDYEGYI CTVDGDATRA WVKLNTKESC
EGSSTETYML NPGVWKANPD STMTQPLAFE VFKPGVLMLT EPPSKCNLGT IIRSSREKPN
QFYMLEERLE LLENTLESPS ATGVSGGKCP TVARTFLNEH TCRLLPGCLP LGQDKLSVAL
TTHNLAKFFT VGGRYVYEVS GLITTKTPCG KTSRWKKCSS GCIATSGLSD SDIDKIADAL
QAASEEGDVR DIDVSCSNVD AESIVAVGSD LFQHVHKDEG SVYDFTDWVL QHPGGAAKIK
QFTGMGYKLI YPSWHPMERW DASFATQVIQ PQYVGKFGSH VQYHNLPATL QTEEIAQALG
AASEATEYSA VCGSPQEVAN DPDMGHLLSF KHGGSDDIYF DQEYHFWWNR DRRAKQMIWT
MHALYADDQL RQRMAWALSQ IYVASVSGLG YSERVEVWVN FYDIFVRNAF GNLRDVLQQV
TYSPVMGSYL TYKGNRAFDE TGRFPDENYA REVMQLFSIG LWKLNPDGTQ MLDSSGQPIP
TYDNEAIMDF ARVFTGFNDQ PNRGNYEHLD GRNWIDPMRM TARHHDKYPK PDLDGNYIGD
GYPLCSDAAD QAFLVKGAKF SFVGASKHLD PDHKEGALVL DSSSGLYQAL CFESGGKCNY
NLTVELDQTI PCTGKECTTV TVQLLNVSGG LYEYLQPTCV NMFFFNGRIT RDSGRRWGWS
QKCRDPESLL AGIACCDGCK NRTDNWMRNR GYTCENAETT YPNMFTSRCN NSDWWSNSKY
CQLACWQKGV GYEGDDCSFG PWHAERVCNY NWERVRLSTA EAVCAAKGMK LCNEKLEGYG
CNYDSMQVWT QEPCSYEVIV SDEGKVASNF TSKTRNNWIK AVWKNGYPKM QNGQCPSGCT
PFGSACSCSM TVEVRAVFTS TPTPSQIQQL KIGAFPPTTP CSTSCSGPVK AYQKDGKFDE
FTVFEYMGSF YSNKESVVVL PGFELRNPPN FMPHVYMSEH HHPKHALAEV EALLDHLFKH
PNVAPFISYR LIQRFGTSNP SPAYVHGVAD AFRKGSYGGK TYSGKYGDLG ATVAAILLHP
EARNPTLASD GSLREPFLKL IHIMRALEYK DDLGSMPALR ELRDRIGQQP YRQPSVFSYY
LPDFKPDNFP EGKVGPEFQI FTPPAIIDFL NGVNSLVDRG LGDCDRGFGE WAPGCSAGKL
TLGQRDCLQP TVDQLDLLLT GGRLHNPQFI RAAYEEANGE DRYEVATQAV LLSPEFNTIG
TPMPMGPRPP QPPKPVEVPG EYKATVMVFL NGGADSFNML VPINCPLNDE YKQVRGNVAL
NNNELHAINV ANQDCAEFGI HHELKTFKEL YDLEELAFTA NVGSLVESLD KASYSSGMGQ
RCVGLFSHSD QQRAAQTLTC QSATSAMRGA GGRLADALAS GSKKLKTMSF SLVGSQPWPQ
GIETNAEIIS NGEPVQLRKK KQLQVIIDNI TSVKHGNIYA EEYSTRLTAA LDFNEQLTRS
LRNAELATSF STDNRGFTRQ LREVARLIAS RTERGVDRDL FFVQLGGFDT HSSVERNLER
RFDELDHGFQ RFVAEMKEQN MFDKIVMATH SDFARTLTPN SGAGTDHAWA GNYVIVGGGI
KGGNVFNQFP KSLLPGAEQD AGRGRLIPKY PFENMMLPIA QWMGLETSQE TMVFPNLNNF
NSSHFISEST LFK
//