ID A0A1Q9D440_SYMMI Unreviewed; 3956 AA.
AC A0A1Q9D440;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE RecName: Full=Gypsy retrotransposon integrase-like protein 1 {ECO:0000256|ARBA:ARBA00039658};
GN Name=GIP {ECO:0000313|EMBL:OLP89925.1};
GN ORFNames=AK812_SmicGene28588 {ECO:0000313|EMBL:OLP89925.1};
OS Symbiodinium microadriaticum (Dinoflagellate) (Zooxanthella
OS microadriatica).
OC Eukaryota; Sar; Alveolata; Dinophyceae; Suessiales; Symbiodiniaceae;
OC Symbiodinium.
OX NCBI_TaxID=2951 {ECO:0000313|EMBL:OLP89925.1, ECO:0000313|Proteomes:UP000186817};
RN [1] {ECO:0000313|EMBL:OLP89925.1, ECO:0000313|Proteomes:UP000186817}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP2467 {ECO:0000313|EMBL:OLP89925.1,
RC ECO:0000313|Proteomes:UP000186817};
RA Aranda M., Li Y., Liew Y.J., Baumgarten S., Simakov O., Wilson M., Piel J.,
RA Ashoor H., Bougouffa S., Bajic V.B., Ryu T., Ravasi T., Bayer T.,
RA Micklem G., Kim H., Bhak J., Lajeunesse T.C., Voolstra C.R.;
RT "Genome analysis of coral dinoflagellate symbionts highlights evolutionary
RT adaptations to a symbiotic lifestyle.";
RL Submitted (FEB-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OLP89925.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSRX01000737; OLP89925.1; -; Genomic_DNA.
DR OrthoDB; 2958334at2759; -.
DR Proteomes; UP000186817; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR47266; ENDONUCLEASE-RELATED; 1.
DR PANTHER; PTHR47266:SF28; GYPSY RETROTRANSPOSON INTEGRASE-LIKE PROTEIN 1; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000186817};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 820..835
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 2216..2382
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 697..720
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 761..809
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1026..1065
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1393..1517
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1556..1587
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2530..2562
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2574..2657
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3196..3221
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3402..3425
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3880..3956
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 699..720
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 781..802
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1028..1055
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1406..1433
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1469..1484
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1485..1513
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1562..1577
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2640..2654
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3402..3419
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3904..3926
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3940..3956
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3956 AA; 438725 MW; 08A5D7422D5F7934 CRC64;
MGAADLSPFF LATNRKVKDG RGPRKAAVLF KGPADSDQGS GSTRWLSRWA ETAGDPDGKF
VGDWLARGKT ALMILVTLPV LLVLAPQAGT VLTDEEKRDF GRGVKMKGHR SFVNYASAAK
EKHVALADLA PAVTEAKEGR GPFNPRSWPQ IDRSSVSPPV STGFAGAENV AGSLVAGASG
LSVEVRFQLD TRLAGMNQKE LQLHRAELFQ VLDSEDKSAC DASDPAGAAG RLQAMTWPRG
FGHPYATRYQ LRKRHWAATP SACSAWEVFP DAALPKELPG VGPHSKWYKV SVPDLVATAK
ASSCELVQAS VRVGDGFRWS PWKACKEVPV LLEKPRAPEG AEALASWKTV CRVEWPLAIA
HAGIQEVEYQ LLVIPDSAHL LPFVGALVVA PAASEDSKSG GASMAYLVAG QLAPEPGDDP
EWTLGPNTWP ADLCVMHQAL NRLEEQALRQ VRLEPGIRIR DILMEFPRPL EFLHQVIGIK
EDPPLLFWLR DLLKEFLEAG DPSLQEAWHR LQRTLHRCPY RGKEKYTIVE RTTRDIAKIA
DQIPMSELLS ANGYDLVLTA LMTKYRPYLD IAGPASVDKF FFAGDRGKGE SFANFIATKE
VARQELETNL GERLNDKVAG RILLRQAHLS ELQRELISLR DQSTLMGFDE IANMLRPLDR
PEMIAQAANA ELGTSASKHY PVIYHDMSEP RQGQVEYEEI NGEESEEEEA QEDDDEVQED
GEEIQIYFED RTYDEEESLY LAAYHSAYAD VRKDLKDRKK ERGFIKHNKA PPQRSRSFPK
GSHKGQGRGR FFRGRSATPH RKGGPKMIKG SMEDLQSRTK CYNCHELGHY ARDCPLKGGA
SGKGANSDRK VNFVVSRGPG GGQVFVHQSP WMRVANTSAG DSGGPRRTIS VFAGIQVRGY
EALVDTAAED AVIGARAMDL KFPVLGGEAK LESVQDVPIA VAGIHALLRF HVLRDGEFTT
PPLLPISFLE AIGSNIDLPS ERLNTAAGYS TPMTRLPSGH RTVSILDFQG RPWTLPSQLR
VADEDPFQLQ RPSSSWTGAS STAAPTSQRT PSAGSSGHED FRGGRSNECV NNAISAISGF
GGPTPTSLVA FRAGEDVEAK AWDLMQNSKW SMQDMDELLH ALHPARLDHF RMIMRKAPRG
KELTRSFSTV FGVYAHGAFV GVTKSTSKYP CLVKYINSWL RHHAPEASFS SFSIGANVQA
PLHRDVNNEV TSKNLTVSFG KHKGGSLWIQ NDDSEVFSKQ SKTVKGPQGQ DLHGSLVRTQ
HRLVSFSPKL WHEVTSWKGS RISLTAYTSR GVHTLKSTDE RQLRDLLFPL TAPQEIPVSC
ALVEFQDDVP LESQCRHAEY DLEHVFDEVN LSCRSVPSPS ASWFERMISR AAKWITNSYS
RPSRPLRKNV LAETAGGPAG GQRDAAERDP SQEEVGHCQG GDGVEKDHGP NLDDVGYGEA
ADGGGRVLPA GPDRGQPEGE SFIVSSDHPK ASHQEDDKRQ PGDRTTVGQI ESSQDLRPRA
ECLSTSPRQA TLSGELGVTM VDVHTMRESM AEERRGGCHP GQAAADLRER AEDRGITRAG
VSEIPPSSSQ PTSPRRQDGR AGQLRETKNV GFGILERDWW NWSDAGSHAE NEQAAQSCYT
AVALKAVNTE TYEINTDDEA DTDEASSITK SSPVPLRKLR SELQRQGWIV KTLLLLITWA
GNGWQAPTSE FLGTPRLRMD WSAEKQSWES LPLEPEAINP PRTSTSTSWC CYIYEDKMQK
FNWLLDYVGH SMKNKFDPSI KVINKATREH LASQVRRLAG DNECQSGAQP AFGNAVDLNR
GWDFSREDHR QAALKTVVAQ RPAIALLGRG PWDEKYSMTK TTVHYILGVD AVDSESWLIL
EHPSQESMLQ MQSLLDRPEV YVLRLQQGGR EKDSEVMLVT NLDFLPELLQ KRMNQRPLPK
TSGGFQDGVG IFVEFLLDDI RKHVKLDGST TTPLRWRLTP FYLTCHHFQP RRHPTTPEQC
GSLDINHLKF TGKRIMEQHF VGQPSKTIVD NWRLDSSSPA SRPWTGITSF ELELQTILPS
SYKDYATWLA AGVAHHLFAY QAAEAAFQRE WLSVFPSHNI LGEEARANRA ASSSDDFDLG
LDNVEAPIRE SEAQVFRGKD RAHAMELDSV DEAVVRNELR EMEVPEPPEL VVSKAEVPPP
PADIRREIYR LHRNLGHPDN ATLVRALKRA GVKAEYLRWI RRQFQCPICK EKKPPAQQRP
AHLAARAMGF NEVVGVDLFF LDRKIFLNVV CWGTNYQVVE LIEDRTSSTV ALAMARSWMA
HCGPPMMVVC DQGTEFTGKD FVDIMADNGT IVHYTDTASP WQNSRTEKAG GIFKSRLAKI
CQDAAVTTEV DYRIAVAETA MAHNRYYDRS GYSPQQRVFG TNLRVPGSLL SDDVVDRDLL
QQPQSDYMKR SAEIREAASK AWMERQDFEA INRAVKTNSR TVDALQIHSG DRVYVWRTTP
EFRGWSGPGV VIQTTENGRS LWISLRGYLL KASREQTRLA SSEESFGAEL RKVLAKSMLE
DLDKGTLKHY RDVQSEGPPV DDIEEFASSE YAPSDTEELS RDEAEQLGIQ PFEQAPMAGD
PSMPAIPEEP VPMEEEASTR MPSEPSPSQP PSAPASRRSS IRVDEASSGE MIFGPIRETR
QPPMPYPMTS SVPSWPSPGQ PHAYLEVTID DDPSGKVKWW SDKAGNRKLP IPTSKRTFSK
EEAQASFNFR EKKMFLSKKV DPPGHIDFRR LPENLKKVFR KSRDKEIKSL LDSGAIKVLS
LEESLQFERD HPNHVLTSRY VDRWKPSEDG ATLPDQFDAY DASLADNGVV APKSRWTVVG
WRDPEVHAIE RSAPTPLTTS IYLAMQTAAT KRWKTFVRDV KTAFLQSMPT TRKQKLAVRM
PSCEHFPSYH PKQLILLLTE IYGLVSGPAW WRRSLLSVLV KDLGYTVSPY DRCVLILKAD
PSEKLEDQKS TQGIVVIEVD DVLEAGGERH RRKMLELESR FRFGKVTSLM DSEAGSGYAG
RRLKQSKDYG FTYSMSDYVS NRLRYVDVTR KVLKKTAETT KLTADEESQL RGVIAALNWA
AREGRPDASA AASILAGCFP SPSMADVMAV NQVVQIVKAR KVDLVIHPIP EDHIRHVVIS
DAAFDPSGKC KPQHGWLQGM TTPALNCGER APISLIAWKS RRMKRKAGNT LLCESIALST
AMGALERQVA TWESFTKSDY DPREGAEDEE DDYDSPTVLA TEDPRYLDPK SIAIADAKSL
FDALHSEQSH GDDDRSALEI AIIQQSLQRL RGRIRWVPHN ENPADGLTKL IGAHMEPMYR
LLQSNQFAVE KEETVLSRGK QSDCRLKRGL ELPWYVGKQA TLKSRREGEG EGDDGARRVV
LELAELCPEL RYAFQVFARY PTVGPRTFHK IYEVGSIARS TPEDEVRELN DKERKGKGNK
GVSPLLMNTK PPQAAVALPL PVQVPCSEES QETMQRWLQD DSPLVLLTWD GLLPEAANPR
QASLVSSAAE SLNPPFEVQA AADAEDAEEA DAAGERPSRQ WFHCPVVTKP FALKKGDASL
PCVAVRELPF RAGRFRLFDP SRMQAGPCTR PMVCLYEQLE PCKAEMLALP RSADSIPRSL
GILLQISFAG ASGSTRTARL AGLLQLRPRG RRGLLGGTDG SVGDIHSIGT QELPAEATFR
EGPKVVREED GLELGNVYEF QVRVGDECRM GPWSKSSPPV RFALSPPVPC EGGGLRILEK
GDRAEVSWAP FQPDAASQAQ LPNLARLPIE YTLSVFAEPG EPGERLLSSL STTSTSCCVA
NLRPLSAYSA SLSARWSRFG VAGDSDSARL FAAFATTGLK GSKLTAELSV RLGAESMGAV
PAVSPASRAS IPIVEGGVPA AVTLDLDPYY TQPRLNHYTP EFVRKPSLPS SRRTQAQDSP
EERPLSPRPQ PTLPRVLVPM PPPKFTTRDP LSFALRPDPR RCALCEERSG ERRQRV
//