ID A0A388LZF6_CHABU Unreviewed; 3457 AA.
AC A0A388LZF6;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 13-SEP-2023, entry version 21.
DE RecName: Full=Integrase catalytic domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=CBR_g45840 {ECO:0000313|EMBL:GBG87686.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG87686.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG87686.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG87686.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG87686.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000627; GBG87686.1; -; Genomic_DNA.
DR STRING; 69332.A0A388LZF6; -.
DR EnsemblPlants; GBG87686; GBG87686; CBR_g45840.
DR Gramene; GBG87686; GBG87686; CBR_g45840.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR GO; GO:0005929; C:cilium; IEA:UniProtKB-KW.
DR GO; GO:0030286; C:dynein complex; IEA:InterPro.
DR GO; GO:0045505; F:dynein intermediate chain binding; IEA:InterPro.
DR GO; GO:0051959; F:dynein light intermediate chain binding; IEA:InterPro.
DR GO; GO:0008569; F:minus-end-directed microtubule motor activity; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0007018; P:microtubule-based movement; IEA:InterPro.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 1.10.8.1220; -; 1.
DR Gene3D; 1.20.1270.280; -; 1.
DR Gene3D; 1.20.920.20; -; 1.
DR Gene3D; 1.20.920.30; -; 1.
DR Gene3D; 3.10.490.20; -; 2.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 6.10.140.1060; -; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 3.
DR Gene3D; 1.10.8.720; Region D6 of dynein motor; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR035706; AAA_9.
DR InterPro; IPR041658; AAA_lid_11.
DR InterPro; IPR042219; AAA_lid_11_sf.
DR InterPro; IPR026983; DHC_fam.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR041228; Dynein_C.
DR InterPro; IPR043160; Dynein_C_barrel.
DR InterPro; IPR024743; Dynein_HC_stalk.
DR InterPro; IPR024317; Dynein_heavy_chain_D4_dom.
DR InterPro; IPR004273; Dynein_heavy_D6_P-loop.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR45703; DYNEIN HEAVY CHAIN; 1.
DR PANTHER; PTHR45703:SF8; DYNEINS HEAVY CHAIN; 1.
DR Pfam; PF12780; AAA_8; 1.
DR Pfam; PF12781; AAA_9; 1.
DR Pfam; PF18198; AAA_lid_11; 1.
DR Pfam; PF18199; Dynein_C; 2.
DR Pfam; PF03028; Dynein_heavy; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF12777; MT; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Cell projection {ECO:0000256|ARBA:ARBA00023273};
KW Cilium {ECO:0000256|ARBA:ARBA00023069};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000265515}.
FT DOMAIN 243..424
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 775..935
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 30..99
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 265..295
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3285..3333
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 129..177
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1648..1710
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2877..2930
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2965..3016
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 30..69
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 79..99
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3457 AA; 393797 MW; 3DCD35B951926D76 CRC64;
MVDTRRGTST IPYTKEEEAK MVAILQERKE KREAKKKALQ EEQAAKLKKL EEEMAREKER
IKKEEEEKLK EVEEEEDEGP PLERRRGQHS GSKGEEMEKR ISEWVANLSL GEEEEVAMYI
PKDDQEPALK KWEAEKDVLK RQAMEDEQRM EWKLAMMREK KRRVEAASEA VKELEEVQKL
SLKLTPQVDL ERTVEILAQS VERLAKIQEQ QYEFSRSQDI VVRSMRMGFR DFARELIGAV
RAEVNHRLEK TERFCVGAIE GVKAASPKEE EPRPPRRESV KVKFPDSDSG KREENFDNWE
ANGAKYFSKI DLKSGYHQIE VHPDDQYKTA FRTRYGHYKF IVMPFGLTNA PATFQRCMND
LFRPWLCRFL VVYLDDILVF SRTLEEHQGH LRQVLEKLRE ANFKINAKKC DWAKTQVLYL
GQVLDGDGVK PEDSKIAAIR DWPMPRTMTE LRSFLGLANY YRKFVRNFTT IAAPLRRLLR
KETIWKWDKD CTSAMKKLKQ LLIEYTVLKV ADPSLPFVVT TDASQYGIGA VLQQDDGNGY
RPVEFMSARM PSEKVATSTY ERELYALRQA LDHWKHYLLG RHFKVYSDHE TLRWLKTQAK
MTPKLTRWAA EIDQYDFELK PFKGKYNVVA DALSRRADYF GAIVHYLDIG KDLQQKVREA
YAQDPIYSEL LEKVKEAPET EPNYRTTEGL LFEKTNVFDR LCIPNSEEIR SLILGECHDI
EGHFGWQNTL ANLMRAYTWP GMKNDCVKYV RSCKVCQRNK TRARAPLDLL RPLPIPDQPG
DSVSIDFMDT QVKSRHGKSQ VMVIVDRFSK YAVFVPLPSE ARTDLVIHRL FDCWVSENGI
PLSIVSDRDS RFTSQNWQEL MGVYGSKLLM SSDRHPETNG QTEQMNKILQ QVLRMYIRPD
QINWDEMLPK VASAYNNSVH LSTCRTPNEL HKSFQPRRPF EGLNRDQIQR LPPGTREFAV
QHEKELTTVV ENLRKSQHRM IEQANKHRRP SQFQVGDLVW VSSKEFAPEE NISQKLLPTY
RGPWPVLKVK GGEDGPSYTI ELPAHLHTYP VFHASKLLPC QTSDQFPSRK SMIPPDMDGR
YDIDGIVAED VFRTGGRGRP QKQYKADAAA VEAKPLIFNS FMVQNSDAVP VYIDVEDYGK
LRVALEEKLA EYNGDHVAMD LVLFDMAMEH VCRIVRVLNL PRGNVLLVGV GGSGKQSLAR
LASYICNYEV FQITVTSTYG VLDFELDLLQ LYNRAGLKSI AMTLLLNDGQ IVNETFLVYL
NDFLASGYID DLYKPDDKEL VCNSIRNEVK QAGIVDTRDN CWDYFVDKVR KYLHVVLSFS
PVGDKFRIRA RQFPALVNNT VMDCFQPWPQ EALVSVAGRF LSSIPDVRDE IRENLTHHMA
FVHTVVTEAS VSYLEVERRY NYTTPKSYLE LIAFYKALLE KRRNDLKWKR ERLEGGVNKI
KLASEQVADL QVNLKQELIV VEEKKATTDQ LIVNIGQEKA IVDEQKSSSA RDEEECARIA
FEVAEYQVQC EADLAKAEPI IKEAEAALNS LDKRSLGELK SFANPTLEVV QVGSACLVLT
APGGKIPKDL GWYAAKKMMG NVDSFLTFLR DRFDKDNVPV QCVEKVEKDY ISNPNFNSEY
IKSKSIAAAG LCGWIVNICK YFRIYQVVAP KRARLEEANQ KLAQANQKLA GVRAKVDELE
GRVRQLEEAL MRATEDKNMA EAQVDRTRTK VSLADRLVNG LASENERWGE SIAHFGAKEG
KLIGDVLLAA AFVSYAGPFT APYRERLVNS SWTPDLIERQ IPMTPGAKPM DMLSDDTEMA
KWLNEGLPSD SLSMENGAII SNCARWPLMI DPQLQAIQWI RTREGKNGIK IIQLGQPKYM
EVVENCIENG LPLLIENLGE YIDAVLDPVI SRSVVRRGKS RLMKFGEKEV EYDANFRLYL
HTKMTNPHYK PEVAAQTTLI NFSVTERGLE EQLLALVVNK EKPELEIERL NLVRQLNDFK
IQVKELEDAL LFKLSNSQGD ILEDIELIEN LETTKRTSLD IQQKVKLARE TEEGIRTARE
IYRSVAARGA LLYFLIDRLD VLDHMYRFSM ATFIRVMDKG MGLAKTSPTY LVERVAELLD
TSSFAIFKFV SAGLFEKHKL IFAMQLTLSI MRQAGELPPP MLDFFLQGPK NIDPEVAKTI
KSPMADWMTN RMWLCIQALK EIDEFASLPD EMISAAKRWR EWTEMERPEA EPLPGDWKRM
PEFQQLLIVR ALRPDRVTSG ASRYIGNKIG DKYAQSYPFN LEEAYADVRA GTPIFIFLSP
GVDAAKAVEN FGRKMGFKQE NGLYKVVSLG QGQEEPALKA VKHGQETGGW VLLQNIHLTP
KWTAGPLEKV IDKLGEGVHD KFRLFLSAEP SHEIPVPVLQ NSIKLSNEPP EGVRPNLLRA
LGNFSDEVYD SCTKPNELKI IIFTLCLFHS LLLERKKFGP QGWNQNYPFN TGDLTSCAQC
ASNYLENNIR VPWDDMRYIF GEIMYGGHVV DDWDRRTVSA YLSMYMKDEL LDGMELFPSF
STPSSILHQA ELITYVFQNV PAESPIMFGL HPNAEIRFRL AQGDKLMDEV LKLQRLTIGA
AGVMSVQDKA KLVLDDVMER VPRSFNMEDI NSRLADQERT PYTVVFLQEI DRMNLLTAEI
VRSLQELDLG LKGDLTMSEP MEKLMFALAE DRVPMTWETL AYPSLRPLGS WVTNLLDRIQ
QLADWIREMT LPKVTWISGL FNPQSFLTAV MQTTARKNDW ALDKTVIQTE CHVSSATSQC
QISSATSQCQ VTVSRQQCHV TVPDQQCHVS SARSQCHVRL SATSVVTSDT VPRKQCHISS
ATSDTVPPRQ TQCHVRHSAT SDKVPHQQCH VSNMTGLPGQ LPNESLAAYK QRFQAHIEAE
EQRQLAAEAA RVQAEEAAAA EQLQLQADAD ADAQARRKEA QDLLQRHEAN SIDRLKYWHF
EPNGDEATPE EKNKEFLSKL VTRLLHACNY QRSELERQYQ DLTQQHQELA QLRRMVQSHE
DATRALNARL LDLEQAVPGP AAGASSSAPS SCQLEDRVYH VVAMLGDIST FAAPTTTISS
QLHTLKTEVQ QLQTTNADGN PKMYKMPTFN LERFDDYTQQ NPALWWEAFT TQLRILPVAK
HAYIGAVFLN SKGGCQTWLS HLATSHGVDV PDLKDQITWE ELARLWQKRV IVDDAPTLAI
NRLFTMSQGN TATRDWLTEW QKIAAVPNLE LPFTHLRHEF YNRFCAALSQ ALGDRELYST
FSEIIDKARE IIKTNRSAAH EKSPWQLTYV EKVRTGPRQQ HFAAVQQDSG DNPAATPASS
DGDQVAAVQP QSNNKSRNNG KAKSASQAGN GQPGQCPWVK FGLTEAEYKV TKKTVDGVDA
PSREGAYIHG LVLEGCGWDD KNGCLAESQP KELYFTMPVI QIKAAPADKV DVKDTYLCPV
YKTSKRGPSY VFAAQLRSKE GSTKWVLAGV AMLMEII
//