ID A0A388K818_CHABU Unreviewed; 1438 AA.
AC A0A388K818;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE RecName: Full=Integrase catalytic domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=CBR_g57083 {ECO:0000313|EMBL:GBG66204.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG66204.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG66204.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG66204.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG66204.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000070; GBG66204.1; -; Genomic_DNA.
DR EnsemblPlants; GBG66204; GBG66204; CBR_g57083.
DR Gramene; GBG66204; GBG66204; CBR_g57083.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 1.10.8.10; DNA helicase RuvA subunit, C-terminal domain; 2.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR InterPro; IPR015940; UBA.
DR InterPro; IPR009060; UBA-like_sf.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF00627; UBA; 1.
DR SMART; SM00165; UBA; 3.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR SUPFAM; SSF46934; UBA-like; 3.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50030; UBA; 3.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000265515}.
FT DOMAIN 1..214
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 452..565
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 1152..1192
FT /note="UBA"
FT /evidence="ECO:0000259|PROSITE:PS50030"
FT DOMAIN 1224..1264
FT /note="UBA"
FT /evidence="ECO:0000259|PROSITE:PS50030"
FT DOMAIN 1286..1326
FT /note="UBA"
FT /evidence="ECO:0000259|PROSITE:PS50030"
FT REGION 1059..1099
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1328..1388
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 647..687
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 740..767
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1067..1099
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1438 AA; 161270 MW; ADB96C52BDA4CA1B CRC64;
MEEEEGLEVV GHGGRVWVME EEEGLEVVGY GGRVWVMEEE KGLEVVGYGG WVWVMEEEEG
LEVVGYGGRV WVMKEEEGLE VVGYGGRVWV MEEEEGLGVF DYGGRVWVTE EEGELEVVGY
GGWVYGLYKF VVMPFGLCNA PGTFQHAMIR IFHDYLDTFV IVYLDDILIF SKTVEERVAH
LDEVLSLLRQ HKFKINGEKC EFGRTRVLYL GHEISAEGSK SDDGKVANIH DWPHPQSVTE
VKSFLVTTGY YRNFVENYSI VAAPLTDLTR LDTPWKWTER CEAAFRHLKH ALTHYEVLKL
PDLDKPFIVT TDASQYGALM TEFNLAGNVT QYLVEAYRED LLMSEIIRSL EAKDKVTSAE
FELVNDLLFL EKAGNKRLCV PNGESLCSLF LGECHDATGH FGYKKTAANL LQRFWWPTMM
RDAQLYVETC QVCQRDKPHT QAPLGLLKPL PIPEQPGESL SMDFMDTLVT SKSGMRHIFV
IVDRFSKYAR LVAMPETAST EYVIRMFKEN WVRDFGLPKS IISDRDVRFT SELWKAAVAE
QGTQLQMTSG NHPEANGQAE QLNRACHVSS ATSATVLRQQ HCHVSNSAMS ATVPRQQCHV
TSTTPAVPRQ QQCPTTMTVS VLGMPGQLAK EPIAEYRQRF QAQLAPIEAE EQRQAAAEAA
CLQAEAAAAA EKQRLQAEAD ADTQARRKEA QDLLQRHEAA SIEKLKFWHF EPSEHHEDAT
PEEQYKEFLA KLVTRLVYTC NHLQSELANL QQAVRNHKDL HEDATRALDS RVQDLEQVAP
RPDVGESISA PSTRQLEERL DHVVAMLGDI STFAAPATIS KQLDTLKTEV QQLHQLPNKD
GNTSAQHYKM PTFRIEKFDD YTHQDPVPWW EGFTTELRIL FVPEHSYIGA LFLNSKGGCQ
IWLNHLATIH GVQVADLHKK ISWDELTKLW KKRFIVDDAL ALTINRLFSM TQGNTPTRDW
LTEWQKIVAT RDLELPFSHL RRKLYNRSCA ALSLALGDRE QYTTFAEIID KAREIIKTNR
AAAHEKSAWQ PTYVEKGKFG PRPQHVAAVQ PDNIVEDPAA TQASREGDQV AADQPRSNNN
SRGKGKAKTT SPAGNGQSTP WVKFHLTEAE YKWRSRYVRL ELLEGVVAFY AGDLESARRI
LSSAKDKFHQ LQVSNEALVT LLSMGITSRE ARRSLRVSGQ DPCRAAQFVF EQRQKMAEKV
EEDRRLWRER REQKSYGKTP TGKAVDLAKL AEIESLGYPR ALAAEALRQA DNERAKALDI
LLNPDLLVAL EVAICKKKGK KRSREHVDEV SLAILVSQGF SRSEAKSALR ETEGDVEAAL
AGLVSKATDA SSTAATGGPS GEGTCGSEAG GAGEASQTRD GEAVREEDFL SADEDDEGTM
NEVRDVGMED EIARDISGDP FSEYDVDMAE EGKAIEEYLA LVESSCTSAR GNLHAPSS
//