ID A0A388M3U9_CHABU Unreviewed; 2029 AA.
AC A0A388M3U9;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 15.
DE RecName: Full=Integrase catalytic domain-containing protein {ECO:0000259|PROSITE:PS50994};
GN ORFNames=CBR_g48977 {ECO:0000313|EMBL:GBG89268.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG89268.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG89268.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG89268.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG89268.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000726; GBG89268.1; -; Genomic_DNA.
DR EnsemblPlants; GBG89268; GBG89268; CBR_g48977.
DR Gramene; GBG89268; GBG89268; CBR_g48977.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR007021; DUF659.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR PANTHER; PTHR24559:SF438; RT_RNASEH_2 DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF04937; DUF659; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000265515}.
FT DOMAIN 1698..1796
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 67..107
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 433..486
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 499..569
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 760..799
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 968..995
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1507..1537
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1057..1104
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 447..486
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 519..540
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 773..791
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2029 AA; 221561 MW; 78648E3E1CEE422F CRC64;
MMLRWTFCDE VFQGTQFQAT RHFTQTNYCK DVSDEALYEI ARQSQQKFEA DQMERFARYA
AERGLDVPRT GGARGGEAGR RPVEGGVGGD GGHGVPDPPL GGGGDETEEG IIHVDREARG
PGEEGVPEWE DVPDGISFNA FHNEAWKAYQ HVLLEQPDSS PRAVLPNHCE IASMRAVETH
RAELTEELEE VRQPFWVTGA TLLSDGRKSR DGRPIVNFLA AGSRGVVVYT TINRESEADD
AVHVLRKWVT IFHEFSFGGP QQINAICTDS ASAYVGAARA LVSPSMPLAL RRITWLPCSV
HVCNKLLSDM GTSCDAFVDA ITRALHEGIH TKKRNKLAFE KVVQLVEITA NVRLTEYRRA
GCGYVLPWQR DEGMQDCQAG LELEPVCSGT RRGMTEEEIA HQVALITRDP IGVSSPPFAD
VVFDRRACIF RPYPREDDSD EESVPEAADD PALHIPREID ETHEDLDSEE TRAHTARRAA
DRAEREMLGG EEEFWGPFEE VASTGGPEVQ ATTPTPTRRE SSMPPPPAPS PAPPSPVSPL
QPATATADRE ELGSSLPQRG LLHRGGAVRQ LRLRSPSPGI LQEEGAPSAA AVESSVAPAA
VPDAMIAASV EEIAAAAAAA VLEEMEASLL EEDPPAAGGA AVAGGTGGGA AAVEVEVAAV
VEVEVAAAVE VEVAAAVEEE EAPSVPAVEG EVPGPVEEEI AAQAEVQRGG DDERLMQQFL
TEELDPVIAG MTRGVARGFG ISDSEMGTHL DFDLSMGLPP SCGGATSTDR APSRDEAPGQ
TLTQTARETR MTESPDAARD IMERKRARLL ASSDPRAQAF AWALKEARLR ETGGDCVQGV
VVVGEDVAEG VATEAVDEAM PGGAKAADVA VEGRPQAVDE AVQAGQRRVE GAIDARHVVE
ETATGPIVPF SGLHLAQGRQ SQAVQMAVHG VPPPVGVRAG GCAPTIETRA QKAAREKRER
EGVLFESVRK GKRSVRPNGR RAQTPLERST DIPSSSMAEI TSAQWRVMLD AASEFEKPFF
QKLYDDAVLR EGEAEAAAKA VSIAAQEALL QVPEANADRF EEKLAAAVAA LVRLRELEDF
ESRVTALEQQ NRELQAEVLS LRQSQLSAPR PPNPRLAAVP VPQTNTALVA SAGGTVASAG
TGASSSSGSA GSSALVIVPG AGPSAQKAPV HTGIPYSAPM VDKRAATLPS KYDGKADITS
WISSMRSFFE VMRTPQEDRR LLPTSGKQAR GVEELSCRAG LLPTSGKQAR GVEELSALST
ARGAEQSVAG PSKEPESDQQ IALEALSDAD SKAYTVRVLY TEPWEESKEV DFHAHMEHWL
QLKQQHRVQG SVELTFFKLL INRRYIRVLI NSGSTTNFFS PNGIRKVGLG MKQVELQNPC
RTQVGNQEVV TSTHVVKGVR ITFDKDRAVT HELNFYVMDK CPFDAVIGLG WLKAHCLRTI
WADNQFLVLD AKGNERTVLL DETRESPVTL LSANKFCRSV RRRKEGEFVH IDLVKPFHVP
STFAAFSSSE GKSTSTPVPS TSTMNTQPTS DSETSKPVVI AVNSQSDFLS PDEDDPPPEI
PASIRLLLNR FPEVLAEPRG VPERPVKHKI EIIEGSVPPK GCVYRMGKGE LEELRRQIDD
MIDRGWIRPS ESEFGAPVLF VPKKGGKLRM CIDYRGLNRI TRKNAYPLPR IDDLLDAAGG
CKVFSKIDLK SGYHQIEYGP PKTLVSDRDT RFISKDWKDF TAQVYDITLN MTSGRHPQAN
GLAEEINQTV TQLLRALIVP DQNTWDKELH KVKGLYNNSI HSATGVTPNQ LQYGWPMRNP
VSYLFPERSP GLMSGMPGYN AKYARLLKVV TAAMNKRQHA MIKHANKLRK EVKFKVGDYV
WVKMSEFSDE EGVSRKLLPL YYGPWQIMKV IGDDFGPSFV IDLPPHLRTY PVFHASKLFP
HIDDETFPYR DPMIPRPIDG GHEIDRIVSH EGRGRNKQYK VHFLYHPLNE FFCIDRKELL
KSAPRVVNAY ERQVADGQTA KFAAKLTPFG LTNALFLIYA YDAFCADSE
//