ID A0A388M9D2_CHABU Unreviewed; 1487 AA.
AC A0A388M9D2;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE RecName: Full=DUF659 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=CBR_g52048 {ECO:0000313|EMBL:GBG91166.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG91166.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG91166.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG91166.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG91166.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000878; GBG91166.1; -; Genomic_DNA.
DR EnsemblPlants; GBG91166; GBG91166; CBR_g52048.
DR Gramene; GBG91166; GBG91166; CBR_g52048.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR007021; DUF659.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR000477; RT_dom.
DR PANTHER; PTHR32166:SF81; DUF659 DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR32166; OSJNBA0013A04.12 PROTEIN; 1.
DR Pfam; PF04937; DUF659; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000265515}.
FT DOMAIN 2..151
FT /note="DUF659"
FT /evidence="ECO:0000259|Pfam:PF04937"
FT DOMAIN 950..1064
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|Pfam:PF00078"
FT REGION 279..439
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 464..491
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 518..544
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 612..631
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 680..708
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 736..764
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 778..938
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1083..1105
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 311..329
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 353..388
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 399..439
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 469..484
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 787..807
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 817..838
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 897..919
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1487 AA; 163084 MW; BD465638CA63CC0B CRC64;
MIRTTLLDEF DVEVQKCVKP VLATAATYGC TIMTDGWTNI RGQTLCNYLV GTTRGATYVA
TDVMRGKKDA TALAQAWLRR LKSLDIKLAD ITAFVTDSAS SNVSAMXVFQ KDESVKHIFW
IPCVAHVMDL ILEDIGGIDW VATRIAQVRL VTKFFKRHSR AREVLQAFTT KALLLPAETR
FGTYVIMMRR LLLLQSHLMQ VVVDDRWKDT VWSTKKIRDD AAEVTACVGG VSSTPNLITD
WRWRGPRSSC DAIGTFGSST GRLCRTMLPV TGHIRMGAGR RCRGGGCGGG RRGGRGRGGR
GGPGGRRSVA SRGRAVDELV RKPRQRWGEG DFLYESSSSD DEDFFGNGRP AQDGDSDFDD
RHPNVGDDDG ASGDDHGDGG DRPRPAHGDT MXADXAGGEH RTXVDDARDG VHDDGSRPVS
GGDLRRLKRG PREKDTIASC VRKRHGGVAS IPLSRAPATG QISVSEDSFS DHGGEERIEG
KTSPLPAVRD EGSVGALHPL MSAGAAVSIA ATSDVGQPLA AAEQESMLPP RSVQPRPPPA
LMEGSQGTVV VCQGDDLPXR DISHTPAAEQ SAADHVGLAC PTDSLPGTGE LLTMDSALVS
LPDFSMLISP GLPHVSPPKP QQPVVMERGS DGFDVAGGDI ADMITSPMVS ATPTIFRVGK
LREVALGLAE TATRHCRDGQ RSIAQRSLSH SFDGAEEGSP GGGADATEQL VVGSLATARR
DRATVLPTVA FYASGKSTGG MDEQGVRNVG RPSAPSMGKR SIGSVDGARH TAMAEFEDRH
GSALPTKTSD VHATRSAKAS LSQARKKAST RKASRSSSHM RSRERGSGVV HLEDGEIAPD
GDALDVGGRD ATTTDIVGRE GTGNAVAGQK RRGSVLIVHD DSTDVAPGET TDTDDAGDSD
YVPKPRAADG DDGGGRRVRS RTRLGPQGQR AQGTPSAMIP APIDRRAQAQ RLLRKMEATL
QLTHGDEYYK RRNDSKTRTG GRSIVSLIDL YSGYDQFSVY PADRPITAMH TPRGLVHMNV
APQGWTNVVA MVQRSMIRVM QPIRPQITEL YIDDLAVKGP IEKDESELER IPGNKKRADG
MSRIEWGNQG ESNEETPPVD GFLDEGEGMR THVNCWFMFV ESKEMRRGNP IWHPPPEFMR
EPSLILTPLE EENSWGERNT EWMMELALAE NYRIMDEPLT IEVGVQQGDD QVAMMGRMYY
LTNSLLQVQV NDERGIVKVD EIVIGDDYEF DEGKEGEFEQ EEISEEFREE EYDEFYLEMG
LLLSGDKRER EVSEKTLRMR DHYVVRGDHL FIRDKRGSPR RVVCGRNRQI DIVTAMHDGI
VGGHRSFAAT YKKAMEYYYW EGIFEITRDP WKEEAIAQRG AVAGPSGPAL RKGGQRRGQR
VSRNLEELPI YRVGDSLRIF LRDLEQYAFR REWGDREKIA NVSGAGMYKR RIEGVVAGCT
RWKVCKERLW KAIGAFPMDD VEDDLRFDET NLEDFIESLQ LTAERGE
//