ID A0A388LTI7_CHABU Unreviewed; 893 AA.
AC A0A388LTI7;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 13.
DE RecName: Full=DUF659 domain-containing protein {ECO:0000259|Pfam:PF04937};
GN ORFNames=CBR_g40363 {ECO:0000313|EMBL:GBG85634.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG85634.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG85634.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG85634.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG85634.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000527; GBG85634.1; -; Genomic_DNA.
DR EnsemblPlants; GBG85634; GBG85634; CBR_g40363.
DR Gramene; GBG85634; GBG85634; CBR_g40363.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR InterPro; IPR007021; DUF659.
DR InterPro; IPR012337; RNaseH-like_sf.
DR PANTHER; PTHR32166:SF81; DUF659 DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR32166; OSJNBA0013A04.12 PROTEIN; 1.
DR Pfam; PF04937; DUF659; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000265515}.
FT DOMAIN 1..151
FT /note="DUF659"
FT /evidence="ECO:0000259|Pfam:PF04937"
FT REGION 557..873
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 557..576
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 601..620
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 676..718
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 742..761
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 762..814
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT UNSURE 83
FT /note="D or N"
FT /evidence="ECO:0000313|EMBL:GBG85634.1"
FT UNSURE 244
FT /note="E or Q"
FT /evidence="ECO:0000313|EMBL:GBG85634.1"
SQ SEQUENCE 893 AA; 98347 MW; 02A1CC87842CB76D CRC64;
MRTRLLDEIY HEIXERVAPK KAKWKLTGCT MMTDGATTRS NKPIVNFIAA GEDGPVLIST
VDMSXRDKTG VALAELWEEV IRDINVKHVN AYCTDNAYAN KVAAQRLQDH PDRVISRIPW
LPSAAHCLSL LLRDITRFSW VWPILNNTRK VXMFFKNIHK ALSYHRSFKE QGQLELIRPC
DTRFXSAYQM VERLTDHERV LQMVXVGSVR WRSTLWRGKA GQDERPVRQL LLSASFWDCA
HRVEEVMRPA YSLLRSMDRD GSSPXTLWAF VDSIATRVRD MGLPQADEAE IMERVDYRCG
MMRQPAHALA YLVDPRRRDV SLLADTDSAL VQSALHHLAT YADGGEGSEE HTTMWYGLYL
FQHDDPHADP RPKWWTDPVA IAQTKHGVHP AWWWYLHGGD FPRLQEVAIK LLRARSTSRK
GYVDVWEDLV EEPPKPVVSD PSEEVYAEGM TIPEEVEADL RSRRKEGGDR ASARLLQARD
DDDVEAEDAI YEADDVWAGK DMIEEIAAAG KGKEKVHDDP LVSRVWDRWG SLEAVEDLDG
FLHGSRHTLH IMQDIEECRR PEGGSGRVDE GHHVNAGPLD DGAGGRPSEG PIPGSVVEEQ
RPDVGELEAS RGKEDLCPDV EEQEAAAMEA TPDMEEEEAA AMSHGQPCLS REEMGTGHTE
HASPAPTISK VAPDDSDDDG DVDNDDSDDD GDVDNDDDDD DDEDDDEDDD EDDDDDALMM
IIIIMMMMMV MSVLGDPPPK WKVDEAGGDD KEGREEEKEK EKEEEEKEEV EEGEEEEAVE
EESVEERLDD LGDNVEDVSG EGGGDGDGED DDRGGDAGGD GSGDTGGGGE RGGGGNGDGG
GGGGGGGDDG VGGAGGGNGG GGGSGSDEGE TLSVVIAASA DELGHWRKHW NNV
//