ID A0A388JZ42_CHABU Unreviewed; 3883 AA.
AC A0A388JZ42;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 17.
DE RecName: Full=Reverse transcriptase domain-containing protein {ECO:0000259|PROSITE:PS50878};
GN ORFNames=CBR_g36536 {ECO:0000313|EMBL:GBG63052.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG63052.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG63052.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG63052.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG63052.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000035; GBG63052.1; -; Genomic_DNA.
DR EnsemblPlants; GBG63052; GBG63052; CBR_g36536.
DR Gramene; GBG63052; GBG63052; CBR_g36536.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 2.
DR Gene3D; 3.30.70.270; -; 6.
DR Gene3D; 2.40.70.10; Acid Proteases; 2.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 3.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR33064; POL PROTEIN; 1.
DR PANTHER; PTHR33064:SF40; REVERSE TRANSCRIPTASE_RETROTRANSPOSON-DERIVED PROTEIN RNASE H-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF17919; RT_RNaseH_2; 3.
DR Pfam; PF00078; RVT_1; 2.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 3.
DR PROSITE; PS50878; RT_POL; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000265515}.
FT DOMAIN 1644..1823
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 2693..2872
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 88..140
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 246..314
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 341..487
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 856..884
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1006..1031
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1086..1105
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1236..1400
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3221..3264
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 2069..2099
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 88..132
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 246..266
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 341..372
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1236..1263
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1276..1307
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1308..1332
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1361..1376
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3249..3264
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3883 AA; 432215 MW; 23183CE22E0F3427 CRC64;
MLAKGYWSGL NTGLAARQFF PVTSSSISEH GVETGAASEV GKKQTSEVAS SCPAQTSHEE
NVFSMLFRLH IASIGALFLE DYQPTLGSGL GHDTQEGNGN QQQPNPPLPF LTSFPHQSKS
DSWTSGLSGN QRFLSPGRDD DGRDGAFVDS FWIEMFAGDG YRAHASLGWE GFRMGMNVEG
EDGEAALDRQ QVQKSSEREG KGKEAARRAK KTAKIECSIG QCCLNLCGPQ GCGQVGDIEN
IVKRRMMSEA TERRDGDKDV HGEEDAEGIP TAGSQVNNDH RDEQGEVDGQ AGPAGDEGDY
VRPEFGVPVP ESFAAQESVD AGSTFTDFGS AADRFSVWSS RYGRSSTFSE TGELGTRLSA
EGQIQDVSKG QGGSRAGGSD GEAPSEARSG QKLGMLSRSR SLGSMRDLEK ENLEHNVGES
GGGIDLRRPT DFPATPKAAP VDQRKHTGSK STHSQNSGRR SLSRLKGLYV DGSSGEKNKG
DTWEDALSPR SVGEMSFMTA EAESSDGELM TEMGLLNEMS TSGIMWGKQW AVASLQIGSL
VLRRGDYDGV ADLRAVAEQV MKGSDEIDIT LLVAENLTKV RMKSEGGLVY IDTPVLEGVI
TRAWRYYTSL LQSVSKMEIL KMALLSSRPV SAETVTESGD VSDSHHNASS SSGMSITWQG
EGSMVISDLS VVLARPPLDA NVTGEGLLLE LDSSLNVVMD MQEILLSTQI SGFAARGLLT
SRKPLTVRRA DIIGGNGYHK EVDESVVRDP ELAVQFPTET FYEAERETRQ AGSEEVLFEG
EPRVILDCME SACDLRRAIH QDGSADDLGP SKEGVWEGRV SIGGLHAAVT TYEVQLLTGL
LTPLLRVSSS MTSAENKEDD AGIFPGAMTS SSVPALPDSE TESPEIPYLP HGSVIAIRDV
YEESFVAVEE HSGTLGTFSL TSVRHYGLAG ERALFKVKRL RNKKHKEQPW FCLLSLYARH
STTGRALRLH YQPASALPEI ATTNNGAWEQ WQLLPAKPPD YLDGVIAHDV DEGNKDGDDK
EQRKDKTSAQ RTVYLENKKA GRCLSISDGS VVLGKYSHPL KIKIVALPKE GETDVEFQME
EAEERAKRKK RRKGESSAPE PVQELVRSPT EQVAIALANI PSVNVVIKDG LSITLLYEAA
GGAHLLPLVR ARGEKMEATV QLGTEKTRFI ASCSIGLDYF EVERGKWEKI FSRISLEMLS
RGRVLASSLA RKEWKAPSRT FITFQKGDME RRRRMWTRRT TRIERTTRSQ DGGGEEMDGG
KRGRAGGGGR VGGGGDMDGE EGEDTRKGRR RGGMEDEEVR GGGGDKEDEE GQEEEDEQEE
EEEEEEEEEE EIKRRREEET WSMRKGRRSM SELANLQRAV RNHKTQHEDA TRALDARVLD
LEQAVPGPDA GASSSASSSR QLEERVDHVV AMLGDISAFT ELATISQRFE LLDTKIRLHG
ARRPWSTCPE EVPAYATLAD GHTHKSIDRC IDVVPVYFAP HASEAVSFDI LDTKFDMILG
MSWLRSEDHP VNFFHRTVHI RDRNGVLVPC TVPLPHPSIS CHVVSAASMR ASIIRDDIEE
MGGCFLHALP PRDASSTDSS SDPRITELLD AYSDLFEGPH GVVPDRPIRH EIILEDGVVP
PCGCIYRMSE EELSVLRAQL DDLLEKGWIR PSSSPYGAPV LFVRKKNKDL RLCIDYRKLN
AQTIRNVGPL PRIDDLLERL GGAQFFSKLD LKSGYHQLEI RKDDRYKTAF KTRYGHFEWL
VMPFGLTNAP ATFQAAMTTE FQHMLDRFVL IYLDDILVYS RSLDEHVEHL RTVLERLRQA
KYKANRDXCE FAXQEXEYLG HYVTPQGIRP LADKIEALRV WPEPTNTTDV RSVMGLAGYY
QRFITGYSRI AAPMTRLQSP KVPFVFDDAA RWSFQALKTA MLMAPVLSIY DPTLPTRVTT
DASGCGIGAV LEQHDGDDWH PVEYFSHKVP PINSLDDARK KELLAFVMAL KRWRHFLLGR
RRFTWVTDNN PLTYYKMQDT DTGALPRRSA RLAVRARPVV PPRPKQRLSK RKTPAASSTA
MTVPVSTGVL PACPVQGAGE PLAAYLQRVQ AFTDAVATAK AQEEAAEAER QRLANEAAAQ
AQHTAEADAA ARDQRNASST ESLIHSENQW TMFLQGMIFL PSDAQADPTP AEAEKTHLAN
LMLGMMRGIM WNNTLLQAHL RTEQQQRQKY QQDIAVLTAA IRAEASQQQQ QHQLLNSALA
RINNIEANAT AALGCTMDAT KQLNERIDHV VTIIGDIGDF TIPATISSTV AAVKTDITKL
QTRPDAATKT YKMPHFDISK FDDYNKSDAL AWWQRFLTEA SCRTVPADDM MKALYLQLIG
GGHAWMNHLA ATKKCTIAEP HTHIPWKEFE KLWLTRFMVR NVMNAAMNEV YTCSQGSMPT
RDWTTKWQKI VTTPGFDLSF TNQRSEFFSR SCAGLRSALG NEYDYDSFQA ILDRANLVIQ
TDDKAANERQ SQPHYVAKQG YQRPTHNNVV ISEETVDLHA AAASSSDGRI VAALPPKRPK
RVRKNKATQE TASMSVSFDI LDTKFDMILG MSWLRSADHP MNFQDRTIHI RDRNGVLVPC
TVATAHTSIA CHVVSVARIR AAIARNDVEE MGLALLHALP SPDGPAASPP DPRITHLLDK
YGDVFEAPTG KVSDRPIRQE ITLEAGAVRP RGCIYRMSEE GLEVLRAQLD DLLNKGWIRR
SCSPYGAPVL FAWKKNKDLP LCIDYRKLNS QTVNNADPLP RIDDLLERLG GATYFSKLDL
KSGYHQIEIQ PQDRYRTAFK TLYGHFEWVV MPFGLTNAPA TFQAAMTTEF RDLLDRSVLI
YLDEILVYSS TLDEHITHLR AVLNRLRLAK YKANRDKCEF AKQELEYLGQ YVTPKGIRPL
ADKIQAIMDW PEPRCRTDVH SFLDLAGYYQ RFVESYSKVA APLSRLQSPK VPFEFDDAVR
GAFTTLKAAM QAAPALRIYD PTLPTQVTTD ASGYGIGAVL EQCHEDGWHP VEYFSQKVQN
HTGVPLNCHL GTWHKIQPQT AEDFLIRRMA SRRDIRAALM SSSIVIGLGD PTTGVTISAL
MSVPLTDPGV QIVNFQLRGT AEGDQKSHVP LVVSISKQSQ EGLFVSISPL LTVGNLSGMP
LFVRYWKLGA SDENGKAEDG VMRLDKNEAV DDLMHSFNVR TEGGERSHLF GGLDCRHPEG
EERGVPEAGE VKGYCRGAGG EEEAVRRRDV EVSEEKIMML QREEEEKRRA AEEEAAAEEE
EEEEGEPLER RCGEERGEVS GTKEEDKWRE KKISEWVANL SLGEDEETQL YVSQEEREAF
ARALELIEDP LECQATEDEK KLEWKLKMMR EKKRRRKEAS RIAVEVERVR TGRQELQAQT
EVFAKLDKMM GFLEVLSEAW LEAHQARKGQ EVTLQAMRSG FREFASDVVG HVGSEIRRLR
DGVDKFCAGA IETAKIVATT EATARPHKEP VNYISCKALK KLRLGLKVQK LEDPIVTVLA
DNRTMRVEDY VEGVQAYFRL EKDGKVEKVI HSPTLLVQDD LPFDIVLGLD WGEAAGATLH
LREHECRLPS PSGEVKTARL FHVSRVDNSL AHCCLSAPAF ARLVRKEQLE DQVFVVYVRP
VTEPKEEDRS TDPAIAKLLE EFEDLAEPPT GVVPWPIQQR IEIEPGSRTP KGAVYRMSPR
ELKELRKQLD EPLQKGWIRP SSSPFGAPVL GKLREANFKI NAKKCEGAKT QVLYLGHVID
GDGIKPEDSK IAAIRDKPTP RTLTELRSFL GLANYYRKFV MNFSTIAAPL RRLLKKEAIW
QWDKDCTSAL KKLKRALIEY PVLKVADPSL PFVVTTDASQ YGIGVVLQED DDNGYRPVEF
MSARMPSEKV ATSTYERELK LSGKLRAQEA LPAWEALQGV FRP
//