ID R4GAM1_ANOCA Unreviewed; 1102 AA.
AC R4GAM1;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 2.
DT 27-MAR-2024, entry version 46.
DE RecName: Full=Reverse transcriptase domain-containing protein {ECO:0000259|PROSITE:PS50878};
OS Anolis carolinensis (Green anole) (American chameleon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC Iguania; Dactyloidae; Anolis.
OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000022322.2, ECO:0000313|Proteomes:UP000001646};
RN [1] {ECO:0000313|Ensembl:ENSACAP00000022322.2, ECO:0000313|Proteomes:UP000001646}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JBL SC #1 {ECO:0000313|Ensembl:ENSACAP00000022322.2,
RC ECO:0000313|Proteomes:UP000001646};
RG The Genome Sequencing Platform;
RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J.,
RA Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard).";
RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACAP00000022322.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; R4GAM1; -.
DR STRING; 28377.ENSACAP00000022322; -.
DR Ensembl; ENSACAT00000029739.2; ENSACAP00000022322.2; ENSACAG00000028909.2.
DR eggNOG; KOG1075; Eukaryota.
DR GeneTree; ENSGT00940000163630; -.
DR HOGENOM; CLU_000680_1_1_1; -.
DR InParanoid; R4GAM1; -.
DR Proteomes; UP000001646; Chromosome 2.
DR Bgee; ENSACAG00000028909; Expressed in adrenal gland and 9 other cell types or tissues.
DR GO; GO:0003824; F:catalytic activity; IEA:InterPro.
DR CDD; cd01650; RT_nLTR_like; 1.
DR Gene3D; 3.60.10.10; Endonuclease/exonuclease/phosphatase; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR036691; Endo/exonu/phosph_ase_sf.
DR InterPro; IPR005135; Endo/exonuclease/phosphatase.
DR InterPro; IPR000477; RT_dom.
DR PANTHER; PTHR31635:SF196; REVERSE TRANSCRIPTASE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR31635; REVERSE TRANSCRIPTASE DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR Pfam; PF03372; Exo_endo_phos; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF56219; DNase I-like; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001646};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..1102
FT /note="Reverse transcriptase domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5032299758"
FT DOMAIN 463..742
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
SQ SEQUENCE 1102 AA; 130827 MW; FABBDA0A1BF247E0 CRC64;
MQVTKMGWAK ALLLHYCFCQ WFKGQSNVPI ANQEAFLKNH HGRQVYESRG SNRARGVAIL
IKNTKDFQVE MVKRDSDGRW VMIKGKIEKI RITLVCIYAP NKNQKKYLKK LFGEIDKFAQ
GEVYCAGDYN LELVTTRQNS NRKLNIREYN MNDLLEGEPE NNRWTYYSGR HDTYSKIDYI
LGKNTNQSVI LEAKTLPIHL SDHAPLRVKV KLEIERKERT WRYNTVLNSR EEEYETIRKE
INKYFETNDN GQVTKDIIWD ASKAVIRGHC ISLEVNLRRR WRNEQDKRIK EIETLQDKHK
RTKDRQIWRE LKEKKNQLEA FEERNIINKY LYVKSYSQSW NIKSMKRMAY YLKKKKEKTW
IKRLKDENGK EQEEQSKIRE IMAKNNKNLY KEERVMQGNL NIREKIAEED RELLNQPITQ
EETIRAIKGL KGGKSPGPDG FPAEYYKAFM EELAPHLTEL FNEIYTKQKV PLTWKVSEII
TIPKKGRDLT QPGSYRPISL CNQDYKIFTK ILAKRLETIM PKIIKEDQYG FVKGRKIGDP
IKNVVIAMNH ANSNKKKMGI LKLDVYKAFD KVNHNYLLRL CQQLNMGENF CKTLQQLYSN
CKAKIRVNNG RTGEIPILNG TKQGCPLSPT LFVIAIEMLA ESIRNSTNWT GYKIEKGGGQ
KEEIRLNFFA DDAMIITSQP LIMVKDIIKK LEEFKNISGL FVNIKKSEIL CLNTPPREQL
EIRKISGLRL GLKKLKYLGV WIYKNPKSIV KGNYKQKWKV IEKQFKNWEG KTLTRVDRIR
ALKMFIIPKL TYLFQTLPNT VDRKTLKTWD RRIRKWAVGG KRPRIRDKWL NAKEEEGGWG
IPSLEVYHDA FQIQRLLEIQ YTKKKWANME KIINNTKEKE IIFREWSRRE EAEFKEPFKG
TIKVWKKRKK KLSPGNENKM ATIWQINYNN NEIIGRTLRS IESRGKRLVE DLYTIDGDPV
EQKEIQNWIG SGNWIQSWAI RQEIFRLREK IREQDSPFEK AIRKCLREEK KVAGDLYEQI
ITVKEEIVEE IFQTWRMDME IEREDIQREI SNIAKEKYNV LREARRKMLQ KWYRTPAQLS
RFLPGMKDRC WHGCEQRWCS IV
//