ID F7DC66_HORSE Unreviewed; 814 AA.
AC F7DC66;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 27-MAR-2024, entry version 66.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000017764.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000017764.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000017764.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000017764.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000017764.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; F7DC66; -.
DR PaxDb; 9796-ENSECAP00000017764; -.
DR Ensembl; ENSECAT00000021559.3; ENSECAP00000017764.3; ENSECAG00000059841.1.
DR GeneTree; ENSGT01100000263511; -.
DR HOGENOM; CLU_000680_9_4_1; -.
DR InParanoid; F7DC66; -.
DR OMA; ANENWIE; -.
DR TreeFam; TF341604; -.
DR Proteomes; UP000002281; Chromosome 6.
DR Bgee; ENSECAG00000020340; Expressed in trophectoderm and 15 other cell types or tissues.
DR CDD; cd01650; RT_nLTR_like; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR000477; RT_dom.
DR PANTHER; PTHR25952; ENDO/EXONUCLEASE/PHOSPHATASE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR25952:SF246; LINE-1 RETROTRANSPOSABLE ELEMENT ORF2 PROTEIN; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002281}.
FT DOMAIN 70..345
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
SQ SEQUENCE 814 AA; 95746 MW; 241327D082750BF6 CRC64;
MGKFLDSYNL PKLNQEEMET LNRPITSKEI ETVIKNLPKN KSPGPDGFSG EFYQTFKEDL
ISILLKLLQK IEEDGVLPDT FYEANITLIP KPDKDNTKME NYRPISLMSI DAKILNKILA
NRMQQYIKKI IHHDQVGFIP GTQGWFNIRK SINVIHYINK MRNKNHMIIS IDSEKALDKI
QHPFMIKTLN KMDIEGKYLN IIKAIYDKPT ANIILNGQKL KAIPVRTGTR QGCPLSPLLF
NIVLEVLARA IRQEKEIKGI QIGNEEVKLS LFADDMILYI ENPKESTEKL LEIINNYSKV
AGYKINVHKS VAFLYTNNKL TEKELKNSIP FTIATKRIKY LGINLTKEVK DLYNENYKTF
LKEIGDDIKR WKNIRCTWIG RINIVKMSIL PKAIYRFNAI PIRIPRTFFT EIEQTILKCI
WGNKRPRIAK AILSKKNKAG GITIPNFKTY YKATVIKTAW YWYKNRSTDQ WNRIESPEIK
PHIYGQVIFD KGAEGLQWRK ESLFNKWCWE NWTATCKRLK IDHSFSPHTK INSKWIKDLK
IRPETISLLE ENIGSTLFDI SFKRIFSDTV TPQLRETIER INKWDFIRLK SFFKARENRI
ETKKQLTNWE KIFTSHLSDK GLISIIYKDL TLLNNKKTNN PIKKWAEDMN RHFSKEDMNM
ANRHMKRCSS SLIIREMQIK TTLRYHLTPV RLAKTSKTKN DKCWRGCGER GTLIHCWWEC
KLVQPLWKTV WRFLKKLKIE IPYDPAIPLL GIYPKNLITD ISRVRCTPMF IAALFTIAKT
WNQPTCPETD DWIKKMWYIY TMEYYSAIKK DEIG
//