ID F6QT81_HORSE Unreviewed; 851 AA.
AC F6QT81;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 27-MAR-2024, entry version 67.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000008581.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000008581.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000008581.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000008581.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000008581.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; F6QT81; -.
DR STRING; 9796.ENSECAP00000008581; -.
DR PaxDb; 9796-ENSECAP00000008581; -.
DR Ensembl; ENSECAT00000011032.3; ENSECAP00000008581.3; ENSECAG00000037657.2.
DR GeneTree; ENSGT01100000263511; -.
DR HOGENOM; CLU_000680_9_4_1; -.
DR InParanoid; F6QT81; -.
DR OMA; TWNQPTR; -.
DR TreeFam; TF341604; -.
DR Proteomes; UP000002281; Chromosome 15.
DR Bgee; ENSECAG00000037657; Expressed in inner cell mass and 14 other cell types or tissues.
DR CDD; cd01650; RT_nLTR_like; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR000477; RT_dom.
DR PANTHER; PTHR25952; ENDO/EXONUCLEASE/PHOSPHATASE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR25952:SF246; LINE-1 RETROTRANSPOSABLE ELEMENT ORF2 PROTEIN; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002281}.
FT DOMAIN 76..351
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
SQ SEQUENCE 851 AA; 100257 MW; 96489405144FEE0D CRC64;
MDNLEEMDKF LDSYNLPKLT QEEADNLNRP ITRKEIETAI KNIPKNKTPG PDGFPGEFYQ
TFREDLIPIL FKLFQKIRED GTLPNTFYEA NIPLIPKPDK DTTKKENYRP ISLMNIDAKI
LNKILATRIQ QFIKRIIHQD QVGFIPGTQG WFNIRKSINV IHHINKLRNK NHMIISIDAE
KAFDKIQQPF MIKTLNKMGI EGNCLNIIKA IYDKPIANII LNGQKLNPIP LKTGTRQGCS
LSPLLFNIVL EVLARAIRQE KRIKGIQIGR EEVKLSLFAD DMILYIENPK ESIGKLLEVI
NNYSKVAGYK INLHKSVAFL YASNEPTEKE LKNTIPFTIA TRRIKYLGVN LTKEVKDLYN
ENYKAFLREL DDDIRRWKDI PCTWIGRINI VKMSILPKAM YRFSAIPIRI PMTFFTELEQ
RILKFIWGNK RPRIAKAILR KKNRTGGITI PDFKTYYKTT VIKTAWYWYK NRCTDQWNRI
ESPEIKPHIY GQLIFDKGAE CIQWRKESLF NKWCWENWKA ICKRMKIDHS FSPFTKINSK
WIKDLKVRPE TIRLLEENVG STLFDISIKR IFSDSMPSQR RETIERINKW DFIRLKSFFK
ANENRIETKK QPTNWEKIFA SHISDKGLIS LIYKELSQLN HKTSNNPIKK WAGDMYRHFS
KVDILMDNRH MKRCSSSLII REMQIKTTLR YHLTPVRMTK ISKTNSNKCW RGCGDKGTVI
HCWWECKLVQ PLWKTVWRFL KKLKIELPYD PAIPLLGVYP KSLKSAIPKV LCTPMFIAAL
FTIAKTWKQP KCPSTDEWIK KMWYIYTMEY YSAAKQNKII PFAITWMDLE RIMLSEISQQ
EKDNLCMTPL I
//