ID A0A1X7SVN2_AMPQE Unreviewed; 842 AA.
AC A0A1X7SVN2;
DT 05-JUL-2017, integrated into UniProtKB/TrEMBL.
DT 05-JUL-2017, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE RecName: Full=Reverse transcriptase domain-containing protein {ECO:0000259|Pfam:PF00078};
OS Amphimedon queenslandica (Sponge).
OC Eukaryota; Metazoa; Porifera; Demospongiae; Heteroscleromorpha;
OC Haplosclerida; Niphatidae; Amphimedon.
OX NCBI_TaxID=400682 {ECO:0000313|EnsemblMetazoa:Aqu2.1.06206_001, ECO:0000313|Proteomes:UP000007879};
RN [1] {ECO:0000313|Proteomes:UP000007879}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20686567; DOI=10.1038/nature09201;
RA Srivastava M., Simakov O., Chapman J., Fahey B., Gauthier M.E., Mitros T.,
RA Richards G.S., Conaco C., Dacre M., Hellsten U., Larroux C., Putnam N.H.,
RA Stanke M., Adamska M., Darling A., Degnan S.M., Oakley T.H.,
RA Plachetzki D.C., Zhai Y., Adamski M., Calcino A., Cummins S.F.,
RA Goodstein D.M., Harris C., Jackson D.J., Leys S.P., Shu S., Woodcroft B.J.,
RA Vervoort M., Kosik K.S., Manning G., Degnan B.M., Rokhsar D.S.;
RT "The Amphimedon queenslandica genome and the evolution of animal
RT complexity.";
RL Nature 466:720-726(2010).
RN [2] {ECO:0000313|Proteomes:UP000007879}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Lucas S., Shapiro H., Lindquist E., Tice H., Dalin E., Glavina del Rio T.,
RA Bruce D., Barry K., Pitluck S., Srivastava M., Simakov O., Chapman J.,
RA Mitros T., Hellsten U., Putnam N.H., Fahey B., Gauthier M., Larroux C.,
RA Richards G.S., Stanke M., Adamska M., Darling A., Dacre M., Degnan S.M.,
RA Zhai Y., Adamski M., Calcino A., Cummins S.F., Goodstein D.M., Harris C.,
RA Shu S., Woodcroft B., Leys S.P., Manning G., Degnan B.M., Rokhsar D.S.;
RT "The genome of the haplosclerid demosponge Amphimedon queenslandica and the
RT evolution of animal complexity.";
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EnsemblMetazoa:Aqu2.1.06206_001}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (MAY-2017) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A1X7SVN2; -.
DR EnsemblMetazoa; Aqu2.1.06206_001; Aqu2.1.06206_001; Aqu2.1.06206.
DR eggNOG; KOG1075; Eukaryota.
DR InParanoid; A0A1X7SVN2; -.
DR OMA; YEEYCEG; -.
DR Proteomes; UP000007879; Unassembled WGS sequence.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR000477; RT_dom.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007879}.
FT DOMAIN 370..570
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|Pfam:PF00078"
FT REGION 1..63
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 215..286
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 28..63
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 220..286
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 842 AA; 89075 MW; 21614EC017B3E3F3 CRC64;
SCRPGGSSSC HNPFGRGPPA SGPRVSSPLD SDPANSHPSS LSASGQSDTE ALPDPATSST
SGLPSLSTIC HLQVPLLYHV PKAARNSWSG ILSAALEDVV SRPTDLDSWS RLFMLPKCVL
FLPPFRSRRK SHDLLYLIKE RLQSWRNGEF LALWDKVTVR AAQLPRTGSS PQSDANVRRA
RRAVEAGHLS KAIQALSSRG LAPPSHESYL ELLSKHPQSP LPASPLPLAP PDSLPPSSPP
HTPLSPLPSS IPPSSPSVSP PLFSSPSPPP PTPPSPPPLL PTLFSPPLPS PSLTPAAVLQ
AVRSFPLDTA PGPTGLRASH IKEAVCCPSP CRAQSTLQQL TLFVVFLSSG DCPSSVIPHL
CGATLLASLK KSGGLRPIAV GEVLRRLTSK CLSSLVLPQV RHILPPHQVG VGCSNGAESI
VHSLKLILAN QSIPSNSKCC LLLDFSNAFN CINRLSMFSE VRSKIPLLSN WVECCYGAQP
NLLFGDYIIP SCCGVQQGDP LGPLLFSLVL QPIVERLESE VPGLVLNSWY LDDGVLCGSS
DDLLAALTII EDLGPSHGLH LNLSKSLLYL PPDIASNPHP LPSAIPSTSV GFVLLGAPVG
PPDFCRSIVQ ERVESIKASL ELLPLLEDSQ SQFSLLRSCL GLPKLLCALR TTSPDVLSSV
TMDFDAIIFD FLSELVGGSL TSWSRCKAAL PIKLGGVGLR LASQHSSAIF LASVTACSPL
ILSLSGQEVP PSYVSAALVA FASSAELSDL TSISELDLPI SQKSLSGLID KVNYNALLSS
TQDIRSKALL LSSSIPHAGD WIGVLPSPNL GLHLLDCEFR LCLRYCPSCC PLPSQGDPVP
CA
//