ID A0A3M0L402_HIRRU Unreviewed; 1313 AA.
AC A0A3M0L402;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE RecName: Full=ribonuclease H {ECO:0000256|ARBA:ARBA00012180};
DE EC=3.1.26.4 {ECO:0000256|ARBA:ARBA00012180};
GN ORFNames=DUI87_00934 {ECO:0000313|EMBL:RMC20088.1};
OS Hirundo rustica rustica.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Sylvioidea; Hirundinidae;
OC Hirundo.
OX NCBI_TaxID=333673 {ECO:0000313|EMBL:RMC20088.1, ECO:0000313|Proteomes:UP000269221};
RN [1] {ECO:0000313|EMBL:RMC20088.1, ECO:0000313|Proteomes:UP000269221}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Chelidonia {ECO:0000313|EMBL:RMC20088.1};
RC TISSUE=Blood {ECO:0000313|EMBL:RMC20088.1};
RA Formenti G., Chiara M., Poveda L., Francoijs K.-J., Bonisoli-Alquati A.,
RA Canova L., Gianfranceschi L., Horner D.S., Saino N.;
RT "A high quality draft genome assembly of the barn swallow (H. rustica
RT rustica).";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the beta type-B retroviral polymerase family.
CC HERV class-II K(HML-2) pol subfamily. {ECO:0000256|ARBA:ARBA00010879}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RMC20088.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QRBI01000093; RMC20088.1; -; Genomic_DNA.
DR STRING; 333673.A0A3M0L402; -.
DR Proteomes; UP000269221; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-KW.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR GO; GO:0016032; P:viral process; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR Gene3D; 1.10.1200.30; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 1.10.375.10; Human Immunodeficiency Virus Type 1 Capsid Protein; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR045345; Gag_p24_C.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR008916; Retrov_capsid_C.
DR InterPro; IPR008919; Retrov_capsid_N.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR000477; RT_dom.
DR PANTHER; PTHR33064:SF29; LRRGT00076; 1.
DR PANTHER; PTHR33064; POL PROTEIN; 1.
DR Pfam; PF00607; Gag_p24; 1.
DR Pfam; PF19317; Gag_p24_C; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF47353; Retrovirus capsid dimerization domain-like; 1.
DR SUPFAM; SSF47943; Retrovirus capsid protein, N-terminal core domain; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Multifunctional enzyme {ECO:0000256|ARBA:ARBA00023268};
KW Reference proteome {ECO:0000313|Proteomes:UP000269221}.
FT DOMAIN 976..1157
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 1..49
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 330..397
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 420..495
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 778..802
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1282..1313
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 682..713
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 335..387
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 468..489
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 784..800
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1313 AA; 145891 MW; 3940DDCC4E267142 CRC64;
MMSPPGQAAA LAGTRLPGNP VPQGTGVPAP SNPEVPEQGT GAETTASSCS IPTLAPVTFR
NARGGNNRAE YHPFPRATIK EICKAHRIYG HDGPYFRGLL RADLSAEEVV PADLQYLFSC
LLNPTEYILW VTAWKRLLQE ALPGLLNHVN TRVDAQGIPL TLDHLAGEGQ WAEATDQVVI
PVQCLHVVRE TALTAFFSMQ TQGPAISYSK IRQDQSESFT DFVERLSRAI EAQVKNEMAR
EHILSEIAFS NANDLCRAAI LSLPLHPKPT LPDMLQLFGP TPPVFEGFKS LNVNDIIQWL
VLLVCLFCLA FRDKGRLTWI TTLIPTLENK DAAPQPEPVP QPDPAPQPEP VPQPEPVPQP
DPAPQPDPAP QPDPAPQPDP APQPDTVPQP ASGINHPDWV KEIGEMLREY LSPVVASLAP
AAGKPSPCPD KEESDGAAAE PTDVTTVQVP AEPQGQPQPA AVAPVETKKY KVKSEHPGNK
DKKGGPSQPG EESEVEIITE SLTYESLRNL QKDLARRGRE AYTAWLLRVW DLIGTGVQLD
SSEARNLGPL TQDSGVNQVF VREPGPLSLW ERLLMSVRER FVHRDRMQEH HHRMRWKTLE
EGIQQLREVA VLEVLFGRGG QHDNDPDKVK CTGQMLWNLA TLGPSQYATY IATIHPDTNR
ETVGSVANKL RNYESIICSP MQAQVSAVAK ELREEIREKM EEVTEKMEEM MRRNSSHVAP
VRVTGPRVRA QQPPARERGY TPRADLWYFL RDHGEDMGRW DGKPTSALAA RVRELKERNT
NRGGSTKMKV ASTSHDQAAS QGEARENRVY WTVWIRWPGT SEPQKYEALV DTGSQCTLIP
SEYVGTEPIS IAGVTGGSQE LTLLEAEVSL TGKEWQKHPI VTGSGAPCIL GIDFLRNGYY
KDSKGFRWAF GIAAVEAEGI KKLNSLPGLS ENPSAVGLLK VEEQRVPVAT STVHRRQYRT
NRDAVIPIHK MIRELESQGV VSKTHSPFNS PIWPVRKPDG EWRLTVDYRA LNEVTPPLSA
AVPDMLELQY ELESKAAKWY ATIDIANAFF SIPLAAECRP QFAFTWRGVQ YTWNRLPQGW
KHSPTICHGL IQAALEKGEA PEHLQYIDDI IVWGNTAMEV FEKGEKIIHI LLKAGFAIKQ
SKVKGPAREI QFLGVKWQDG RRQIPAEVIN KITAMSPPTS KKETQAFLGA IGFWRMHIPE
YSQIVSPLYL VTRKKNDFHW GPEQQQAFAQ IKQEIAHAVA LGPVRTGPDV KNVLYSAAGN
NGLSWSLGRR CLGRLGADHW DSGAEATEGQ KPTTLPQRRK SWQPMKEFKL PQR
//