ID A0A3M0JF59_HIRRU Unreviewed; 1058 AA.
AC A0A3M0JF59;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 24-JAN-2024, entry version 18.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:RMB99478.1};
GN ORFNames=DUI87_24215 {ECO:0000313|EMBL:RMB99478.1};
OS Hirundo rustica rustica.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Sylvioidea; Hirundinidae;
OC Hirundo.
OX NCBI_TaxID=333673 {ECO:0000313|EMBL:RMB99478.1, ECO:0000313|Proteomes:UP000269221};
RN [1] {ECO:0000313|EMBL:RMB99478.1, ECO:0000313|Proteomes:UP000269221}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Chelidonia {ECO:0000313|EMBL:RMB99478.1};
RC TISSUE=Blood {ECO:0000313|EMBL:RMB99478.1};
RA Formenti G., Chiara M., Poveda L., Francoijs K.-J., Bonisoli-Alquati A.,
RA Canova L., Gianfranceschi L., Horner D.S., Saino N.;
RT "A high quality draft genome assembly of the barn swallow (H. rustica
RT rustica).";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RMB99478.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QRBI01000149; RMB99478.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3M0JF59; -.
DR Proteomes; UP000269221; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0016032; P:viral process; IEA:InterPro.
DR CDD; cd05482; HIV_retropepsin_like; 1.
DR CDD; cd07557; trimeric_dUTPase; 1.
DR Gene3D; 1.10.1200.30; -; 1.
DR Gene3D; 2.70.40.10; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 1.10.375.10; Human Immunodeficiency Virus Type 1 Capsid Protein; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR029054; dUTPase-like.
DR InterPro; IPR036157; dUTPase-like_sf.
DR InterPro; IPR033704; dUTPase_trimeric.
DR InterPro; IPR045345; Gag_p24_C.
DR InterPro; IPR001995; Peptidase_A2_cat.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR034170; Retropepsin-like_cat_dom.
DR InterPro; IPR018061; Retropepsins.
DR InterPro; IPR008916; Retrov_capsid_C.
DR InterPro; IPR008919; Retrov_capsid_N.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR PANTHER; PTHR40389; ENDOGENOUS RETROVIRUS GROUP K MEMBER 24 GAG POLYPROTEIN-RELATED; 1.
DR PANTHER; PTHR40389:SF3; IGE-BINDING PROTEIN; 1.
DR Pfam; PF00692; dUTPase; 1.
DR Pfam; PF00607; Gag_p24; 1.
DR Pfam; PF19317; Gag_p24_C; 1.
DR Pfam; PF00075; RNase_H; 1.
DR Pfam; PF00077; RVP; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF51283; dUTPase-like; 1.
DR SUPFAM; SSF47353; Retrovirus capsid dimerization domain-like; 1.
DR SUPFAM; SSF47943; Retrovirus capsid protein, N-terminal core domain; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50175; ASP_PROT_RETROV; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000269221};
KW Ribosomal frameshifting {ECO:0000256|ARBA:ARBA00022758};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Transposable element {ECO:0000256|ARBA:ARBA00022464}.
FT DOMAIN 110..254
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT DOMAIN 971..1046
FT /note="Peptidase A2"
FT /evidence="ECO:0000259|PROSITE:PS50175"
FT REGION 436..465
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1058 AA; 115991 MW; CAB6ECB4254DEE86 CRC64;
MANLPTSCCC NCGTGGGDSE IDPTGKIRVH TPHDLKTILS QKAPEWITDS RILKYEITLI
NSENLTLTTS KALNPVQFLS GEPPQELEHD CLELLNFQTK VREDLETIPL PQGRKLFIDG
SSRVIEGKRA SGYAVIEGST RDDMKTLEIG KLPGSWSAQL CEIYALKRGL DLLEGDRGTI
YTDSKYAYGV AHTFGKIWEE RGYLNSKGKD LAHKELRKAI LTSVLKPLEL AIVHCYSPIY
DGENLESLIR VHTNVNPTCF DMARLSTCEE EGKQYWVAKN AATFKQRLTG ECPIREWFCM
EKVNGVKGVK DLLKEKPLTI KKMEIEEPLN KNLFIDLVER ISHELNITNC WICGSTQMSD
VWPWEGISLG PLDILRWKKI KQKPPHIGKR KREQWDLKSK VIGEECIRRV GKRYKNLCRT
ISGDSSGIWP RLKRDRRWDG RSPFGPPGQV ATREQPGTGN EEERRRVSIP AAVWTLPPCQ
VTVRPRDRGQ FWDEVKAKVE GLSGDADAGA RLAVSDVQLV PSRVPASDPV GSTGQDRAGA
AASSEGLQAF PVLQGATHNT YQPLAWQALS ELHDAVGKYD LGSAEVMQVL RSFNASLLMP
FDFRSLARAL FPLVEYDFFE NKWTQLAVRA VERNTTLGPG DPRRMVNIDM LMGTGNYTRA
EGQAGYGPLV QEQCQQTGMA ALVQTLQLAT PQQPFATIIQ GIDEPFLYFA GRLTAAVEKQ
VSDPAARKLM IQSVAQGNCN AACKRIIEAL PGEPSMSDMV GACAKISPSS QQVLAVHTAV
QPAVATIVQP TVTTAVQPTV PAAVHPAVAA AVQPAWVVPQ GVQQQQWGAR ARKKQSAESV
VIDSDKIHKV PLDAFGPLGD GMSAFLMGRS SATIQGIIVH LGLIDADFSG QIHAMVSTPT
PPLTIPKGTR IAQLVPFKSS VSRTKDQSWG DGGFGSTGPP QVRWTAVLTK DRPETLCTVS
MVGATPSEIH LRGLLDTGAD VSILSLAAWP PQWPLTLANT WISGLGGTKQ CYVSQNPVAI
TNPEGQTAII WPHVTEIPQN LWGRDVLAAW GVRLGMDF
//