ID A0A0V0S7N3_9BILA Unreviewed; 1018 AA.
AC A0A0V0S7N3;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KRX22753.1};
GN ORFNames=T07_2090 {ECO:0000313|EMBL:KRX22753.1};
OS Trichinella nelsoni.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=6336 {ECO:0000313|EMBL:KRX22753.1, ECO:0000313|Proteomes:UP000054630};
RN [1] {ECO:0000313|EMBL:KRX22753.1, ECO:0000313|Proteomes:UP000054630}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS37 {ECO:0000313|EMBL:KRX22753.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRX22753.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDL01000029; KRX22753.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0V0S7N3; -.
DR STRING; 6336.A0A0V0S7N3; -.
DR Proteomes; UP000054630; Unassembled WGS sequence.
DR GO; GO:0042575; C:DNA polymerase complex; IEA:UniProt.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR001878; Znf_CCHC.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF13456; RVT_3; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000054630};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 196..210
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 889..948
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
SQ SEQUENCE 1018 AA; 112490 MW; 50820AADDB008353 CRC64;
MGSGNGVDKS LDLRLIPEFD GSPQQSVVEW LEKLELVCKL RDISDVASVI PLRLTGGAFA
VYLQLNAQER SSVDKVKEAL LAAFAADPFV AYDQFVSRKL GPDESPDVFL AELRRLATLF
GGVSEKALAC AFVAGLPENV RQLLRAGSRM EDLGLSQILT RARAIITDER PVDAPNTCLS
ARGPGVRSPT APPVQRCFEC GGPNHFARDC LARRQGGDPG KRARDRGISA SLLSPRPLSE
ALPAVRMNVG GIRRRVLVDT GCSVCVAHTS CCRNWRKEDI AITTMCGRAM GCEGTGVVQL
RPHGKGPIEV EVIVVRSKPL GYDLILGMNG IAALGGVTVS GGRCVRFGLD DPGVCAAAEA
RISIREKDFT ATYCPSTRSW TAAWKWSDAG EPGVLRNTVE EYPPANVARG AYEDELRKWI
KDGWLVPYDE SEHGPPKGLL PLMAVIQRNK KKVRPVMDFR ELNAHIESHT ADTDVCSQKL
REWRRQGVDV ALIDLEKAYL QIRIDKSLWP YQTVAFKGKR YWLTRLGFGL NVAPLVMKAV
LNCVLSRDPD VRKGTSAYID DILVNESVVA VDRVKRHLAH YGQGERGKLM WRRDNDVGGV
PDVLTRRSVF SYCGKLVSHF PVCGWLRIAA AYIKRKVNDA TTSWDEVIDG DELRGLIQET
ALAVKKHASS LAIGVALEVG GSIVEDAAWL RPDDAQHINM AELDAVIKGL NLALSWQMRR
IRLMTDSATV HRWVTDGLSG KARLKTKASG EMLIRRRAGI ILSLVEEFGL ELEVDLVKSA
CNKADELTRV PRRWLKPPAA GPALACAATA DLGVERMIAD VHHAMGHPGI RRTLYFARRT
DPKVSKRQVR QVISGCEPCK SLDPAPGKWK RRSLEVEEVW QSANVTEQLE VVFYERGAPE
ELLTDNDTAF RGRIFTEFIA RWGVRVRYRC ANAPSGNGIA ERCHRSVKII AVRKNCTVEE
AVYLYNVMPR DGRNPWTAPA NVVHAYAVRV RGVDRATEEP EEKNGRFAVG DSVWVRPP
//