ID A0A3R7P7S4_PENVA Unreviewed; 795 AA.
AC A0A3R7P7S4;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 13.
DE RecName: Full=Gypsy retrotransposon integrase-like protein 1 {ECO:0000256|ARBA:ARBA00039658};
GN ORFNames=C7M84_018820 {ECO:0000313|EMBL:ROT63311.1};
OS Penaeus vannamei (Whiteleg shrimp) (Litopenaeus vannamei).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Crustacea; Multicrustacea;
OC Malacostraca; Eumalacostraca; Eucarida; Decapoda; Dendrobranchiata;
OC Penaeoidea; Penaeidae; Penaeus.
OX NCBI_TaxID=6689 {ECO:0000313|EMBL:ROT63311.1, ECO:0000313|Proteomes:UP000283509};
RN [1] {ECO:0000313|EMBL:ROT63311.1, ECO:0000313|Proteomes:UP000283509}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Muscle {ECO:0000313|EMBL:ROT63311.1};
RA Zhang X., Yuan J., Li F., Xiang J.;
RL Submitted (APR-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ROT63311.1, ECO:0000313|Proteomes:UP000283509}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Muscle {ECO:0000313|EMBL:ROT63311.1};
RA Sun Y., Gao Y., Yu Y.;
RT "The decoding of complex shrimp genome reveals the adaptation for benthos
RT swimmer, frequently molting mechanism and breeding impact on genome.";
RL Submitted (JAN-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ROT63311.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QCYY01003452; ROT63311.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3R7P7S4; -.
DR Proteomes; UP000283509; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR47266; ENDONUCLEASE-RELATED; 1.
DR PANTHER; PTHR47266:SF28; GYPSY RETROTRANSPOSON INTEGRASE-LIKE PROTEIN 1; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000283509};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 271..287
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 602..699
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 1..46
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..39
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 795 AA; 88599 MW; 5F523F9772467442 CRC64;
MGRRNGGSRN NSPSRPSTSQ EAASAPSVGR NSPSGQTLLP DSDDRGIERL PEISFKAEVS
SAHPLKRNQE VESWLRSVEN HTRPQTDAAF IKAAKASCKG AAELIVNSPL FDSIVSWVEF
KDKLRQKFRG TGTSSDFYRV LHSQKLQAGQ APMDFYLTLE GMVYQGLRDY PRAIGDPDDL
IRRVFLQGLP RWIREAVVVK EEAELCNLVE TTQRVWSLRE SEGPGVSSPT RSPFRPRVLA
AAEVDPPRKY CRYHKNHEHD SSECRVLSER RKCFACGEQG HFARQCPFVS RQVGRGSDTV
TGEQRSQPLI AITRFFRNLQ VDLVSLMVNG FKTQCFVDTG SEVAHVAVPA FVPPRSGRFV
KCSVPRGMQA AKELMVSGIE CPLVVPRSLQ EYAAEEQALG TELFMGNDND FELGEGFIDD
DFDSFDAVGD FGGADDYILL PDVDLPPCSV VTQTPVGVDQ SGSSTDCPED VQLKDAELGD
LLSEIDLGHL TLEQQQQLRQ LSRAVAATEL HNRDPDDLRR HQLEDPLWKE VIEYLEERNL
PRRRLPLTLE EFELRDGVLY HVRHLPDRVL HQLVVPKTLR GSAMHLAHSS ATAGHLGSTG
HIASYPLERV SADLMELEVT SQGNRYVLAF IDQLTRYVQL IPLPSKDAET VADAFINQFV
TVFGPPPNGM IERTNRVVKN ALATLLEASP LEWDELLPYV RLAMNSAVHR SAREGWARDY
NRRTRKSTFA AEVGDLVLFK DLPRMAGAGR RGALGPRWFG PARICKKTGP VTRRLPEDDP
AVALLASFCR PPDHG
//