ID R7QQY1_CHOCR Unreviewed; 1046 AA.
AC R7QQY1;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE RecName: Full=Integrase catalytic domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=CHC_T00000480001 {ECO:0000313|EMBL:CDF39891.1};
OS Chondrus crispus (Carrageen Irish moss) (Polymorpha crispa).
OC Eukaryota; Rhodophyta; Florideophyceae; Rhodymeniophycidae; Gigartinales;
OC Gigartinaceae; Chondrus.
OX NCBI_TaxID=2769 {ECO:0000313|EMBL:CDF39891.1, ECO:0000313|Proteomes:UP000012073};
RN [1] {ECO:0000313|Proteomes:UP000012073}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Stackhouse {ECO:0000313|Proteomes:UP000012073};
RX PubMed=23503846; DOI=10.1073/pnas.1221259110;
RA Collen J., Porcel B., Carre W., Ball S.G., Chaparro C., Tonon T.,
RA Barbeyron T., Michel G., Noel B., Valentin K., Elias M., Artiguenave F.,
RA Arun A., Aury J.M., Barbosa-Neto J.F., Bothwell J.H., Bouget F.Y.,
RA Brillet L., Cabello-Hurtado F., Capella-Gutierrez S., Charrier B.,
RA Cladiere L., Cock J.M., Coelho S.M., Colleoni C., Czjzek M., Da Silva C.,
RA Delage L., Denoeud F., Deschamps P., Dittami S.M., Gabaldon T.,
RA Gachon C.M., Groisillier A., Herve C., Jabbari K., Katinka M., Kloareg B.,
RA Kowalczyk N., Labadie K., Leblanc C., Lopez P.J., McLachlan D.H.,
RA Meslet-Cladiere L., Moustafa A., Nehr Z., Nyvall Collen P., Panaud O.,
RA Partensky F., Poulain J., Rensing S.A., Rousvoal S., Samson G.,
RA Symeonidi A., Weissenbach J., Zambounis A., Wincker P., Boyen C.;
RT "Genome structure and metabolic features in the red seaweed Chondrus
RT crispus shed light on evolution of the Archaeplastida.";
RL Proc. Natl. Acad. Sci. U.S.A. 110:5247-5252(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; HG002088; CDF39891.1; -; Genomic_DNA.
DR RefSeq; XP_005710185.1; XM_005710128.1.
DR AlphaFoldDB; R7QQY1; -.
DR STRING; 2769.R7QQY1; -.
DR EnsemblPlants; CDF39891; CDF39891; CHC_T00000480001.
DR GeneID; 17317909; -.
DR Gramene; CDF39891; CDF39891; CHC_T00000480001.
DR KEGG; ccp:CHC_T00000480001; -.
DR OrthoDB; 1697826at2759; -.
DR PhylomeDB; R7QQY1; -.
DR Proteomes; UP000012073; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR PANTHER; PTHR42648:SF22; RETROVIRUS-RELATED POL POLYPROTEIN FROM TRANSPOSON TNT 1-94; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF07727; RVT_2; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000012073}.
FT DOMAIN 312..372
FT /note="GAG-pre-integrase"
FT /evidence="ECO:0000259|Pfam:PF13976"
FT DOMAIN 609..848
FT /note="Reverse transcriptase Ty1/copia-type"
FT /evidence="ECO:0000259|Pfam:PF07727"
FT REGION 192..211
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1046 AA; 116583 MW; C1F9B64667214561 CRC64;
MDDQSSGSVR IEKLTASNFY IWKQKIQLLL ALRDVDQYVL DDIPENATSD DRRKWIRGDA
KAKAVIGLTL SDDYLHHVRE CSSAKETWEA ILNVFERHTL LNKLAARRDF HTVSMLPSEK
VLVFINRVKQ LAARLQSMSV EIDDKEIAMA VLNGLPPRFD NLIVALDALG NEDKVFGLEF
VKSRLLQEEQ RESMKTASAS SPHAPALVNH GFRSRNTALK PSAFVSRNDN STAAFAESNF
TCLLSTSTPE KRAANARSWL VDSGCSAHIT FDRSLFVTYE QMQSGSVEMG TKARANVAGP
TVVATASQVG DLYILDVMNE TFSSHVVTMQ TLHERMAHVN VQGIASMIHN NVVSGINSDN
HSPICPACVF GKATRSVIPK QRSSSRAQNC LDLVHSDVCG PLEVQSIGGS RYFITFVDDH
SNWVVHQLTT AYTPEQNGVA ERLNRTLIDL VHSMLSHKQV KLRKLDLRAR EAMFLGYSQC
SKAYKLWDGE LNKIVVSRDV KFDESTCGIH DIPAHESDSI NSDVDVVLLD GDNGNEADTP
NTNEVAVRRS SRVSRPPGSW WRSNFAAALL SHAHVAIEGR NTFKQATNGP RAAFWQTGID
KELASQTKNR TWKLVPPSEV SNILTSRWVF NVKQLPDANG NIVESAKARL VARGFQQVQG
LDYTETYAPV IKFTTIRLLL ALVAHYDLEL HQMDVVTAFL NGDLDEDIYM EQPEGCIDKS
KSDHVCKLLK ALYDLKQAHR QWHAKVNDFL IGELGFETSR SDPCLFIKRI GNTIMLIALY
VDDLLLAGSD IDAIKWMKGE LNKRFEMKDL GKAKVCIGLE IERDRSAKTL SLTQTKYASN
VLDRFKMSTL GSLMYLMVGT RPDLAYAIGK LSQHSANPCE SHWASVKRVM RYVQGTRNLG
IVFDNKSKSP ELLAISWSSR KQTVVATSTC EAEYIALCEA CKEATWLRQV VADVLGLDSD
PTIMMGCDNA GTISYAENES VNRRNKHVDV KYHYVRDAIK RNVISLSHCP TTSMPADILT
KALGRVLFQK FVELLGLAVA TSDKSM
//