ID R4GCV5_ANOCA Unreviewed; 1050 AA.
AC R4GCV5;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 2.
DT 27-MAR-2024, entry version 45.
DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSACAP00000023180.2};
OS Anolis carolinensis (Green anole) (American chameleon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC Iguania; Dactyloidae; Anolis.
OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000023180.2, ECO:0000313|Proteomes:UP000001646};
RN [1] {ECO:0000313|Ensembl:ENSACAP00000023180.2}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JBL SC #1 {ECO:0000313|Ensembl:ENSACAP00000023180.2};
RG The Genome Sequencing Platform;
RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J.,
RA Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard).";
RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACAP00000023180.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the beta type-B retroviral polymerase family.
CC HERV class-II K(HML-2) pol subfamily. {ECO:0000256|ARBA:ARBA00010879}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; R4GCV5; -.
DR STRING; 28377.ENSACAP00000023180; -.
DR Ensembl; ENSACAT00000030600.2; ENSACAP00000023180.2; ENSACAG00000028270.2.
DR eggNOG; KOG0017; Eukaryota.
DR GeneTree; ENSGT01040000240511; -.
DR HOGENOM; CLU_000384_6_12_1; -.
DR InParanoid; R4GCV5; -.
DR Proteomes; UP000001646; Unplaced.
DR Bgee; ENSACAG00000028270; Expressed in embryonic post-anal tail and 10 other cell types or tissues.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR24559:SF437; RIBONUCLEASE H; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000001646}.
FT DOMAIN 125..304
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 668..827
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 977..999
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1015..1050
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 981..999
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1050 AA; 119921 MW; 5347733C1C08F4FD CRC64;
MWEVQGVTGT FVWDITSLPR YDVILGMDWL AVVNPQVDWV TRKLTLKRQD CCTLNVSQSD
MEGVPAEYGE FSDVFCKKEA DKLPPHRPYD CAIKLAEGAK LPAGRLYALT VPERQALREF
LDENLAKGFI RPSSSPTAAP VFFVAKKTGE LRLVCDYRIL NKYTIRDRYP LPLISELLSR
VQGAKVFTKL DLREAYNLIR IREGDEWKTA FNTCFRCHEF RVMPFGLCNA PAVFQRFMND
VFRDLIDQFL VIYLDDILIF SKDEKEHRQH VKQVLHRLRA NGLFAKASKC VFHVPEVEFL
GHVVSGRELK MDPHKVDAVN SWQELKTKKD VQRFLGFANY YREFIPNFAK LTVPLTQLLR
KKQPFVWGRE AHDAFLQLKS SFQSDNILTH PDVDKPFVVE ADASSYALGA VLSQKDSSGT
LRPCGFYSRQ LTTFEQNYTI WEKELLAIKV AFEVWRHWLE GARHQIVVRS DHKNLEHLQT
AKKLNQRQIR WALFFSRFNF KVQFVEGKAN LRADALSRKP EFKTNEQVVC QTILPTASLC
VVDNELGLHD QILEAQRDDV WTQEQLMLLS AGNRTILPHL QDQDGVLVRR GQVYVPVGAL
RLEVIRAHHD EPMAGHFGRF KTVQLITRSY WWPKMRQDIL RFCDSCAVCQ QSKTPVGRPR
GLLSSLPVPE RPWQIISMDF ISDLPKSGGY TCIWVVVDLF SKLAHFIPCS TIPAAPTLAL
LFTKHIYRLH GAPEVIISDR APQFVSRFWK HFHECLGTKL NVSSAFHPQT DGQSERVNGL
LEQYLRCFCL DQPTAWVKWL PVAEFAYNNA VHTSSQHTPF ELTYGFHPRG GVAPSTNVVS
SDPVYRSSEM AALHDVARRL LLEAKATQKT QADRRRQAGE ELEEGDLVWL SSKHIKQAGG
KFAPRYLGPF PIVKKISSVA FRLRLPSSLK VHPVFHRSLL KLDTSSRRGA IAEGITATSP
PSGEEAFVRG DSVMIEPHGS VTGRRGRKTP RESDTLEERE RKRLRDIFAA PSDEESFEGF
TERMEEGLVS SEEDEMELDS GEGGIGCHWQ
//