GenomeNet

Database: UniProt
Entry: R4GCV5_ANOCA
LinkDB: R4GCV5_ANOCA
Original site: R4GCV5_ANOCA 
ID   R4GCV5_ANOCA            Unreviewed;      1050 AA.
AC   R4GCV5;
DT   26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT   29-SEP-2021, sequence version 2.
DT   27-MAR-2024, entry version 45.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSACAP00000023180.2};
OS   Anolis carolinensis (Green anole) (American chameleon).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC   Iguania; Dactyloidae; Anolis.
OX   NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000023180.2, ECO:0000313|Proteomes:UP000001646};
RN   [1] {ECO:0000313|Ensembl:ENSACAP00000023180.2}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=JBL SC #1 {ECO:0000313|Ensembl:ENSACAP00000023180.2};
RG   The Genome Sequencing Platform;
RA   Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J.,
RA   Lander E.S., Lindblad-Toh K.;
RT   "The Genome Sequence of Anolis carolinensis (Green Anole Lizard).";
RL   Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSACAP00000023180.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the beta type-B retroviral polymerase family.
CC       HERV class-II K(HML-2) pol subfamily. {ECO:0000256|ARBA:ARBA00010879}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; R4GCV5; -.
DR   STRING; 28377.ENSACAP00000023180; -.
DR   Ensembl; ENSACAT00000030600.2; ENSACAP00000023180.2; ENSACAG00000028270.2.
DR   eggNOG; KOG0017; Eukaryota.
DR   GeneTree; ENSGT01040000240511; -.
DR   HOGENOM; CLU_000384_6_12_1; -.
DR   InParanoid; R4GCV5; -.
DR   Proteomes; UP000001646; Unplaced.
DR   Bgee; ENSACAG00000028270; Expressed in embryonic post-anal tail and 10 other cell types or tissues.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR   CDD; cd01647; RT_LTR; 1.
DR   Gene3D; 1.10.340.70; -; 1.
DR   Gene3D; 3.30.70.270; -; 2.
DR   Gene3D; 2.40.70.10; Acid Proteases; 1.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR041588; Integrase_H2C2.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041373; RT_RNaseH.
DR   PANTHER; PTHR24559:SF437; RIBONUCLEASE H; 1.
DR   PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR   Pfam; PF17921; Integrase_H2C2; 1.
DR   Pfam; PF17917; RT_RNaseH; 1.
DR   Pfam; PF00665; rve; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50878; RT_POL; 1.
PE   3: Inferred from homology;
KW   Reference proteome {ECO:0000313|Proteomes:UP000001646}.
FT   DOMAIN          125..304
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          668..827
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          977..999
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1015..1050
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        981..999
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1050 AA;  119921 MW;  5347733C1C08F4FD CRC64;
     MWEVQGVTGT FVWDITSLPR YDVILGMDWL AVVNPQVDWV TRKLTLKRQD CCTLNVSQSD
     MEGVPAEYGE FSDVFCKKEA DKLPPHRPYD CAIKLAEGAK LPAGRLYALT VPERQALREF
     LDENLAKGFI RPSSSPTAAP VFFVAKKTGE LRLVCDYRIL NKYTIRDRYP LPLISELLSR
     VQGAKVFTKL DLREAYNLIR IREGDEWKTA FNTCFRCHEF RVMPFGLCNA PAVFQRFMND
     VFRDLIDQFL VIYLDDILIF SKDEKEHRQH VKQVLHRLRA NGLFAKASKC VFHVPEVEFL
     GHVVSGRELK MDPHKVDAVN SWQELKTKKD VQRFLGFANY YREFIPNFAK LTVPLTQLLR
     KKQPFVWGRE AHDAFLQLKS SFQSDNILTH PDVDKPFVVE ADASSYALGA VLSQKDSSGT
     LRPCGFYSRQ LTTFEQNYTI WEKELLAIKV AFEVWRHWLE GARHQIVVRS DHKNLEHLQT
     AKKLNQRQIR WALFFSRFNF KVQFVEGKAN LRADALSRKP EFKTNEQVVC QTILPTASLC
     VVDNELGLHD QILEAQRDDV WTQEQLMLLS AGNRTILPHL QDQDGVLVRR GQVYVPVGAL
     RLEVIRAHHD EPMAGHFGRF KTVQLITRSY WWPKMRQDIL RFCDSCAVCQ QSKTPVGRPR
     GLLSSLPVPE RPWQIISMDF ISDLPKSGGY TCIWVVVDLF SKLAHFIPCS TIPAAPTLAL
     LFTKHIYRLH GAPEVIISDR APQFVSRFWK HFHECLGTKL NVSSAFHPQT DGQSERVNGL
     LEQYLRCFCL DQPTAWVKWL PVAEFAYNNA VHTSSQHTPF ELTYGFHPRG GVAPSTNVVS
     SDPVYRSSEM AALHDVARRL LLEAKATQKT QADRRRQAGE ELEEGDLVWL SSKHIKQAGG
     KFAPRYLGPF PIVKKISSVA FRLRLPSSLK VHPVFHRSLL KLDTSSRRGA IAEGITATSP
     PSGEEAFVRG DSVMIEPHGS VTGRRGRKTP RESDTLEERE RKRLRDIFAA PSDEESFEGF
     TERMEEGLVS SEEDEMELDS GEGGIGCHWQ
//
DBGET integrated database retrieval system