ID A0A3P8QIQ8_ASTCA Unreviewed; 1470 AA.
AC A0A3P8QIQ8;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSACLP00000029318.1};
OS Astatotilapia calliptera (Eastern happy) (Chromis callipterus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Haplochromini; Astatotilapia.
OX NCBI_TaxID=8154 {ECO:0000313|Ensembl:ENSACLP00000029318.1, ECO:0000313|Proteomes:UP000265100};
RN [1] {ECO:0000313|Ensembl:ENSACLP00000029318.1, ECO:0000313|Proteomes:UP000265100}
RP NUCLEOTIDE SEQUENCE.
RA Datahose.;
RL Submitted (MAY-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACLP00000029318.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the beta type-B retroviral polymerase family.
CC HERV class-II K(HML-2) pol subfamily. {ECO:0000256|ARBA:ARBA00010879}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 8154.ENSACLP00000029318; -.
DR Ensembl; ENSACLT00000030006.1; ENSACLP00000029318.1; ENSACLG00000019876.1.
DR GeneTree; ENSGT01050000244855; -.
DR OMA; CADEPRE; -.
DR Proteomes; UP000265100; Chromosome 1.
DR Bgee; ENSACLG00000019876; Expressed in spleen and 1 other cell type or tissue.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 1.10.4020.10; DNA breaking-rejoining enzymes; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR InterPro; IPR003309; SCAN_dom.
DR InterPro; IPR038269; SCAN_sf.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF02023; SCAN; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00431; SCAN; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF47353; Retrovirus capsid dimerization domain-like; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50804; SCAN_BOX; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 3: Inferred from homology;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Signal {ECO:0000256|SAM:SignalP};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..1470
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018328660"
FT DOMAIN 179..257
FT /note="SCAN box"
FT /evidence="ECO:0000259|PROSITE:PS50804"
FT DOMAIN 333..347
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 651..809
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 1032..1210
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 261..332
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 488..519
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1470 AA; 162788 MW; FE6A8B1C358302A6 CRC64;
MWKSAVHTCV CVCLCLVYST SGAETQESAG GGMSDPQAAP PAQPLQFLAQ MLAEMATMSR
DQAADQRAHL AALQEQTDRQ TQILERLVGA AATPKPSPLS VAVPRMGDGD DPQIFLETFR
ATAEACQWPR EEWAPRLLPL LSGEAQTAAL SLPPASRSSF PDVSRAVVDR LGLTAEDHRR
RFRACRLATT DRPFAWARQL RDAAVRWLQP GASEGETKLV DKVVLEQFTE GLPAETARWV
RCHRPASLEV AVTLAEDHLA AGAGEPGDAG RRPNKQAPVP APRRRVPAPG QAERLPALSP
TNPFAALVPS RAAEPSGAAP EPRRAAQTPG PECWKCGQPG HLRRDCPLME VGQVFRVAGA
PAPSPGPGGT YSIPVRTRGG IRQALVDTGC TQTLVHQSLV RPGALLEAEW VEVRCVHGDV
HRYPIVPLLL KYKGKMHRVT AAVSPRLSHP LILGTNWPGF HQLLGQYAGV RSRPEAGCSV
CAAFSGDAGS SDTDSGGEEP AGPSRDVPPA PEVSPMGDFP LEQSRDGTLR SAFDQVMAID
GHVVRPEAAQ TYPRFVLLND RLYRVSRDTQ TEETHTQLLV PQGRRETLFQ AAHYNPMAGH
MGYEKTLERI TARFYWPGIR ADVRRWCASC PDCQLVNQPA IPKAPLRPLP LIEVPFERIG
MDLIGPFHRS ARGYRFVLVL VDYATRYPEA VPLRTISAKS VAQALFQVIS RVGIPKEILT
DQGTSFMSRT LRELYELLGI KSIRTSVYHP QTDGLVERLN KTLKSMIRKF INEDERNWDH
WLDPLLFAVR EVPQASTGFS PFELLFGRRP RGVLDLIKES WEDGPSPAKN EIQYVLDLRA
KLHTLGRLSR ENLLQAQQRQ QRLYDRGARL RQFSPGDKVL VLLPTSSSKL LAKWQGPFVV
TRRVGDVDYE VARSDRGGAT QIYHLNLLKQ WREAETASLV SLVKERDELG PEVPNSIIPA
SLPCDDHLTQ AQRADVVALQ QRFADVFSPL PGRTSLIEHH FVTQPGVTVR SRPYRLPEHK
RKIVQRELAE MLRMGVIEES HSAWCSPIVL VAKKDGSIRF CVDYRRVNEV SRFDAYPMPR
VDELLDRLGT ARFFTTLDLT KGYWQIPLSA EAREKTAFST PYGLYQFVTL PFGLFGAPAT
FQRLMDRVLR PHAAYAAAYL DDVIIHSDTW AEHVLRVAAV LESLRGAGLT ANPKKCAVGR
REVQYLGYHL GGGQVRPQVE KTAAIASCPR PRTKKEVRRF LGLAGYYRRF VPGFAQLTSP
LTDLTRKGAP DLVQWTGPCQ AAFVQVKKAL CGEPLLHTPN FSLPFVLQTD ASDRGLGAVL
SQQVRGADRP VLYISRKLAE RERRYSTVEK ECLAIRWAVG SLRYYLLGRS FTLCSDHAPL
QWLHRMKDAN ARITRWYLAL QPFKFKVIHR PGAQMAVADF LSRSRGGVGS AGRLPGLERA
VGVCGGGVAP GAAAGEECSR RLNRAALRGR
//