ID A0A2U1M1H8_ARTAN Unreviewed; 1462 AA.
AC A0A2U1M1H8;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 15.
DE RecName: Full=Integrase catalytic domain-containing protein {ECO:0000259|PROSITE:PS50994};
GN ORFNames=CTI12_AA430010 {ECO:0000313|EMBL:PWA55115.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA55115.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA55115.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA55115.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA55115.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01006861; PWA55115.1; -; Genomic_DNA.
DR STRING; 35608.A0A2U1M1H8; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR029472; Copia-like_N.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF14244; Retrotran_gag_3; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF07727; RVT_2; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000245207}.
FT DOMAIN 572..749
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 280..313
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 870..902
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 285..313
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1462 AA; 164292 MW; 104046681D410A2C CRC64;
MDPSNPLYVH PSDGPGSLPI QEKLIGAQNY RSWRRAMEIG LSTKRKLGFV RGTIPRPPIV
PVPPTTPAEN AVRTELWETC NNLVTSWIMS SVSDSIAKSI MFIESASEIW IQLETRFSLS
NGSRKYKLSK ECFEIQQQGS TVSEYYTRMK CVWEELDSML MLPRLVTITP EISNFLSAVE
KQKEEQRLFQ FLNGLDDCYG AQRSQLLLLN PLPSVENACA VIQQEESQKD VFKGGVTIVE
STALFSKQET KGKCSICGYK WHPPDKCWEK VGYPVWHHKH KQSQGQNKFS QSQYKPKSSN
NGGANNGGGS TFKRTAANVT SGASSFTFTS EQFETLMRGV LNDMKNSGTS GADCTDDELE
FVAGMLCLSA ATNHALYYWI LDTGATDHMT PHSKSMLSTK ILKILPKINL PNGQSSEITQ
IGQVKLNNGI VLKDVLCVPS FQFSLLSVPK LTKDNNCVAI FFPNFCVLQD LATRKVLGLG
KKVAGLYHLL NVPMDSVDAK LRNLVDCHVN KGLFSCSAGV YSKSVCPNMF SLWHHRLGHL
SVSRMKNLQC NELSSVDESN DTTCLTCPMA KLTKLPFSRS ESHCTTAFHM LHMDTWGPYK
VPTNGKYRYF LTIVDDYSRA TWIYLMVHKS EAVEVLKFFL KFVDRQFGAK VKCVRSDNAL
EFVKGPCSVF LANQGIEHQT TCVDRPQQNG RAERKHRHIL EVARALRFQA SLPLRFWGDC
VTTATYLINR FPTPLLKNKT PYEMLLKKTP DYSQLRVFGC FAVATNPSRV PDKFAPKGVP
CVFLGYPAHQ KGYKLYNLIT HTCFVSRDVQ FHEHIFPFSD SSATKFFQPN PVPMPKPSTV
YDDYPTIPTP VTNLHTDDPH LAQHEIPVQS VTQTEPAVNN NEPTSTNQTR KSTRQSKPPT
WTKDFIVPTI KPVANQVTSP VLSSQFHCML SVLETQTDPK SFKEAVVKPE WCNAMNGELR
ALEDNGTWEE TELPPGKKAI GCHWLFKTKY KSDGTVERKK ARLVVQGNRQ KKGEDYEETF
APVAKMVTVR SLLAVAAMQG WDIVQMDVSN AFLHGDLIEE VYMQLPLGYV GKGEPVQNVK
SNSNKVCRLK KSLYGLKQAP RQWFAKLSSA LLSFGFQQSK ADYSLFTKKE GTSFTAVLVY
VDDLMITGSD SSQIQMLKDQ LSSTFHMKDL GDLHYFLGLE VTKAESGLFV SQKKYTLELL
QEAGVMSSKP YKLPMDPNLK LQADVGSPLQ DPEVYRRYIG KLIYLTITRP DICYTVQLLS
QFMQNPTSVH MQAVKHLLRY LLNAPGQGIL LAHHSKAHLT AYCDSDWASC PMTRRSTTGY
CILLGESPIS WKSKKQGVVS RSSAEAEYRA MAITCCEVTW LLSLLKDLGI KDLHPITLHC
DNQAAIHIAA NPVFHDRTKH IEVDCHYVRD QVKDGIIKPE YIHTSQQLAD VFTKILTVDQ
HHTLLHKLGV SFSENSQLEG EC
//