GenomeNet

Database: UniProt
Entry: A0A2U1M1H8_ARTAN
LinkDB: A0A2U1M1H8_ARTAN
Original site: A0A2U1M1H8_ARTAN 
ID   A0A2U1M1H8_ARTAN        Unreviewed;      1462 AA.
AC   A0A2U1M1H8;
DT   18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT   18-JUL-2018, sequence version 1.
DT   27-MAR-2024, entry version 15.
DE   RecName: Full=Integrase catalytic domain-containing protein {ECO:0000259|PROSITE:PS50994};
GN   ORFNames=CTI12_AA430010 {ECO:0000313|EMBL:PWA55115.1};
OS   Artemisia annua (Sweet wormwood).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC   Artemisiinae; Artemisia.
OX   NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA55115.1, ECO:0000313|Proteomes:UP000245207};
RN   [1] {ECO:0000313|EMBL:PWA55115.1, ECO:0000313|Proteomes:UP000245207}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC   TISSUE=Leaf {ECO:0000313|EMBL:PWA55115.1};
RX   PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA   Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA   Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA   Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT   "The genome of Artemisia annua provides insight into the evolution of
RT   Asteraceae family and artemisinin biosynthesis.";
RL   Mol. Plant 11:776-788(2018).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:PWA55115.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; PKPP01006861; PWA55115.1; -; Genomic_DNA.
DR   STRING; 35608.A0A2U1M1H8; -.
DR   Proteomes; UP000245207; Unassembled WGS sequence.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR029472; Copia-like_N.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR025724; GAG-pre-integrase_dom.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR013103; RVT_2.
DR   PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR   Pfam; PF13976; gag_pre-integrs; 1.
DR   Pfam; PF14223; Retrotran_gag_2; 1.
DR   Pfam; PF14244; Retrotran_gag_3; 1.
DR   Pfam; PF00665; rve; 1.
DR   Pfam; PF07727; RVT_2; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000245207}.
FT   DOMAIN          572..749
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          280..313
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          870..902
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        285..313
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1462 AA;  164292 MW;  104046681D410A2C CRC64;
     MDPSNPLYVH PSDGPGSLPI QEKLIGAQNY RSWRRAMEIG LSTKRKLGFV RGTIPRPPIV
     PVPPTTPAEN AVRTELWETC NNLVTSWIMS SVSDSIAKSI MFIESASEIW IQLETRFSLS
     NGSRKYKLSK ECFEIQQQGS TVSEYYTRMK CVWEELDSML MLPRLVTITP EISNFLSAVE
     KQKEEQRLFQ FLNGLDDCYG AQRSQLLLLN PLPSVENACA VIQQEESQKD VFKGGVTIVE
     STALFSKQET KGKCSICGYK WHPPDKCWEK VGYPVWHHKH KQSQGQNKFS QSQYKPKSSN
     NGGANNGGGS TFKRTAANVT SGASSFTFTS EQFETLMRGV LNDMKNSGTS GADCTDDELE
     FVAGMLCLSA ATNHALYYWI LDTGATDHMT PHSKSMLSTK ILKILPKINL PNGQSSEITQ
     IGQVKLNNGI VLKDVLCVPS FQFSLLSVPK LTKDNNCVAI FFPNFCVLQD LATRKVLGLG
     KKVAGLYHLL NVPMDSVDAK LRNLVDCHVN KGLFSCSAGV YSKSVCPNMF SLWHHRLGHL
     SVSRMKNLQC NELSSVDESN DTTCLTCPMA KLTKLPFSRS ESHCTTAFHM LHMDTWGPYK
     VPTNGKYRYF LTIVDDYSRA TWIYLMVHKS EAVEVLKFFL KFVDRQFGAK VKCVRSDNAL
     EFVKGPCSVF LANQGIEHQT TCVDRPQQNG RAERKHRHIL EVARALRFQA SLPLRFWGDC
     VTTATYLINR FPTPLLKNKT PYEMLLKKTP DYSQLRVFGC FAVATNPSRV PDKFAPKGVP
     CVFLGYPAHQ KGYKLYNLIT HTCFVSRDVQ FHEHIFPFSD SSATKFFQPN PVPMPKPSTV
     YDDYPTIPTP VTNLHTDDPH LAQHEIPVQS VTQTEPAVNN NEPTSTNQTR KSTRQSKPPT
     WTKDFIVPTI KPVANQVTSP VLSSQFHCML SVLETQTDPK SFKEAVVKPE WCNAMNGELR
     ALEDNGTWEE TELPPGKKAI GCHWLFKTKY KSDGTVERKK ARLVVQGNRQ KKGEDYEETF
     APVAKMVTVR SLLAVAAMQG WDIVQMDVSN AFLHGDLIEE VYMQLPLGYV GKGEPVQNVK
     SNSNKVCRLK KSLYGLKQAP RQWFAKLSSA LLSFGFQQSK ADYSLFTKKE GTSFTAVLVY
     VDDLMITGSD SSQIQMLKDQ LSSTFHMKDL GDLHYFLGLE VTKAESGLFV SQKKYTLELL
     QEAGVMSSKP YKLPMDPNLK LQADVGSPLQ DPEVYRRYIG KLIYLTITRP DICYTVQLLS
     QFMQNPTSVH MQAVKHLLRY LLNAPGQGIL LAHHSKAHLT AYCDSDWASC PMTRRSTTGY
     CILLGESPIS WKSKKQGVVS RSSAEAEYRA MAITCCEVTW LLSLLKDLGI KDLHPITLHC
     DNQAAIHIAA NPVFHDRTKH IEVDCHYVRD QVKDGIIKPE YIHTSQQLAD VFTKILTVDQ
     HHTLLHKLGV SFSENSQLEG EC
//
DBGET integrated database retrieval system