GenomeNet

Database: UniProt
Entry: A0A1S3XSF4_TOBAC
LinkDB: A0A1S3XSF4_TOBAC
Original site: A0A1S3XSF4_TOBAC 
ID   A0A1S3XSF4_TOBAC        Unreviewed;       781 AA.
AC   A0A1S3XSF4;
DT   12-APR-2017, integrated into UniProtKB/TrEMBL.
DT   12-APR-2017, sequence version 1.
DT   27-MAR-2024, entry version 25.
DE   SubName: Full=Uncharacterized protein LOC107768221 {ECO:0000313|RefSeq:XP_016442809.1};
GN   Name=LOC107768221 {ECO:0000313|RefSeq:XP_016442809.1};
OS   Nicotiana tabacum (Common tobacco).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae;
OC   Nicotiana.
OX   NCBI_TaxID=4097 {ECO:0000313|Proteomes:UP000084051, ECO:0000313|RefSeq:XP_016442809.1};
RN   [1] {ECO:0000313|Proteomes:UP000084051}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. TN90 {ECO:0000313|Proteomes:UP000084051};
RX   PubMed=24807620; DOI=10.1038/ncomms4833;
RA   Sierro N., Battey J.N., Ouadi S., Bakaher N., Bovet L., Willig A.,
RA   Goepfert S., Peitsch M.C., Ivanov N.V.;
RT   "The tobacco genome sequence and its comparison with those of tomato and
RT   potato.";
RL   Nat. Commun. 5:3833-3833(2014).
RN   [2] {ECO:0000313|RefSeq:XP_016442809.1}
RP   IDENTIFICATION.
RG   RefSeq;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_016442809.1; XM_016587323.1.
DR   AlphaFoldDB; A0A1S3XSF4; -.
DR   STRING; 4097.A0A1S3XSF4; -.
DR   PaxDb; 4097-A0A1S3XSF4; -.
DR   GeneID; 107768221; -.
DR   KEGG; nta:107768221; -.
DR   OMA; SSETHMY; -.
DR   OrthoDB; 1132202at2759; -.
DR   Proteomes; UP000084051; Unplaced.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR013103; RVT_2.
DR   PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR   Pfam; PF14223; Retrotran_gag_2; 1.
DR   Pfam; PF07727; RVT_2; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000084051}.
FT   DOMAIN          381..479
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          575..605
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   781 AA;  87263 MW;  E461F170E032414F CRC64;
     MVKAWITNSV SREIATSVMC LKTAREVWKD INERFGQSNG SKYLQIQREI STTTQGSSDI
     ATYFTKLRSL WDELNSSYVG PVCSCGALPK FIEDQQLFQF LNGFNESYST DESQKESFSN
     VSNFSGDSAS FSVTPAQYNN NRSFTQKVNF EPKKNAPTVS CKFCKKSGHT VEKCYRLHGF
     PPDFKFTKNK RSASCVQSDI SYPQPSSGFS QLPGISAPVQ GFTKEQYQHL LSLFQQVQVS
     PGSAPPIHPD EDSAFAHFAV LTKFHCSLQG PSLKRPLVIG KAVGRLYYLH PDGDLFPSTT
     SSLSSFVDAT CNESIFDIGS IPCNKSVLAS SPVNVPPSYN KTSPVNKMDL LWHQRLGHMP
     FHKMQSISFL SNKVHFNSSV QTFRSDNAFE LGSSSEAISF FTSQGILHQT SIPYTPQQNG
     IVERKHKHLL EVSRALLFQS KLPLKFWGGC VLTATYLINR MSSPLLLKLS PFEKLHGHPP
     SYDHLRSFGC LCFATFPKGG RDKFQSRAIA CIFLGYPCGK KGYKLLNLSK IFVFHSRDVV
     FHEHTFPYSS SFNPSHSNVL PPTFVDIPAH PSVTTLSTSH SEPSSPIVSP SFNSTSPTST
     STTSALPILR KSTRTVTQPS YLKDYICSSV IFSNSAHSNL PSSETHMYEP QFYQQAVTHP
     AWQEDMLKEF FALERADGSV ERYKARLVIR GDTQREGIDF TETFSPVIKI TTIKCLLTLA
     IKRDWTVYQL DVNNAFFHGD LHEEVYMKIP PGLDISSASS STPLVCKLKK SLYGLRQASR
     Q
//
DBGET integrated database retrieval system