ID A0A1S3ZSC6_TOBAC Unreviewed; 1035 AA.
AC A0A1S3ZSC6;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=Uncharacterized protein LOC107789900 {ECO:0000313|RefSeq:XP_016467267.1};
GN Name=LOC107789900 {ECO:0000313|RefSeq:XP_016467267.1};
OS Nicotiana tabacum (Common tobacco).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae;
OC Nicotiana.
OX NCBI_TaxID=4097 {ECO:0000313|Proteomes:UP000084051, ECO:0000313|RefSeq:XP_016467267.1};
RN [1] {ECO:0000313|Proteomes:UP000084051}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TN90 {ECO:0000313|Proteomes:UP000084051};
RX PubMed=24807620; DOI=10.1038/ncomms4833;
RA Sierro N., Battey J.N., Ouadi S., Bakaher N., Bovet L., Willig A.,
RA Goepfert S., Peitsch M.C., Ivanov N.V.;
RT "The tobacco genome sequence and its comparison with those of tomato and
RT potato.";
RL Nat. Commun. 5:3833-3833(2014).
RN [2] {ECO:0000313|RefSeq:XP_016467267.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016467267.1; XM_016611781.1.
DR AlphaFoldDB; A0A1S3ZSC6; -.
DR PaxDb; 4097-A0A1S3ZSC6; -.
DR GeneID; 107789900; -.
DR KEGG; nta:107789900; -.
DR OMA; WIEVIXX; -.
DR OrthoDB; 783436at2759; -.
DR Proteomes; UP000084051; Unplaced.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR041373; RT_RNaseH.
DR InterPro; IPR001878; Znf_CCHC.
DR PANTHER; PTHR45835:SF105; IPP TRANSFERASE; 1.
DR PANTHER; PTHR45835; YALI0A06105P; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF08284; RVP_2; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000084051};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 395..409
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT REGION 1..82
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 288..370
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 415..462
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 55..77
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 289..324
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 335..370
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 419..436
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1035 AA; 116669 MW; 8DFFC265C811EA2B CRC64;
MAQREIGMDP GEGTSRSPPG QRDRFPLEAH SESPVPLVSA SPALVGAQGD AVPPASPVPL
VPEAARDTGP PAPIVPPSET GEQGMREAVQ LLTRMVSIHE RQLESGADAR RDRIGSSTVR
EFLHLAPPLF TGSSSTEDPQ DFIDHMYRVL RVMHASVTEA VELASFRLRD VAVLWYEAWE
RSRGPDAPPA EWEDFSEAFL AHYLPREVRE ARLDQFLSLK QGDMSVRDYS HKFNSLARYA
PDIVRTMRAR VHHYVDGLGD HLIRDCRVAS LSDDVDISRI QAFAQTTEDL SRRIRDTRRD
REQSKRARTM GSYREPRVDF RPPLHRYPPR SAGSFPPQMQ GQRFDRYIQS GPGQSSGQPE
GRRQERSAQM RQLTPPCTQC GKLHTGQCRQ GSSACFHCGQ TGHYISRCPG LGRGTPAQPS
GFTAASSPSV RAPRPGPQST QGRGRGRGGG DTSGSSGGQN RFYALTGRQD SEASPDVVTG
ILTIHSHAIY ALMDPGSTFS YITPFIAGKL DMRSELLPQP VEVSTPVGDS IVANHVYRDS
CYANIDCRAK LVRFHFPGEP VLEWKAFLGH VITGDGIKVD GQKIEAVMTW PRPLNPTEHG
KVIAYASRQL RKHEQNYPTH DLELAAVVFA LKIWRHYLYG VHIDVFTDHQ SLHRKSMGSL
SHVEADKVKM TKYLCQLASL QVRLVDAEGG RILVQNTAKS SFVTEVKERQ HEDPELIKLR
ESIPQQRQPL FELTGDGVLR YQGRLCVPSV GELRAKILSE AHYSRYAVHP GATKMYRDLR
QIYWWNGMKK DIAEMVAQCP NCQQVKTDGQ AERTIQTVED MLRACVLDFK GSWDDHLPLI
EFAYNNSFQA SIQMAPYEAL YGRKCRSPIG WFEVGEAELL GPNLVQQAME KVKLIRDRLR
TAQSRQKSYA DVRRRDLEFD VEDWVFLKVS PMKGVMRFGK KGKLSPRYVG PYKIIRRIGR
CIGDPSRITP IEDIHIAEDL SYAEVPVVIL DRQVRKLRTK EVASVKVLWR NNNIEEMTWE
AEEEMRKKYP HLFTT
//