GenomeNet

Database: UniProt
Entry: A0A1U7W445_NICSY
LinkDB: A0A1U7W445_NICSY
Original site: A0A1U7W445_NICSY 
ID   A0A1U7W445_NICSY        Unreviewed;      1326 AA.
AC   A0A1U7W445;
DT   10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT   10-MAY-2017, sequence version 1.
DT   27-MAR-2024, entry version 29.
DE   SubName: Full=Uncharacterized protein LOC104222439 {ECO:0000313|RefSeq:XP_009771976.1};
GN   Name=LOC104222439 {ECO:0000313|RefSeq:XP_009771976.1};
OS   Nicotiana sylvestris (Wood tobacco) (South American tobacco).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae;
OC   Nicotiana.
OX   NCBI_TaxID=4096 {ECO:0000313|Proteomes:UP000189701, ECO:0000313|RefSeq:XP_009771976.1};
RN   [1] {ECO:0000313|Proteomes:UP000189701}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=23773524; DOI=10.1186/gb-2013-14-6-r60;
RA   Sierro N., Battey J.N., Ouadi S., Bovet L., Goepfert S., Bakaher N.,
RA   Peitsch M.C., Ivanov N.V.;
RT   "Reference genomes and transcriptomes of Nicotiana sylvestris and Nicotiana
RT   tomentosiformis.";
RL   Genome Biol. 14:R60.1-R60.17(2013).
RN   [2] {ECO:0000313|RefSeq:XP_009771976.1}
RP   IDENTIFICATION.
RC   TISSUE=Leaf {ECO:0000313|RefSeq:XP_009771976.1};
RG   RefSeq;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the peptidase C48 family.
CC       {ECO:0000256|ARBA:ARBA00005234}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_009771976.1; XM_009773674.1.
DR   STRING; 4096.A0A1U7W445; -.
DR   GeneID; 104222439; -.
DR   KEGG; nsy:104222439; -.
DR   eggNOG; ENOG502RRK9; Eukaryota.
DR   OrthoDB; 394414at2759; -.
DR   Proteomes; UP000189701; Unplaced.
DR   GO; GO:0017108; F:5'-flap endonuclease activity; IEA:InterPro.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0003887; F:DNA-directed DNA polymerase activity; IEA:UniProtKB-EC.
DR   GO; GO:0033567; P:DNA replication, Okazaki fragment processing; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd09898; H3TH_53EXO; 2.
DR   Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 2.
DR   Gene3D; 3.40.50.1010; 5'-nuclease; 1.
DR   InterPro; IPR020046; 5-3_exonucl_a-hlix_arch_N.
DR   InterPro; IPR002421; 5-3_exonuclease.
DR   InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR   InterPro; IPR020045; DNA_polI_H3TH.
DR   InterPro; IPR015410; DUF1985.
DR   InterPro; IPR038969; FEN.
DR   InterPro; IPR008918; HhH2.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR003653; Peptidase_C48_C.
DR   InterPro; IPR029060; PIN-like_dom_sf.
DR   PANTHER; PTHR42646:SF4; 5'-3' EXONUCLEASE FAMILY PROTEIN; 1.
DR   PANTHER; PTHR42646; FLAP ENDONUCLEASE XNI; 1.
DR   Pfam; PF01367; 5_3_exonuc; 2.
DR   Pfam; PF02739; 5_3_exonuc_N; 1.
DR   Pfam; PF09331; DUF1985; 1.
DR   Pfam; PF02902; Peptidase_C48; 1.
DR   SMART; SM00475; 53EXOc; 1.
DR   SMART; SM00279; HhH2; 2.
DR   SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 2.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   SUPFAM; SSF88723; PIN domain-like; 1.
DR   PROSITE; PS50600; ULP_PROTEASE; 1.
PE   3: Inferred from homology;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW   Reference proteome {ECO:0000313|Proteomes:UP000189701}.
FT   DOMAIN          988..1195
FT                   /note="Ubiquitin-like protease family profile"
FT                   /evidence="ECO:0000259|PROSITE:PS50600"
FT   REGION          763..835
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          891..921
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        766..789
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        805..823
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        899..915
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1326 AA;  150678 MW;  39F2C1E0C978C01D CRC64;
     MEIHKATFII NPRYFLPIPN STHHAILVSP QHRPLQRRKL LKLRPHFCCC SSSHSPVVKD
     PILEAGNESF GKNDYKNITR KKRVFFLDVN PLCYKGSTPS LLSFAHWISL FFSQVSLTDP
     VIAVVDGERG NEYRRQLLPS YKAKRRKHWH QFPGAGKSPR SMVETSHRLV LDVLRNCNVP
     VVKIESHEAD DVIATLVEQV LQRGHRVVVA SPDKDFKQLI SDDVQIVMPV PEFNRWSFYT
     LKHYVAQYNC DPRSDLSLRC ILGDEVDGVP GIQHVVPGFG RKTALKLLKK HGTLENLLNA
     AAVRSVGKQY AQDALIKYAD YLRRNYEVLS LKRDVRIHIE EQWLDERDAR NDLLVXAWCI
     LGDEVDGVPG IQHVVPGFGR KTALKLLKKH GTLENLLNAA AVRSVGKQYA QDALIKYADY
     LRRNYEVLSL KRDVRIHIEE QWLDERDARN DLLVLSNFIT LLKESRILNS QNSSHSNALM
     KQHMKYLLDN KGSKMPKENK LFVKHPPRLS PHICSYTNTN IVSDLKEKLT PKQYKLLSST
     CFGSLLDMDQ CEFALVTGLK CVGDPREFKF NTEVPNRIVQ TYFGGSKLVK KEVLLSCFDE
     KKWGDGNDGD TIKISLLYLI HTWIFSSEKK TTTIPRLHFD LVESGRYSDY PWGTLAFSSL
     IISISKKMDY HKKYYRIVGM PLAMQVWFYE CCSKVDPKIA KRLGYWVPRL LNWRTTVNQP
     TYAYLMDGMF KDQGNMIVYK DIQPSDVELA VIQIPLEGVE VHTIPTNTHS DKHADDQDDS
     DDFSPTPHLH LEKKYNASVG PSSSPPHKKR KEQIIDRSEA GTKHKTPYAG ISEVNQNVFH
     DKKSADSKAD EVSSLRNDLN SFKDYVTGEF TSLRTLINEN FKNISGQIKA NQPTVTRDDG
     IQKSHDVDVQ DNPKGHPKSD VTVGENLENI DVNPVCDEAM VDGNNTFEML CIIPPICVSK
     EVHVSQFELS DHFLPSQIPE ARIVIHHVVK NPTDATPLAS HRNRHPSRSY SLPYESNFDS
     AGTSVKLTPI FDKKHPFEDD LISGPHPTLV IQEYEKWVRE GLLAKHYQKY ADTDSNANVA
     KEENMVFNLE EKMHWVLAVV SFKDRSIKVY DSIRSALHDS YVASEIDKLA KLVPLYLSSS
     GFYKDKQGID WTHDSAYSDK APTDPFEVVF ISDLSQQNPG SMDCGMHVAA YAEFLSTFGM
     VPQIKFDVNL LRQRYGALLW DYAMRNIDAD AVSESEAPSK ILRHITDSDT SVKIINPING
     LSRFFFGLPG GLLHLGGNTT SSATWAEQYS SSIRNLLFST AQACGHGNSS TWNRPQLHFF
     SKRHTV
//
DBGET integrated database retrieval system