ID A0A1U7W445_NICSY Unreviewed; 1326 AA.
AC A0A1U7W445;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE SubName: Full=Uncharacterized protein LOC104222439 {ECO:0000313|RefSeq:XP_009771976.1};
GN Name=LOC104222439 {ECO:0000313|RefSeq:XP_009771976.1};
OS Nicotiana sylvestris (Wood tobacco) (South American tobacco).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae;
OC Nicotiana.
OX NCBI_TaxID=4096 {ECO:0000313|Proteomes:UP000189701, ECO:0000313|RefSeq:XP_009771976.1};
RN [1] {ECO:0000313|Proteomes:UP000189701}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23773524; DOI=10.1186/gb-2013-14-6-r60;
RA Sierro N., Battey J.N., Ouadi S., Bovet L., Goepfert S., Bakaher N.,
RA Peitsch M.C., Ivanov N.V.;
RT "Reference genomes and transcriptomes of Nicotiana sylvestris and Nicotiana
RT tomentosiformis.";
RL Genome Biol. 14:R60.1-R60.17(2013).
RN [2] {ECO:0000313|RefSeq:XP_009771976.1}
RP IDENTIFICATION.
RC TISSUE=Leaf {ECO:0000313|RefSeq:XP_009771976.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the peptidase C48 family.
CC {ECO:0000256|ARBA:ARBA00005234}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_009771976.1; XM_009773674.1.
DR STRING; 4096.A0A1U7W445; -.
DR GeneID; 104222439; -.
DR KEGG; nsy:104222439; -.
DR eggNOG; ENOG502RRK9; Eukaryota.
DR OrthoDB; 394414at2759; -.
DR Proteomes; UP000189701; Unplaced.
DR GO; GO:0017108; F:5'-flap endonuclease activity; IEA:InterPro.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003887; F:DNA-directed DNA polymerase activity; IEA:UniProtKB-EC.
DR GO; GO:0033567; P:DNA replication, Okazaki fragment processing; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd09898; H3TH_53EXO; 2.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 2.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 1.
DR InterPro; IPR020046; 5-3_exonucl_a-hlix_arch_N.
DR InterPro; IPR002421; 5-3_exonuclease.
DR InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR InterPro; IPR020045; DNA_polI_H3TH.
DR InterPro; IPR015410; DUF1985.
DR InterPro; IPR038969; FEN.
DR InterPro; IPR008918; HhH2.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR003653; Peptidase_C48_C.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR PANTHER; PTHR42646:SF4; 5'-3' EXONUCLEASE FAMILY PROTEIN; 1.
DR PANTHER; PTHR42646; FLAP ENDONUCLEASE XNI; 1.
DR Pfam; PF01367; 5_3_exonuc; 2.
DR Pfam; PF02739; 5_3_exonuc_N; 1.
DR Pfam; PF09331; DUF1985; 1.
DR Pfam; PF02902; Peptidase_C48; 1.
DR SMART; SM00475; 53EXOc; 1.
DR SMART; SM00279; HhH2; 2.
DR SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 2.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
DR PROSITE; PS50600; ULP_PROTEASE; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Reference proteome {ECO:0000313|Proteomes:UP000189701}.
FT DOMAIN 988..1195
FT /note="Ubiquitin-like protease family profile"
FT /evidence="ECO:0000259|PROSITE:PS50600"
FT REGION 763..835
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 891..921
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 766..789
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 805..823
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 899..915
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1326 AA; 150678 MW; 39F2C1E0C978C01D CRC64;
MEIHKATFII NPRYFLPIPN STHHAILVSP QHRPLQRRKL LKLRPHFCCC SSSHSPVVKD
PILEAGNESF GKNDYKNITR KKRVFFLDVN PLCYKGSTPS LLSFAHWISL FFSQVSLTDP
VIAVVDGERG NEYRRQLLPS YKAKRRKHWH QFPGAGKSPR SMVETSHRLV LDVLRNCNVP
VVKIESHEAD DVIATLVEQV LQRGHRVVVA SPDKDFKQLI SDDVQIVMPV PEFNRWSFYT
LKHYVAQYNC DPRSDLSLRC ILGDEVDGVP GIQHVVPGFG RKTALKLLKK HGTLENLLNA
AAVRSVGKQY AQDALIKYAD YLRRNYEVLS LKRDVRIHIE EQWLDERDAR NDLLVXAWCI
LGDEVDGVPG IQHVVPGFGR KTALKLLKKH GTLENLLNAA AVRSVGKQYA QDALIKYADY
LRRNYEVLSL KRDVRIHIEE QWLDERDARN DLLVLSNFIT LLKESRILNS QNSSHSNALM
KQHMKYLLDN KGSKMPKENK LFVKHPPRLS PHICSYTNTN IVSDLKEKLT PKQYKLLSST
CFGSLLDMDQ CEFALVTGLK CVGDPREFKF NTEVPNRIVQ TYFGGSKLVK KEVLLSCFDE
KKWGDGNDGD TIKISLLYLI HTWIFSSEKK TTTIPRLHFD LVESGRYSDY PWGTLAFSSL
IISISKKMDY HKKYYRIVGM PLAMQVWFYE CCSKVDPKIA KRLGYWVPRL LNWRTTVNQP
TYAYLMDGMF KDQGNMIVYK DIQPSDVELA VIQIPLEGVE VHTIPTNTHS DKHADDQDDS
DDFSPTPHLH LEKKYNASVG PSSSPPHKKR KEQIIDRSEA GTKHKTPYAG ISEVNQNVFH
DKKSADSKAD EVSSLRNDLN SFKDYVTGEF TSLRTLINEN FKNISGQIKA NQPTVTRDDG
IQKSHDVDVQ DNPKGHPKSD VTVGENLENI DVNPVCDEAM VDGNNTFEML CIIPPICVSK
EVHVSQFELS DHFLPSQIPE ARIVIHHVVK NPTDATPLAS HRNRHPSRSY SLPYESNFDS
AGTSVKLTPI FDKKHPFEDD LISGPHPTLV IQEYEKWVRE GLLAKHYQKY ADTDSNANVA
KEENMVFNLE EKMHWVLAVV SFKDRSIKVY DSIRSALHDS YVASEIDKLA KLVPLYLSSS
GFYKDKQGID WTHDSAYSDK APTDPFEVVF ISDLSQQNPG SMDCGMHVAA YAEFLSTFGM
VPQIKFDVNL LRQRYGALLW DYAMRNIDAD AVSESEAPSK ILRHITDSDT SVKIINPING
LSRFFFGLPG GLLHLGGNTT SSATWAEQYS SSIRNLLFST AQACGHGNSS TWNRPQLHFF
SKRHTV
//