ID A0A1S4D223_TOBAC Unreviewed; 586 AA.
AC A0A1S4D223;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE SubName: Full=Uncharacterized protein LOC107825007 {ECO:0000313|RefSeq:XP_016507319.1};
GN Name=LOC107825007 {ECO:0000313|RefSeq:XP_016507319.1};
OS Nicotiana tabacum (Common tobacco).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae;
OC Nicotiana.
OX NCBI_TaxID=4097 {ECO:0000313|Proteomes:UP000084051, ECO:0000313|RefSeq:XP_016507319.1};
RN [1] {ECO:0000313|Proteomes:UP000084051}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TN90 {ECO:0000313|Proteomes:UP000084051};
RX PubMed=24807620; DOI=10.1038/ncomms4833;
RA Sierro N., Battey J.N., Ouadi S., Bakaher N., Bovet L., Willig A.,
RA Goepfert S., Peitsch M.C., Ivanov N.V.;
RT "The tobacco genome sequence and its comparison with those of tomato and
RT potato.";
RL Nat. Commun. 5:3833-3833(2014).
RN [2] {ECO:0000313|RefSeq:XP_016507319.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016507319.1; XM_016651833.1.
DR AlphaFoldDB; A0A1S4D223; -.
DR PaxDb; 4097-A0A1S4D223; -.
DR GeneID; 107825007; -.
DR KEGG; nta:107825007; -.
DR OrthoDB; 5549754at2759; -.
DR Proteomes; UP000084051; Unplaced.
DR CDD; cd00303; retropepsin_like; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR PANTHER; PTHR33067:SF9; ASP_PROTEASE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR33067; ASP_PROTEASE DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000084051}.
FT DOMAIN 36..129
FT /note="Retrotransposon gag"
FT /evidence="ECO:0000259|Pfam:PF03732"
FT REGION 225..295
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 341..370
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 381..430
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 225..290
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 349..370
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 586 AA; 67001 MW; 43DF607DA3021AC9 CRC64;
MAEYEEKENP FADIDEYIDD TNDINNVSDD ALRLRVFKYS LAGEARKWIQ NLPPHSIHSW
PKLVRAFLAK WFPQSKKSKL RDKILFFKKL PGEHLHGAWD RFKLYLVRSP NHGFPDTILL
EKFYMGLDPL NQSIAKNVAD GSFMDKTFTR VTQILDKMAE HNQAWHSEDT TGEITYGTPS
LTNMIKENQE RDQVNVVEDV QPLSNEDCEE ANYVHNSQGD YQRQSYQGYG QQKEWRPNSQ
GQEHQQWQND QGGSSQGNWS NNNNNYANRS SNPYVPPKGQ YSNSSHWKEG SSSESKLENM
LERVLQNQER NDTSMKSMAE LVGSHTVSIQ KLEMQMRDLS REKNPKQKCA LPSDTNANPK
SKWSGPTSHC MTITTRSGKV LQRENEQMVE VDDLEQEVEA QVELPIVDEV EKLPKEVKVQ
EANREEVKEK AFQEMPGFAK YLKYLITKKK TTKNEVVYVT HRVSSIIATT TVQKKEDLGA
FTIPCTIGLR DFAKALCDNG ARINLMTLAI YKQAGLGMPR PTSMRLQMAD RSIKRPVGIV
DDVLVKVGKF LLPTDFLILD CAIDKEIPII LGRPFLATVR ALMDSE
//