ID S8CYM7_9LAMI Unreviewed; 712 AA.
AC S8CYM7;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE RecName: Full=Pentacotripeptide-repeat region of PRORP domain-containing protein {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
GN ORFNames=M569_02663 {ECO:0000313|EMBL:EPS72095.1};
OS Genlisea aurea.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Lamiales; Lentibulariaceae; Genlisea.
OX NCBI_TaxID=192259 {ECO:0000313|EMBL:EPS72095.1, ECO:0000313|Proteomes:UP000015453};
RN [1] {ECO:0000313|EMBL:EPS72095.1, ECO:0000313|Proteomes:UP000015453}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23855885;
RA Leushkin E.V., Sutormin R.A., Nabieva E.R., Penin A.A., Kondrashov A.S.,
RA Logacheva M.D.;
RT "The miniature genome of a carnivorous plant Genlisea aurea contains a low
RT number of genes and short non-coding sequences.";
RL BMC Genomics 14:476-476(2013).
CC -!- SIMILARITY: Belongs to the PPR family. P subfamily.
CC {ECO:0000256|ARBA:ARBA00007626}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EPS72095.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AUSU01000972; EPS72095.1; -; Genomic_DNA.
DR AlphaFoldDB; S8CYM7; -.
DR OrthoDB; 466167at2759; -.
DR Proteomes; UP000015453; Unassembled WGS sequence.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 5.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 7.
DR PANTHER; PTHR47942:SF30; OS11G0607100 PROTEIN; 1.
DR PANTHER; PTHR47942; TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEIN-RELATED; 1.
DR Pfam; PF01535; PPR; 5.
DR Pfam; PF12854; PPR_1; 1.
DR Pfam; PF13041; PPR_2; 1.
DR Pfam; PF13812; PPR_3; 1.
DR PROSITE; PS51375; PPR; 10.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000015453};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 134..168
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 169..203
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 204..238
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 239..273
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 274..308
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 309..343
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 345..379
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 380..414
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 423..457
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 492..526
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REGION 1..29
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..22
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EPS72095.1"
FT NON_TER 712
FT /evidence="ECO:0000313|EMBL:EPS72095.1"
SQ SEQUENCE 712 AA; 79537 MW; 1C30B7EE7221CE93 CRC64;
ETSVSETPDV HSSFTTQTAE AEAFPSEGRR RRKREKLEVI ICRMMSHRCW TTRLENSIRD
LVPRFDNELL HNVLDGARKP QHALQFFRWV ERSHLIQPDR EAYLKIIETL GHASMLNHAR
CILLDMPKKG VEWDEDLWIS MIASYGRAGI VQECVKLFQK MPEFGVKRTS KSYDTFLRAI
LLRGRYMMAK RYFNKMLNEG IEPTSHTFNI LIWGFFLSRK LCVVKRFFQD MKARGITPDV
ITYSTMISGY FRVKRIEDAE KYFAEMKEMQ IKPTVITYTA LIKGYVDNDR IDDAMRMVEE
MKGVGIKPNA VTYSTLLPGL CNADHISSAA AMLKEVVGKR ITLKDHSIFM RLISSLCETS
NLDAAANVLD SMVRLGVPAE PGHYGVLIKS FFEAGRYDEG VKLLDSLIEK DIVMRPENTH
HLEPDSYNPM IEYLCSNGQT AKAEALFRQL MKLGVLDAAA FNALVVGHSR EGAPASASEI
LRITARRSIP IERASYESVI RSYLDKNDPS EAKLALDGMI ESGHLPDSSL YGSVMESLFD
DGRVQTASRV MKTMLEKNAA TADHAGLFGK ITESLFVRGH VEEALGRVEM LMRSSGMGPD
FDGLLSSLCE KGKTSAATKL LDFCVERDCT AAAVSSTERV LDALLAEGKT LNAYGVLCKV
LGKGSGGCDW EGCKDLIRSL NEEGNTKQAE ILSRMIGGGG SDKGGGRRKK KA
//