ID A0A1R3IQS4_COCAP Unreviewed; 916 AA.
AC A0A1R3IQS4;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE RecName: Full=Pentacotripeptide-repeat region of PRORP domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=CCACVL1_10558 {ECO:0000313|EMBL:OMO84916.1};
OS Corchorus capsularis (Jute).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX NCBI_TaxID=210143 {ECO:0000313|EMBL:OMO84916.1, ECO:0000313|Proteomes:UP000188268};
RN [1] {ECO:0000313|EMBL:OMO84916.1, ECO:0000313|Proteomes:UP000188268}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. CVL-1 {ECO:0000313|Proteomes:UP000188268};
RC TISSUE=Whole seedling {ECO:0000313|EMBL:OMO84916.1};
RA Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA Rashid M.M., Khan S.A., Rahman M.S., Alam M.;
RT "Corchorus capsularis genome sequencing.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the PPR family. P subfamily.
CC {ECO:0000256|ARBA:ARBA00007626}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OMO84916.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWWV01009658; OMO84916.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1R3IQS4; -.
DR EnsemblPlants; OMO84916; OMO84916; CCACVL1_10558.
DR Gramene; OMO84916; OMO84916; CCACVL1_10558.
DR OrthoDB; 449202at2759; -.
DR Proteomes; UP000188268; Unassembled WGS sequence.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 4.
DR InterPro; IPR012881; DUF1685.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 5.
DR PANTHER; PTHR47933; PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 1, MITOCHONDRIAL; 1.
DR PANTHER; PTHR47933:SF2; PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 2; 1.
DR Pfam; PF07939; DUF1685; 1.
DR Pfam; PF01535; PPR; 3.
DR Pfam; PF13041; PPR_2; 2.
DR PROSITE; PS51375; PPR; 4.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000188268};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 501..535
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 638..672
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 673..707
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 708..742
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REGION 15..55
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 139..169
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 29..43
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 148..163
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 916 AA; 103667 MW; 5E82D746427E63A9 CRC64;
MVGISSQSES WLSITNSIDQ TNEDDTTADS ESFDLKRQDE TKIDQDADED DNNYSNSNIV
DTITKKKKKN QVLLEGYVEA VDKEDELTRT KSLTDEDLDE LKGCLDLGFG FSYEEIPELC
NTLPALELCY SMSQKYLDEH HKSPDTSPET TTEAVSSPIA NWKISSPGDH PEDVKARLKY
WAQAVACTVR LCKLSNWDRL QGVGEEAEEE KDVSVLNRAS PTSWLQTSAF AEMITQFILP
GIAEAVENEG KGKKQVKEEQ ACISLPNFSA CSWLALWRHG LLAFSSNLLQ FQTSSCFQCF
SHSCQKPKTA ANHPQQLPPV LASEDVSSPA LLSTSSSLSA QILQCKDLDD LLEDYKDKLN
SKLVLQVLMN YKHLGRVKTL EFFSWAGMQM GFQFDDCVIE YMADFLGRRK LFDDIKCFLL
TILSHRGRLS CSVFSICIRF LGRQGRVAEA LSLFQEMETT FRCKPDNVVC NNILYVLCKK
ATSGELIDLA LTIFHRIDVP DTYSYSNILV GLCKFGRLET ALQVFRKMDR AGLVPTRSAL
NVLIGQFCLL SSKEGAIEKV RVKNVYRPFT ILVPNVSSSS RKGAIEPAVF VFCKVVDLGM
LPSAFVILEL VSELCRLGKM EEAFKVVKAV EQRKMSCLEE CYSLLMQALC EHNWFEEASF
LFGRMLSLGV KPRLVVYNSI ICMLSNAGNM DDAERVFKIM NKQRCLPDTV TYTALVHAYS
KARNWEAAYS LLIEMLGLGL IPNLHTYNEV DKLLRENGKM DLCFKLESKM ETQILLKHCK
VGQLEAAYEK LNSMIRKGFH PPVYACDAFQ QAFQKNEIFE SKMGQVEPFL WFLKHQGYAF
FEQFEILLHL TKFHPVTGKH FYANGQFYLV LHMTSLYLEE IETRKLVPLM VEPSPIVGRD
LLSERAPVQL NALSTN
//