ID A0A1R3JAH3_9ROSI Unreviewed; 593 AA.
AC A0A1R3JAH3;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 20.
DE RecName: Full=Pentatricopeptide repeat-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=COLO4_18095 {ECO:0000313|EMBL:OMO91797.1};
OS Corchorus olitorius.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX NCBI_TaxID=93759 {ECO:0000313|EMBL:OMO91797.1, ECO:0000313|Proteomes:UP000187203};
RN [1] {ECO:0000313|Proteomes:UP000187203}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. O-4 {ECO:0000313|Proteomes:UP000187203};
RA Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA Rashid M.M., Khan S.A., Rahman M.S., Alam M., Yahiya A.S., Khan M.S.,
RA Azam M.S., Haque T., Lashkar M.Z.H., Akhand A.I., Morshed G., Roy S.,
RA Uddin K.S., Rabeya T., Hossain A.S., Chowdhury A., Snigdha A.R.,
RA Mortoza M.S., Matin S.A., Hoque S.M.E., Islam M.K., Roy D.K., Haider R.,
RA Moosa M.M., Elias S.M., Hasan A.M., Jahan S., Shafiuddin M., Mahmood N.,
RA Shommy N.S.;
RT "Corchorus olitorius genome sequencing.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the PPR family. P subfamily.
CC {ECO:0000256|ARBA:ARBA00007626}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OMO91797.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWUE01016422; OMO91797.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1R3JAH3; -.
DR STRING; 93759.A0A1R3JAH3; -.
DR OrthoDB; 386124at2759; -.
DR Proteomes; UP000187203; Unassembled WGS sequence.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 4.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 9.
DR PANTHER; PTHR47939; MEMBRANE-ASSOCIATED SALT-INDUCIBLE PROTEIN-LIKE; 1.
DR PANTHER; PTHR47939:SF14; OS03G0201400 PROTEIN; 1.
DR Pfam; PF01535; PPR; 3.
DR Pfam; PF13041; PPR_2; 4.
DR PROSITE; PS51375; PPR; 10.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000187203};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 152..186
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 222..256
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 257..291
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 292..326
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 328..362
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 363..397
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 398..432
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 433..467
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 468..502
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 503..537
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
SQ SEQUENCE 593 AA; 66099 MW; F661975617772FB0 CRC64;
MKSGNFHQRP NFESGVPIGK FSHFQAEDFV NLIRPRNGLW LQMTLFSFTT CASRARAASK
VFIPHFHAQF HGGSHPQGNI ELNAIQKHEA WFVKVVCTLF VYSQPLDDCC LGYLSKNLTP
FIEFESVKWL NNPALGLKFF EFSRVNFDIA HSILTYNLLM RSFCHMGLLD SAKLVFDYMK
SDGHLLDSTM LGFMISSFGR AGEFGMARKL LAEVKSDEVV VSSFALNNLL DMMVKQKKME
EAVSLYKENL GSNFYPDTCT FNILIRGLCT VGNVDQASVF LNDMGSFDCV PDIITYNTII
KGLCWANEVD RGHKLLKKVQ SKSDCPPTVV TYTSVISGYC KLGKMTGASA LFSQMLSSGT
LPTVVTFNIL IDGFGKVGDM VSAKSMYEKM ASFGCAADVV TFTSLIDGYC RMGAVDQCLQ
LWNTMKRRHI SPNVYTFAIT INALCKENRL HEARGFLREL HCMNIVPKPF IYNPVIDGFC
KAGNLDEANL IVEEMELKKC HPDKVTFTIL IIGHCMKGRM FEAISIFNKM LAIGCTPDDI
TVRTLLSCLL KAGMPNEAYR IKKWSSVDMN LASSSSLENN APLQINSGVP VAA
//