GenomeNet

Database: UniProt
Entry: PP199_ARATH
LinkDB: PP199_ARATH
Original site: PP199_ARATH 
ID   PP199_ARATH             Reviewed;         822 AA.
AC   Q8RWS8; O22948;
DT   16-DEC-2008, integrated into UniProtKB/Swiss-Prot.
DT   01-JUN-2002, sequence version 1.
DT   24-JAN-2024, entry version 124.
DE   RecName: Full=Pentatricopeptide repeat-containing protein At2g41720;
DE   AltName: Full=Protein EMBRYO DEFECTIVE 2654;
GN   Name=EMB2654; OrderedLocusNames=At2g41720; ORFNames=T11A7.18;
OS   Arabidopsis thaliana (Mouse-ear cress).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX   NCBI_TaxID=3702;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Columbia;
RX   PubMed=10617197; DOI=10.1038/45471;
RA   Lin X., Kaul S., Rounsley S.D., Shea T.P., Benito M.-I., Town C.D.,
RA   Fujii C.Y., Mason T.M., Bowman C.L., Barnstead M.E., Feldblyum T.V.,
RA   Buell C.R., Ketchum K.A., Lee J.J., Ronning C.M., Koo H.L., Moffat K.S.,
RA   Cronin L.A., Shen M., Pai G., Van Aken S., Umayam L., Tallon L.J.,
RA   Gill J.E., Adams M.D., Carrera A.J., Creasy T.H., Goodman H.M.,
RA   Somerville C.R., Copenhaver G.P., Preuss D., Nierman W.C., White O.,
RA   Eisen J.A., Salzberg S.L., Fraser C.M., Venter J.C.;
RT   "Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana.";
RL   Nature 402:761-768(1999).
RN   [2]
RP   GENOME REANNOTATION.
RC   STRAIN=cv. Columbia;
RX   PubMed=27862469; DOI=10.1111/tpj.13415;
RA   Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA   Town C.D.;
RT   "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT   genome.";
RL   Plant J. 89:789-804(2017).
RN   [3]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC   STRAIN=cv. Columbia;
RX   PubMed=14593172; DOI=10.1126/science.1088305;
RA   Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M.,
RA   Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G.,
RA   Liu S.X., Lam B., Sakano H., Wu T., Yu G., Miranda M., Quach H.L.,
RA   Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C.,
RA   Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J.,
RA   Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A.,
RA   Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C.,
RA   Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X.,
RA   Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M.,
RA   Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K.,
RA   Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A.,
RA   Ecker J.R.;
RT   "Empirical analysis of transcriptional activity in the Arabidopsis
RT   genome.";
RL   Science 302:842-846(2003).
RN   [4]
RP   GENE FAMILY.
RX   PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA   Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA   Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA   Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA   Taconnat L., Small I.;
RT   "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT   reveals their essential role in organelle biogenesis.";
RL   Plant Cell 16:2089-2103(2004).
CC   -!- ALTERNATIVE PRODUCTS:
CC       Event=Alternative splicing; Named isoforms=1;
CC         Comment=A number of isoforms are produced. According to EST
CC         sequences.;
CC       Name=1;
CC         IsoId=Q8RWS8-1; Sequence=Displayed;
CC   -!- SIMILARITY: Belongs to the PPR family. P subfamily. {ECO:0000305}.
CC   -!- SEQUENCE CAUTION:
CC       Sequence=AAC02776.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC   -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC       URL="https://ppr.plantenergy.uwa.edu.au";
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AC002339; AAC02776.1; ALT_SEQ; Genomic_DNA.
DR   EMBL; CP002685; AEC10025.1; -; Genomic_DNA.
DR   EMBL; AY091135; AAM14084.1; -; mRNA.
DR   EMBL; AY117242; AAM51317.1; -; mRNA.
DR   PIR; C84845; C84845.
DR   RefSeq; NP_850356.1; NM_180025.2. [Q8RWS8-1]
DR   AlphaFoldDB; Q8RWS8; -.
DR   SMR; Q8RWS8; -.
DR   BioGRID; 4108; 2.
DR   IntAct; Q8RWS8; 2.
DR   STRING; 3702.Q8RWS8; -.
DR   iPTMnet; Q8RWS8; -.
DR   PaxDb; 3702-AT2G41720-1; -.
DR   ProteomicsDB; 249160; -. [Q8RWS8-1]
DR   EnsemblPlants; AT2G41720.1; AT2G41720.1; AT2G41720. [Q8RWS8-1]
DR   GeneID; 818771; -.
DR   Gramene; AT2G41720.1; AT2G41720.1; AT2G41720. [Q8RWS8-1]
DR   KEGG; ath:AT2G41720; -.
DR   Araport; AT2G41720; -.
DR   TAIR; AT2G41720; EMB2654.
DR   eggNOG; KOG4197; Eukaryota.
DR   InParanoid; Q8RWS8; -.
DR   OrthoDB; 1203802at2759; -.
DR   PhylomeDB; Q8RWS8; -.
DR   PRO; PR:Q8RWS8; -.
DR   Proteomes; UP000006548; Chromosome 2.
DR   ExpressionAtlas; Q8RWS8; baseline and differential.
DR   Genevisible; Q8RWS8; AT.
DR   GO; GO:0009507; C:chloroplast; IEA:GOC.
DR   GO; GO:0003735; F:structural constituent of ribosome; IMP:TAIR.
DR   GO; GO:0010239; P:chloroplast mRNA processing; IMP:TAIR.
DR   GO; GO:0009793; P:embryo development ending in seed dormancy; IMP:TAIR.
DR   GO; GO:0008380; P:RNA splicing; IMP:TAIR.
DR   Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 7.
DR   InterPro; IPR002885; Pentatricopeptide_rpt.
DR   InterPro; IPR011990; TPR-like_helical_dom_sf.
DR   NCBIfam; TIGR00756; PPR; 13.
DR   PANTHER; PTHR47941; PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 3, MITOCHONDRIAL; 1.
DR   PANTHER; PTHR47941:SF22; PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 3, MITOCHONDRIAL; 1.
DR   Pfam; PF01535; PPR; 1.
DR   Pfam; PF13041; PPR_2; 4.
DR   Pfam; PF13812; PPR_3; 5.
DR   SUPFAM; SSF48452; TPR-like; 1.
DR   PROSITE; PS51375; PPR; 18.
PE   2: Evidence at transcript level;
KW   Alternative splicing; Reference proteome; Repeat.
FT   CHAIN           1..822
FT                   /note="Pentatricopeptide repeat-containing protein
FT                   At2g41720"
FT                   /id="PRO_0000356058"
FT   REPEAT          106..136
FT                   /note="PPR 1"
FT   REPEAT          142..176
FT                   /note="PPR 2"
FT   REPEAT          177..211
FT                   /note="PPR 3"
FT   REPEAT          212..246
FT                   /note="PPR 4"
FT   REPEAT          247..281
FT                   /note="PPR 5"
FT   REPEAT          282..316
FT                   /note="PPR 6"
FT   REPEAT          319..353
FT                   /note="PPR 7"
FT   REPEAT          354..388
FT                   /note="PPR 8"
FT   REPEAT          389..423
FT                   /note="PPR 9"
FT   REPEAT          424..458
FT                   /note="PPR 10"
FT   REPEAT          459..493
FT                   /note="PPR 11"
FT   REPEAT          494..528
FT                   /note="PPR 12"
FT   REPEAT          529..563
FT                   /note="PPR 13"
FT   REPEAT          564..598
FT                   /note="PPR 14"
FT   REPEAT          599..633
FT                   /note="PPR 15"
FT   REPEAT          634..668
FT                   /note="PPR 16"
FT   REPEAT          669..699
FT                   /note="PPR 17"
FT   REPEAT          704..738
FT                   /note="PPR 18"
FT   REPEAT          739..773
FT                   /note="PPR 19"
FT   REGION          1..28
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   822 AA;  92414 MW;  B67C530B407AC63B CRC64;
     MATVTNFKLV TPPESSRADK PGATKASDAF QEKKSVSVNY DRGEHEVSVN IGGLRKADIP
     RRYRIRVEND RFQKDWSVSE VVDRLMALNR WEEVDGVLNS WVGRFARKNF PVLIRELSRR
     GCIELCVNVF KWMKIQKNYC ARNDIYNMMI RLHARHNWVD QARGLFFEMQ KWSCKPDAET
     YDALINAHGR AGQWRWAMNL MDDMLRAAIA PSRSTYNNLI NACGSSGNWR EALEVCKKMT
     DNGVGPDLVT HNIVLSAYKS GRQYSKALSY FELMKGAKVR PDTTTFNIII YCLSKLGQSS
     QALDLFNSMR EKRAECRPDV VTFTSIMHLY SVKGEIENCR AVFEAMVAEG LKPNIVSYNA
     LMGAYAVHGM SGTALSVLGD IKQNGIIPDV VSYTCLLNSY GRSRQPGKAK EVFLMMRKER
     RKPNVVTYNA LIDAYGSNGF LAEAVEIFRQ MEQDGIKPNV VSVCTLLAAC SRSKKKVNVD
     TVLSAAQSRG INLNTAAYNS AIGSYINAAE LEKAIALYQS MRKKKVKADS VTFTILISGS
     CRMSKYPEAI SYLKEMEDLS IPLTKEVYSS VLCAYSKQGQ VTEAESIFNQ MKMAGCEPDV
     IAYTSMLHAY NASEKWGKAC ELFLEMEANG IEPDSIACSA LMRAFNKGGQ PSNVFVLMDL
     MREKEIPFTG AVFFEIFSAC NTLQEWKRAI DLIQMMDPYL PSLSIGLTNQ MLHLFGKSGK
     VEAMMKLFYK IIASGVGINL KTYAILLEHL LAVGNWRKYI EVLEWMSGAG IQPSNQMYRD
     IISFGERSAG IEFEPLIRQK LESLRNKGEG LIPTFRHEGT LL
//
DBGET integrated database retrieval system