ID U5GXU1_USTV1 Unreviewed; 1363 AA.
AC U5GXU1;
DT 11-DEC-2013, integrated into UniProtKB/TrEMBL.
DT 11-DEC-2013, sequence version 1.
DT 27-MAR-2024, entry version 48.
DE RecName: Full=Pentacotripeptide-repeat region of PRORP domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=MVLG_00017 {ECO:0000313|EMBL:KDE09610.1};
OS Microbotryum lychnidis-dioicae (strain p1A1 Lamole / MvSl-1064) (Anther
OS smut fungus).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina;
OC Microbotryomycetes; Microbotryales; Microbotryaceae; Microbotryum.
OX NCBI_TaxID=683840 {ECO:0000313|EMBL:KDE09610.1};
RN [1] {ECO:0000313|Proteomes:UP000017200}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=p1A1 Lamole {ECO:0000313|Proteomes:UP000017200};
RA Cuomo C., Perlin M., Young S.K., Zeng Q., Gargeya S., Alvarado L.,
RA Berlin A., Chapman S.B., Chen Z., Freedman E., Gellesch M., Goldberg J.,
RA Griggs A., Gujja S., Heilman E., Heiman D., Howarth C., Mehta T.,
RA Neiman D., Pearson M., Roberts A., Saif S., Shea T., Shenoy N., Sisk P.,
RA Stolte C., Sykes S., White J., Yandava C., Haas B., Nusbaum C., Birren B.;
RT "The genome sequence of Microbotryum violaceum strain p1A1 Lamole.";
RL Submitted (NOV-2010) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:KDE09610.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=P1A1 Lamole {ECO:0000313|EMBL:KDE09610.1};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Butler R., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C.,
RA Freedman E., Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S.,
RA Heilman E., Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P.,
RA Mehta T., Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M.,
RA Roberts A., Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S.,
RA White J., Yandava C., Wortman J., Nusbaum C., Birren B.;
RL Submitted (NOV-2010) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:KDE09610.1, ECO:0000313|Proteomes:UP000017200}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=p1A1 Lamole {ECO:0000313|Proteomes:UP000017200}, and P1A1
RC Lamole {ECO:0000313|EMBL:KDE09610.1};
RX PubMed=26076695; DOI=10.1186/s12864-015-1660-8;
RA Perlin M.H., Amselem J., Fontanillas E., Toh S.S., Chen Z., Goldberg J.,
RA Duplessis S., Henrissat B., Young S., Zeng Q., Aguileta G., Petit E.,
RA Badouin H., Andrews J., Razeeq D., Gabaldon T., Quesneville H., Giraud T.,
RA Hood M.E., Schultz D.J., Cuomo C.A.;
RT "Sex and parasites: genomic and transcriptomic analysis of Microbotryum
RT lychnidis-dioicae, the biotrophic and plant-castrating anther smut
RT fungus.";
RL BMC Genomics 16:461-461(2015).
RN [4] {ECO:0000313|EnsemblFungi:MVLG_00017T0}
RP IDENTIFICATION.
RG EnsemblFungi;
RL Submitted (JUN-2015) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AEIJ01000001; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; GL541643; KDE09610.1; -; Genomic_DNA.
DR STRING; 683840.U5GXU1; -.
DR EnsemblFungi; MVLG_00017T0; MVLG_00017T0; MVLG_00017.
DR HOGENOM; CLU_256852_0_0_1; -.
DR InParanoid; U5GXU1; -.
DR OMA; ETHAGEH; -.
DR OrthoDB; 1410109at2759; -.
DR Proteomes; UP000017200; Unassembled WGS sequence.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 2.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 1.
DR PANTHER; PTHR47942:SF63; ATPASE EXPRESSION PROTEIN 3; 1.
DR PANTHER; PTHR47942; TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEIN-RELATED; 1.
DR Pfam; PF01535; PPR; 1.
DR Pfam; PF12854; PPR_1; 1.
DR Pfam; PF13041; PPR_2; 1.
DR Pfam; PF13812; PPR_3; 1.
DR PROSITE; PS51375; PPR; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000017200}.
FT REPEAT 760..794
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 869..899
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 937..971
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REGION 30..53
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 79..157
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 175..259
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 31..53
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 88..119
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 138..157
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 180..194
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 197..243
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 244..258
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1363 AA; 150670 MW; 666EEC1EDB084D0F CRC64;
MSVRPRSRLP SSHVLLEWIR PDRAAAARPA IAVSTTSSWS SPSSTRLNST QVSSAASSPT
LDFLYPSFGF FSRTLPTASP SPSPCPSPST SSSSSSSSSS SFSSSPSPLT SSSAPYSAPS
VDLITGVRIP RGTARRVPVS SAPTSNSRRP SSNLSRACPC GRTTVCMRCR ATSSSAPTHE
THAGEHKQHV RPDSSPIPVD RTSSTTSRSL VSDRSGQKDA SLTEEGSALN GSGLPPAQNG
YPKTQKQRRD KRPISHDSPI DADAISVPWI RDFLRDSIYK FGREPISLRP VPVASYSRRG
LRTRLVSRFE SVTSRLDQLQ ALTTWLVEDP ARLQELELEE RLDLIKLTSS FVRALKLPIL
EPGGPASVEE EARRIVSLRH EAGRKMEEYF HRLVMEDPRH PRSKAMLWID AIALQDRLPA
ALSFDPDVPV DRDDILSKTY LAHLQQIFAH GDERTEAHAK RRKTAEVVTP ILLEACLLRL
YDTVNPVPSN LRRQYGELLG QLGPSPPEYY SSLPSRRMRW VMGPHLVEYL STTGSTLRAL
QIKTMIDEEK ALDPLDDMTR LRMNATLLEG LLGMHYNLDA EVLAVEVLQL ARRVAEAGAM
NDRTKAVVVH AYKMVIRSAA ELGSQNRVDA LAKELGQLPQ AGIALEMVAR NVRAASRSTG
LADARVAMTE AVHVVDPTTA KDRARVVSNL VEAYISCDDL EAAIKALDDY LLAHGDRPTI
GTINSLMFGY ATRHDIDSTY AIFRRLAAGE WGSRASLNPN AGSYEALLCA HANVRDFDAV
IGIMSKMREE GFEITLSAWT TLMNLYVELG QYREAFDIFA FLENSPNPKF VPDTVTFNVM
LKAAVFTETP VVIQLQWFQQ ALQRGLRPNM VTYHTLLQSV CNAGLVDVAE EMFKVMDETL
ARNVASFSET STASTESPSS AESPTSLPVA MDDVRPDVFT FSILLNAYIK AKELAKAQAC
LQEMKSRGIE PTSVTFGIIV ASMVAGAKRA SPTIRARAKT FARNFLQLSP LDVHRKDIPK
SAKRDRVLAR GDELLHIFAP ILQAEAKTAN GHTVLDTFKL VLSSGARPSI ELYTILMDAF
RRDAIVHSAP DSSAMGVADV VTVWNGLHAS VLDAYGYPVQ TQRVVPSLPS IFSDHLRMRS
PTRRISAAHC SDLCLPITIL IDTLSSPPAL EAGHGHDLSR IWGDLCAEGF RFDAGTYNAL
VRAFIRQGEM ERAAWIIENL LLVRSDEASE DEVESRFRSA EQTFWDIWIA RGLGNPRMAG
RIASASRRHA LFDFDQLTPD MLREYMSRPS NPAPMSNITF VEAMKVARIE KLRRLWKVSN
KTWNELNEAL MSGEMREEDL ERRFPMAIDG LFRWRAGRVK PAV
//