ID S9U7G0_9TRYP Unreviewed; 1058 AA.
AC S9U7G0;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE RecName: Full=Pentacotripeptide-repeat region of PRORP domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=STCU_07029 {ECO:0000313|EMBL:EPY24739.1}, STCU_09081
GN {ECO:0000313|EMBL:EPY20260.1};
OS Strigomonas culicis.
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Strigomonadinae; Strigomonas.
OX NCBI_TaxID=28005 {ECO:0000313|EMBL:EPY24739.1, ECO:0000313|Proteomes:UP000015354};
RN [1] {ECO:0000313|EMBL:EPY24739.1, ECO:0000313|Proteomes:UP000015354}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23560078;
RA Motta M.C., Martins A.C., de Souza S.S., Catta-Preta C.M., Silva R.,
RA Klein C.C., de Almeida L.G., de Lima Cunha O., Ciapina L.P., Brocchi M.,
RA Colabardini A.C., de Araujo Lima B., Machado C.R., de Almeida Soares C.M.,
RA Probst C.M., de Menezes C.B., Thompson C.E., Bartholomeu D.C., Gradia D.F.,
RA Pavoni D.P., Grisard E.C., Fantinatti-Garboggini F., Marchini F.K.,
RA Rodrigues-Luiz G.F., Wagner G., Goldman G.H., Fietto J.L., Elias M.C.,
RA Goldman M.H., Sagot M.F., Pereira M., Stoco P.H., de Mendonca-Neto R.P.,
RA Teixeira S.M., Maciel T.E., de Oliveira Mendes T.A., Urmenyi T.P.,
RA de Souza W., Schenkman S., de Vasconcelos A.T.;
RT "Predicting the Proteins of Angomonas deanei, Strigomonas culicis and Their
RT Respective Endosymbionts Reveals New Aspects of the Trypanosomatidae
RT Family.";
RL PLoS ONE 8:E60209-E60209(2013).
RN [2] {ECO:0000313|EMBL:EPY24739.1}
RP NUCLEOTIDE SEQUENCE.
RA Motta M.C.M., Martins A.C.A., Preta C.M.C.C., Silva R., de Souza S.S.,
RA Klein C.C., de Almeida L.G.P., Cunha O.L., Colabardini A.C., Lima B.A.,
RA Machado C.R., Soares C.M.A., de Menezes C.B.A., Bartolomeu D.C.,
RA Grisard E.C., Fantinatti-Garboggini F., Rodrigues-Luiz G.F., Wagner G.,
RA Goldman G.H., Fietto J.L.R., Ciapina L.P., Brocchi M., Elias M.C.,
RA Goldman M.H.S., Sagot M.-F., Pereira M., Stoco P.H., Teixeira S.M.R.,
RA de Mendonca-Neto R.P., Maciel T.E.F., Mendes T.A.O., Urmenyi T.P.,
RA Teixeira M.M.G., de Camargo E.F.P., de Sousa W., Schenkman S.,
RA de Vasconcelos A.T.R.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EPY24739.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ATMH01009081; EPY20260.1; -; Genomic_DNA.
DR EMBL; ATMH01007029; EPY24739.1; -; Genomic_DNA.
DR AlphaFoldDB; S9U7G0; -.
DR EnsemblProtists; EPY20260; EPY20260; STCU_09081.
DR EnsemblProtists; EPY24739; EPY24739; STCU_07029.
DR OrthoDB; 169103at2759; -.
DR Proteomes; UP000015354; Unassembled WGS sequence.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 4.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 3.
DR PANTHER; PTHR47447; OS03G0856100 PROTEIN; 1.
DR PANTHER; PTHR47447:SF24; PPR_LONG DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01535; PPR; 2.
DR Pfam; PF13041; PPR_2; 1.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS51375; PPR; 4.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000015354};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 730..760
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 766..800
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 801..835
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 937..971
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REGION 1002..1058
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1008..1035
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1043..1058
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1058 AA; 120330 MW; A3699AE04800C2C2 CRC64;
MFRCIPSLVT KCPILGGPLG RVRLADAASQ RALVYAQRHF APSAAFFRAN SRHFRPFSSE
DEAQLTAVLS TKPRHVARHI RQSMLSSRRR ITQFRNSVTY LMILLQSQLQ RKQIKPEDAT
QIMESLMKEC VELRQSDMSH LLFRAAVRFR KYGLSVRFPL VKYLFESYRL DNAKELMKNM
ADELRNDKKL QFIAVLAYEF SGHDTAAKEL LAEIPRDLYN TEDICGLVET YGMTAHFDDI
VALLQLLFES HSSGSGESTL DKSAVYSTAV IALRGSTDLM DLVIDTTIAK EIALTESAMG
SVLRTRFLQE DIGSIERVYE IENRLRERLK VQSLGMIAET AVIAKCSEVL SRTHAGGDDL
MLQKVKHLQA VIETSIANDD SDAVDSAYMM ALIKGYGVLG RFAEMKKSFD MLRDAGLLKD
HRLYDEVLKW YAYSYNLKEV IAFKDEMTEK GIYHTPHTYF NVFKVLDKYY PRMVEKYLVE
MQSKGIHIES FMYPTLIRVF AELQNTEAVE KLYREAKQKA SAGGKLNPGV IIQMLKAFQT
NAKRCEAIIH DAEVYGLLSV ESVQAEIIEL YSINDRYSDL KQFLARVPYK SQNMYRVLLR
DASKRKDRAA FHGLLQEMRD GHVELNERLF SVIIMALSRF EDSEGVRKFI HEALESDKIH
SPLFFSDAAA AYMRLGDTAA VDQCWKDLVQ SQMVITMPVY NRFLDLYLSQ NNMTMVQEIL
DTMMKLVPPN PVTATTVVDM LGKMGRLSEM EAVLEEMSKS MNAAPTLVTY HQAMNAYAKC
GDVDKMEEMR DRLVRSGFQE NPVTYNIMFD GYGRAKRYER LAELIEERKE KNIPLEEFGY
VVLLNIYSRA KLREETEALV SDMLESGVPL SSRMLATIAS SFSAVGNIAQ MEHYVALLLA
HPECRLRDVE SVFLVYSRLR DIVKLQELLD TEKLPKSEFI YNVCVAAFAK AGEHTKVAFL
LTQMEKKGYT LSRNTSITLS SLLLKAGKLE LAQTVLKWKG MNPSDVEGHT EGAEEETHPA
AEVDEPLKEQ LQREVRSIGK AGVTAEEGEE SDEETHRM
//