ID B2HM19_MYCMM Unreviewed; 1289 AA.
AC B2HM19;
DT 10-JUN-2008, integrated into UniProtKB/TrEMBL.
DT 10-JUN-2008, sequence version 1.
DT 27-MAR-2024, entry version 58.
DE SubName: Full=PE-PGRS family protein {ECO:0000313|EMBL:ACC40483.1};
GN OrderedLocusNames=MMAR_2033 {ECO:0000313|EMBL:ACC40483.1};
OS Mycobacterium marinum (strain ATCC BAA-535 / M).
OC Bacteria; Actinomycetota; Actinomycetes; Mycobacteriales; Mycobacteriaceae;
OC Mycobacterium; Mycobacterium ulcerans group.
OX NCBI_TaxID=216594 {ECO:0000313|EMBL:ACC40483.1, ECO:0000313|Proteomes:UP000001190};
RN [1] {ECO:0000313|EMBL:ACC40483.1, ECO:0000313|Proteomes:UP000001190}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC BAA-535 / M {ECO:0000313|Proteomes:UP000001190};
RX PubMed=18403782; DOI=10.1101/gr.075069.107;
RA Stinear T.P., Seemann T., Harrison P.F., Jenkin G.A., Davies J.K.,
RA Johnson P.D., Abdellah Z., Arrowsmith C., Chillingworth T., Churcher C.,
RA Clarke K., Cronin A., Davis P., Goodhead I., Holroyd N., Jagels K.,
RA Lord A., Moule S., Mungall K., Norbertczak H., Quail M.A.,
RA Rabbinowitsch E., Walker D., White B., Whitehead S., Small P.L., Brosch R.,
RA Ramakrishnan L., Fischbach M.A., Parkhill J., Cole S.T.;
RT "Insights from the complete genome sequence of Mycobacterium marinum on the
RT evolution of Mycobacterium tuberculosis.";
RL Genome Res. 18:729-741(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP000854; ACC40483.1; -; Genomic_DNA.
DR RefSeq; WP_012393812.1; NC_010612.1.
DR STRING; 216594.MMAR_2033; -.
DR KEGG; mmi:MMAR_2033; -.
DR eggNOG; COG2931; Bacteria.
DR eggNOG; COG3391; Bacteria.
DR HOGENOM; CLU_006485_0_0_11; -.
DR Proteomes; UP000001190; Chromosome.
DR Gene3D; 1.10.287.850; HP0062-like domain; 1.
DR InterPro; IPR000084; PE-PGRS_N.
DR InterPro; IPR048996; PGRS_rpt.
DR Pfam; PF00934; PE; 1.
DR Pfam; PF21526; PGRS; 1.
DR PRINTS; PR01228; EGGSHELL.
DR SUPFAM; SSF140459; PE/PPE dimer-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001190}.
FT DOMAIN 5..93
FT /note="PE"
FT /evidence="ECO:0000259|Pfam:PF00934"
FT REGION 354..442
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 458..491
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 520..588
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 623..816
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 832..977
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 999..1143
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1156..1289
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 388..402
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 722..736
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 865..884
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1024..1038
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1289 AA; 103002 MW; 95138F0824E13DB2 CRC64;
MSYLAIAPEI VAAAAADMDG IGSAITAANA AAAIPTSAIA AAAADEISAA IAELFGNHAQ
QYQALSAQIA RFHDQFVRSL SGAAQTYASA EATGAAPLQP VLDLINGPTQ ALLGRPLIGN
GASGAPGTGQ AGGDGGLLIG NGGAGGSGAA GQAGGAGGAA GLIGSGGAGG IGGTGAAGGK
GGNAVLFGAG GNGGIGGTGA VGGTGAAGGA GGNGGFLFGH GGDGGTGGTG ASGVAGANGG
AGQTGLAGTD GGTGGVGGAG GKGGLLFGAG GEGGAGGTGG AGGAGGTGGD GLAATTAGGT
GGNAGDGGGG GTGGNAGDGG AGGQGGLFGN AGTTGAGGDG GNGGNGGLAG NGGAGGNGDA
SNPNGGTGGN GANPGAGGAG GAGGNGSRTG APGTTGNTPT TAAGHGGKGG DGFSPATSGQ
DGGSGGKGGD AGRFGNGGNG GNGGNGAAGD AAGSGHTGGN GGAGGGGGNA GQFGEPGTGG
SGGNGGKGGD GAAGGLAQAG GNGGEGGAGG NAGAGGLSGT GDSTGNAGTG GNGGAGGNGG
TGGDGSDGGP GGAGGKGGNS GPGGTNGTGG HGGNGGTGGD GGDGASGVGL GKAGGTGATG
GNGGNAGSGG AAGTGGTGGT NGSAGTGGTG GTGGNGGAGD KGAVGDSGFA GGTGGAGGTG
GNSGPGGTNG TGGHGGNGGT GGDGGDGASG VGQGKAGGTG ATGGNGGNAG SGGAAGTGGD
ASGGGTNGSA GTGGTGGTGG NGGDGDKGAV GDSGFAGGTG GAGGTGGNSG PGGTNGTGGH
GGNGGTGGDG GDGASGVGQG KAGGTGATGG SGGNAGSGGA AGTGGNGGTN GNAGTGGTGG
TGGNGGDGDK GAVGDSGFAG GTGGNGGTGG NSGPGGTNGT GGHGGNGGTG GDGGDGRNGS
TGAIGGNGGT GGTGGIGGTA GTGGTGGSAG SAGTGGTGGN GGAGDSGASG GTGGTGGEGG
AGIGGKDGGT GGTGGTGGTG GAGVVSGLSA FPGGIGGAGG AGGQGGQATG GGNAGDGGTG
GQGGTGGQGA TSLFEASSGG TGGAGGAGGL GGQAADGGNA GDGGTGGQGG TGGQGAAGPT
VNDTAGDGGA GGDGGVGGAG GQGGQATDGG NAGNAGDGGN GGQGGTGGQG AAAPSAAKAA
GNGGAGGVGG DGGNGADASG GGNGGDGGKS GGGGTGGTGG KSLGNLPSGD GGVGGNAGTG
GNGGNATDGG AGGKGGDGAA GGTGGNAGEF QSLLGIGKGG DGGTGGDGGN GGTGTPVGAG
GLAGTGGAGG FGVFPGGQTG QNGTDGKAG
//