ID G5GBM2_9BACT Unreviewed; 864 AA.
AC G5GBM2;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 24-JAN-2024, entry version 27.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHG23221.1};
GN ORFNames=HMPREF9332_00973 {ECO:0000313|EMBL:EHG23221.1};
OS Alloprevotella rava F0323.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC Alloprevotella.
OX NCBI_TaxID=679199 {ECO:0000313|EMBL:EHG23221.1, ECO:0000313|Proteomes:UP000015993};
RN [1] {ECO:0000313|EMBL:EHG23221.1, ECO:0000313|Proteomes:UP000015993}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=F0323 {ECO:0000313|EMBL:EHG23221.1,
RC ECO:0000313|Proteomes:UP000015993};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Izard J., Blanton J.M.,
RA Baranova O.V., Tanner A.C., Dewhirst F.E., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Alvarado L., Arachchi H.M.,
RA Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C., Freedman E.,
RA Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S., Heiman D.,
RA Howarth C., Larson L., Lui A., MacDonald P.J.P., Montmayeur A., Murphy C.,
RA Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Prevotella sp. oral taxon 302 str. F0323.";
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EHG23221.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACZK01000016; EHG23221.1; -; Genomic_DNA.
DR AlphaFoldDB; G5GBM2; -.
DR eggNOG; ENOG503495Q; Bacteria.
DR HOGENOM; CLU_334297_0_0_10; -.
DR Proteomes; UP000015993; Unassembled WGS sequence.
DR CDD; cd05483; retropepsin_like_bacteria; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR034122; Retropepsin-like_bacterial.
DR Pfam; PF13975; gag-asp_proteas; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000015993};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..864
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003477060"
SQ SEQUENCE 864 AA; 96536 MW; E62C05F4E2BA5F6F CRC64;
MKRFLNILFL FISIAASSQA TLPDSIANRC AMLAARGEVR QLRPLYRQHK AELPTYVRLF
CDLAIARAEG NNNRTNQCVD SLVSNFPKQV GAVGRLTLTE LKAENLLNDG LYEELQAYAD
EQVRYLKRHN FKAARIDVFR DYARKAIRYG SSGESGRILG MAERRSDFEL LKAYRSDQSL
TAFARLTARA ELARAFNLPD AACASADSLL RFYADSLNAE VQTTYLTTCI ENYISQGLWK
KLAEFCEFAT TLPQLRAVPL EIFSSIAREL RNEQPFALEQ PATPISVPTT RDWPLLLPVS
VNGGSNVFFH LNTGQRLTII TEADARQFGA RVLNVTLAVN TMFGKVNVRP AFLRELRVGS
VRFRNLLVYV FPAEEMTAEM KPEICRILGN RELMRLGEVS IYPEKIVFAP SQMDVRNSQP
NLRLNDNYRL QLQAEYGDAI CGLSLESGSP GNVLSAVVFP PEETDTLDFH LTLNGECALV
PAVELSEVKD GNSCGILGLP FLRGFSLVRF DFGGMKLTIE GKQQYNHRYL TDYTADGDLF
GLERNAPALQ AVTDDSVTSD FLRLLVDKGK NVPNSVVSLA RRLEPALFGS RSEEELLMVE
IEKVRALGGI GHYSEAAADC RKLLDNNAFS GSSRLEIQRY EQLLRAAIPF GAPQYLSTAD
SLAVPFNQAD KTVSIQAGKK SYISSIDITQ PITTLSMKAV QKLGAHVFFR DTQNTYAMLP
EIAIGTARFR NIFCKVVEDK DIKITLGFNL FRLIPQLTLT QNQLILGKRQ TASGTPLRFD
KYLCVQGETN AGYVTLRLKM SGQNSLLRLK DTPVSVAEAK IQASDAVPGD YNEQENPYQG
EISIDYLINR QGRVTFDWEK MRME
//