GenomeNet

Database: UniProt
Entry: G5GBM2_9BACT
LinkDB: G5GBM2_9BACT
Original site: G5GBM2_9BACT 
ID   G5GBM2_9BACT            Unreviewed;       864 AA.
AC   G5GBM2;
DT   25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT   25-JAN-2012, sequence version 1.
DT   24-JAN-2024, entry version 27.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHG23221.1};
GN   ORFNames=HMPREF9332_00973 {ECO:0000313|EMBL:EHG23221.1};
OS   Alloprevotella rava F0323.
OC   Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC   Alloprevotella.
OX   NCBI_TaxID=679199 {ECO:0000313|EMBL:EHG23221.1, ECO:0000313|Proteomes:UP000015993};
RN   [1] {ECO:0000313|EMBL:EHG23221.1, ECO:0000313|Proteomes:UP000015993}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=F0323 {ECO:0000313|EMBL:EHG23221.1,
RC   ECO:0000313|Proteomes:UP000015993};
RG   The Broad Institute Genome Sequencing Platform;
RA   Earl A., Ward D., Feldgarden M., Gevers D., Izard J., Blanton J.M.,
RA   Baranova O.V., Tanner A.C., Dewhirst F.E., Young S.K., Zeng Q., Gargeya S.,
RA   Fitzgerald M., Haas B., Abouelleil A., Alvarado L., Arachchi H.M.,
RA   Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C., Freedman E.,
RA   Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S., Heiman D.,
RA   Howarth C., Larson L., Lui A., MacDonald P.J.P., Montmayeur A., Murphy C.,
RA   Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T., Shenoy N.,
RA   Sisk P., Stolte C., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT   "The Genome Sequence of Prevotella sp. oral taxon 302 str. F0323.";
RL   Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EHG23221.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ACZK01000016; EHG23221.1; -; Genomic_DNA.
DR   AlphaFoldDB; G5GBM2; -.
DR   eggNOG; ENOG503495Q; Bacteria.
DR   HOGENOM; CLU_334297_0_0_10; -.
DR   Proteomes; UP000015993; Unassembled WGS sequence.
DR   CDD; cd05483; retropepsin_like_bacteria; 1.
DR   Gene3D; 2.40.70.10; Acid Proteases; 1.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR034122; Retropepsin-like_bacterial.
DR   Pfam; PF13975; gag-asp_proteas; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000015993};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..20
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           21..864
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003477060"
SQ   SEQUENCE   864 AA;  96536 MW;  E62C05F4E2BA5F6F CRC64;
     MKRFLNILFL FISIAASSQA TLPDSIANRC AMLAARGEVR QLRPLYRQHK AELPTYVRLF
     CDLAIARAEG NNNRTNQCVD SLVSNFPKQV GAVGRLTLTE LKAENLLNDG LYEELQAYAD
     EQVRYLKRHN FKAARIDVFR DYARKAIRYG SSGESGRILG MAERRSDFEL LKAYRSDQSL
     TAFARLTARA ELARAFNLPD AACASADSLL RFYADSLNAE VQTTYLTTCI ENYISQGLWK
     KLAEFCEFAT TLPQLRAVPL EIFSSIAREL RNEQPFALEQ PATPISVPTT RDWPLLLPVS
     VNGGSNVFFH LNTGQRLTII TEADARQFGA RVLNVTLAVN TMFGKVNVRP AFLRELRVGS
     VRFRNLLVYV FPAEEMTAEM KPEICRILGN RELMRLGEVS IYPEKIVFAP SQMDVRNSQP
     NLRLNDNYRL QLQAEYGDAI CGLSLESGSP GNVLSAVVFP PEETDTLDFH LTLNGECALV
     PAVELSEVKD GNSCGILGLP FLRGFSLVRF DFGGMKLTIE GKQQYNHRYL TDYTADGDLF
     GLERNAPALQ AVTDDSVTSD FLRLLVDKGK NVPNSVVSLA RRLEPALFGS RSEEELLMVE
     IEKVRALGGI GHYSEAAADC RKLLDNNAFS GSSRLEIQRY EQLLRAAIPF GAPQYLSTAD
     SLAVPFNQAD KTVSIQAGKK SYISSIDITQ PITTLSMKAV QKLGAHVFFR DTQNTYAMLP
     EIAIGTARFR NIFCKVVEDK DIKITLGFNL FRLIPQLTLT QNQLILGKRQ TASGTPLRFD
     KYLCVQGETN AGYVTLRLKM SGQNSLLRLK DTPVSVAEAK IQASDAVPGD YNEQENPYQG
     EISIDYLINR QGRVTFDWEK MRME
//
DBGET integrated database retrieval system