GenomeNet

Database: UniProt
Entry: A0A1Y3W1B1_9BACT
LinkDB: A0A1Y3W1B1_9BACT
Original site: A0A1Y3W1B1_9BACT 
ID   A0A1Y3W1B1_9BACT        Unreviewed;       807 AA.
AC   A0A1Y3W1B1;
DT   30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT   30-AUG-2017, sequence version 1.
DT   24-JAN-2024, entry version 19.
DE   RecName: Full=DUF5117 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=B5G13_01890 {ECO:0000313|EMBL:OUN67011.1};
OS   Butyricimonas sp. An62.
OC   Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Odoribacteraceae;
OC   Butyricimonas.
OX   NCBI_TaxID=1965649 {ECO:0000313|EMBL:OUN67011.1, ECO:0000313|Proteomes:UP000196559};
RN   [1] {ECO:0000313|Proteomes:UP000196559}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=An62 {ECO:0000313|Proteomes:UP000196559};
RA   Medvecky M., Cejkova D., Polansky O., Karasova D., Kubasova T., Cizek A.,
RA   Rychlik I.;
RT   "Function of individual gut microbiota members based on whole genome
RT   sequencing of pure cultures obtained from chicken caecum.";
RL   Submitted (APR-2017) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OUN67011.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; NFHW01000001; OUN67011.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A1Y3W1B1; -.
DR   OrthoDB; 9776599at2; -.
DR   Proteomes; UP000196559; Unassembled WGS sequence.
DR   InterPro; IPR033413; DUF5117.
DR   InterPro; IPR033428; DUF5118.
DR   InterPro; IPR032534; EcxA_zinc-bd.
DR   PANTHER; PTHR38478; PEPTIDASE M1A AND M12B; 1.
DR   PANTHER; PTHR38478:SF1; ZINC DEPENDENT METALLOPROTEASE DOMAIN LIPOPROTEIN; 1.
DR   Pfam; PF16313; DUF4953; 1.
DR   Pfam; PF17148; DUF5117; 1.
DR   Pfam; PF17162; DUF5118; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000196559};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..27
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           28..807
FT                   /note="DUF5117 domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5012892699"
FT   DOMAIN          41..88
FT                   /note="DUF5118"
FT                   /evidence="ECO:0000259|Pfam:PF17162"
FT   DOMAIN          100..278
FT                   /note="DUF5117"
FT                   /evidence="ECO:0000259|Pfam:PF17148"
FT   DOMAIN          413..712
FT                   /note="EcxA zinc-binding"
FT                   /evidence="ECO:0000259|Pfam:PF16313"
SQ   SEQUENCE   807 AA;  90359 MW;  CE20D98FF1556DBF CRC64;
     MINMRNMNRL IVIFACLLAL APINLSAQEK TQPAAVKKAP MAYEAFFKKD MQKFNGTFPI
     YRNGEKYYLE IPASTLGRDL LVSGSIVQGG NHGMVSSVTN LLIFHLGRNN TLEVHQQICS
     ERAKGDLAKA IEAANLKPVA TSFPIVAFGQ NKGGYIIDIT SDVNSSGKLF SFPNMKSVNT
     PAADRSGVDS VYVINNGVKF TSLHAQTDLI PGFMHIPPRD QHTTALIEWS LQLLPERHIT
     ARESDPRVGY ATISYADYDR NPYGVKSVRE IQRWHLAIKP EDTERYRRGE LVEPANPIRV
     YLDRTLSSES ERRAVMQAVA EWNRCFEAAG FKNALQVQNG QPEVTVAYHQ IVYSYAMGKS
     QFSQISDSRT GEIFSGNIVL SHKESEDNLP GIQLSIGGYE PAVLTDSMPI VREEYIRYQA
     SKLTGQLLGL QPNWAGSAAF TTKQLRDAAW ARENGISASV TDGCVVNFAA QPGDGIALRD
     LFSKASIYDR WAIEWGYRQY PGMDASAEKK ALNNLAAQAK DNTALYFATK EQADYRVNET
     DLGQNVVETA TLGLKNMERL SSQLSDIFLQ RIAKDDTPWT DYIRILSSFN GLYISYFDMP
     LNYIGGISVE PVLAGYNEQA ISYLPKQKGE EVMAFLNRQV FQGAPAWRTT PVEVDIIGNS
     GETKGTGIFM STFRSLMNPS RLHQLVVAQD KASGQAYTIN DLFKALESYV FLNYSASRPL
     SRYQVQMQYN FVREFVSTFS KLKAKEGSDD LSYFLVNQGQ RMKEKLDYLG KHHQHTYSRT
     YYRGLSVYLT RAMKSGKLSG MFEDAKK
//
DBGET integrated database retrieval system