ID A0A1Y3W1B1_9BACT Unreviewed; 807 AA.
AC A0A1Y3W1B1;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 24-JAN-2024, entry version 19.
DE RecName: Full=DUF5117 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=B5G13_01890 {ECO:0000313|EMBL:OUN67011.1};
OS Butyricimonas sp. An62.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Odoribacteraceae;
OC Butyricimonas.
OX NCBI_TaxID=1965649 {ECO:0000313|EMBL:OUN67011.1, ECO:0000313|Proteomes:UP000196559};
RN [1] {ECO:0000313|Proteomes:UP000196559}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=An62 {ECO:0000313|Proteomes:UP000196559};
RA Medvecky M., Cejkova D., Polansky O., Karasova D., Kubasova T., Cizek A.,
RA Rychlik I.;
RT "Function of individual gut microbiota members based on whole genome
RT sequencing of pure cultures obtained from chicken caecum.";
RL Submitted (APR-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OUN67011.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NFHW01000001; OUN67011.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y3W1B1; -.
DR OrthoDB; 9776599at2; -.
DR Proteomes; UP000196559; Unassembled WGS sequence.
DR InterPro; IPR033413; DUF5117.
DR InterPro; IPR033428; DUF5118.
DR InterPro; IPR032534; EcxA_zinc-bd.
DR PANTHER; PTHR38478; PEPTIDASE M1A AND M12B; 1.
DR PANTHER; PTHR38478:SF1; ZINC DEPENDENT METALLOPROTEASE DOMAIN LIPOPROTEIN; 1.
DR Pfam; PF16313; DUF4953; 1.
DR Pfam; PF17148; DUF5117; 1.
DR Pfam; PF17162; DUF5118; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000196559};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..807
FT /note="DUF5117 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012892699"
FT DOMAIN 41..88
FT /note="DUF5118"
FT /evidence="ECO:0000259|Pfam:PF17162"
FT DOMAIN 100..278
FT /note="DUF5117"
FT /evidence="ECO:0000259|Pfam:PF17148"
FT DOMAIN 413..712
FT /note="EcxA zinc-binding"
FT /evidence="ECO:0000259|Pfam:PF16313"
SQ SEQUENCE 807 AA; 90359 MW; CE20D98FF1556DBF CRC64;
MINMRNMNRL IVIFACLLAL APINLSAQEK TQPAAVKKAP MAYEAFFKKD MQKFNGTFPI
YRNGEKYYLE IPASTLGRDL LVSGSIVQGG NHGMVSSVTN LLIFHLGRNN TLEVHQQICS
ERAKGDLAKA IEAANLKPVA TSFPIVAFGQ NKGGYIIDIT SDVNSSGKLF SFPNMKSVNT
PAADRSGVDS VYVINNGVKF TSLHAQTDLI PGFMHIPPRD QHTTALIEWS LQLLPERHIT
ARESDPRVGY ATISYADYDR NPYGVKSVRE IQRWHLAIKP EDTERYRRGE LVEPANPIRV
YLDRTLSSES ERRAVMQAVA EWNRCFEAAG FKNALQVQNG QPEVTVAYHQ IVYSYAMGKS
QFSQISDSRT GEIFSGNIVL SHKESEDNLP GIQLSIGGYE PAVLTDSMPI VREEYIRYQA
SKLTGQLLGL QPNWAGSAAF TTKQLRDAAW ARENGISASV TDGCVVNFAA QPGDGIALRD
LFSKASIYDR WAIEWGYRQY PGMDASAEKK ALNNLAAQAK DNTALYFATK EQADYRVNET
DLGQNVVETA TLGLKNMERL SSQLSDIFLQ RIAKDDTPWT DYIRILSSFN GLYISYFDMP
LNYIGGISVE PVLAGYNEQA ISYLPKQKGE EVMAFLNRQV FQGAPAWRTT PVEVDIIGNS
GETKGTGIFM STFRSLMNPS RLHQLVVAQD KASGQAYTIN DLFKALESYV FLNYSASRPL
SRYQVQMQYN FVREFVSTFS KLKAKEGSDD LSYFLVNQGQ RMKEKLDYLG KHHQHTYSRT
YYRGLSVYLT RAMKSGKLSG MFEDAKK
//