GenomeNet

Database: UniProt
Entry: A0A1Y4DRD6_9ACTN
LinkDB: A0A1Y4DRD6_9ACTN
Original site: A0A1Y4DRD6_9ACTN 
ID   A0A1Y4DRD6_9ACTN        Unreviewed;       897 AA.
AC   A0A1Y4DRD6;
DT   30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT   30-AUG-2017, sequence version 1.
DT   27-MAR-2024, entry version 20.
DE   RecName: Full=SLH domain-containing protein {ECO:0000259|PROSITE:PS51272};
GN   ORFNames=B5F74_03230 {ECO:0000313|EMBL:OUO61654.1};
OS   Collinsella sp. An271.
OC   Bacteria; Actinomycetota; Coriobacteriia; Coriobacteriales;
OC   Coriobacteriaceae; Collinsella.
OX   NCBI_TaxID=1965616 {ECO:0000313|EMBL:OUO61654.1, ECO:0000313|Proteomes:UP000195889};
RN   [1] {ECO:0000313|Proteomes:UP000195889}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=An271 {ECO:0000313|Proteomes:UP000195889};
RA   Medvecky M., Cejkova D., Polansky O., Karasova D., Kubasova T., Cizek A.,
RA   Rychlik I.;
RT   "Function of individual gut microbiota members based on whole genome
RT   sequencing of pure cultures obtained from chicken caecum.";
RL   Submitted (APR-2017) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OUO61654.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; NFJE01000003; OUO61654.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A1Y4DRD6; -.
DR   Proteomes; UP000195889; Unassembled WGS sequence.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd02619; Peptidase_C1; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR040528; Lectin-like.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR001119; SLH_dom.
DR   PANTHER; PTHR12411:SF741; CATHEPSIN K; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF18560; Lectin_like; 1.
DR   Pfam; PF00112; Peptidase_C1; 2.
DR   Pfam; PF00395; SLH; 3.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS51272; SLH; 3.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Reference proteome {ECO:0000313|Proteomes:UP000195889}.
FT   DOMAIN          711..775
FT                   /note="SLH"
FT                   /evidence="ECO:0000259|PROSITE:PS51272"
FT   DOMAIN          776..835
FT                   /note="SLH"
FT                   /evidence="ECO:0000259|PROSITE:PS51272"
FT   DOMAIN          839..897
FT                   /note="SLH"
FT                   /evidence="ECO:0000259|PROSITE:PS51272"
SQ   SEQUENCE   897 AA;  97636 MW;  C65AF264A8AA9C9A CRC64;
     MQSEEAAPLS FPKSARLFPH ARRHAPASFI LASEQPIAPH HPIEEGCSMK RPKGFWKQRC
     ASLIRRSAVA LSLAVGFSLA AAPVASALEA SPLDSHNSAL NFASDTTGVD LLAADELPAS
     FDLRGRGVVT PVKNQGVWST CWGFAAVAAA ETSLLSDLNT TYERTGLDLS ERQLAYFSTT
     ALPDGEEGDR LYNDQGGEGM HNVLLEDDDL PDDETMEDIL GCQPQSAPLL YGGLSAYATS
     LYSSGIGPIS ESLAPYQNDE GILHPSGTMY AASGTWALDE SLRLQTGAQL EESLMLPCPA
     TFDEDGTYSY DERATRAIKE QLTEGRAVSI ALCADQSHAS DELAADGFMN AATWANYGYE
     YAPANHAVTI VGWDDTYAAE NFGTPDPETG EVDPSHRPPA DGAWVVKNSW GAESSEFPNQ
     ASWGDDGYFY LSYYDQTLTM PEAFVLDAEH LGTDGLEPFY TNQYDYLPTC RQSAYSATER
     LSGANIFTAE GPQVIDRLSC ETVKPNTTVT YQLYRLNEGA TGPTDGELLV TLSDTYEYGG
     YHLIEIPESD HDKTRMATGE RFSVVVTEYC NDDATYYVPL QAQVGKQQHD AQVASLYAQE
     NETHALASKA AENISERYFD EHEGATDEDY QAWAQENAQA IQDEIDDYVT VQIEAMAPVY
     GQSVINRGES FVFDSEEVLD WNDAIADFLT EEELALWAFD NPPYKAYGTA VEPPFADIPA
     DAWYFEAVEY AKEHGYMHGY DDTGLFDPEA TVTREQAACV MYNWLGNGAK VEAADLDDVV
     QGTYYSDAVN WAVKNKIMNG YGNGTFGVGD SLTREQFACI LANALSAEPG DVGAIEGMLG
     ADRVSDWAES GVAWAVEHGV MNGVETEDGQ RDLQPQASVS RAQIAAFVMN FLESGVA
//
DBGET integrated database retrieval system