ID D4CA09_9CLOT Unreviewed; 872 AA.
AC D4CA09;
DT 18-MAY-2010, integrated into UniProtKB/TrEMBL.
DT 18-MAY-2010, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE SubName: Full=SH3 domain protein {ECO:0000313|EMBL:EFE13084.1};
GN ORFNames=CLOM621_06236 {ECO:0000313|EMBL:EFE13084.1};
OS Clostridium sp. M62/1.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=411486 {ECO:0000313|EMBL:EFE13084.1, ECO:0000313|Proteomes:UP000004936};
RN [1] {ECO:0000313|EMBL:EFE13084.1, ECO:0000313|Proteomes:UP000004936}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=M62/1 {ECO:0000313|EMBL:EFE13084.1,
RC ECO:0000313|Proteomes:UP000004936};
RA Weinstock G., Sodergren E., Clifton S., Fulton L., Fulton B., Courtney L.,
RA Fronick C., Harrison M., Strong C., Farmer C., Delahaunty K., Markovic C.,
RA Hall O., Minx P., Tomlinson C., Mitreva M., Nelson J., Hou S., Wollam A.,
RA Pepin K.H., Johnson M., Bhonagiri V., Zhang X., Suruliraj S., Warren W.,
RA Chinwalla A., Mardis E.R., Wilson R.K.;
RL Submitted (FEB-2010) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EFE13084.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACFX02000013; EFE13084.1; -; Genomic_DNA.
DR RefSeq; WP_008396009.1; NZ_GG730310.1.
DR AlphaFoldDB; D4CA09; -.
DR HOGENOM; CLU_021092_0_0_9; -.
DR OrthoDB; 9816557at2; -.
DR Proteomes; UP000004936; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0000272; P:polysaccharide catabolic process; IEA:InterPro.
DR CDD; cd14256; Dockerin_I; 1.
DR Gene3D; 1.10.1330.10; Dockerin domain; 1.
DR Gene3D; 2.30.30.40; SH3 Domains; 1.
DR InterPro; IPR025883; Cadherin-like_b_sandwich.
DR InterPro; IPR002105; Dockerin_1_rpt.
DR InterPro; IPR016134; Dockerin_dom.
DR InterPro; IPR036439; Dockerin_dom_sf.
DR InterPro; IPR018247; EF_Hand_1_Ca_BS.
DR InterPro; IPR003646; SH3-like_bac-type.
DR Pfam; PF12733; Cadherin-like; 1.
DR Pfam; PF00404; Dockerin_1; 1.
DR Pfam; PF08239; SH3_3; 1.
DR SMART; SM00287; SH3b; 1.
DR SUPFAM; SSF63446; Type I dockerin domain; 1.
DR PROSITE; PS51766; DOCKERIN; 1.
DR PROSITE; PS00018; EF_HAND_1; 1.
DR PROSITE; PS51781; SH3B; 1.
PE 4: Predicted;
FT DOMAIN 40..112
FT /note="SH3b"
FT /evidence="ECO:0000259|PROSITE:PS51781"
FT DOMAIN 806..872
FT /note="Dockerin"
FT /evidence="ECO:0000259|PROSITE:PS51766"
FT REGION 260..320
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 344..371
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 632..744
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 265..298
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 346..368
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 666..693
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 712..744
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 872 AA; 89906 MW; D390D31A326D5143 CRC64;
MKKSVYKRGM AFLLSCVLAF GTCLSILEPT FSLRTFAYAE KQGSVIASSL NVRSGAGTGY
RTVARLSNGS SVTVIGEETA SDGVLWYKIR FTGSQGAQTT GYVSSQYIKL ASQNVPVDSN
FESYMTQQGF PESYKQGLRE LHAKYPNWRF TAFQTGLDWN TAVDEESKIT RNLVARSSIS
SWKSTETGAY DWSTGTWPGF DGSSWVAASR DIISYYMDPR NFLNETYIYQ FMDQAYDSSI
HSKEGLADMV AGTFLEGTAA PGGASGSRGS DQSGSGSGGS SGGPGGDSQS SGSGTGSGGS
PDGGEIAPGQ TGGPGAAAEA SFRYADEEPV LSAVSRPVNL VTAYGPGMEG TSSENGSQGN
SESATGGSSE GRPYVDIIMD AAAQSGVSPY IIASMILVEQ GKQGTGRSIS GTVSGYTGYY
NFFNIEAYQS GSMSAVERGL WWVSQSGSYG RPWNTREKSI LGGAKWYAEN YLNRGQDTLY
LKKFNVQGSS PYTHQYMTNV QAAASEGAEL AKIASLKNTA LEFSIPVFNN MPDTACAQPT
LDGSPNNKLS GLGVDGFALT PTFSRDTESY DLIVDPSVES VTVEASAIDS KATVSGTGTV
ALQSGINDIT VSVTAENGHV RNYVIHVVRQ NNGPTYSDSI DSGVSSGGGI GPGGTAGPGQ
DTSVEIVGPP GSSGSSGTGN SGSEGTGGQS AAPGGTGNGA VEPIAPDGSV GSFGQEENSS
GIQGPDSAGS SSAAPGQNAG YQNSDGYRTV AAQTSAAALA SQMQSEGAGT IVKVYNSSGA
EVTGNVGTGN LVQTYGSDGE PAARYTVVVR GDNTGDGKLN VLDILNAQRH ILGLGSLAGA
CEKASDINGN GKIDITDVLA MQRDVLGIEK LS
//