ID A0A5C5WS20_9BACT Unreviewed; 2330 AA.
AC A0A5C5WS20;
DT 13-NOV-2019, integrated into UniProtKB/TrEMBL.
DT 13-NOV-2019, sequence version 1.
DT 13-SEP-2023, entry version 9.
DE SubName: Full=Dockerin type I repeat protein {ECO:0000313|EMBL:TWT52883.1};
GN ORFNames=Pla22_05110 {ECO:0000313|EMBL:TWT52883.1};
OS Rubripirellula amarantea.
OC Bacteria; Planctomycetota; Planctomycetia; Pirellulales; Pirellulaceae;
OC Rubripirellula.
OX NCBI_TaxID=2527999 {ECO:0000313|EMBL:TWT52883.1, ECO:0000313|Proteomes:UP000316598};
RN [1] {ECO:0000313|EMBL:TWT52883.1, ECO:0000313|Proteomes:UP000316598}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Pla22 {ECO:0000313|EMBL:TWT52883.1,
RC ECO:0000313|Proteomes:UP000316598};
RA Wiegand S., Jogler M., Boedeker C., Pinto D., Vollmers J., Rivas-Marin E.,
RA Kohn T., Peeters S.H., Heuer A., Rast P., Oberbeckmann S., Bunk B.,
RA Jeske O., Meyerdierks A., Storesund J.E., Kallscheuer N., Luecker S.,
RA Lage O.M., Pohl T., Merkel B.J., Hornburger P., Mueller R.-W., Bruemmer F.,
RA Labrenz M., Spormann A.M., Op Den Camp H., Overmann J., Amann R.,
RA Jetten M.S.M., Mascher T., Medema M.H., Devos D.P., Kaster A.-K.,
RA Ovreas L., Rohde M., Galperin M.Y., Jogler C.;
RT "Deep-cultivation of Planctomycetes and their phenomic and genomic
RT characterization uncovers novel biology.";
RL Submitted (FEB-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:TWT52883.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; SJPI01000001; TWT52883.1; -; Genomic_DNA.
DR OrthoDB; 5242130at2; -.
DR Proteomes; UP000316598; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0000272; P:polysaccharide catabolic process; IEA:InterPro.
DR Gene3D; 1.10.1330.10; Dockerin domain; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 7.
DR InterPro; IPR002105; Dockerin_1_rpt.
DR InterPro; IPR036439; Dockerin_dom_sf.
DR InterPro; IPR030916; ELWxxDGT_rpt.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR000601; PKD_dom.
DR InterPro; IPR035986; PKD_dom_sf.
DR NCBIfam; TIGR04534; ELWxxDGT_rpt; 9.
DR Pfam; PF00404; Dockerin_1; 1.
DR Pfam; PF18911; PKD_4; 1.
DR SUPFAM; SSF49299; PKD domain; 1.
DR SUPFAM; SSF63825; YWTD domain; 1.
DR PROSITE; PS50093; PKD; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000316598}.
FT DOMAIN 1729..1788
FT /note="PKD"
FT /evidence="ECO:0000259|PROSITE:PS50093"
FT REGION 1..27
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..16
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2330 AA; 250161 MW; 564C22565BF2B766 CRC64;
MSKRNSATNT SKRSTARRVN QNRRRKGRFE RLEDRRLMAA DLIRDLYDVN SAPAPNEGIA
VGNVAYFANE DLYGSELWKS DGTVAGTQRV MDIWEGSQSS YPKKFVAFDD GAAFLARSYN
DLLEFSLSSV WMTDGTEDGT KLLAEGISTY QAPVDVDGKL FFSGYDYAES DGWGIFQFDP
ATGVTSEVLS GLESDPLNGN VQAVGGKLFF THDDEGGGEE LWVSDGTESG TKLLKEIQDN
ATAYNYGYGS YPGDFTPSDG LLFFTATQNS TGQELWVSDG TEVGTFMLKD IHTGISPNYP
TEASIGNSSV PHSLTAVDGT LFFIAEDGTG NRLWKTEGTA ETTIKLSDVD DVAYLTEAGG
TLFFSAYHEE TGTELWSSDG TVDGTALLKD IYESPSGEYG YFYSSYPETL AEYNGQLLFI
VANEAGDKVL YRSDGTSEGT VPVIATDSRF HQQRSIVGNA AGYQFFTLSQ SSEELWRTDG
TTSGTVLVQS NVGGDYDALG NTFLADDFLF FTQPNLSDLL VTEGGEDDSI AVTSNYKSNN
SGTRYSFEAL GQWVTFTDDG IWISDGTSAG TVKLADHDNN LGYGVNGSHF AFGADLGNGS
FVFAGAPYSA DGMYIGAEPW ISDGTPENTN LLVDIPGNYS GSHPFDFQRA GDDVYFTTYG
EHFGYDLWKT DGTELGTELV TNPSGAPHFT YPGRKTAVGD RLYFVAQGGG YSTPHLWTIG
PDGGNAEQVL DVLTGEPILD PGNFIDFNGK LALTDYVSYW TSLQPTPWIV DEAAATQLST
TSVQDDFELF EGELYFASNK DEIWKTNGEV EGTVRVVNLT ESIDGYDYYY SKPGELTAVG
DQLFFTAIDD AENYNDQVSL WALRDGVAST IMEFELASNY ELFDLTAYQD KLIFFANDGV
HGIEPWISDG TAQGTELFKD IHPILGSGSV RKLQSDSLVI DEQLFFVADD GITGDEPWRI
TLPIVEVDAP LFEGDEADVV TATGRIRFAE TIEADFGNVV DNGDETWTWT GSLPDGPANQ
TVTLTATDSD DFITTATFEL IVNNVEPIVV LGNDFTVAEG TEFELDITNL VDPGLDTVTS
WTVDWGDGSS DQYTEAGIVS HIYTDGMTTA NISVSVEDED GSFEDAGSST VTIENVPPTI
VLSGEASIDE GRPYTLTLGD ITDPGDDVVV KWTVDWGDGT TDVYTEGGDV EHVYADGDSS
MLISVTLEDE DGIFPMAETL NVDVINVKPT IELSGDFEVN EGATYSLTLG EITDPGEDTV
TEWIVDWGDG TTDTYTSGGV VDHVYSDGDS FPTIFVTLLD DDGTHQDVAS IDVSVLNVAP
SISISGASSV IEGGTYTLTF VGVADPGDDL VDTWFVDWGD GNTESFSFGP NGGTAEHVYA
DGDSASTISV GVEDEDGLYS DLEVIEVSVI NEAPTTEITG DSTVNEGATY SLTLGDIVDA
GDDTVYRWTV NWGDGNTDTY SQNGLVQHTY ADGDITATIL VTIEDEDGVY VDVASKVVEV
LNVAPQIAFS GPDSVAEGVT YTLNLGAITD PGVDTVFSWT VNWGDGTSDT YTEGGDVQHT
YTDDTPSALI TVGLEDEDGI HEAANTITVE VTNTAPSIEL TGDATVLEGS TYSLALSEII
DLGDDTVSLV TVNWGDGKSD TYDSLELPTA VDHVFQDNAP VYNITVELTD EDGDHTSPQT
LAVEVINAPP TIDLGGDITL SGVGEVNFSR KELAVTDPGL LDTHTVTVDY GDGSDLETFA
LDSSREFQLN HIYRYPGVYT LSVSVDDQDG GIATDQIQVT ADVQLGFALG DTGRGIAVAD
NATGVGYIMY SAEIVQERFT AAPPRLAAQQ LVAVRNVGGQ WQYNDNNAWF DFEAVDGDRL
LASVDFDSDT IATLHGSLGM VAGIASGYTS GDLDVFANQY NGKYGLGEFT VSGSHFVIGG
SGQTSGLGEL GGGIAVKEDA SGTGYLMYSK QSVHDRFVDH APRPANADHV IAVQYADGQW
SYNNDLGWFS FDPAGTDRLL AAVDFDNDTA TSLENSAAVI EGIIAGYPIG DLTFTANEFR
GVANVGEFSV SGTFFENRYP GLYDIGDLKF GVTAQDNGTG TGYLMYTAED VQSRFAANRP
SYAHSSNLIM VRFHAGAWQY DNNTAWVEFT PTTTDRLLAD VDFDNDTIVG LMIGSDGELA
PVEGISVGYV DSDLAFFANQ YKGRINPGDF EVSGTFFTTG TVFESWSDES MSNPDSSHPS
PLLMSSRLDV TGEGNVTALD ALQIVNHVGM QTGESESVLP LQLADRVDRL DVNGDDKVSA
LDALLVINAL ARQEFTPAPL DNGLGNGLDA DEVDEAFADA LSDWQEPSLF
//