GenomeNet

Database: UniProt
Entry: A0A5C5WS20_9BACT
LinkDB: A0A5C5WS20_9BACT
Original site: A0A5C5WS20_9BACT 
ID   A0A5C5WS20_9BACT        Unreviewed;      2330 AA.
AC   A0A5C5WS20;
DT   13-NOV-2019, integrated into UniProtKB/TrEMBL.
DT   13-NOV-2019, sequence version 1.
DT   13-SEP-2023, entry version 9.
DE   SubName: Full=Dockerin type I repeat protein {ECO:0000313|EMBL:TWT52883.1};
GN   ORFNames=Pla22_05110 {ECO:0000313|EMBL:TWT52883.1};
OS   Rubripirellula amarantea.
OC   Bacteria; Planctomycetota; Planctomycetia; Pirellulales; Pirellulaceae;
OC   Rubripirellula.
OX   NCBI_TaxID=2527999 {ECO:0000313|EMBL:TWT52883.1, ECO:0000313|Proteomes:UP000316598};
RN   [1] {ECO:0000313|EMBL:TWT52883.1, ECO:0000313|Proteomes:UP000316598}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Pla22 {ECO:0000313|EMBL:TWT52883.1,
RC   ECO:0000313|Proteomes:UP000316598};
RA   Wiegand S., Jogler M., Boedeker C., Pinto D., Vollmers J., Rivas-Marin E.,
RA   Kohn T., Peeters S.H., Heuer A., Rast P., Oberbeckmann S., Bunk B.,
RA   Jeske O., Meyerdierks A., Storesund J.E., Kallscheuer N., Luecker S.,
RA   Lage O.M., Pohl T., Merkel B.J., Hornburger P., Mueller R.-W., Bruemmer F.,
RA   Labrenz M., Spormann A.M., Op Den Camp H., Overmann J., Amann R.,
RA   Jetten M.S.M., Mascher T., Medema M.H., Devos D.P., Kaster A.-K.,
RA   Ovreas L., Rohde M., Galperin M.Y., Jogler C.;
RT   "Deep-cultivation of Planctomycetes and their phenomic and genomic
RT   characterization uncovers novel biology.";
RL   Submitted (FEB-2019) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:TWT52883.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; SJPI01000001; TWT52883.1; -; Genomic_DNA.
DR   OrthoDB; 5242130at2; -.
DR   Proteomes; UP000316598; Unassembled WGS sequence.
DR   GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR   GO; GO:0000272; P:polysaccharide catabolic process; IEA:InterPro.
DR   Gene3D; 1.10.1330.10; Dockerin domain; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 7.
DR   InterPro; IPR002105; Dockerin_1_rpt.
DR   InterPro; IPR036439; Dockerin_dom_sf.
DR   InterPro; IPR030916; ELWxxDGT_rpt.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR000601; PKD_dom.
DR   InterPro; IPR035986; PKD_dom_sf.
DR   NCBIfam; TIGR04534; ELWxxDGT_rpt; 9.
DR   Pfam; PF00404; Dockerin_1; 1.
DR   Pfam; PF18911; PKD_4; 1.
DR   SUPFAM; SSF49299; PKD domain; 1.
DR   SUPFAM; SSF63825; YWTD domain; 1.
DR   PROSITE; PS50093; PKD; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000316598}.
FT   DOMAIN          1729..1788
FT                   /note="PKD"
FT                   /evidence="ECO:0000259|PROSITE:PS50093"
FT   REGION          1..27
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..16
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2330 AA;  250161 MW;  564C22565BF2B766 CRC64;
     MSKRNSATNT SKRSTARRVN QNRRRKGRFE RLEDRRLMAA DLIRDLYDVN SAPAPNEGIA
     VGNVAYFANE DLYGSELWKS DGTVAGTQRV MDIWEGSQSS YPKKFVAFDD GAAFLARSYN
     DLLEFSLSSV WMTDGTEDGT KLLAEGISTY QAPVDVDGKL FFSGYDYAES DGWGIFQFDP
     ATGVTSEVLS GLESDPLNGN VQAVGGKLFF THDDEGGGEE LWVSDGTESG TKLLKEIQDN
     ATAYNYGYGS YPGDFTPSDG LLFFTATQNS TGQELWVSDG TEVGTFMLKD IHTGISPNYP
     TEASIGNSSV PHSLTAVDGT LFFIAEDGTG NRLWKTEGTA ETTIKLSDVD DVAYLTEAGG
     TLFFSAYHEE TGTELWSSDG TVDGTALLKD IYESPSGEYG YFYSSYPETL AEYNGQLLFI
     VANEAGDKVL YRSDGTSEGT VPVIATDSRF HQQRSIVGNA AGYQFFTLSQ SSEELWRTDG
     TTSGTVLVQS NVGGDYDALG NTFLADDFLF FTQPNLSDLL VTEGGEDDSI AVTSNYKSNN
     SGTRYSFEAL GQWVTFTDDG IWISDGTSAG TVKLADHDNN LGYGVNGSHF AFGADLGNGS
     FVFAGAPYSA DGMYIGAEPW ISDGTPENTN LLVDIPGNYS GSHPFDFQRA GDDVYFTTYG
     EHFGYDLWKT DGTELGTELV TNPSGAPHFT YPGRKTAVGD RLYFVAQGGG YSTPHLWTIG
     PDGGNAEQVL DVLTGEPILD PGNFIDFNGK LALTDYVSYW TSLQPTPWIV DEAAATQLST
     TSVQDDFELF EGELYFASNK DEIWKTNGEV EGTVRVVNLT ESIDGYDYYY SKPGELTAVG
     DQLFFTAIDD AENYNDQVSL WALRDGVAST IMEFELASNY ELFDLTAYQD KLIFFANDGV
     HGIEPWISDG TAQGTELFKD IHPILGSGSV RKLQSDSLVI DEQLFFVADD GITGDEPWRI
     TLPIVEVDAP LFEGDEADVV TATGRIRFAE TIEADFGNVV DNGDETWTWT GSLPDGPANQ
     TVTLTATDSD DFITTATFEL IVNNVEPIVV LGNDFTVAEG TEFELDITNL VDPGLDTVTS
     WTVDWGDGSS DQYTEAGIVS HIYTDGMTTA NISVSVEDED GSFEDAGSST VTIENVPPTI
     VLSGEASIDE GRPYTLTLGD ITDPGDDVVV KWTVDWGDGT TDVYTEGGDV EHVYADGDSS
     MLISVTLEDE DGIFPMAETL NVDVINVKPT IELSGDFEVN EGATYSLTLG EITDPGEDTV
     TEWIVDWGDG TTDTYTSGGV VDHVYSDGDS FPTIFVTLLD DDGTHQDVAS IDVSVLNVAP
     SISISGASSV IEGGTYTLTF VGVADPGDDL VDTWFVDWGD GNTESFSFGP NGGTAEHVYA
     DGDSASTISV GVEDEDGLYS DLEVIEVSVI NEAPTTEITG DSTVNEGATY SLTLGDIVDA
     GDDTVYRWTV NWGDGNTDTY SQNGLVQHTY ADGDITATIL VTIEDEDGVY VDVASKVVEV
     LNVAPQIAFS GPDSVAEGVT YTLNLGAITD PGVDTVFSWT VNWGDGTSDT YTEGGDVQHT
     YTDDTPSALI TVGLEDEDGI HEAANTITVE VTNTAPSIEL TGDATVLEGS TYSLALSEII
     DLGDDTVSLV TVNWGDGKSD TYDSLELPTA VDHVFQDNAP VYNITVELTD EDGDHTSPQT
     LAVEVINAPP TIDLGGDITL SGVGEVNFSR KELAVTDPGL LDTHTVTVDY GDGSDLETFA
     LDSSREFQLN HIYRYPGVYT LSVSVDDQDG GIATDQIQVT ADVQLGFALG DTGRGIAVAD
     NATGVGYIMY SAEIVQERFT AAPPRLAAQQ LVAVRNVGGQ WQYNDNNAWF DFEAVDGDRL
     LASVDFDSDT IATLHGSLGM VAGIASGYTS GDLDVFANQY NGKYGLGEFT VSGSHFVIGG
     SGQTSGLGEL GGGIAVKEDA SGTGYLMYSK QSVHDRFVDH APRPANADHV IAVQYADGQW
     SYNNDLGWFS FDPAGTDRLL AAVDFDNDTA TSLENSAAVI EGIIAGYPIG DLTFTANEFR
     GVANVGEFSV SGTFFENRYP GLYDIGDLKF GVTAQDNGTG TGYLMYTAED VQSRFAANRP
     SYAHSSNLIM VRFHAGAWQY DNNTAWVEFT PTTTDRLLAD VDFDNDTIVG LMIGSDGELA
     PVEGISVGYV DSDLAFFANQ YKGRINPGDF EVSGTFFTTG TVFESWSDES MSNPDSSHPS
     PLLMSSRLDV TGEGNVTALD ALQIVNHVGM QTGESESVLP LQLADRVDRL DVNGDDKVSA
     LDALLVINAL ARQEFTPAPL DNGLGNGLDA DEVDEAFADA LSDWQEPSLF
//
DBGET integrated database retrieval system