ID D9XSM5_9ACTN Unreviewed; 2101 AA.
AC D9XSM5;
DT 05-OCT-2010, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2010, sequence version 1.
DT 27-MAR-2024, entry version 47.
DE SubName: Full=Mucin-2 {ECO:0000313|EMBL:EFL39432.1};
DE Flags: Fragment;
GN ORFNames=SSRG_02236 {ECO:0000313|EMBL:EFL39432.1};
OS Streptomyces griseoflavus Tu4000.
OC Bacteria; Actinomycetota; Actinomycetes; Kitasatosporales;
OC Streptomycetaceae; Streptomyces.
OX NCBI_TaxID=467200 {ECO:0000313|EMBL:EFL39432.1, ECO:0000313|Proteomes:UP000002968};
RN [1] {ECO:0000313|EMBL:EFL39432.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Tu4000 {ECO:0000313|EMBL:EFL39432.1};
RG The Broad Institute Genome Sequencing Platform;
RG Broad Institute Microbial Sequencing Center;
RA Fischbach M., Godfrey P., Ward D., Young S., Zeng Q., Koehrsen M.,
RA Alvarado L., Berlin A.M., Bochicchio J., Borenstein D., Chapman S.B.,
RA Chen Z., Engels R., Freedman E., Gellesch M., Goldberg J., Griggs A.,
RA Gujja S., Heilman E.R., Heiman D.I., Hepburn T.A., Howarth C., Jen D.,
RA Larson L., Lewis B., Mehta T., Park D., Pearson M., Richards J.,
RA Roberts A., Saif S., Shea T.D., Shenoy N., Sisk P., Stolte C., Sykes S.N.,
RA Thomson T., Walk T., White J., Yandava C., Straight P., Clardy J., Hung D.,
RA Kolter R., Mekalanos J., Walker S., Walsh C.T., Wieland-Brown L.C.,
RA Haas B., Nusbaum C., Birren B.;
RT "Annotation of Streptomyces griseoflavus strain Tu4000.";
RL Submitted (FEB-2009) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG657758; EFL39432.1; -; Genomic_DNA.
DR STRING; 467200.SSRG_02236; -.
DR eggNOG; COG3209; Bacteria.
DR eggNOG; COG4842; Bacteria.
DR HOGENOM; CLU_234401_0_0_11; -.
DR Proteomes; UP000002968; Unassembled WGS sequence.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProt.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.180.10.10; RHS repeat-associated core; 4.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR028946; Ntox44.
DR InterPro; IPR022385; Rhs_assc_core.
DR InterPro; IPR031325; RHS_repeat.
DR InterPro; IPR006530; YD.
DR NCBIfam; NF033679; DNRLRE_dom; 1.
DR NCBIfam; TIGR03696; Rhs_assc_core; 1.
DR NCBIfam; TIGR01643; YD_repeat_2x; 5.
DR Pfam; PF15607; Ntox44; 1.
DR Pfam; PF05593; RHS_repeat; 4.
PE 4: Predicted;
FT DOMAIN 1966..2071
FT /note="Bacterial toxin 44"
FT /evidence="ECO:0000259|Pfam:PF15607"
FT REGION 1..22
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EFL39432.1"
SQ SEQUENCE 2101 AA; 229448 MW; 78F5C1E44D96D5D2 CRC64;
GSQGGVTDTT LSSAQPSTNQ DTIQSWDVGQ KWLSVGNNSG TYGKTRAVLK FPTTGIPSTA
TVINNRMFLW GAETTTDTDG ALYELRALTR DFTETQATWN NASSTTAWSS AGGDMSAAVS
STVGQVADVG RREWDATSLM QGWIDDPAGN KGVAVKLKDE SSTGPQERTL FLSAEAADPQ
LRPYMQVIYV DSTTEDTYYA PQTPSRMTPN TTYTVDFTVT NTTSAKWAAG ERELSYTWKL
PDGTDVTTGG NQLATPIPEL LPGKSATIQA KVATPVNSDS GNKRGEYALG WDVRKIADGS
WLSAGTNGIP SLKQSVAVED PTSNQLGLEK FYSYTGKNTG AGSTVMNNLS SGNSVWSYNA
FSNPGRGLTT FARFSYNSLD TSDTVSGAGW SAQLAGPIRL GAPLDFHPNP DPTEVRLPDG
DGTTHVFHKQ ADGTFKAPAG VHYRLSMKDG LDCKPTADPV PDAWTMLRPD GTRFLFGCDG
YLTSVVDKNG NTQTYTYEER KSNNKPTKFL KYITDPAGRK SLSVGYYLKG DATYSYVDDS
GALVSDTNLT NSKIYDHVAS MTDISGRKIS FYYTEKGLLG RMVDGAGSDQ PKTFKFTYDA
TQGNKNVKLV TATDPRGNDT DLAYYAPQNG DDPKYHWWTK TVTDRLGGNT GFTYAANTAN
PKFTDTTVTD AESHATKYVT DDYGRPVQTT NAKSQTTKLS WDADNNVTYL EEANGAKTAY
CYDQKTGYPL WSRDAENNKT GVPDQATACV TDTSKWPADA ATYEYQTRAD GYAADLWRKT
SPEGRAWQFG YDDFGNLKTV TDPKGVATAT AGDYTTSYEY NSYGQLTKAV DANGNPTTNS
DFGPTGYPEK VTDALGEPTT FIYDERGQVT EVTDALGKRT TQTYDAFGRP LVNTVPRSQA
SGEVITTPAP EYDANDNIIT STTPNGAVST AVYDDADQVA SATAPKDTTT SDERRSNYTY
DKVGNLRTTT EPKGTLTTAD STDYVTTNHY DEIYQLTSVV NADGDKISYE YDNVGNATKV
IDPKKNATAD TADYTTKTVY DLNHRVTAVT DAAGNTTKRA YSKDSLVTST TDAENNTTLI
DYDERGKTTE VKVPHSGTTT ITHRTTRYEY DEVGNTTKVI TPRAVEADTT TAFTARTEYD
ALNRPVKQYQ PYDPADTRHN DPNVYTETVY DKVGRVAKTS LPPSEGQTVR NTTAYDYFDN
GWVRSSTDPW DIVTTYDYND LGQQTARTLT SAGGSSNRTM TWGYYPDGKL KSRSDDGVPV
GKSVVLVDNS DTQHTTATGT WAEGDIAGQQ GYDHRTHAAG TGTDAFTWTL SIPEDGTYTA
YVKYPKVTGA ATAATYTLTH GTTTEPAVTR DQTAGTGTWV SLGNYALKQG VDTELKLEQN
STGTVVADAV KLVRDNSADT DNEKKTFTYA YDANGNLTSI DDTSSGAQID AYTIAYTGLN
QVRNVAESLS GTEKKATSYT YDANGQPETV THPTQHSTYT YDLRELVKTV SVGKTATDTD
PKVTSYTYTD RGQKLKETKA NDNTVDYTYY LDGALKSTVE KKADGTTLVA SHTYAYDPNG
NKARDVAKKM NADNHTAYLD STTDYSYDPA NRLTKSVKTG NGAGTETYVH DDNANVISQT
VKGTSTTYQY DRNRLLTATT SGVTADYTYD PFGRQQSVTS QGQVISHSVY DGFDHVVESQ
KMDDTGAMKS TTYTFDPLDR TASKTADGKT TDFTYLGLSG EVLNEEVAGE LTKSYQYSPW
GERLSQVKHN ADGTTEDGYY GYNSHTDVET LTDDTGNTKA TYGYTAYGSN DESDFTGIDK
PDSNDPTKET YNPYRYNAKR WDAQSSTYDM GFRDYNPGLN RFTTRDMYNG ALADMSLGTD
PFTGNRYAFT GGNPTSLIEY DGHRLAECDE QGYTCRMNEA GGWDVEYQNP TGEEAYDDVE
GFMFSEMKQN LESDAFKLIK GNIDSCDTWN PLEYAFSCDP DAAIMIWIEQ VRNDSVWDHK
TILREQLPVE NRDGICCYNK VPGMDADVYY DVWSNVHYGY VGAASGLDQD LLEFGASPGL
NVPGMESLPL VGASDPGDEV TVRMGIDLWN KYGADLTQEQ FHSEMLNMID GLQGTDKVLG
I
//