ID A0A384IHI3_9ACTN Unreviewed; 2395 AA.
AC A0A384IHI3;
DT 07-NOV-2018, integrated into UniProtKB/TrEMBL.
DT 07-NOV-2018, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Sugar-binding protein {ECO:0000313|EMBL:PZT77983.1};
GN ORFNames=DNK56_22360 {ECO:0000313|EMBL:PZT77983.1};
OS Streptomyces sp. AC1-42W.
OC Bacteria; Actinomycetota; Actinomycetes; Kitasatosporales;
OC Streptomycetaceae; Streptomyces.
OX NCBI_TaxID=2218666 {ECO:0000313|EMBL:PZT77983.1, ECO:0000313|Proteomes:UP000249154};
RN [1] {ECO:0000313|EMBL:PZT77983.1, ECO:0000313|Proteomes:UP000249154}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=AC1-42W {ECO:0000313|EMBL:PZT77983.1,
RC ECO:0000313|Proteomes:UP000249154};
RA De Leon M.P., Montecillo A.D., Siringan M.A.T., Kim S.-G., Rosana A.R.R.;
RT "Near complete genome sequences of Streptomyces sp. strains AC1-42W
RT isolated from bat guano of Cabalyorisa Cave, Pangasinan, Philippines.";
RL Submitted (JUN-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PZT77983.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QKWY01000002; PZT77983.1; -; Genomic_DNA.
DR Proteomes; UP000249154; Unassembled WGS sequence.
DR CDD; cd00161; RICIN; 1.
DR Gene3D; 2.180.10.10; RHS repeat-associated core; 2.
DR InterPro; IPR022385; Rhs_assc_core.
DR InterPro; IPR031325; RHS_repeat.
DR InterPro; IPR035992; Ricin_B-like_lectins.
DR InterPro; IPR000772; Ricin_B_lectin.
DR InterPro; IPR049002; Stv.
DR InterPro; IPR006530; YD.
DR NCBIfam; TIGR03696; Rhs_assc_core; 1.
DR NCBIfam; TIGR01643; YD_repeat_2x; 2.
DR PANTHER; PTHR32305; -; 1.
DR PANTHER; PTHR32305:SF17; TRNA NUCLEASE WAPA; 1.
DR Pfam; PF05593; RHS_repeat; 2.
DR Pfam; PF00652; Ricin_B_lectin; 1.
DR Pfam; PF21527; Stv; 1.
DR SMART; SM00458; RICIN; 1.
DR SUPFAM; SSF50370; Ricin B-like lectins; 1.
DR PROSITE; PS50231; RICIN_B_LECTIN; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000249154}.
FT DOMAIN 1733..1869
FT /note="Ricin B lectin"
FT /evidence="ECO:0000259|SMART:SM00458"
FT REGION 1..36
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1966..1991
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2184..2291
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2193..2232
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2395 AA; 255421 MW; DF5B11A9FFA987A9 CRC64;
MAVQLPDLPE SARTDQDDSA EKHLTTAPEQ VTTPYTPQAV EEWSPGVGTA DLTDVQPGQT
VPVENVPTIS LGVPEEGDAA ALAGEWTVDL KAPADSQAAE VDGLLMEITP PATADPEAEV
TVGVDYTSFA DLYGPQAADR FGVVLLPNCV VDAPTEGECA PEAPAQEPAA GTTASAQPLA
SEIEVVRPKT TAKLLAAAEK QGETLRTRRV VSASVPVSEL LGSGAGSKAS GAMRAAASEG
TGSRAVGVLD TGASAAGDFT ATPLQSSGSW AAGSSSGAFT YGYQVQVPEA AGGLTPQVAL
SYSSQSVDGR TSATNNQASW IGDGWDYNAG SITRSYASCR QDAKKAGANN ASHKTGDLCW
GSDNATLTLG GTTTELVWDA DQEKWFTANG DGSRVQVVKD TSKDNKDADG EYWIVTTRDG
TKYHFGLNHL PGWSTGKPVT NSTLTVPVFG NHSGEPCYKA GDWKGSECKQ AWRWNLDYVE
DVHGNAMSLW WKKDANYYAR NFNWKAPVSY DRDGYLLHID YGQRKDTLFS AQAPGRVAFE
VDERCYAEGS LTCSPENFAS KDPGKYRIWY DTPADLRCTA GKMCWNAAPS FWSTKRLDSI
RTFAQRRTDT TARQLVDHYR LKQSFPFLRT GPNTALWLET ITRTGYARNG ETDAKVQLNP
VRFEANADDM PNRVMRGDND PRPGFSRLRI GRVINEYGGE TVVSYKQPAG QCATGQDLPG
KSDKAALKSN TRLCYPVFWH PDPEKEDIDW FHKYVVQEVE ELPNVAGAYS TSTKYTYGTA
GWKMAEQEFT KKSTRTYSRF AGFDRTTVIT GADDKAIGSK RTKAVTRFFR GMGDDVAVKD
ITGAHIAYDR EPFAGRIAEE LTYAEATDAD TAWKTRSVTV PEATELASRT RGDGLDPLKA
WRVTEPRQLA YSKDGQGTVH TSETKTTYES TYGLPVRIET LEDAADPGLV GDVSCTDLSY
VHRTDKNLIG LTKQTLSSAT SCSAADFTDL GSLASGSRTA YDGGEYGAAL GGSTRGLVTR
TWSLKADGSG FQPDGTVGFD SLGRVVKTTD PDGKSSTTSY AMTNGQTFGV TETNSLGHSS
VQEIEPGRST ATRSTDANGH VTRFVFDALG RIVEAWAPGR TPESSAVPDF AAEYHISKND
PEEEDRFPPY VVTRSRGHKD RIETSVTLYD GLGRERQSQK EATGGGRLIT DTLYNSSGEV
WQTNNAYLAT GKPSGRLFTP LADTAVPNAT RYTYDGLGRV VTELPILNGS EMPARSTRYE
YGADWSTVIN PSGASSYRVR SDSMGRTTQV DTFTDAERTE YTSVKYEFDD RGLLTKAYSA
QNPKRSWTWT YDGRGRMVSA TDPDAGTTTT TYDHRDRPVT TTDARGVTVW TKYDELSRPT
EQRLGGATGE LLTRSTYDTA PGGKGLPATA ARITDGQEYT MSVGGYTADY QPRSTTLSLP
DTLATTWGLR KSYTSTYNYS DTGLLLDGTI PAAGAFDSEK LVVRYNEDGL PLSVSGKDWY
GSEAKYSPYG QLLRATLGAQ PHRVWALAGF DDASGALTDQ QVYREQNGDT SLVAGKLASH
RSYSYDDAGN VTGIRERSTG IQERQCFTYD PIGQLTEAWT SADLGSCADG PVKAGGALNV
TAGPDDSGYW QSYEYDLLGN RTKLTEKDLT GATAKDAVTT YGYGKADGSQ PHTLTKVTKK
YTAPSGAQVT AEAERLYELT GATKQVSSLE NGDTQDISWT WDGQVERLTG QGSGGKTSYV
GLGGKCLDLN GGKPEENRPV NLYSCNGAAS QKWSFQVEPG QPDPDLGTLT FHGDTWCVAP
AASAAGAGMR INACDGSAGQ KIKRNATGQL VHVATGLCVA VQDGSTANSA STVLTTCAAG
SAAQKWEAQN ETRYIYGPDG SRLLTVQGKQ ATLHLAETEV TTQAAGKLVS TQRNYGAPGG
SVVRQQYGYQ TTSTLVAEVG DHQGSTYAEV AMTDGMPVRV RKQDPFGNER GAAASGTGPR
TRAGFLGTSP DDASGYTQLG ARMYDPAVGR FLSADPVLDI ADPLQANGYA YAHNNPVTLS
DPTGLAVSLT ASETKAALAG VGLTEAQVSQ ARAIMDQSIT SVILSVAWDQ LGELIGKDDA
LGCFGGSVKS CISLVIGATP VGKLGRIPSA IKAIHKTAKA IQALNRAKKK AEKVIAAARA
AERKAIAAKK AAIEKVKKAA QAAKKKAAEK KQTTSNKAVN ETKKTGNPVQ KRAQADSAPK
VASTSATHKS SAGGKAGSGK PGGSAGGSSR SKGGSSGDGG AGKASRGSDE GSSLSRPSGR
PDNEEVFAGH GDWSLRDGWT RIPENTSLAV YGEKGYTISD YTGFRVEVGD PSVLPTRVYG
PGEWVRNYQL HPVSPDLVVH SASTTVDART SLSGLLRPGM GRVHWAACLG SKRPS
//