ID A8M703_SALAI Unreviewed; 2615 AA.
AC A8M703;
DT 04-DEC-2007, integrated into UniProtKB/TrEMBL.
DT 04-DEC-2007, sequence version 1.
DT 24-JAN-2024, entry version 78.
DE SubName: Full=YD repeat protein {ECO:0000313|EMBL:ABV97185.1};
GN OrderedLocusNames=Sare_1280 {ECO:0000313|EMBL:ABV97185.1};
OS Salinispora arenicola (strain CNS-205).
OC Bacteria; Actinomycetota; Actinomycetes; Micromonosporales;
OC Micromonosporaceae; Salinispora.
OX NCBI_TaxID=391037 {ECO:0000313|EMBL:ABV97185.1};
RN [1] {ECO:0000313|EMBL:ABV97185.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CNS-205 {ECO:0000313|EMBL:ABV97185.1};
RG US DOE Joint Genome Institute;
RA Copeland A., Lucas S., Lapidus A., Barry K., Glavina del Rio T., Dalin E.,
RA Tice H., Pitluck S., Foster B., Schmutz J., Larimer F., Land M., Hauser L.,
RA Kyrpides N., Ivanova N., Jensen P.R., Moore B.S., Penn K., Jenkins C.,
RA Udwary D., Xiang L., Gontang E., Richardson P.;
RT "Complete sequence of Salinispora arenicola CNS-205.";
RL Submitted (OCT-2007) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP000850; ABV97185.1; -; Genomic_DNA.
DR STRING; 391037.Sare_1280; -.
DR CAZy; CBM13; Carbohydrate-Binding Module Family 13.
DR KEGG; saq:Sare_1280; -.
DR PATRIC; fig|391037.6.peg.1301; -.
DR eggNOG; COG3209; Bacteria.
DR HOGENOM; CLU_000662_1_0_11; -.
DR OMA; WACGSLI; -.
DR OrthoDB; 291011at2; -.
DR CDD; cd00081; Hint; 1.
DR CDD; cd00161; RICIN; 1.
DR Gene3D; 2.80.10.50; -; 1.
DR Gene3D; 2.170.16.10; Hedgehog/Intein (Hint) domain; 1.
DR Gene3D; 2.180.10.10; RHS repeat-associated core; 3.
DR InterPro; IPR003587; Hint_dom_N.
DR InterPro; IPR036844; Hint_dom_sf.
DR InterPro; IPR030934; Intein_C.
DR InterPro; IPR022385; Rhs_assc_core.
DR InterPro; IPR031325; RHS_repeat.
DR InterPro; IPR035992; Ricin_B-like_lectins.
DR InterPro; IPR000772; Ricin_B_lectin.
DR InterPro; IPR032721; Toxin-deaminase.
DR InterPro; IPR006530; YD.
DR NCBIfam; TIGR01443; intein_Cterm; 1.
DR NCBIfam; TIGR03696; Rhs_assc_core; 1.
DR NCBIfam; TIGR01643; YD_repeat_2x; 1.
DR PANTHER; PTHR32305; -; 1.
DR PANTHER; PTHR32305:SF15; PROTEIN RHSA-RELATED; 1.
DR Pfam; PF07591; PT-HINT; 1.
DR Pfam; PF05593; RHS_repeat; 2.
DR Pfam; PF00652; Ricin_B_lectin; 1.
DR Pfam; PF14424; Toxin-deaminase; 1.
DR SMART; SM00306; HintN; 1.
DR SMART; SM00458; RICIN; 1.
DR SUPFAM; SSF51294; Hedgehog/intein (Hint) domain; 1.
DR SUPFAM; SSF50370; Ricin B-like lectins; 1.
DR PROSITE; PS50818; INTEIN_C_TER; 1.
DR PROSITE; PS50231; RICIN_B_LECTIN; 1.
PE 4: Predicted;
FT DOMAIN 2462..2488
FT /note="Intein C-terminal splicing"
FT /evidence="ECO:0000259|PROSITE:PS50818"
FT REGION 1..21
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 57..91
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 219..250
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 257..276
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2254..2341
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 63..81
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2254..2298
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2326..2341
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2615 AA; 277721 MW; 9452DE84513B211C CRC64;
MRGTGLLFRR PSGRRIPTES GRWAGPTRRA FVLALAATLA VTSLPAAAFA VPAPGMSREQ
VDLPDLPESE RVGRDEGAET DLTTAPEVPI DPYTPQAVDA WQQDSGVVDL TGLEAGDSRP
VDDLPIALGV PEEGDPADLA GTWTVDLAAP EASQDAGVAG LIMKLTPPVA ADPAAEVALS
VDYTPFADLY GPQAADRFGV VLLPDCVYDA PDSGDCATDG GGSGGGAQAR TAAPTDAADA
PTGPAAAQAV PSEVGLVPAD EAPTNSSAAE DEPTRRVVTG TVPVASLLGV QSAAARSGAA
SSAAASSAAS MSTGSSVVGV LDTGASTAGD FTATPLLSSG SWAAGASSGA FTYSYQVHVP
ETAGGLMPKV ALSYSSQSVD GRTSATNNQA SWIGDGWDYN AGAITRSYAN CRQDSKKPGA
NNSTHRTADL CWGSRNATLS LGGMTTELVW DEDEQRWFTA NGDGSTVELR TDTGRANGDA
DGEYWVVTTR DGTRYHFGLH RLPGWSDHGS GADDPVTNSV LTVPVYGNHA GEPCYKAGNW
AGSFCTQAWR WGLDYVEDIH GNAMSLWWGK ETNYYARNFN FKAPVRYDRG GYLSRIDYGQ
RRDTLFTADP LARVGFTVAE RCFDEGELTC SEENFTSSDP AKYRIWYDTP ADLRCTAGKK
CWNAGPSFFT RKRLDKITTS AQRRTNDTTL QAVDEYQLTQ SFPILLTGPN TALWLESITR
TGFARNGTTD AKVTLNPVRF EHNAEDMPNR VKRDHRPGFS RLRVARVINE YGGETVISYK
APTGDCATGT GLPGKDDTAA LKANTRLCYP SYWHPDPEAE EIDWFHKYVV DSIEELPAID
GSSVATVTRY TYGPPAWRLA EREFTKKSTR TYSQFAGFAQ VAVLTGADEP SIGSARTKTV
TRYFRGLGDT VSVPDITGAE IAKDRKPFAG RIAEELTYVS AEDADEEWLT RSVTYPAAQL
LASRERDDGL SPLEAWRVTE PRQVAHARSS GTGDDTRTER VVETRTTFES THGLPTHVES
LGDTGKTGDE SCAFLEYHHR LDKNLIGLSK QVRSSPTTCA AATFDDLTTL SSASRVAYDG
QDYGAALGDG TRGLATGTWS LKGDGSGFQP DGTTGFDAIG RVVTRTDVDG ETSTITYTPA
TGQAFTVSEE NALGHRQTRE VEPGRAVSLK TTDVNGRVSE AKYDPLGRLV EAWAAGRTPS
TAAVPDFRAE YATPAGKPPY ITTFARGHED QIETSVTLYD GLGRERQIQE EATGGGRLIT
DTLYNSSGEV WQTRNAYHTD GSPIGQLFTP LADTAVPNAT RYTYDGLGRV TRELPVLDGV
ETPARATSYT YGDDHSTVVN PAGAASYRIF SDAMGRTTRL DTFTDSGHTE FTSIRYEYDA
RGHLVQATNS VDSTHPWTWT YDQRGRLVTA VDPDSGTTRM TYDRFDRQES VTNGRDITVW
NGYDKLHRPT AQRLDTSTGT MLASYTYDSA PGGKGLPATA TRYTDGLAYT QAIGGYTDDY
QPTSTTLTLP QSIADTWGLR TTYQYDYTYT DTGLPESATL PAVGNLPAEK VLTRYTKDGL
PLSVSGRDWY GAETVYSPYG QVLRSTLGAQ PYRVWTTASY DNASGALTDQ QVHREQVGDQ
SIVAANLVSH RSYGYDAAGN VTSIRERSLG IEERQCFRYD PIGQLTTAWT AADQGSCAAD
PAGGAGAVTA GTDGSGYWQE YEYDLLGNRT KLVEKDLTGD TAKDATTSYD YGRADGGQPR
TLTKVTKDYV TPDGAEVTAV AERLYELTGE TKTVTSVENG DQQVLSWTYD GKVERITGAG
GRGKTAYVGL ADKCIDLSRA VPGNPLQLFP CNGTIAQKWT FAPVPGQANA NLGTLSVYDD
WCVQPAGNTT GSAFATQECS GSTAQHLERL STGQLKHPAS GLCLAVKDEA TDDRTPLVLV
TCDGDSAAQQ WEAQNETRHL YGPGGSRLLT IQDQQATLQL GESVLTVQRG GTLVNTQRSY
PAPGGVVMRY AHLATSSTGL VALAGDHQGS PYAEVGLHNE MPVRVRKQDP FGNQRGAAPI
GVNMQTHTGF LGAQRDDASG YTPLGARLYD PVVGRFLSAD PVLDIADPMQ SNGYAYAHNN
PVTYSDPTGL SVSLTASEKA AALAGAGLSA AQVAQAQATM GRSLMSVILD SAWYMLKEFI
GINDAMNCFG GDMWACGSLI VGAIPWTKVA KIPGVLKAVN RTISAIQAWR AAKRAAEAVL
RAAKAAETAA LNAKKLAIEK AKKAAQAAKK KAAAKAQTTS NKAVNAAKKT GNQVQKNAQA
KSNPKGSSAA SSGAGKSGKS GGGSGKSGGG AGKGAPKGGG DSGASARGNG GSSGASCKTN
SFVPGTRVLM ADGSTKPVEE VQPGDKVLAT DPETGQTVAE TVTAAIKGDG VKKLVKVTID
TDGDRGSETA EVTATDGHPF WVPELGEWID ATDLRSGQWL RTGSGTYVQI TAIDRWDVSQ
ATVHNLTVAN THTYYVVAAD TPVLVHNCGL ADAVAEHRAN ANGGLGVSAK KNIAAMEATI
DGAAPRTTIA TSGVHVNPGE VGMPATRLFT PPNPSRAFDS EVFLFEDLAQ GLRPKSIGTI
NLYSELTVCP SCGGVIDQFR ARFPGIRINI RTGED
//