ID A0A1I2FXM1_9ACTN Unreviewed; 2672 AA.
AC A0A1I2FXM1;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 24-JAN-2024, entry version 16.
DE SubName: Full=Intein C-terminal splicing region/intein N-terminal splicing region/RHS repeat-associated core domain-containing protein {ECO:0000313|EMBL:SFF10135.1};
GN ORFNames=SAMN05216251_108145 {ECO:0000313|EMBL:SFF10135.1};
OS Actinacidiphila alni.
OC Bacteria; Actinomycetota; Actinomycetes; Kitasatosporales;
OC Streptomycetaceae; Actinacidiphila.
OX NCBI_TaxID=380248 {ECO:0000313|EMBL:SFF10135.1, ECO:0000313|Proteomes:UP000199323};
RN [1] {ECO:0000313|EMBL:SFF10135.1, ECO:0000313|Proteomes:UP000199323}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CGMCC 4.3510 {ECO:0000313|EMBL:SFF10135.1,
RC ECO:0000313|Proteomes:UP000199323};
RA de Groot N.N.;
RL Submitted (OCT-2016) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FONG01000008; SFF10135.1; -; Genomic_DNA.
DR STRING; 380248.SAMN05216251_108145; -.
DR OrthoDB; 291011at2; -.
DR Proteomes; UP000199323; Unassembled WGS sequence.
DR CDD; cd00081; Hint; 1.
DR CDD; cd00161; RICIN; 1.
DR Gene3D; 2.80.10.50; -; 1.
DR Gene3D; 2.90.10.10; Bulb-type lectin domain; 1.
DR Gene3D; 2.170.16.10; Hedgehog/Intein (Hint) domain; 1.
DR Gene3D; 2.180.10.10; RHS repeat-associated core; 1.
DR InterPro; IPR001480; Bulb-type_lectin_dom.
DR InterPro; IPR036426; Bulb-type_lectin_dom_sf.
DR InterPro; IPR003587; Hint_dom_N.
DR InterPro; IPR036844; Hint_dom_sf.
DR InterPro; IPR030934; Intein_C.
DR InterPro; IPR022385; Rhs_assc_core.
DR InterPro; IPR031325; RHS_repeat.
DR InterPro; IPR035992; Ricin_B-like_lectins.
DR InterPro; IPR000772; Ricin_B_lectin.
DR InterPro; IPR006530; YD.
DR NCBIfam; TIGR01443; intein_Cterm; 1.
DR NCBIfam; TIGR03696; Rhs_assc_core; 1.
DR NCBIfam; TIGR01643; YD_repeat_2x; 2.
DR Pfam; PF07591; PT-HINT; 1.
DR Pfam; PF05593; RHS_repeat; 2.
DR Pfam; PF00652; Ricin_B_lectin; 1.
DR SMART; SM00108; B_lectin; 1.
DR SMART; SM00306; HintN; 1.
DR SMART; SM00458; RICIN; 1.
DR SUPFAM; SSF51110; alpha-D-mannose-specific plant lectins; 1.
DR SUPFAM; SSF51294; Hedgehog/intein (Hint) domain; 1.
DR SUPFAM; SSF50370; Ricin B-like lectins; 1.
DR PROSITE; PS50927; BULB_LECTIN; 1.
DR PROSITE; PS50818; INTEIN_C_TER; 1.
DR PROSITE; PS50231; RICIN_B_LECTIN; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000199323}.
FT DOMAIN 1338..1448
FT /note="Bulb-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50927"
FT DOMAIN 2526..2552
FT /note="Intein C-terminal splicing"
FT /evidence="ECO:0000259|PROSITE:PS50818"
FT REGION 70..127
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 874..914
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1158..1197
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2331..2420
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 70..96
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 108..126
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 874..912
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1170..1197
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2395..2410
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2672 AA; 278387 MW; CA7C0B8C38941441 CRC64;
MALIVPLAQA DSVGYHYDGK VWADGPLQDQ PVVKGHPVTG ADAVQKARTP YGARALGAYK
AAKPSWPAAA TNSVSLTRPS TSSTTKSSTD VDSRSAAVRA GSSPVLVSAA PTPATKSKSR
TATPDVPGSV QVRLAGRTQA QAADVDGLLV GLSRTDGGTS AGRVKVTVEY GSIAKAYGGG
WATRLHLVAM PACALTTPEA AACQARTPLV SSNDTAAQTL SASVDIPADK AAHPGARTTT
ESAAAPMGVA VAAVSGTGGS QGDYTATSLS ASGAWSQSAS GAFTYNYPIA TPASLGGSAP
SVALSYNSQT VDGETSARNS QSSWIGDGWS YEPGFIERSY KACDDQGIKD SGDECWGGYN
ATLSLGSLNG ELVRDSDGSY HLQADDGTKI ERLTGADNGL WQGEYYRVTT TDGTASYFGL
DHAPGTTSDA STNSAWGVPV YHPKSADPCY DAAKGNKSQC DKPVGYRFNL DFVVDPHGNV
QRYDYATETN YYNMGYGQVA ASGGGGTMTP YIRAGHLTTI SYGYQLDDAR AERDPAARIV
FSTAQRCITS DTVCQASNLS DSTATNWPDV PYDIHCESGD DTSGDGSDVC HTGSPTFWST
NRLKTITTRV KVGSGYKDVD AYDLTQQFPD GGGVIDPVTG KTEHPDEIGQ LQAVMWLASV
QHRGLDTSAG GSGSLTLDPV TFQGIETDNR VDGLTPAAPA LFRPRISSIR TETGESIAVT
YADPQCSRVK HTMPASADSN TMACFPVYWH PTGVKDPVSD WFTKSLVTRI DDNDATKAGS
PAKTTQYEYA QAAWHRDDSD LTDDQYRTWN DFRGFRTVTT TSGTSPNPIT KTVSTYLQGM
DGDYKADKTT RHVTVNGVTD SSWLAGTPVE TDAFKSASDT DPVTKSITDA PETVSTASRP
RTAWTSEDPA PSELSTLPDL AARRLKTSTK HSLAQLSGTK GWRTTTVVTS YDQYGRVTTT
DDKGDESQPS QENCTTVHYA PAPSDNPMML SYPSETVAVS GPCGTVEGAT TTLMHKRIFY
DGDGSITDPG TFGKLGQSWP SDGSTPKVHS LGNMTAVQTI TSYDGSGDPA YVLTGALTYD
RYGRITKSLD GAGASTTTSY SPATGVLPTS VSTVNPFGWK STTTVDPLRG AVTESEDANG
RLTDSTFDAL GRRIATWLPG RDKADNPNSP DKKFSYSVNG AGTQPNPSTV TTQTLREDGS
YSTSVSIYDG MLQLRQQQST TADNSAGRLI TSNSYDSHGW PVSTINSYYD PDHAPSGTMW
AELETTVPSE SKTVYDGLGR PTESQLWAKG AQLWKSTTSY PGAEETDTTP PTGGQTTATF
TNARGQTTAT QVKDTTRDRK LTAGTVIQSG SSVASNSVRL TMQSDGNLVV AAIATGATLW
SSGTSGNAGA AATVHTDGNL VITGTTGTVL WTSNAAVAGS TGAYLVVGDD AGVKLFNSSG
ATTLWTANTN GKATAANATT SYTYTPAGQT SKITDAVGNA WTYEYNLQGQ LVSQKDPDAG
TSSYVYDTYG HLVQTTDPRG QALSYTYDTL GRKTAEYAEP TKAVHDPDNE LSSWAFDTLG
DGTDVKGLPV SSTRYVGGAS GSKYIARING FNTAYQPTST TTIIPAAEGK LAGSYTSAAE
YTPNVGLLEA TSYGADGGLP SERVGYGYDL QGLLTQTGSD NNPYLDAAWY TPFGQVMQST
YGVYGKQLRT AQTYDAATQR LATNTVSLQP STFGPIDSTT YGYDQSGSLN AVSDVQSTGT
TVTGTDTQCF TYDGMGRLAE AWSDTKGITT PAVAGTGQLA SCKTAAPSPT TIGGPAPYWQ
SYTYDLLGDR TQQVSHDTAG NALKNTTQSI AYPSTAAPAS LPNQATAVTT SNPTTGTATS
TLSYTDTSHN SAGVNAGSVT SRKTSTTGPI ISSVKTTAGD PLCLTDASAL TGDGTSQNLS
HCGGVGQNYT IGTDGTVRVV GKCMDTANPP AADTVVVIRT CSSSAPSQQW KLTASGNLVN
VASGLCLTDP SGSQTPGTKQ TLHACGSAGQ TYTTAATGTG LPAGQGQTFT YDAEGRTASV
TTGDGVNPKA TNYLYDADGG LLLQHGPSGT ILYLFGGAEQ LTLNSAGTTV SGLRYYSHPD
GTAITRSSGG AVTYQPTNPQ NTAQLQVDAT SLAITRRSYD PYGKPRGTVP TSWADNHGYL
GQPADPTTGL DLLGARNYDP VLGRFLTVDP VFEAGDPNQM GGYAYAGDDP VNGSDPSGLM
FRAGDSNITP KDQAQTEEYK CEQGGGTPSG CGNYTPYSVH HGWRHALDVG TSLAELVGIG
VVVDAGVDAV GACSATLVIA PECIAAGKTA AGALSGLLGG CADTGCIDGE PTAHGSDDPP
AASGRPQHDL ATGRTPEEAA PPPPGNPEAG TPAEAARPSS PKSATAKKPV ADDATSAEQS
PTTSGGKCSF SPDTPVLLDH GKTKPIVDIA VGDKVESADP KTGKHVGSHA VTATWVNYDT
DLVDVTVQGS DGKPATLHTT SKHPFWDDTT HKWTPAGKLT PGHALNTDKN LHATVLTVHV
TPGAADRYNL TVAQVHTYYV LAGTTPVLVH NCNNAATHDV GNVVDNLDDN VYFHYTSEVG
HSGILADDGS LRIGANSAGK VHVTQEIGSP AEIEQNIFIG NPMYAGKAEY MFAFRMPEGV
ELGPGSQPNE LITRGSLKIP AGNVLFHGRN PF
//