ID U5W0I7_9ACTN Unreviewed; 2583 AA.
AC U5W0I7;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE SubName: Full=YD repeat-containing protein {ECO:0000313|EMBL:AGZ41446.1};
GN ORFNames=AFR_15820 {ECO:0000313|EMBL:AGZ41446.1};
OS Actinoplanes friuliensis DSM 7358.
OC Bacteria; Actinomycetota; Actinomycetes; Micromonosporales;
OC Micromonosporaceae; Actinoplanes.
OX NCBI_TaxID=1246995 {ECO:0000313|EMBL:AGZ41446.1, ECO:0000313|Proteomes:UP000017746};
RN [1] {ECO:0000313|EMBL:AGZ41446.1, ECO:0000313|Proteomes:UP000017746}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 7358 {ECO:0000313|EMBL:AGZ41446.1,
RC ECO:0000313|Proteomes:UP000017746};
RX PubMed=24637369; DOI=10.1016/j.jbiotec.2014.03.011;
RA Ruckert C., Szczepanowski R., Albersmeier A., Goesmann A., Fischer N.,
RA Steinkamper A., Puhler A., Biener R., Schwartz D., Kalinowski J.;
RT "Complete genome sequence of the actinobacterium Actinoplanes friuliensis
RT HAG 010964, producer of the lipopeptide antibiotic friulimycin.";
RL J. Biotechnol. 178:41-42(2014).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP006272; AGZ41446.1; -; Genomic_DNA.
DR STRING; 1246995.AFR_15820; -.
DR KEGG; afs:AFR_15820; -.
DR PATRIC; fig|1246995.3.peg.3212; -.
DR eggNOG; COG3209; Bacteria.
DR HOGENOM; CLU_000662_1_0_11; -.
DR Proteomes; UP000017746; Chromosome.
DR CDD; cd00081; Hint; 1.
DR CDD; cd00161; RICIN; 1.
DR Gene3D; 2.80.10.50; -; 1.
DR Gene3D; 2.170.16.10; Hedgehog/Intein (Hint) domain; 1.
DR Gene3D; 2.180.10.10; RHS repeat-associated core; 2.
DR InterPro; IPR003587; Hint_dom_N.
DR InterPro; IPR036844; Hint_dom_sf.
DR InterPro; IPR030934; Intein_C.
DR InterPro; IPR022385; Rhs_assc_core.
DR InterPro; IPR035992; Ricin_B-like_lectins.
DR InterPro; IPR000772; Ricin_B_lectin.
DR InterPro; IPR006530; YD.
DR NCBIfam; TIGR03696; Rhs_assc_core; 1.
DR NCBIfam; TIGR01643; YD_repeat_2x; 1.
DR Pfam; PF07591; PT-HINT; 1.
DR Pfam; PF00652; Ricin_B_lectin; 1.
DR SMART; SM00306; HintN; 1.
DR SMART; SM00458; RICIN; 1.
DR SUPFAM; SSF51294; Hedgehog/intein (Hint) domain; 1.
DR SUPFAM; SSF50370; Ricin B-like lectins; 1.
DR PROSITE; PS50818; INTEIN_C_TER; 1.
DR PROSITE; PS50231; RICIN_B_LECTIN; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000017746};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..32
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 33..2583
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004666258"
FT DOMAIN 2435..2461
FT /note="Intein C-terminal splicing"
FT /evidence="ECO:0000259|PROSITE:PS50818"
FT REGION 956..983
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1178..1200
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2226..2353
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 961..975
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1180..1200
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2230..2245
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2275..2292
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2331..2353
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2583 AA; 275504 MW; E4FDDEDEB721AA55 CRC64;
MRGTGPLWWR VRTALVLSLS ATLVATSLPA RALGVPGDGM SREGAAVTLP KLAANTPVDQ
DEAAEADLTT ADEVPVEPYQ PTAVAPWQED SGVVDLTGLE PGASLPVDDL PIALGVPEGA
DPADLAGNWT VDLAAPEASQ ASDVAGLIMK IVPPASAAAD AEVALSVDYT TFADLYGPQA
ADRFGIVVLP DCAYDAPGTG ECESTGEPGE ETDPATATAV PSEVQVVPAS SAAARSFSGA
TASAGKRHII SATVPLDKLL DDSATGGVGG MARAAAASGS GVVGAMDTGA SAAGDFTATP
LLSSGSWAAG SSSGAFTYSY QVQVPETAGG LIPKVNLGYS SQSVDGRTSA TNNQASWIGD
GWDYSAGSIT RTYANCKQDS KRPGANNTQL KTADLCWGSK NATLSLGGMN TELVWDASKN
KWFTANGDGS VIDQEFDTSK ANGDKDGEYW VVTTRDGTKY HFGLNKLPGW TSTDPVTNSV
LTVPVYSNHS GEPCYKGSST DDYKKSSCTE AWRWSLDYVE DVHGNAMSLW WAKEQNYYAK
NFDFKAPVKY DRGGYLSRID YGQRKDSIFS AVAPARVNFS VEERCYDKGS LTCSETNFTS
KNPGDYRIWY DTPANLRCAA PTPKVLCWNA APTFFSRKRL DKITTSAQRR TDTTARQVVD
DYQLKQSFPI LKTGPNTALW LESITRTGYA RNGTTDEKVT LNPVRFESNV ENMPNRVMRG
TNDPRPGFSR LRIGRVINEY GGETVVSYKK PTGDCETGLG LPGKTATGEL KSNTRLCYPA
FWHPDPAAEE IDWFHKYVVE TVEELPAVDG AFGTTTTYAY KNAGWKLAES EFTKKSTRTY
SQFAGFQQLT VLTGEENPDF GSKRTKSVTR YFRGMGDTVG VDDITGTEIA KDREPFAGRI
AEELTYSAAG DADTDWLTRS ITYPQAVELA RRNRDADDIS PLIAYRVLEP RQLTVTKSSG
TGDDKRTERK VETRTTYDPT YGLPTQIESL GDTAKTGDES CGTVDYVHHT TKNIIGLTRQ
LRTSPTTCAA ADFDNLTTLT SASRTAYDGL AYGAALPATT RGLATQTFSL KADGTGFQLE
GTTGFDAIGR VTSKADRDNR TSTITYEPAT GQAFRVREKN PLQHEQVREI EPGRAVGIST
TDVNGHFSQA QFDPLGRMRK AWGPGRATNT VPDFEAIYTT PDPASSQRKP PYVTTKTRGH
EGRIQTSVTI YDGLGRARQS QEEATGGGRL ITDTLYNSSG EVYETNNAYY TAGTPDGQLF
KPDAAVPNTT RYRYDGLGRV VQELPILRGD EMPNRATRYE YGADYSTVIN PTGAGSYRTY
SDALGRTTRV DTFRDAGRST FTSMSYQYDA RGQLEKATNS ENAKITWSWT YDRRGRMIAA
SDPDSGVTST SYDDYDRPLT ATNARGATVW NSYDELSRPK EQRLNNSTGT LLASFGYDTA
AGGKGLPASS TRYTDGQPYT QTIGGYTDDY QPTSTTLSLP ATVASTWGLK PSYSYSYGYT
ETGLVESATL PSVGSLAEEK LLIRYTKDGL PLSVSGRDWY GSETVYSPYG QVLRSTLGAQ
PYRVWAMANY DDASGALTGQ QVYREKSDDK SIVGGNLVSQ RSYAYDDAGN VTAVREHSVG
IEERQCFVHD PLGQLKKAWT AKDQDSCSAG PVGADGTVNV AAGKDNTGYW QEFEYDLLGN
RKKLVQKDIT GASAKDATTD YTYGKADGSQ PGTLTKATKK YVTPAGAAIT AEAERLYELT
GETKSVTSLQ NGDKQDLAWT YDGKIERIAG QGENGKTAYV GLADKCIDLK SSLPVAAQPI
QLYTCNGGLA QKWRFTATPG QSDSTLGTLS IHDGWCLQPA AGTAGSAAQL QKCDGTAAQQ
VNRLSATNQL KHVASGLCFA VKDGITTDAT PIVLAACAGT STAQQWLAQN ETRHIYGPDG
GRLLTMQGKQ ATLYLGEAEL TVQRGGIAVN TQRTYSTPGG AVVRNAYGAG APGLTAVVGD
HQGTPYAEVN LSGTMQVRIR KQDPFGNQRG TVPLGMHIAS NDGFLGTTRD DASGYVPLGA
RLYDPVVGRF LSADPVLDLA DPLQSNGYAY AHNNPVTHAD PTGLSVASVT LTGAEMAAAL
AGVGLSPAQV AEAQANANRS LSSIILSAAW GILSEFIGLT DAMNCFGGDL WACGSLIIGA
IPWTKVLKIG KIAKAIDRTI AAIQAWRTAK KAAEAVLAAA RAAERMALQA KKAAIERAKK
AAQAAQKKAA DKAATTSNAA ANASKKTGNP VHKEAQAKAN PAGASSGAGK GGSKPGGSSG
SSSRSSGGTS GGGGAGKSRD SDGGGDGDGG SCNSFLPGTK VLMADGSAKP IEDVKTGDEV
KTTDPKTGES RDSTVTAEIE GEGLKNLVKV TVDTDGDKGE AEAQVTATDG HPFWVPELAD
WIDATELQAG QWLRTSAGTF VQITAIDRWT TTQAAVHNLT VSDVHTYYVV TGGSSLLVHN
CDATVYRVEG AANERIRVHE DGSVLIRGAS KTLFVGFDNR ARAEEFLAKR LEQGFSDSVI
KSFKVKREFL DYLRADMVPE SMSKQFPTRP ISVDHPATDQ YGLKPFNTRL MLDYIVPNSG
KVG
//