ID A0A438M5I1_9ACTN Unreviewed; 2156 AA.
AC A0A438M5I1;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 15.
DE SubName: Full=Intein/RHS repeat-associated protein {ECO:0000313|EMBL:RVX41120.1};
GN ORFNames=EDD27_3589 {ECO:0000313|EMBL:RVX41120.1};
OS Nonomuraea polychroma.
OC Bacteria; Actinomycetota; Actinomycetes; Streptosporangiales;
OC Streptosporangiaceae; Nonomuraea.
OX NCBI_TaxID=46176 {ECO:0000313|EMBL:RVX41120.1, ECO:0000313|Proteomes:UP000284824};
RN [1] {ECO:0000313|EMBL:RVX41120.1, ECO:0000313|Proteomes:UP000284824}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 43925 {ECO:0000313|EMBL:RVX41120.1,
RC ECO:0000313|Proteomes:UP000284824};
RA Klenk H.-P.;
RT "Sequencing the genomes of 1000 actinobacteria strains.";
RL Submitted (JAN-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RVX41120.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; SAUN01000001; RVX41120.1; -; Genomic_DNA.
DR OrthoDB; 582519at2; -.
DR Proteomes; UP000284824; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:InterPro.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR CDD; cd00081; Hint; 1.
DR Gene3D; 2.170.16.10; Hedgehog/Intein (Hint) domain; 1.
DR Gene3D; 2.180.10.10; RHS repeat-associated core; 2.
DR InterPro; IPR003587; Hint_dom_N.
DR InterPro; IPR036844; Hint_dom_sf.
DR InterPro; IPR022385; Rhs_assc_core.
DR InterPro; IPR031325; RHS_repeat.
DR InterPro; IPR003284; Sal_SpvB.
DR InterPro; IPR006530; YD.
DR NCBIfam; TIGR03696; Rhs_assc_core; 1.
DR NCBIfam; TIGR01643; YD_repeat_2x; 2.
DR PANTHER; PTHR32305; -; 1.
DR PANTHER; PTHR32305:SF17; TRNA NUCLEASE WAPA; 1.
DR Pfam; PF07591; PT-HINT; 1.
DR Pfam; PF05593; RHS_repeat; 1.
DR Pfam; PF03534; SpvB; 1.
DR SMART; SM00306; HintN; 1.
DR SUPFAM; SSF51294; Hedgehog/intein (Hint) domain; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000284824};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Virulence {ECO:0000256|ARBA:ARBA00023026}.
FT DOMAIN 1909..2010
FT /note="Hint"
FT /evidence="ECO:0000259|SMART:SM00306"
FT REGION 1086..1105
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1170..1192
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1769..1872
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1086..1104
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1776..1797
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1798..1819
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1820..1872
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2156 AA; 233679 MW; F3061496A3A66309 CRC64;
MAFTLRLKVE PKKPVWPQPG KAEVAVPQPG KLADAGALPV KVGQVRGAGL DKVTIETLSP
EAAQKLGGVG IAARLVRADG QTAPGKVRAA FSYAGFRDAY GGQFVSRLQV LRLPACALQQ
PRPRSCVVRP TVVPSVNDLK TGTLTAEVEA APANTRQTTA QAPGLGKDRK KDAAKASDTM
LTAQLAEGSV YMLAANVKGP DGNWGATDLK PSGTWQAGTS GGGFDYEVPL PEPPSPAGEG
PDLSLQYDAS SVDGQGAWTN NQSGVVGAGW DLSAGFIERR YRRCTVSTYY DPDTAELVWI
AQESTSGRAL CWESPDQTDG DSTTNDLTQS ELVLSAGGRS AQIVKDRTSG GWKTVPDFGW
KIEQVAGGAD GNPYWKITDR EGQVWRFGST RDAQWQVPYV GDDNGEPCFD RYWNNAIPPT
CTGVWRWNLD QEADRNENVI DYSYTRETNY FCLPSCVDEV YQTLPYDRGG FLASVSWGHN
TQVAGSTPTA RTTFTTAARD GGDVPTDLRC DQAAGCANDA IAFFSTRKLT TIGTESRNPT
SGVWDPVDRL NFTHAWMYQR TDQGLPFDSV MWLETVQQAG QAASPNVTLP PLDFDAVMLA
GKMDYINTSD WPSQVSWRMV PRIAAIGNGM GGRIEVTYGQ ADPCGGGKGR DGSNYLADQV
GDCYQIDMGT DPTSGFEAWT RYYKQLATKV VERDMVAGSP DMVHSYEFLG SPRWANPVQF
AEPALAPSGT DWRGYAEVRT LQGSGTDPAG YTVTTQTFLR GSELQVTHFD GTAITDAPLL
QGQVLQEQTW QMTSFSPRAY TEVDSTRWEY TLQTTGNGPG SMDPALVLQT RERSRQKVTG
GTWRYTDERT AYNSDGLPTK VNDYGQDGVR TDNSCTTTSY ARNADPGHWL VDFPSVEEKR
AGDDCTAGTL VGKTVTLYDA GTDPATNKPS DGNPTEVRSF AAASTISVSK ATFDDYGWTL
TSTDPLNKTT TTTYTPAVGW PKDGITVTNP RGHTVTTRLS HILGEPTAVT DANGKTAEMD
YDALGRTTAL WKPGQPRSGG TPSATVAYDI PFNGGLGQPT APIKTTVKQL LTGTGTAATW
TTTHSYDDGL GRTRETQTAS PGGGRIVIAT TYDPRGLAEA ISEPVHNSND PGSGLLNPAL
TSPPQWTKTL YDGLERPTAA IAYHEATELR RTSTTYPGTE RSKVTPPVGG KTATVTDAFD
RVVKVEEWSD ATSHADTSYG YDLGDNLTRM TDANGNVRSY TYDWLDRRTA ASDPDAGTSS
HGYDAAGRQI WSIDGKGQKI STSYDDLGRR TAQWAGEPIT GIKLAEWSYD TLAKGQPDAA
TRYTGGQAYT QTVTGYDSDY RPTTTKLTVP ASEGALGGDY VFTTAYDAAG NLRQEGMPAA
GGLASETLTH SYTDLGFAKG LTSDLAGSTF VKDTTFTLTG KLASRTLGAS GQIKRLLERD
PATDWLSRVT TQTKVDTATP ETVQDDRYSY NIAGSIARVL DAASAIPGIT DGQSECFSYD
GLLRLKTAYT TTGSSCTGTG DAQGIDPYSQ AYSYDNVGNI TSLTDNGQTA TYTYPTPGAT
AIRPNAVTAI TRPAGTDTYA YDNAGQLAAR TVAGKQATFD WNPLGQLDRA TIDGQQTSMV
YDTDGERLIR RDPDGSATLY LGAMELRLAG GQVTGKRYYS TADATLVAMR ETGVTWLLAG
MHGSTQLAVN DSTGTISRER YLPFGQRRGA DDLPFTDRGF LGKAEDASTG LTYLGARYYD
PTIAKFISTD PELDLRTPEW ANAYSYAANN PIDLADPDGR RVDTGNRKSD ATFAKTHHAS
GKKKTARERK IHKKRHQQYE RDRKRETERR RQEEQRKQAR ERYLKKDYNQ HKAAQDRYNR
THCSEKRCSD GRGERAGLIG EGEGVIEQFL FRRATRGRGG NHKPKYRPCS SFTPGTKVLT
ADGSSKPIDE IKVGDKVLAT NFTTGETAPK TVVALITSKG PKNMVKISAG GTGSRDNIVA
TDTHPFWVPT ARRWMQAGEL QPTQWLQTSA GTYVQIAAVA KWSANGQRVH NLTVDDFHTF
YVLAGETPVL VHNASPCFSG VSRQKQDQHV YGSKGYNDRV RRGEPTSYFN SRAEADAYAE
YAWKHGKPVP GRPNVRDYDF GKPVGRGPRG GWQTQVRVHI DGSGKVHAHP KGREYR
//