ID A0A1C4Y4D1_9ACTN Unreviewed; 3119 AA.
AC A0A1C4Y4D1;
DT 02-NOV-2016, integrated into UniProtKB/TrEMBL.
DT 02-NOV-2016, sequence version 1.
DT 24-JAN-2024, entry version 29.
DE SubName: Full=Intein N-terminal splicing region/RHS repeat-associated core domain-containing protein {ECO:0000313|EMBL:SCF15564.1};
GN ORFNames=GA0074696_3042 {ECO:0000313|EMBL:SCF15564.1};
OS Micromonospora purpureochromogenes.
OC Bacteria; Actinomycetota; Actinomycetes; Micromonosporales;
OC Micromonosporaceae; Micromonospora.
OX NCBI_TaxID=47872 {ECO:0000313|EMBL:SCF15564.1, ECO:0000313|Proteomes:UP000198228};
RN [1] {ECO:0000313|EMBL:SCF15564.1, ECO:0000313|Proteomes:UP000198228}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 43821 {ECO:0000313|EMBL:SCF15564.1,
RC ECO:0000313|Proteomes:UP000198228};
RA Kjaerup R.B., Dalgaard T.S., Juul-Madsen H.R.;
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LT607410; SCF15564.1; -; Genomic_DNA.
DR Proteomes; UP000198228; Chromosome i.
DR GO; GO:0005737; C:cytoplasm; IEA:InterPro.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0016539; P:intein-mediated protein splicing; IEA:InterPro.
DR CDD; cd00081; Hint; 1.
DR Gene3D; 2.170.16.10; Hedgehog/Intein (Hint) domain; 1.
DR Gene3D; 2.180.10.10; RHS repeat-associated core; 2.
DR InterPro; IPR032871; AHH_dom_containing.
DR InterPro; IPR003587; Hint_dom_N.
DR InterPro; IPR036844; Hint_dom_sf.
DR InterPro; IPR028994; Integrin_alpha_N.
DR InterPro; IPR006141; Intein_N.
DR InterPro; IPR022385; Rhs_assc_core.
DR InterPro; IPR031325; RHS_repeat.
DR InterPro; IPR003284; Sal_SpvB.
DR InterPro; IPR022045; TcdB_toxin_mid/N.
DR InterPro; IPR006530; YD.
DR NCBIfam; TIGR03696; Rhs_assc_core; 1.
DR NCBIfam; TIGR01643; YD_repeat_2x; 1.
DR PANTHER; PTHR32305; -; 1.
DR PANTHER; PTHR32305:SF15; PROTEIN RHSA-RELATED; 1.
DR Pfam; PF14412; AHH; 1.
DR Pfam; PF07591; PT-HINT; 1.
DR Pfam; PF05593; RHS_repeat; 1.
DR Pfam; PF03534; SpvB; 1.
DR Pfam; PF12256; TcdB_toxin_midN; 1.
DR SMART; SM00306; HintN; 1.
DR SUPFAM; SSF51294; Hedgehog/intein (Hint) domain; 1.
DR SUPFAM; SSF69318; Integrin alpha N-terminal domain; 1.
DR PROSITE; PS50817; INTEIN_N_TER; 1.
PE 4: Predicted;
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Virulence {ECO:0000256|ARBA:ARBA00023026}.
FT DOMAIN 2857..2954
FT /note="Hint"
FT /evidence="ECO:0000259|SMART:SM00306"
FT REGION 38..63
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1176..1218
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1311..1380
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 42..58
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1326..1370
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3119 AA; 333282 MW; 0267C1F5E3FBD538 CRC64;
MWHPPLPGFR LHALTSRWTR LLAVLAVLAL VTSGLSASVA RPRPDRTSTQ SATTPGPACG
ESADTAAMTD VAATLPDYQQ VVAPGIATTV SYGGAKLAIG PKAVRLPTGI GVTVLPPGLT
PKLDSGMTNV TGKTKGGGYR FTPHPQQFAA SIQVTLPYDP ALLTDEFTAQ DIYTYFFDDV
QTCWQVLERV SVDEVNHTVT SLTDHFTDMI NATVTVPEHP EGVQFNPNQI KGIQAADPGS
GINLIAPPSA NNQGENHLAY PIQVPPGRQG LTPQLGLSYS SSSGNGWTGV GWDLSVPSIS
VDTRWGVPRY AGATETETYL LNGEQLTPVA HRGAPVPRSA EKVFHTRVEG SFARIVRHGT
NPKNYTWEVT DKNGIHWHYG AVAGAGGPAT DASLADDAGN VFLWSVREVR DPRGNFMRYH
YAKVDDTGLA GGSVPGRNLY VEKITYTGQG TSEGRYAVSF DRDRERGEAL RVDTSIDARG
GFKRVTADLL REVRVTLDDS LIRRYEFTYK RGAFAKTLLD AVIQYDADDH EFNRHRFSYH
DDIRDSAGQY QAFRAQPWTS PGDGLSSAAL NLTPEQAGNA SALNANTNTS GGGHLYVGVG
SSPSKSGSVG VKVGFSHGSD KGLLALVDVD GDSLPDKVFR DGGSVKYRKN LSGPTGEATF
AAQAQPLNLP GIMGESSNTL TLGVEGYLGA AAAQLDYVNS FATTEQYFND VNGDGIQDLV
DGASVLFGRV GANGVPVYGV SGDTPVPIGQ GEVDATGLFD SFNADRDRLT DSFPLLDSVR
RWVAPYDGTV KVEGAVQLAP GTAAARAESR TADGVRAAVQ KEGTELWSAR IEADDDGAKT
PTGVDAVTVS RGDRLYFRVG SAADGGLDEV TWDPKVSYVG VGDPLDVNGL ASYRFQASRD
FTLGGRAATV KVPLTGTMHL SGDLVKAGAT SDDVTVLITR NGTPVLEQTL AAGTTGSVPV
NLDVDVQQGQ TLQWRIRVDS PVDVDKLTWT PRAYYTAAPG VDRLTDANGK PFIDVHPPYT
LDLYPVDGLT APQPTYTVTS GGTLSVEPTV SFAFGAEHPT ARVAFTVKRR GELVAKRYFQ
VTNGVLTPPG AFDITADTGD ELWFDFSTTD PKLRAFLTGQ SVTVGGSGAP SAFHSAAEEG
AFPQPYRGWG AVGYNGNRDR ATQPINQDDL VVDEHYGDQL PDSVDPQAQK DDFAADPRVD
PPKVTPFTPS PQDGRWGAGE HSWIARSSVS SSRLGVDSIN LPRPQDFAGA TAVPRLSRSQ
QVSLTGSVGG GIGSIGGSLA TGDSTGQVDF LDLNGDQFPD VVGAGAVQYT DPDGALGDTK
GTLPDGAVRR STNVSGNASA GSAARTISTG RGYDSPPGTS AATSAQSGND MPPLGVGGSF
GGTRSDGAFD LLDVNGDSLP DRVYENGKVA LNLGYAFAAA EQWRNPAGLN KGSGTNAGLN
IGFNTDFYGF AGGASFSQGR SSSAGTLADM NGDGLLDQVL SGSPIRVGFN TGNGFEPPVP
FHGSLTDVNG DRNAKLGGGV YFTFGICFTV VCVVINPGAD IATGASRTEQ ALRDINGDGY
ADHLSSTRDN QLVVAENRTG RTNLLAAVDR PLGGRMEFDY TRDGNTYGQP QSRWVLTKVS
VDDGRPGDGQ DVQLVTYDYD GGVYDRRERE FRGYGTVVEK HRDHANGDAV GRSVTRTFRT
DSFYTKGLLE QELTADGAGK PYQETVQTYT TRNVSSPGSP ADLASTTATL FPQNSRTDVK
YHEGAASPGK TTYVTREYDD VGNLTRQFDA ADAGAADDLD ARIVYTSQDP ACQAAYLVGM
PKQIDVRGDG TLMRHRESTI SCTTGDLAQV RVSLADGTTA TTDLEYFGDG NLKSVTGPAN
KTGQRYRLDY TYDDVVNTHV ESISDSFGYR STATHNLKFG LVETTTDFNN QQIRNTYDAV
GRLDQVAGPY EIPENRFTID FEYHPEAAYP YAVTRHVDRE AGGVRDDTID TITFVDGLNR
TTQTKKDAAV PATPDGAPQD VMVVSGRVAY DFLGRAVKSW YPVTEPKGAG NLTFNPAYDP
VTPTEVRYDV LDRTTRTTFP DGTATTMEYG FGPDRDGVTQ FETVVTDANG KSKRTYTDAR
QITTAVKEFN PAGGQPVIWT SYRYNPIGEL VAATDDKGNV TRSTYDNFGR RTSLTSPDSG
TVTSTFDLAN NLVRKVTSKL AAVSKAIEYD YDYTRLKAIR YPVFPANNVS YTYGAPGAPE
NAANRITDVV DGAGKVNRRY GPLGELVKET RTTPAQGSHI QSFTTEYRFD SFNRMLSMTW
PDREKLSYHY NSGGQVDSAR GVKGEFSYDY LKRLDYDKFE QRILLDTGNG TRTRYSYNAE
DRRLSNIQAK LSNGYVFHNL DYSYDNVGNI MSIANDTVAP SGPEVGMQVG GPSTQSYTYD
DLYQLTHAEG SYQPRTPQTD RYRVDLKYDS LHNLTSKSQT HELVSNGNTI VEGKLSYNYG
YAYGSAKPHA PTSIGIYTFQ YDENGNQISR SQQPKPRRQM IWDEENRLAC SHENVQSQTL
PQTPASCDNA GGTPNSARYY YDDQGSRVVK DGAQFHIYPN QNYSTRGNQE FKHVYIGQDK
LITKLVEPDF RREDRQYYSH SDHLGSTGFV TDDQGGMAEH LQYFPGGETW VAEHSSQPVP
HQFTGKEYDQ ETNLYYYGAR YYDPRTQVWQ TPDPVLENYL EGTPNGGVYA SMNLALYTYA
YNNPIRLGDP DGRFPWNRVL GGVKLVGGVA EAAAGVALGA ATSWTGVGAV AGGAVAVHGV
DVAISGARQL FSGEETSSFT SSGLQAAGVS KSNAELIDAG ISIVGSAGAS MATSAIKGAA
TAAPKVAAAA ADDVAARAVP STAEKVVETA AKIPCVGNSF AAGTRVVMAD GSTKPIEDVR
TGDLVLAEDP ETGERGPREV THLIIGQGVK HLVDVEVNGE VITATDKHPF WVAGAGAWVD
AGDLALGDVV RLADGRTAMV DGIAPYTRVD RVHNLTVAGI HTFYVVTGDG RAADAVLVHN
SGPCSVNAKA LAGSLTAANV VRPAETAAHH IVASGAKAAA PARAHLASLG VGINEAANGA
YLPRFVSSAN PLGAAVHSTT HSPAYYAEVN RLILQTKTAA EARNVLAYIG RQLAAGPWP
//