ID F0Z2P1_9CLOT Unreviewed; 5529 AA.
AC F0Z2P1;
DT 03-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 03-MAY-2011, sequence version 1.
DT 24-JAN-2024, entry version 42.
DE SubName: Full=Putative transcriptional regulator, AraC family with Parallel beta-helix repeat {ECO:0000313|EMBL:EGB91716.1};
GN ORFNames=HMPREF0240_03368 {ECO:0000313|EMBL:EGB91716.1};
OS Clostridium sp. D5.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=556261 {ECO:0000313|EMBL:EGB91716.1, ECO:0000313|Proteomes:UP000003978};
RN [1] {ECO:0000313|EMBL:EGB91716.1, ECO:0000313|Proteomes:UP000003978}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=D5 {ECO:0000313|EMBL:EGB91716.1,
RC ECO:0000313|Proteomes:UP000003978};
RG The Broad Institute Genome Sequencing Platform;
RA Ward D., Earl A., Feldgarden M., Gevers D., Young S., Zeng Q., Koehrsen M.,
RA Alvarado L., Berlin A.M., Borenstein D., Chapman S.B., Chen Z., Engels R.,
RA Freedman E., Gellesch M., Goldberg J., Griggs A., Gujja S., Heilman E.R.,
RA Heiman D.I., Hepburn T.A., Howarth C., Jen D., Larson L., Mehta T.,
RA Park D., Pearson M., Richards J., Roberts A., Saif S., Shea T.D.,
RA Shenoy N., Sisk P., Stolte C., Sykes S.N., Walk T., White J., Yandava C.,
RA Sibley C.D., White A.P., Crowley S., Surette M.G., Strauss J.C.,
RA Ambrose C.E., Allen-Vercoe E., Haas B., Nusbaum C., Birren B.;
RT "The Genome Sequence of Clostridium sp. D5.";
RL Submitted (MAR-2010) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL870816; EGB91716.1; -; Genomic_DNA.
DR eggNOG; COG2373; Bacteria.
DR eggNOG; COG2982; Bacteria.
DR eggNOG; COG4223; Bacteria.
DR eggNOG; COG4932; Bacteria.
DR eggNOG; COG5492; Bacteria.
DR HOGENOM; CLU_223151_0_0_9; -.
DR OrthoDB; 9776008at2; -.
DR Proteomes; UP000003978; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR Gene3D; 2.60.40.1080; -; 4.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.60.40.4270; Listeria-Bacteroides repeat domain; 3.
DR Gene3D; 3.10.20.320; Putative peptidoglycan bound protein (lpxtg motif); 1.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR047589; DUF11_rpt.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR013378; InlB-like_B-rpt.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR InterPro; IPR042229; Listeria/Bacterioides_rpt_sf.
DR InterPro; IPR041286; MBG_2.
DR InterPro; IPR001434; OmcB-like_DUF11.
DR NCBIfam; TIGR01451; B_ant_repeat; 7.
DR NCBIfam; TIGR02543; List_Bact_rpt; 2.
DR Pfam; PF02368; Big_2; 4.
DR Pfam; PF01345; DUF11; 1.
DR Pfam; PF09479; Flg_new; 3.
DR Pfam; PF18676; MBG_2; 17.
DR SMART; SM00635; BID_2; 5.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 4.
PE 4: Predicted;
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000003978};
KW Signal {ECO:0000256|SAM:SignalP}; Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..31
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 32..5529
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5038891983"
FT TRANSMEM 5476..5493
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 267..342
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 351..428
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 436..513
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 517..595
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 597..676
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT REGION 40..137
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 239..259
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4607..4636
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4722..4765
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4840..4865
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 5064..5095
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 5180..5201
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 40..59
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 83..97
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 100..115
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 116..137
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 239..257
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4748..4765
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5074..5091
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 5529 AA; 595633 MW; 94D1D936E55CFC70 CRC64;
MGIQMGRRKI MKKNFKRWLA LLLAVAMVAT SAVYSSGTAL KATGDPESVE QNDQQGEPQT
VDEGMGGADA EAPDSDTGEN GSKQVIELEK PEGDQDAAGQ EGQDTANSEA QDAKNEVSGT
EQKNDSEPAG DTQEEPAERK FAVVFNRPEV EGGSLTAWAD GSEKKDVTYD GGGKYTEEVT
EGTLLKFQIT VNDKYTLERV TDQNGTEIAA ASSEGNVYTY QITVSDNMEV NALYKEVSQK
ADEKKEEQKD EKIDGESPKI SYYSSNDISP FSITGEESMY VGESQWLEGE GQFGRSHKWT
AEGNGKVTIE PDGYTAEITA DKAGTVIITH TYKFGGTKTE TFNLTILDKI PLKGIEIVGA
DSARVGETLK LSTAPIPSDT TDETSATWSS SNKKIATVDE NGVIKGIAEG KAIITATSTV
NKEIKATKTI NVIRVPMTGI QIHGDSTVNA GDELQLTADI LPENTTDSKT ISWTSDNEDV
ASVSSKGLVT ANRGGTVKIT ALCGEFNDEI EILVTQEVSE VKVDIEDNKV IVGAATNASA
KVKPADATDK TVTWSSDNED VAVVDEEGHI TAVSEGTANI IATASNDITG SAQITVEQPS
VSVTADFLNP VIGDKIQAKA EVNPEWLPDN MKQIKWSSSD KAIADVNEEG VITAKAAGKA
DITASLKANP EISASVEITV HDPAMISSHS LSVEYVYQSG GRVQATFRAQ YVTGEKYEVE
IPEKQGYTTY LQEADGSLGQ EVSGTLTDNI YADQKLIIVY VPDSVEYIVT HRFEQTDGSY
KEEVEKKTGF VGDETEAEAK DLEQYTPSEF ENVVIGADRD INITITYVLA EYTVRFDTDG
GSYVTTQYPK HGDTVNVDAY KPTKEGYDFS GWYKDSGLTE RVTGTQTIKS DRTFYAKWSA
EDVNYTIVYW LENADDNNYS YSKSKNGTAK AGTELTVTAS QAGSINYFTF KNSEKKTVAA
DGSTIINVYY SRNTYTFTFY DNSKLVHTET KKYNADISKM LPFDSKYSGR AWKATSYYSY
ALQTLDRMPA QNIKFNLYQK SSSTLKTIYY YGQNVDGNGY SLIKSVGTYF NYITYEEEYH
PIEGYVRRDT GWKTKDFVNN KVDLYYNRAS YDLSFYNYNS IEKTESVKYE KSLSSYEHEP
ARPNNLPEYY TFQGWYTSEE CADGTEVDWN RTMPANDLVV YAKWASGEYT VEFDTQGGNS
INSQTVAAGS TAMQPSDPER EGYKFEGWYT SKDYSERYDF GKPVVENTTV YAKWKQITNT
TYTIKYIDDR DGSEVFAPVV KTGKVGNTVY VYPKAHDELV PTPSSVSLEL SWDSKKNVIE
FRYSIPGDVY YKVQYLEEGT ETSLLPDKVE HIGGNRVVET APAIEGYEVD SVRKVLNLAN
ETTEEDIKEN VITFYYTAIK NPIISSAVNG TIEPSGTKEV KWGESQTYTY DANPGFVLKS
VTVDGKDVTA SNPSSYTFSN VTAPHSIKVV YEVDFDGFSV SGVEKEYDGN SYHINFTGNT
YPSDQIEYWY NGMLQMGNPE FSDAGEYPVT VKVKRGNDVW QETATVKINA RKATITADSK
EKFFGETDPE FTGNVENLVD ENDLGTVTYV RSNKDEAVGV YKEVLDAKYT PNSNYEVTVT
RGDFEIKTAK VEGASLAAAG GSWPYDGEAH GAAAAVQGAE GYTIYYSVEG GEWTDQVPTV
TNVADGKKTV SVKATREGYE DLTADDVTIE ITAKPVTIKV NNADKFFDSS DPEFTGTVGE
LVAADDLGEV SYKRTNADEE VGVYTGVLTA EYTENSNYEV HVIPGNFEIK TASIPGAALN
ANGGEWVYDG YGHAAEAKVT QAEGYQILYR TEGGAWSADA PSVTDVSEGT KTVSVKAVRK
GYEDLTTNDV TIRIKPKAAT ITVKDSWKYF DAADPAFEGN VSGLIAQNDL GEISYSRSNT
DEEVGIYEEV LTADYDQNSN YTVSIIKGDF EIKTASIAGA ELKAYGGSWE YDGKAHAASA
VVNGADGYQI YYKTDNGQWS EEAPSVTNVS EGTKTVSVKA TRKGYTDLTA ENVTIQITAK
PVDIIVNNSW KFFDDNDPEF TGTVKGLIAD GDLGTVTYRR SNADEKVGVY PGVIVADYEN
NSNYTVNMKA GDFEIKTASI ADARLEASGG SWEYDGKAHA AEAKVKGAEG YTVYYKVGDG
EWTTEAPDVT NVSEGTRTVS VKATREGYQD LTAADVTIQI TAKDITITVD NSSKYFDEKD
PVFTGKVDGL IAKDDLGEVS YIRTNTDEAV GIYADVLNVE YQANSNYKVT VEPGDFEIKT
AKTEGAYVEA AGGSWTYDGK AHKASAAVKG AEGYTVYYKV GQGEWTESIP SVTNVSEGIK
TVSVKATRTG YEDLRAEDVT IAIQPKPVTI TVENSWKYFD AEDPAFSGTA EDLVKAGDLG
EISFRRTNTD EAVGIYADVL TADYEANSNY KVTIDKGDFE IKTASIEGAK LEAAGGSWIY
DGKAHSAKAE IKGAEDYTIY YKTGDGEWST TAPSVTDVAD GTVTVSVKAT RPGYTDLTAA
DVTIQITAKD VQIKVDSSWK YFDESDPEFD GKVTGLIAEN DLGTVSYRRT NADEAVGKYP
GVLTAGYMPN SNYKVTVIAG NFEIKTASIE NAKLEAKGGS WVYDGKSHSA GAEVLGADGY
TIYYKVGDGE WTDKIPSVTD VAEGVKTVSV KATRTGYEDL TAENVTIQIT AKPVIITVDS
SWKYFDAADP KFTGTVGELA AAEDLGDVKY VRTNADEAVG IYTDVLTAEY EENSNYAVTV
EKGDFEIKTA SVAEASLTAE GGSWVYDGKA HEAKAEVSGA EGYTIYYKIG DGEWTTEAPS
VTNVAEGVKT VSVKATRTGY EDLTVSDITL QITEKPVTIT VDNSWKYFDA ADPAFTGTIG
ELAAEGDLGE VSYKRTNAEE AVGTYQDVLT ADYKENSNYK VTIETGDFEI KTASIEGAHL
EAAGGSWVYD GEAHEVKAEV SGAEGYTVYY KTGDSDWTTE APYVTNVAEG RKTVSVKAVK
TGYEDLTAGD VTIEIKAKPV EITVDSSWKY FDAADPAFSG TVGELVTEGD LGDVSYRRTN
TEEAVGVYED VLTADYEENS NYQVTVVTGD FEIKTASIAD ASLSAEGGSW VYDGKAHAAK
AEVAGAEGYT VYYKADGGEW TTEAPSVTDV AEGIRTVSVK ATRTGYEDLT AEKVNIQITP
KAASIIVDDA WKYFDANDPV FNGTVNDLVK ADDLGTVTYQ RTNADEAVGL YNGVITAAYK
ANSNYQVSVI PGDFEIKTAA IPGAGLTAAG GSWEYDGKAH AAEATVQGAE DYTVYYKVGD
GEWTTAAPSV TDVAEGVKTV SVKATRTGYE DLIAEDVTIQ ITAKPVKITV DNSWKYFDAA
DPAFTGTVGE LAAAGDLGQV SYVRTNTDEA VGTYTDVLTA QYKENSNYAV TVEKGDFEIK
TASIAEASLT AQGGSWVYDG KAHEADAVVS GAEGYTIYYK TGDGDWTTEA PGVTNVAEGV
KTVSVKATRT GYADLTAADV TLEITAKPVT ITVDSSWKYF DEADPAFTGT IGKLAAEGDL
GEVSYRRTNT EEAVGTYEDV LTADYEENSN YNVTIKPGDF EIKTASIEGA YLEAAGGSWV
YDGKAHAVKT EVTGAEGYTV YYKTGDSDWT TEAPSVTDVA EGRKSVSVKA VKTGYEDLTA
RDVTIEITAK PVEITVDNSW KYFDAADPVF TGTVGELVTA GDLGDVSYRR NNTDEAVGTY
EDVLTAVYDD NSNYQVTVVT GDFEIKTASI PSANLSAEGG SWEYDGKAHA VKAEIAGAEG
YTVYYKLGDG DWTTEAPSVT DVSDGVKTVE VKATRTGYED LTAKAVTIEI TAKPVTITVD
NKWKYFDAAD PAFTGTVGEL VSAEDLGEVS YRRTNTEEAV GVYADVLTAD YQENSNYQVT
VETGDFEIKT ASISGAVLTA KGGSWVYDGA AHAVETLVQG AEGYTVYYKT GDADWTTKAP
SVTNVADGVK TVSVKAVRTG YEDLTAKDVT IQITAKPVSI IVDNNWKYFD AADPEFTGTV
GELVAQEDLG TVSYRRTNTD ESVGTYQDVL TADFTENSNY QVTVEPGDFE IKTASVEGAY
LDAQGGSWTY NGDAHYANAE VQGAQGYTVY YKTGDGDWTT TAPSVKDVAE GKKTVSVKAV
KTGYKDLTAE DVTIQITAKP VTITVDNSWK YFDAADPVFD GTVDGLIAEE DLGEVVYSRT
NTEEAVGVYP GVLTAGYAEN ENYTVTVVPG SFEIKTASIE NAKLEAAGGS WVYDGKSHAA
RAEVSGAEGY TIYYKTAGDD WTTEAPSVTN VSEGTVTVSV MAVREGYADL TAEDVTIQVT
QRPITITAAS AEKTYDGTAL VRAVSTVGLG ELLPLHSVTA TVEGSQTFVG SSRNVVTDAK
ITTLLGGDDV TGNYKIAYVD GTLTVTDKDV DVDDVVTKTH DNDKTYKYGD TITFDISVKN
IYDEYKTITI TEQEGVTITG SDVFENVAPG EVVTTTAAYT VTEEDILKGS FTNTVTAKFS
DEKKEYEKDD TVDKFEDPTG HLTVTKETTS RPVNGETYAL GETIIYKVVA VNDGNLTLNN
VVVTDELTGD EWTVGTLKPG QSSEAFKAEH TVTEEDILKG TVLNEATATG KSTDPENPDP
EIVPGNTEDP TVDPNGHLTV AKETTSSPAN GKAYALGETI SYQIVATNDG NLTLKNVVVI
DELTGDSWKI DTLKPGESSD VFTASHKVTQ KDILKGSVLN EATAAGESPD PEKPDPGVDP
GETEDPTDNP NGHLTVTKET TSTPENGAAY ALGETITYKI VAANDGNLTL KNVVVTDALT
GDEWTIESLK PGESSEAFTA EYKVTEQDIL NGTVLNVATA KGESPDPEKP DPGVEPGETE
DPTVTQQPSL FVEKTAAPSQ DSGYGLGDTV PYTIKVVNNG NVTISGITVN DDLTGGQWTI
DSLEPNGVEE FTTDYVVTEA DILAGQVVNV ATAGGTAPDG SEVTKEDTET IGTEISNPHL
SISKETTSTP ANGETYALGE TITYQIVATN DGNLTLTNVV VKDELTGDEW TIESLAPGAS
SEAFTAEYTV TSADIQKGSV LNVATAGGET PDPDKPDPGV DPGEKEDPTD DPNPSVAVVK
EVTSTPANGE AYALGETIGY KVTVTNNGNV PVENIKVDDS LVSITDNVIA SLAPGESREF
TYEYTVTEAD IRNGQVVNTA AASTDDPEGP KGSDEVVTPT EPEDADYTVN KTVVNPQDEY
RVGDTIQYQI GVSNTGNVTL HNVVVSDNLQ GAAGEAVFTE VGDNTVEGNK VVIKEIKVGE
TVVLNCEYQI VREDAGASIS NIASVTTDET GETPREDETE ETPVVNIYRL TVHYVDAAGQ
TVAPDYTGEY EVGAAFSITS PTVTGYTPDY RTVNSSADGM PAADVEVTVT YTANPVITPI
IPPVTPTNPA TPVTPVTPAA VTPTTPTAII PPAQTPAAAI DAELTENDEG DYDLTPIQDE
KTPLAKQNLD DHVCCILHFL LMLLALIVLA FYTRSMKKRQ ARIFELREEL ELEKTRRGLD
EEEKDDDAE
//