ID C3G057_BACTU Unreviewed; 760 AA.
AC C3G057;
DT 16-JUN-2009, integrated into UniProtKB/TrEMBL.
DT 16-JUN-2009, sequence version 1.
DT 27-MAR-2024, entry version 61.
DE SubName: Full=Internalin {ECO:0000313|EMBL:EEM72729.1};
GN ORFNames=bthur0009_11860 {ECO:0000313|EMBL:EEM72729.1};
OS Bacillus thuringiensis serovar andalousiensis BGSC 4AW1.
OC Bacteria; Bacillota; Bacilli; Bacillales; Bacillaceae; Bacillus;
OC Bacillus cereus group.
OX NCBI_TaxID=527032 {ECO:0000313|EMBL:EEM72729.1};
RN [1] {ECO:0000313|EMBL:EEM72729.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGSC 4AW1 {ECO:0000313|EMBL:EEM72729.1};
RX PubMed=22645259; DOI=10.1101/gr.134437.111;
RA Zwick M.E., Joseph S.J., Didelot X., Chen P.E., Bishop-Lilly K.A.,
RA Stewart A.C., Willner K., Nolan N., Lentz S., Thomason M.K.,
RA Sozhamannan S., Mateczun A.J., Du L., Read T.D.;
RT "Genomic characterization of the Bacillus cereus sensu lato species:
RT Backdrop to the evolution of Bacillus anthracis.";
RL Genome Res. 22:1512-1524(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EEM72729.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACNG01000039; EEM72729.1; -; Genomic_DNA.
DR RefSeq; WP_000646257.1; NZ_CM000754.1.
DR AlphaFoldDB; C3G057; -.
DR HOGENOM; CLU_023024_0_0_9; -.
DR Proteomes; UP000001380; Chromosome.
DR CDD; cd06920; NEAT; 1.
DR Gene3D; 2.60.40.1850; -; 1.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 2.
DR InterPro; IPR014756; Ig_E-set.
DR InterPro; IPR001611; Leu-rich_rpt.
DR InterPro; IPR025875; Leu-rich_rpt_4.
DR InterPro; IPR003591; Leu-rich_rpt_typical-subtyp.
DR InterPro; IPR032675; LRR_dom_sf.
DR InterPro; IPR006635; NEAT_dom.
DR InterPro; IPR037250; NEAT_dom_sf.
DR InterPro; IPR001119; SLH_dom.
DR NCBIfam; NF033190; inl_like_NEAT_1; 1.
DR PANTHER; PTHR46652; LEUCINE-RICH REPEAT AND IQ DOMAIN-CONTAINING PROTEIN 1-RELATED; 1.
DR PANTHER; PTHR46652:SF3; LEUCINE-RICH REPEAT-CONTAINING PROTEIN 9 ISOFORM X1; 1.
DR Pfam; PF12799; LRR_4; 3.
DR Pfam; PF05031; NEAT; 1.
DR Pfam; PF00395; SLH; 3.
DR SMART; SM00365; LRR_SD22; 9.
DR SMART; SM00369; LRR_TYP; 5.
DR SMART; SM00725; NEAT; 1.
DR SUPFAM; SSF81296; E set domains; 1.
DR SUPFAM; SSF52058; L domain-like; 1.
DR SUPFAM; SSF158911; NEAT domain-like; 1.
DR PROSITE; PS51450; LRR; 7.
DR PROSITE; PS50978; NEAT; 1.
DR PROSITE; PS51272; SLH; 3.
PE 4: Predicted;
KW Leucine-rich repeat {ECO:0000256|ARBA:ARBA00022614};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..760
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002925804"
FT DOMAIN 35..159
FT /note="NEAT"
FT /evidence="ECO:0000259|PROSITE:PS50978"
FT DOMAIN 584..647
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 649..704
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 705..760
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT REGION 153..185
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 153..170
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 171..185
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 760 AA; 85883 MW; 8F0E4FBE62DAFF83 CRC64;
MKALVVATTL AIPFAAYSTP ALAAIKIEAN QSVAASDRTY DTEIKIYKDQ KDEPSMVSQY
IKDPKVAIVA GKKIVTVTMQ DSDYFQYLRI EDRNQPGVFH DVKVLSEDKR KNGTKVIQFE
IGEFEKKHNM QMHILIPAIG YDHKYQVQFE IKDPTVGDKE TEKPDDNSNS GNTEMDKPVD
NQNMITDNKL RELVNKKVFN RKDVNTPITK EELLQVKNLF LNTNEILDYS ALKYMPNLKS
LTVANAKIKD PSFFANLKQL NHLALRGNEF SDVTPLVKMD HLDSLDLSNN KITNVAPLIE
MKNVKSLYLS GNQIEDVTAL AKMEQLDYLN LANNKITNVA PLSALKNVTY LTLAGNQIED
IKPLYSLPLT DLVLTRNKVK DLSGIEQMKQ LEELWIGKNE IKDVTPLSKM TQLKQLHLPN
NELKDITPLS SLVNLQKLDL EANYISDLTP ASNLKKLVFL SFVANEIRDV RPVIELSKTA
YINVQNQKVF LEETEVNKEV KVPIYEKDGK ISTKIRLKDE GGTYSNDAVK WSTPGEKVYE
FGVKDPFADT GIFFTGSVIQ NVVESKADNT SKEDNTSKED AKVEVVEFKD VPKGHWSEEA
IHYLAKENIF KGYGNGQFGF GDSITRGQVA SLVQRYLKLE NKVEQKERFT DTKGHMFEQD
IATVAQAGIM QGDGTGEFRP DGVLTRYEMS VVLYKVFQLK EDGNNKVNFK DVPTGHWAEG
YVKALVDNNI SKGDGKERFL GDDFVTREQY AQFLYNAITK
//