ID A0A0R2H0R9_WEIVI Unreviewed; 1251 AA.
AC A0A0R2H0R9;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE SubName: Full=Large repetitive protein {ECO:0000313|EMBL:KRN46513.1};
GN ORFNames=IV50_GL000790 {ECO:0000313|EMBL:KRN46513.1};
OS Weissella viridescens (Lactobacillus viridescens).
OC Bacteria; Bacillota; Bacilli; Lactobacillales; Lactobacillaceae; Weissella.
OX NCBI_TaxID=1629 {ECO:0000313|EMBL:KRN46513.1, ECO:0000313|Proteomes:UP000051992};
RN [1] {ECO:0000313|EMBL:KRN46513.1, ECO:0000313|Proteomes:UP000051992}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 20410 {ECO:0000313|EMBL:KRN46513.1,
RC ECO:0000313|Proteomes:UP000051992};
RX PubMed=26415554; DOI=10.1038/ncomms9322;
RA Sun Z., Harris H.M., McCann A., Guo C., Argimon S., Zhang W., Yang X.,
RA Jeffery I.B., Cooney J.C., Kagawa T.F., Liu W., Song Y., Salvetti E.,
RA Wrobel A., Rasinkangas P., Parkhill J., Rea M.C., O'Sullivan O., Ritari J.,
RA Douillard F.P., Paul Ross R., Yang R., Briner A.E., Felis G.E.,
RA de Vos W.M., Barrangou R., Klaenhammer T.R., Caufield P.W., Cui Y.,
RA Zhang H., O'Toole P.W.;
RT "Expanding the biotechnology potential of lactobacilli through comparative
RT genomics of 213 strains and associated genera.";
RL Nat. Commun. 6:8322-8322(2015).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRN46513.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JQBM01000002; KRN46513.1; -; Genomic_DNA.
DR RefSeq; WP_057745371.1; NZ_JQBM01000002.1.
DR AlphaFoldDB; A0A0R2H0R9; -.
DR PATRIC; fig|1629.5.peg.795; -.
DR OrthoDB; 393128at2; -.
DR Proteomes; UP000051992; Unassembled WGS sequence.
DR Gene3D; 1.20.5.420; Immunoglobulin FC, subunit C; 8.
DR InterPro; IPR020840; Extracell_matrix-bd_GA.
DR InterPro; IPR002988; GA_module.
DR InterPro; IPR009063; Ig/albumin-bd_sf.
DR InterPro; IPR019931; LPXTG_anchor.
DR NCBIfam; TIGR01167; LPXTG_anchor; 1.
DR Pfam; PF01468; GA; 8.
DR Pfam; PF00746; Gram_pos_anchor; 1.
DR SMART; SM00844; GA; 8.
DR SUPFAM; SSF46997; Bacterial immunoglobulin/albumin-binding domains; 6.
DR PROSITE; PS50847; GRAM_POS_ANCHORING; 1.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022512};
KW Peptidoglycan-anchor {ECO:0000256|ARBA:ARBA00023088};
KW Reference proteome {ECO:0000313|Proteomes:UP000051992};
KW Secreted {ECO:0000256|ARBA:ARBA00022512}.
FT DOMAIN 1214..1251
FT /note="Gram-positive cocci surface proteins LPxTG"
FT /evidence="ECO:0000259|PROSITE:PS50847"
FT REGION 42..63
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 387..413
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 788..1173
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 799..828
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 844..1045
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1056..1120
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1121..1148
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1251 AA; 130748 MW; 3C8E5C6E254CA821 CRC64;
MSKAVADGSL KTSTSASQSL KTSASISASK AALKKVTRVA AGATSTSTPV PGHQANRESG
IDAGQANGFL NTVDRALKMV GSSLADLGLS TDISGILDGT IAITQGKDGS VKVTVDQGFL
SAFGKMDGVA YYWNPKANNG AGAWVEIPGS RDKLQTSGTD TIEIPFNPSD YGFEEGDTVY
FQVGVNTSGL VQSVGDAAKR EISRLAPGIA GDAANALLQG AINAFEATIG TGNNYEYSDV
KGAPVIKDYN AMSESASASL LESLDASISG SMETSYSMSQ ASESYGQDSL DSFSATDASL
SASIQSSLAD SASTSKSVSL SVSNFNELEG ALWNSNSKSA STRDSLQVSE SDSASLSEST
ASVSAFRSQS TDSVSASKSF ASASNQAVAD DGGKSLSQRL QSQNDSRSES TSFSNAVAES
VSIQNSLNTS VSAAWKSAQD SASREVTTGI WPVQRIDSQK SLQISLSMSI SQADSTQASA
SVSTEVTPGS LAASKSISEA NSTSLSASKS NYDAELQAST DASVSASTSD SMDSAAKSAS
AVDASAAASA SANDASLEAS TSANNSLNAS LETSAKSSLA GLVSGSTSIS ESTSQSIVDS
IASQISLNIA EGSASVSRSN SIQLSGIDAA NSRSARDEIG ILLPNRSKST SISLILSDSV
SKSVSESNAQ WPSVSASASK SAASVNSEFH QSLADSTSTS NSEVASESAA VNSASAEVST
STTGLIIDKY LPNYENNEKQ DAKNKIDALE NLTDAEKDDF KKQVDDATTT PEINGVVTDA
IAKDLDNAKQ SAKDTIDGLT NLDDAEKKDF NDKVDAAKTT DEVKQIVQDA KDANAAKQDP
QALEDAKNAA KDEINKLPNL TDDEKKAAQD AVDASKTTDE VAKNLQDAKD QDQANKDLQD
AKDAAKDEIS KLPNLTDDEK KAAEDAVDAS KTTDEVDSAL KDAKDQDQAN KDLQDAKDDA
KNQIDKLPNL NDKEKQDAKD AVDDAKTTDE VDDALNDAKD KNEANQDPKE LEDAKKDAKD
QIDKLPNLTD KEKEDIKKDI DDAKTTDEVD EIVEDAKDQN EANQDSKNLE DAKNDAKNQI
DKLPNLTDQE KDDLKKRVDE ATSTDDINNI LDEAKRQNDA KTTDPSNPTN QDKGTDGKGN
QGTDGQAQDG KNLEDVKSNG KNQIDGLQNL TEQEKNDFKA RIDAATSEAE VNAIINEAKA
LSAQRAADKK QAQLPNTGVE DAQNAGLLGG SMLGALALFG LGKRNKKREE K
//