GenomeNet

Database: UniProt
Entry: H3SPL5_9BACL
LinkDB: H3SPL5_9BACL
Original site: H3SPL5_9BACL 
ID   H3SPL5_9BACL            Unreviewed;       791 AA.
AC   H3SPL5;
DT   18-APR-2012, integrated into UniProtKB/TrEMBL.
DT   18-APR-2012, sequence version 1.
DT   24-JAN-2024, entry version 39.
DE   SubName: Full=Surface protein C {ECO:0000313|EMBL:EHQ58991.1};
DE   Flags: Fragment;
GN   ORFNames=PDENDC454_27628 {ECO:0000313|EMBL:EHQ58991.1};
OS   Paenibacillus dendritiformis C454.
OC   Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.
OX   NCBI_TaxID=1131935 {ECO:0000313|EMBL:EHQ58991.1, ECO:0000313|Proteomes:UP000003900};
RN   [1] {ECO:0000313|EMBL:EHQ58991.1, ECO:0000313|Proteomes:UP000003900}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=C454 {ECO:0000313|EMBL:EHQ58991.1,
RC   ECO:0000313|Proteomes:UP000003900};
RX   PubMed=22461558; DOI=10.1128/JB.00158-12;
RA   Sirota-Madi A., Olender T., Helman Y., Brainis I., Finkelshtein A.,
RA   Roth D., Hagai E., Leshkowitz D., Brodsky L., Galatenko V., Nikolaev V.,
RA   Gutnick D.L., Lancet D., Ben-Jacob E.;
RT   "Genome Sequence of the Pattern-Forming Social Bacterium Paenibacillus
RT   dendritiformis C454 Chiral Morphotype.";
RL   J. Bacteriol. 194:2127-2128(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EHQ58991.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AHKH01000186; EHQ58991.1; -; Genomic_DNA.
DR   AlphaFoldDB; H3SPL5; -.
DR   STRING; 1131935.PDENDC454_27628; -.
DR   Proteomes; UP000003900; Unassembled WGS sequence.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   CDD; cd00198; vWFA; 1.
DR   Gene3D; 2.60.40.2110; -; 1.
DR   Gene3D; 3.10.20.320; Putative peptidoglycan bound protein (lpxtg motif); 3.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR   InterPro; IPR036168; AP2_Mu_C_sf.
DR   InterPro; IPR009459; MucBP_dom.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR10579; CALCIUM-ACTIVATED CHLORIDE CHANNEL REGULATOR; 1.
DR   PANTHER; PTHR10579:SF43; CHLORIDE CHANNEL ACCESSORY 1; 1.
DR   Pfam; PF06458; MucBP; 5.
DR   Pfam; PF00092; VWA; 1.
DR   SMART; SM00327; VWA; 1.
DR   SUPFAM; SSF49447; Second domain of Mu2 adaptin subunit (ap50) of ap2 adaptor; 1.
DR   SUPFAM; SSF53300; vWA-like; 1.
DR   PROSITE; PS50234; VWFA; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000003900};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          64..303
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          371..400
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        379..396
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         791
FT                   /evidence="ECO:0000313|EMBL:EHQ58991.1"
SQ   SEQUENCE   791 AA;  86186 MW;  93B293A313409C7B CRC64;
     MLSVAFGTIP GTLGAQAANS GELEWPNPGA VKLTKEANPV EGKLDEWDIT LTVEGKNLKS
     GSSDIVLVID KSGSMTQKPN QNRLPKAKDA AKKFVDNLLI EDSDTRIAVV TFNKTSDQVS
     DFKGLDQKTE LKRAIDNIQA TGGTNIQAGL HEAQELLKTS QAKNKVIVLL SDGEPTYSYK
     ASAAVKGSWG NNSHQFILSD FNYKNIIGSG SSYSLGSSWY GLGKETCGWW SCSYEFEIKD
     NGIGTISEAK LAHDAGIHIY SIGLDVGSNS NATNTLKDVA NKGYYSGTSD SLERIFSELA
     SKIAFAAENA VVTDPMGDMF DLKLKGSSFG PDDYKASQGE VTWNPQTETF TWNIGNVTEG
     NPATLTYRVK MDHSKNPDPK QLYPTNKTTT MNYTDAKGDR TSKDFEVPRV GLGKGSILVK
     AYKVNANGKP VNSDGQEVER PDLAQELYSR YHEEDGSDAL EVGKSYTVPA PTVPGYTLRV
     GDNPTKVDLT VSKPSPIIWF GYNDVPNKLT IVHQSGDKVL ERSDADKKPG ESIDVTSKNF
     AGYEFANVEV SEGSGLTVDN GHVTGVMPGK DVTITFHYTA KDQSVKVRYV DRATGKDLLE
     PSVKTGKTGE ILTLEAAHVA GYEAEKPTTV EYVLKAGENP DHVFYYNGTE QSVKVRYVDR
     GTGKELLAPI VKTGKTGEEL TLKAEEIAGY EAEEPKTVTY ELKADENPDH VFYYNGLGQS
     VKVRYVDRVT GKDLLEPSVK TGKTGEELTL KAEEIAGYEA EEPKTLTYEL KAGENPDHVF
     YYKAQKQTVT V
//
DBGET integrated database retrieval system