ID A0A0M2S209_9ACTN Unreviewed; 1225 AA.
AC A0A0M2S209;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 24-JAN-2024, entry version 32.
DE RecName: Full=VWFA domain-containing protein {ECO:0000259|PROSITE:PS50234};
GN ORFNames=LQ51_10160 {ECO:0000313|EMBL:KKK06079.1};
OS Micromonospora sp. HK10.
OC Bacteria; Actinomycetota; Actinomycetes; Micromonosporales;
OC Micromonosporaceae; Micromonospora.
OX NCBI_TaxID=1538294 {ECO:0000313|EMBL:KKK06079.1, ECO:0000313|Proteomes:UP000034330};
RN [1] {ECO:0000313|EMBL:KKK06079.1, ECO:0000313|Proteomes:UP000034330}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HK10 {ECO:0000313|EMBL:KKK06079.1,
RC ECO:0000313|Proteomes:UP000034330};
RA Talukdar M., Das D., Borah C., Deka Boruah H.P., Bora T.C., Singh A.K.;
RT "Draft genome sequence of Micromonospora HK10, isolated from Kaziranga
RT National park, Assam, India.";
RL Submitted (NOV-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the TolB family.
CC {ECO:0000256|ARBA:ARBA00009820}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KKK06079.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JTGL01000104; KKK06079.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0M2S209; -.
DR STRING; 1538294.LQ51_10160; -.
DR PATRIC; fig|1538294.3.peg.1945; -.
DR HOGENOM; CLU_007390_0_0_11; -.
DR Proteomes; UP000034330; Unassembled WGS sequence.
DR CDD; cd00198; vWFA; 1.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 3.
DR Gene3D; 2.120.10.60; Tricorn protease N-terminal domain; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR011659; PD40.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR36842; PROTEIN TOLB HOMOLOG; 1.
DR PANTHER; PTHR36842:SF1; TOL-PAL SYSTEM PROTEIN TOLB; 1.
DR Pfam; PF07676; PD40; 8.
DR Pfam; PF00092; VWA; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF82171; DPP6 N-terminal domain-like; 2.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50234; VWFA; 1.
PE 3: Inferred from homology;
FT DOMAIN 496..656
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 1..55
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 317..350
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 13..28
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1225 AA; 126316 MW; 21DE1C1358FBC90D CRC64;
MVLGLLAGPA VAAAPEPPGP PAPAPPPAVD AGYRLGYTSP ERPTLLVGGG TGPAQPLLTA
AERGDAEQDA DARAGRLVWV GRRAVDGSAD RNGALYLRRP GAAPARLLGG AGTVGQPALS
PDGRRVAFTS DRAGNADIWV IGVDGTGLRR LTDHPAEDSW PTWSPDGGRI AFASTRADLA
GDIWVVPATG GTPVPITDGP AADGQPAWSP DGRRIALSTT RFAPAGAPDV RTVATVAPTG
GPVTRLVPGP GDAAEPAWSG DGARLAFTTT RDDPAGDVYL LRDGRVTPVA TGPLPQHDPA
WRGADLLWTG TDEADSTDVW SADATGGDRR DHTARPGRNE TGPAFSPDGT RLAYSAEQAD
GGARVVLADA TGADPQVLAP PGTADGDRDT DPTWSPAGDA IAFSRQPADS DEPSRILVVA
VADARLLAEV PMPAYLRGSD AEPAWSPDGR RLAFARYATT RDSDLETPVV DRPALPGSTF
TVTQSVRTPE IPPRPDIVFL VDDTASMAQP GEGGASVIQQ LRARLPEVIK NVRSSQPEAR
FGLATFSGVD GEGGHDPLMY FPRQPVTADD AAVRRAVDGL TAQSPYGTEN WFYALRQLAR
NDRIGFRPDS SRVVVLISDT DSVDKTVPPP EEGSIAEADL TRELQAAGIA LIGVPIVGAD
FERGLNYDGA AGRITAATGG RLTDDSDPGG VIDAVQRAIR ELTVTVRPAA TCADGLSVRF
DPDPARVAAG QPAVFRETVT LAPGAVPGTV LRCTVRFDLE PPEAGADAVQ ELIVRVVPPG
LPLVRVDDVR VAPAGPDGAR VSYQASAVDA TGRPLPVRCV PASGSLFPIG QTVVTCTATD
GAGRTGSDTA LVVVADPATQ GSRIWLAGLD AGPGGTLTVT DQRDLSARVG PACPSRSADR
APAWSPDGTA LAFADSAFDL CVVTPEGAGA RHPLAAADRA GRMVADPAWS PDGRRIAVAL
SRPESPGLRA AEPTDIVVLP GRGGPATPVI RTVGSQPAFQ RLPVPELSLA VSVGGLPGYL
GGDPLPVTFT VRNATRLPAD NVWLDVAAPA PLLPLTTADP RCDAGLRLCR LGTLGPGAQQ
VVTVVLPARA AVTATVTGRL TATVRQVPAS RIAQAPVRVL APRVRLDPAI GPPGFVTTAL
GADFPPGARV RLSWTPGITT TPDTVTVGAD GSFRTPVLVL RKDTLGPRDL TANRVAGRPF
APVRAPEPFL VVPRDLGPPV FSGRG
//