ID A0A1F5AIG9_9BACT Unreviewed; 844 AA.
AC A0A1F5AIG9;
DT 15-FEB-2017, integrated into UniProtKB/TrEMBL.
DT 15-FEB-2017, sequence version 1.
DT 24-JAN-2024, entry version 17.
DE RecName: Full=VCBS repeat-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=A2W03_14015 {ECO:0000313|EMBL:OGD18253.1};
OS Candidatus Aminicenantes bacterium RBG_16_63_16.
OC Bacteria; Candidatus Aminicenantes.
OX NCBI_TaxID=1797273 {ECO:0000313|EMBL:OGD18253.1, ECO:0000313|Proteomes:UP000178699};
RN [1] {ECO:0000313|EMBL:OGD18253.1, ECO:0000313|Proteomes:UP000178699}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=27774985; DOI=10.1038/ncomms13219;
RA Anantharaman K., Brown C.T., Hug L.A., Sharon I., Castelle C.J.,
RA Probst A.J., Thomas B.C., Singh A., Wilkins M.J., Karaoz U., Brodie E.L.,
RA Williams K.H., Hubbard S.S., Banfield J.F.;
RT "Thousands of microbial genomes shed light on interconnected biogeochemical
RT processes in an aquifer system.";
RL Nat. Commun. 7:13219-13219(2016).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OGD18253.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MEYB01000117; OGD18253.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1F5AIG9; -.
DR Proteomes; UP000178699; Unassembled WGS sequence.
DR Gene3D; 2.130.10.130; Integrin alpha, N-terminal; 1.
DR InterPro; IPR013517; FG-GAP.
DR InterPro; IPR028994; Integrin_alpha_N.
DR PANTHER; PTHR44103; PROPROTEIN CONVERTASE P; 1.
DR PANTHER; PTHR44103:SF1; PROPROTEIN CONVERTASE P; 1.
DR Pfam; PF13517; FG-GAP_3; 3.
DR SUPFAM; SSF69318; Integrin alpha N-terminal domain; 2.
PE 4: Predicted;
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..844
FT /note="VCBS repeat-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5009516740"
FT REGION 825..844
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 844 AA; 92329 MW; 2E9661CEFD10B20A CRC64;
MDTRRIVPFA LALLALGASA ADPPPPRYFV EVTLIRPARL PNTPCDLVMN FQDYLDELGV
GGEFDRHSIS VEARNPKTGR FEPADFRLDE RFKYGDSGRL LWLIRDPSMT VFRVGFDVAA
RPPRRPPDYV PAIGAGDELM FNTGEPAPLV AMSAPLLADL TGDGITDVLA INHYSDRFGW
PEDGILLQPG IREDDGGLAV RDFFRLHFVP EGGGASDLRP LHARYNWVWP VDWDADGLTD
LLYISMAQGA TGPNALPENR DLFASPGYIT FLKNTGTKLA GEAPLFMEAG RYPAAELTEN
AYVPSLACAD LDGDGRNDLV GVRTSPDGAV RAVSLYFYRY SGAGAGLLPG LDPPALLKTA
DGRPVSATQN AHLVSLGDMD GDGRPDVIGN ELSPSQVYWF RNLGGSPPRF GERTRVAGLP
EDLKGYRWVS WKAGPGLIGL ESSRLFARKM REKGPAFEPA GRLREVSGPV RGGLQEKPEW
VDWDDDGDPD LLAGEFAGTI QLYENAGAPG RPKFLPPVPV AAAGRPVRIT RDGVFGGKHW
HGMAGYPSVA CADWDGDGLF DLIVPNETNR VFWYRNIGRR GSPAFGERRQ ILPDGFVDSA
ERLEKTRRLA EDPGVPNHPY PLEPDIPFFW RTRLAIADYT GDGLTDLIAL DGLKNLVLYP
RYRNAQGEFR VGRGEAAVDN LGKPILSPHF FKLRDVDWDG DGLVDIVATQ NLFGPDQRSL
LFLRNVGTKA KPAFARPEAI QLWGQDIRYS SHGLQPSFLD FDGDGSLDFV GCTESGLYVL
FRRAALTGPK PVAVAGAPQL LPAQRPQSGA LFEIWYQPPL LASYSHSPSA PKLTARVESN
PRPQ
//