ID A0A3P1XYM9_9BACT Unreviewed; 3295 AA.
AC A0A3P1XYM9;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 24-JAN-2024, entry version 16.
DE SubName: Full=Tandem-95 repeat protein {ECO:0000313|EMBL:RRD63809.1};
GN ORFNames=EII26_09725 {ECO:0000313|EMBL:RRD63809.1};
OS Fretibacterium sp. OH1220_COT-178.
OC Bacteria; Synergistota; Synergistia; Synergistales; Aminobacteriaceae;
OC Fretibacterium.
OX NCBI_TaxID=2491047 {ECO:0000313|EMBL:RRD63809.1, ECO:0000313|Proteomes:UP000267192};
RN [1] {ECO:0000313|EMBL:RRD63809.1, ECO:0000313|Proteomes:UP000267192}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=OH1220_COT-178 {ECO:0000313|EMBL:RRD63809.1,
RC ECO:0000313|Proteomes:UP000267192};
RA Coil D.A., Jospin G., Darling A.E., Wallis C., Davis I.J., Harris S.,
RA Eisen J.A., Holcombe L.J., O'Flynn C.;
RT "Genomes From Bacteria Associated with the Canine Oral Cavity: a Test Case
RT for Automated Genome-Based Taxonomic Assignment.";
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RRD63809.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; RQYL01000023; RRD63809.1; -; Genomic_DNA.
DR OrthoDB; 595640at2; -.
DR Proteomes; UP000267192; Unassembled WGS sequence.
DR Gene3D; 2.60.40.1200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 6.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR010221; VCBS_dom.
DR NCBIfam; NF012211; tand_rpt_95; 2.
DR NCBIfam; TIGR01965; VCBS_repeat; 6.
DR Pfam; PF17963; Big_9; 6.
DR SUPFAM; SSF81995; beta-sandwich domain of Sec23/24; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000267192}.
FT REGION 68..112
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 179..314
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 400..435
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 537..565
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3091..3168
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3223..3262
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 96..110
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 195..273
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3295 AA; 337304 MW; 62F3F45BF83A504E CRC64;
MAQTILAVKV LRVEGEAFVR RPNGSLVPIR EGDVLHKGDV LVTRGGSVYF EGPEGDVFGI
PPYRSFSIQP ENLAPQPEED IDAGAPRASA STALPTPEED VESSPSHNMG GFGSFRAPRV
DYLQDMRFFD QGNLGRDVNE IFRFSDRHTE NPRIAYEFDL DRVKFPLYGV EIDYPQAGRP
DSYLPNHPRD QFTDVPPVKP LPPAPGPQPG PQPGPQPGPQ PGPQPGPQPA PGPQPGPEPQ
PGPQPGPQPD PGTPGGPGGP TPPPPPPGLN TPPTAQPDVN TVVRGGPDAP GHDDGSNATT
IVAGNVIPND NDPDGDTLTV VGVVPGTQPS ASGGVGGGVG GAYGSIKINP DGTYTYTLDN
TNPAVQALTG TDRLTDTFTY TVSDGKGGTA TTTVTITIGA TANTPPTADP DTNTVPEDGP
AATGNVIPND SDPDGDTLTV VGVVPGNQPS ATGNVGSNVP GEYGTIVINP DGSYTYTPDP
NNPDVQGLGG GETLTDTFTY TISDGHGGTD TTTVTITING ANDAPVARPD VNLAVRGADD
ASGHDDGNPN TSVVAGNVIP NDTDPEGDPL TVVGVAAGTP GTPPTGDVGV GTGVPGTYGT
IVVNPDGTYT YALDNTNPAV QGLTGTNTLT DTFTYTISDG KGGTATTTVT ITIGATANTP
PTAVPDTNTA TEDGPAVTGN VIVPQSPGDR ADSDPENDPL TVVGVAKGDT GTATGNVGTN
VPGDYGTIKI NPDGSYTYTP DNSNPDVQKL GEGDTLTDTF TYTISDGKGG TATTTVTITI
NGTNDAPEAQ PDVNTVTKNA ADQPGYEDGS AATTVVAGNV IPNDSDPEGN PLSVVGVVAG
NQPSASGNVG GGVSGDYGTI VINPDGTYTY KLDNTNPDVQ SLAGGNTLTD TFTYTISDGK
GGTATTTVTI TINGADTEFG VTPDANTVTE DSKPVAADNV LRNDSTPVGD DLTVVGLAKG
DTGTDLDAPA TIGSNIAGDH GTIKIDRYGN YTYTLDNGKP EVQALAVGET LTEKFTYTVV
DPSGAMKHTT LTITINGTND VPDITVDAGD SDRANLTETN APLTADGTLS LEDVDTHDVV
AVAKTGVSVN TGESTYTGPL PAGLTEAELL KMFTVAGGLD GTQSSVPNVI RWSFNSGDAD
LPAEVEAFKF LPKDQTLVLD YTVKATDPHN ASDTQVVKIV ITGINDTGTL GGNIVRNTNE
DTNITSGNVL TDGAGFVQDP DQGESLAVRD FTIDGMAGTH AVGTPVNVTN AANEVIGTLT
IQSDGAYTFE PKTNYSGPVP QVTFTANIHG VGTDTRTLDI TVNPVSDAPG LTASSKDVLE
DNTVALGLKA PTITDRTDLT GGADNHDNPE RLGVITLSAI PAGAKIFNGA DELVPAGGVL
KIVLTDDPND HVAGAGATPG VVAMTKAQYE ALTITPPAHR HEDIDIKVSV TEYEVDDAGN
VLPGVPGATS TTTAHVEVKA VTDDVSLTVT NPDLGQQNED GLINVTDRLT AQFDDLDGSE
DRWLVLEAPA GGGVRVRIGG VDYVAAAGEQ VKIPVPSLST GTALPAIELG AAGNVSGNLD
GIKLKLVAQD RDGDSTLNPG GTTQDPHHGL AGVTEAEDRA ESKVAEVTLN LKVAPIAGDV
APNASAQMVE DAGRTAETSG AKFLANIALT DNDGSETITG ITIDAIPAGW VLKDHNGQTV
NAGHTVDMAQ YKNYTAVAPN HSSKDEVVRV TVTTTDTAPG FGPATGTQAV DVTIKVTPRA
EALKKVGDSW VTQDLDGDGT SDLTMGGGKI YTTRALEDTP FTINQDGFVF SQGWANQDAD
EQTFARLTPQ IWDAKAGKFV DAVGAKFTWD GGSGTYRGSG VNIPMDKLDS VKFEAPKNAA
GAFQIKVQAY TKDFDADAPG DNSQANTQLS GEAWLKNLIV APKADDVTFA SAQVKGFEDS
NIDLKGKINP ISPDPDETFN ITIRNVPAGA VLTYKGAVLT AQPDGSYKID GFDKNADFSV
KPPLNDNTNF TLQVSAVSVD TVTVPKLDGS GNPIPGQTET ITSTNPVATT LDLLVEVKGV
ADPATLVTAP LATTEGTAEA NDGKIALKNA FQTIAPKDAD GSETVSLVIS GLAEQFDMAG
AKFLGGSGEG RRWLVTKEAL DDESASVVVK KNYSGTINLK ATAVTVENDG NANPNAAEKN
VQIEVAATPE AKMNLEASGK EDAPAKLDFT IQPQNDDADE TLEAVWIKAA DVDSQNITLT
LGAAGAALAA DADGWYKIQK ADLENVFAKG PANKDGDFSF GVKYTVGDAP ADGTLGMETQ
QFDGSYTLKL APVTDATDSD VTAISQGVGT MTLNGTAGLK ITDVADAGPN KGQATFSVDI
KVDQQPDPNA GNTADTDGSE QLAYFVVDNV PYGVSIKGGT LVGKGTPEAG DLSGDFPVNR
WVVEVPEGTG AFTGGTFSKT LEFTLDNNSG SFDNAFRSYT MRATAFSKDG KAIELAESDA
EWKLETAILN QGDVNYEPPT LTAQKLSPTA TEDLEIALKE LLKVEITNIG NDPAASYVIS
LKGLDPRIEV EGMTKTTIDG ETVWTASAPG DTTVPATQNG MDHLLDSIKL KLPEHWNDNK
AGDGLDIQFD ASVRGYTVAS NPNRPDTYAS ADVTDIRPAG VTPVTDEATM TLDAPVVDEG
AAGGVPITID LTNEADGASA TIVGGKMYVQ IQEPAHGTGG TLEYQGTAIA QTNVNGVPGV
DNGRYYVVEG VNVGGDQGSG TVSLTYRPAA HASGDFKVTA HVVSQETGAA NTVASTVEQT
VHVNAVNSGY DLTVGNAAGQ EDTRIQIQVG GTGLVDADGS EQVVSATLTN VPDDYKVFVG
ADAGSAQEAK NVGGGTWALT LTPDGKLPGY IAVQAPQNIS ETAAAVKLTV YSGEKGQTTV
KANEAVFDVK VEAVADGLTI NPTQTFGKAG ESIPIHLNAT VRDTDGSETV TVALKGLGAG
VTFNNGSASY DVHTDTYTVS GIAYNKVPEL AFTRNAPMSG TVNVTAKTVE TSNGDTSAAV
SSSFQVDIKP GTAPSGITGR SAFFSALPEG AEGMEPILDA DGNPIDPTLD PAAAALGASL
LGEANNLPEG TDAEAEAAID EAAALLPVGA DATNTDIPDL PDTADSGTSE PAEGTALPDL
ADGAADTSAA VGEPAGTADL PAPDPIDPVD TGTEAGVPAD NLPDLDDAPL ADAADAVSEP
GPDLDDVPSV DAAEVAPAET AGEDAPDLGN APAEDALDLN GVADMGSDAA DEGAPELGGL
TADDVLDLGG GDLPLPGEDA PEAIPEVESA PEFYAPPVPD AAVTIAQEMD DAIQP
//