ID N6Y363_9RHOO Unreviewed; 275 AA.
AC N6Y363;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 26-JUN-2013, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE SubName: Full=Cell-surface large adhesin {ECO:0000313|EMBL:ENO76578.1};
GN ORFNames=B447_17581 {ECO:0000313|EMBL:ENO76578.1};
OS Thauera sp. 27.
OC Bacteria; Pseudomonadota; Betaproteobacteria; Rhodocyclales; Zoogloeaceae;
OC Thauera.
OX NCBI_TaxID=305700 {ECO:0000313|EMBL:ENO76578.1, ECO:0000313|Proteomes:UP000013140};
RN [1] {ECO:0000313|EMBL:ENO76578.1, ECO:0000313|Proteomes:UP000013140}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=27 {ECO:0000313|EMBL:ENO76578.1,
RC ECO:0000313|Proteomes:UP000013140};
RA Liu B., Shapleigh J.P., Frostegard A.H.;
RT "Draft Genome Sequences of 6 Strains from Genus Thauera.";
RL Submitted (SEP-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ENO76578.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMXB01000038; ENO76578.1; -; Genomic_DNA.
DR AlphaFoldDB; N6Y363; -.
DR eggNOG; COG3064; Bacteria.
DR Proteomes; UP000013140; Unassembled WGS sequence.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000013140}.
FT REGION 31..163
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 74..89
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 275 AA; 28224 MW; 60D953D6B448A49D CRC64;
MSLLRSKALL AVVAERVAAA ERKLADLLAR QLERGEPGAP GPAGERGAEG PRGAPGHPGP
RGEKGEPGPM GPRGEKGPQG PPGPAGPPGE KGDRGPQGEP GPVGPAGPAG RAGDVGPTPR
HEWRGSKLRF EQSPGTWGDW TDLRGPRGPV GPAGGGGGSG GGDLARLLPG GAHVEPAGLA
VLQAGQWVQL PWTAFISMIE GALEMGETNM ARRVDFVGDT LIYRGEAAPG ADEAAPVWRI
KRIQFGADGD VTETWADGVA EFAHVWTDRA SLTYL
//