ID Q118Q6_TRIEI Unreviewed; 3298 AA.
AC Q118Q6;
DT 22-AUG-2006, integrated into UniProtKB/TrEMBL.
DT 22-AUG-2006, sequence version 1.
DT 27-MAR-2024, entry version 90.
DE SubName: Full=Filamentous haemagglutinin family outer membrane protein {ECO:0000313|EMBL:ABG50018.1};
GN OrderedLocusNames=Tery_0573 {ECO:0000313|EMBL:ABG50018.1};
OS Trichodesmium erythraeum (strain IMS101).
OC Bacteria; Cyanobacteriota; Cyanophyceae; Oscillatoriophycideae;
OC Oscillatoriales; Microcoleaceae; Trichodesmium.
OX NCBI_TaxID=203124 {ECO:0000313|EMBL:ABG50018.1};
RN [1] {ECO:0000313|EMBL:ABG50018.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IMS101 {ECO:0000313|EMBL:ABG50018.1};
RG US DOE Joint Genome Institute;
RA Copeland A., Lucas S., Lapidus A., Barry K., Detter J.C.,
RA Glavina del Rio T., Hammon N., Israni S., Dalin E., Tice H., Pitluck S.,
RA Kiss H., Munk A.C., Brettin T., Bruce D., Han C., Tapia R., Gilna P.,
RA Schmutz J., Larimer F., Land M., Hauser L., Kyrpides N., Kim E.,
RA Richardson P.;
RT "Complete sequence of Trichodesmium erythraeum IMS101.";
RL Submitted (JUN-2006) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP000393; ABG50018.1; -; Genomic_DNA.
DR STRING; 203124.Tery_0573; -.
DR KEGG; ter:Tery_0573; -.
DR eggNOG; COG3210; Bacteria.
DR eggNOG; COG4995; Bacteria.
DR HOGENOM; CLU_224734_0_0_3; -.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.160.20.10; Single-stranded right-handed beta-helix, Pectin lyase-like; 4.
DR InterPro; IPR024983; CHAT_dom.
DR InterPro; IPR008638; FhaB/CdiA-like_TPS.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR002884; P_dom.
DR InterPro; IPR012334; Pectin_lyas_fold.
DR InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR NCBIfam; TIGR01901; adhes_NPXG; 1.
DR Pfam; PF12770; CHAT; 1.
DR Pfam; PF01483; P_proprotein; 1.
DR SMART; SM00912; Haemagg_act; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR SUPFAM; SSF51126; Pectin lyase-like; 4.
DR PROSITE; PS51829; P_HOMO_B; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670}.
FT DOMAIN 876..1030
FT /note="P/Homo B"
FT /evidence="ECO:0000259|PROSITE:PS51829"
FT REGION 236..263
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 400..496
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2804..2828
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2844..2870
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 406..420
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 421..444
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 475..489
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3298 AA; 342249 MW; 40699DC48D56BA8A CRC64;
MHSGQTANFV TTPDTRNVLG RVKGGASYIN GLIQVLGSSS NLFLMNPAGI MFGPNASLNV
PASFSVTTAT GIGFDQNNFW FKAMGTNDYS NLVGNPSGYR FDVSKPGSIV NEGNLTLKPQ
GNLTLSGGTV VNTGELSSPG GNITVTAVEG GSTLKISQPG HLLSLEVPLE DGENISNIDP
LSLPELLAGG GDIVEATSVV VKENGDVVLN GSNTMVAETP GTATISGKID VSTTATSLKE
KGRVPSQRKK EGAASPGKID VPPSLKEKRD VAKAGKVNVW GDRVALIDTN IKADGKNGGG
TVLIGGDFQG LGIVPNSQHT FVNNNSFISA DAITNGDGGQ VRIWSDGITN FAGNISAKGG
TSSGNGGLVK IGGKEQLIFD GKVDVTAALG TKGRILLDPE SVTVGEDNSE GEKEIVEDNS
EVSETENTDN YDGEIIEDNS EVSETENTDN YNGEIVEDNS EVSETENTDN STTKNTDNLE
DKETENPLDP FAADENSDVT ISADNLGELS GNVIIDADND ITINERIETD SSVELKAGRS
ININADIDTK SGNGNIDLLG NNDEMNLANR SDGKGSINQL DGTILNAGSG GINIKLGSLG
EVGDINLGNL RTTGKVLVDA NGGNIVRVSE NSLINAGSVL FRTSGNGGIG FLGQPLRLDV
QNLEAVSGSG GVFFDVGNVN IGGVSEDVVG IATFGGDVDI KSAGNVTLNE TISSNEVVEN
NSEGGTTENT EVVDGGGGIN IEAAGDIVAT GSGIKGGGEA VSLSGTNIRI NDEFDETSGD
ADVKLSATND IVVEDIEDDV LEFMPGSGEI EFRADKDGDG FGLVKMLDNK PDVGSNPDIF
ENGADTIKTN GRGLTIAGAG LVLGNVDTSW LPIYSGGGEL LKAIDVDEGG PIPPEGTEGT
TTFTFTVDGD LGTVENIDVR FSAAYTHTGD LDVSLESPQG KVVQLFAGVG RWGDNFQDTV
LDDNASRSIG ISNAPFDGRY SPQGSLVDFN GENPKGIWTL KVKDTNIFSN LADGNLYRAG
ETAPWGTAIG TQLLLHNPLV KIAGGIESGK GGAINLEATH GDISVGNIRS LSEAANGGRI
DLNANKDIIT GLINSSSASV QGNGGAIDLD AGGNITTQSL NSRSYSWEGN SGNGGTIDLD
AGGNITTQSL YSSSYSGSGN SGDGGAIDLD AGGDITTTQD LDSSSYSWES NLGNGGAIDL
DAGGNITTQV LDSRSFSWEG NSGNGGAIDL DAGGNITTQI LDSCSYSGSY DSGNGGAIDL
VAGGDITTQN LYSDSFSRSG NSGNGGAIDL VAGGDITIQD LYSFSYSGSG NSGNGGDITL
NAKQINPPED GNAEKLTIYT FSAGKKESEE GKGGDVNITT NNLSNTDILT LSSHSGSGKV
TIESQTQEPL QIKDSSIITS EKITITMPWG EEIQVETGDT QSGDVSINSS GDLNLSNVTI
ESDTESNQAA GDVNIYSRGN ITLENTDIIS TTNSQGNAGQ ITLESNQNIE LTNNSKILAN
TEGTGNAGQI NIEANKLILD QNTKLITETA RAGNPGNINI QANTIDIGEG AKASTTVLTG
STSTGEGGNI TINTNELNVT GKLGIFAETE ASKNAGILRI SPYKNNPDLD ITFKNEGFIS
ASTSSTGNGG NIFIKAPENI NITGQGFIAT KTSGTGNAGI IDIKTNNLRI SNGVKINAST
EDQGNAGEIK INTTDFTLEK GTSLTTETSS AGLAGNIEIN TKNLTIGQNA QISATALEGA
SNKEAGGNIT INANNLDISG KLGIFAETAG ESPAGTLTLN PYKNNPNLNI EFKEKGFISA
RTTSSGNGGN INIQAPEKIN ITGDGKISAE TTGSGNAGTI NIQTENLNLS EQVAISAETN
SQGQAGNIEI NSQTVTIGKG TEISATAGKK ATSTGDGGNI TINTNDLEIS GKLGIFAETK
GASNAGTLTI TPYQTNPDLN SKTDPNINIT FTDQGFISAS TKSLGKGGDI NILAPENINI
TGDGRITVES EGSGDAGIIN IETENLTIAE NTKISASTSD SGNGGEIKIN SSETFQLQGR
ILTETTGTGD GGIINIEAGE ITAPNSKISA KTTDAGNAGT IDITAQGDIT TGVVTSAAKN
DIETADGGSI SITSEQGKIN ATRAIQSFSE GGNAGNVTLK AQTDITANTI SSHGKQEGGQ
ITIRSETGNI DTSSGKFLAN YSGGGDAGDI TMEAPQGNIT TNNIYSYADG DGGQINIKAG
NNINIEGKSN IISASKSPSD GSSDMPGKGG DITLEAGNNI NTRTAKIYSG ANEGDTGKID
ITADNAIETG KIDLVSGFVK EKEKVNENFT IIPKGEGEAT QGKAGDIRLR SRNSTIDTTG
GTINSRSPDG TGNIIINAKG NISTGKLEAS ALNPDKPTTG GDVNITSEQG EINATQNIET
FSEQGIAGDV NITAFGQIQT NNISSQGMKR GGDINIRSDS ESSIDAAGVL QTYSDAGTAG
NVNLTSPGDV NISGIRSEGM EQGGDINIRS ERGEINSTGD IDSYSKQGKG GYVKVDALER
VNLANVSSYG MTESGDLIIQ SQQAKVNTGN VTTQALEGKS GRIVINGTEV GTGNLSSIGT
TSAGEIKVTA TDGSIKTYNV EIRSDGTIGV LSLKATEDIN TGDQTAIAGE GDVFIDNDAG
DDLTTGDQTA ITGEGDAFID NDAGDDLTTG DQTAITGGGD VFIDNDAGDD LTTGDQTAIT
GGGDAFIDNH AGDELTTGDQ TAITGGGDAF IDNDAGDDLT TGEKTVITGN VTATVTNFIQ
NDGVDQNLDT TVTNFIQNDG VNQNLDTTSV ISNNNIPNNQ VLNQENFSNN NNNNIENNST
TNSSNNKNIL SNLTQSQRSE LISNSTLSNN NQTTNTNTAQ EQTESSINST TDTQKILNII
DTVNTNSLTV ATGSDQVITM LEQNLTNEYS NYFGTDFKEQ FINQKTPREI LTDMAAKTGK
ESAVVYINAY PEELQIILYT KDGQPILKTI PEANRKKLEK VVINFLKLTT SPAYRDFNSY
LSPAKQLYDW FIAPISAELE AANIDTLLFS MGEGLRILPV AALHDGKQFL IEKYSLSLIP
SISLMDTNYR PLQGTQVLAM GASKFINEKP LPAVPVEIET ISEQLWEGSK FLNEEFTKNN
LLTQRKNYPY PIIHLATHAT FNRGKPSNSY IQLWGNEQIK LDQVRELGWS TPSVDLLVLS
ACRTAVGNRE AELGFAGLAV AAGVKSALTS LWTVSDEGTL ALMTEFYTHL NNVSIKAEAL
RQAQLAMLQG QVLITGGELR GSSTRGGVEL PSAFANVNNQ NLSHPYYWAG FTMVGSPW
//