LOCUS WP_011009975 1686 aa linear BCT 08-JAN-2024
DEFINITION endo-alpha-N-acetylgalactosaminidase family protein [Clostridium
perfringens].
ACCESSION WP_011009975
VERSION WP_011009975.1
KEYWORDS RefSeq.
SOURCE Clostridium perfringens
ORGANISM Clostridium perfringens
Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
Clostridium.
REFERENCE 1 (residues 1 to 1686)
AUTHORS Naumoff,D.G.
TITLE GH101 family of glycoside hydrolases: subfamily structure and
evolutionary connections with other families
JOURNAL J Bioinform Comput Biol 8 (3), 437-451 (2010)
PUBMED 20556855
REFERENCE 2 (residues 1 to 1686)
AUTHORS Willis,L.M., Zhang,R., Reid,A., Withers,S.G. and Wakarchuk,W.W.
TITLE Mechanistic investigation of the
endo-alpha-N-acetylgalactosaminidase from Streptococcus pneumoniae
R6
JOURNAL Biochemistry 48 (43), 10334-10341 (2009)
PUBMED 19788271
COMMENT REFSEQ: This record represents a single, non-redundant, protein
sequence which may be annotated on many different RefSeq genomes
from the same, or different, species.
##Evidence-For-Name-Assignment-START##
Evidence Category :: HMM
Evidence Accession :: NF024309.4
Evidence Source :: EMBL-EBI
Source Identifier :: PF12905.11
##Evidence-For-Name-Assignment-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..1686
/organism="Clostridium perfringens"
/db_xref="taxon:1502"
Protein 1..1686
/product="endo-alpha-N-acetylgalactosaminidase family
protein"
/GO_function="GO:0033926 - glycopeptide
alpha-N-acetylgalactosaminidase activity [Evidence IEA]"
/calculated_mol_wt=187697
Region 234..292
/region_name="Big_4"
/note="Bacterial Ig-like domain (group 4); pfam07532"
/db_xref="CDD:400079"
Region 347..1492
/region_name="endo_SpGH101"
/note="SpGH101 family
endo-alpha-N-acetylgalactosaminidase; NF040533"
/db_xref="CDD:439743"
Region 733..998
/region_name="Glyco_hydro_101"
/note="Endo-alpha-N-acetylgalactosaminidase; pfam12905"
/db_xref="CDD:432868"
Site order(802,864,866,904,930..931,944,950)
/site_type="active"
/db_xref="CDD:271203"
Region 1603..1684
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1673..1674,1676..1677)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
ORIGIN
1 mgrkcmnkki aaiiaaaviv gqlpisvlat pvneagdein sesaeiltns deeaeayiqn
61 ydrpegitwt klagsgsvev tdgflsvtnn gdyrimedqs pnikngeles kftvggsqtg
121 iifratesny gminynsgtg wvienknswe ditgpklnng dvvtvkatfv ekhltvnvsv
181 ndgefetiyd kesdliplqa gkvgyrgwgn akttkfdyik yapmtidkgp ivsinevnve
241 typrvkpilp ssvtvnheng mssikdvswn yipkesyskp gtfkvegtve gtdvkaianv
301 tvssdlayye tnfeteetrg dwqvvqgggs psyeegkvki pmngvsiavd mnspevknft
361 yetdfsvdnn ggrigllfry vsetewgavc ydngswvwkt gdgkygnfpg tftpepgkty
421 riklkvedtn itmwvdgeki gqvavsnlpd vrgkvgltgw fgnknvtldn lvveelggim
481 apevgpleeq siesdsmkvv ldnrfptvir yewkgtedvl sgasvddlea qymveingek
541 ripkvtsefa nnegiytlnf edigmtitlk mtvnenklrm evtdiqegdv klqtlnfpnh
601 slasvsslnn gktasvlttg dwnnineeft dvakakpgvk gktyafindd kfavtinnnt
661 ieggnrvvlt tendtlpdnt nykkvgisng twtykeilqd ttdqgsklyq gekpwsevii
721 ardenedgqv dwqdgaiqyr knmkipvgge eiknqmsyid fnigytqnpf lrsldtikkl
781 snytdgfgql vlhkgyqgeg hddshpdygg higmrqggke dfntlieqgk eynakigvhi
841 nateytmdaf eyptelvnen apgwgwldqa yyvnqrgdit sgelfrrldm lmedapelgw
901 iyvdvytgng wnahqlgeki ndygimiate mngpleqhvp wthwggdpay pnkgnaskim
961 rfmkndtqds fladplvkgn khllsggwgt rhdiegaygt evfynqvlpt kylqhfqitk
1021 msenevlfen gvkavrensn inyyrndrlv attpensign tgigdtqlfl pwnpvdeans
1081 ekiyhwnplg ttsewtlpeg wtsndkvyly elsdlgrtlv kevpvvdgkv nlevkqdtpy
1141 ivtkekveek riedwgygse iadpgfdsqt fdkwnkesta entdhitien esvqkrlgnd
1201 vlkisgnega dakisqsisg leegvtysvs awvkndnnre vtlgvnvggk dftnvitsgg
1261 kvrqgegvky iddtfvrmev eftvpkgvns advylkaseg dadsvvlvdd friwdhpght
1321 nrdgyvfyed fenvdegisp fylspgrghs nrshlaekdi sidanqrmnw vldgrfslks
1381 nqqpkeigem lttdvssfkl epnktyefgf lyslenaapg ysvniknrdg ekivsiplea
1441 tgsnyaqdif tktksvthef ttgdfagdyy itlekgdgfk evildniyvk eidksiespe
1501 lahvnlntve hdlevgqsvp fainalmnng anvnleeaev eykvskpevl tiengmmtga
1561 segftdvqvn itvngnkvss ntvrvkvgnp eveeeevivn pvrnfkvtdk tkknvtvswe
1621 epektygleg yvlykdgkkv keigadktef tfkglnrhti ynfkiaakys ngelstkesi
1681 tvrtar
//