ID G3W493_SARHA Unreviewed; 885 AA.
AC G3W493;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 86.
DE RecName: Full=Metalloendopeptidase {ECO:0000256|RuleBase:RU361183};
DE EC=3.4.24.- {ECO:0000256|RuleBase:RU361183};
GN Name=TLL1 {ECO:0000313|Ensembl:ENSSHAP00000010248.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000010248.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000010248.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000010248.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- COFACTOR:
CC Name=Zn(2+); Xref=ChEBI:CHEBI:29105;
CC Evidence={ECO:0000256|PROSITE-ProRule:PRU01211,
CC ECO:0000256|RuleBase:RU361183};
CC Note=Binds 1 zinc ion per subunit. {ECO:0000256|PROSITE-
CC ProRule:PRU01211, ECO:0000256|RuleBase:RU361183};
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3W493; -.
DR STRING; 9305.ENSSHAP00000010248; -.
DR MEROPS; M12.016; -.
DR Ensembl; ENSSHAT00000010338.2; ENSSHAP00000010248.2; ENSSHAG00000008850.2.
DR eggNOG; KOG3714; Eukaryota.
DR GeneTree; ENSGT00940000157225; -.
DR HOGENOM; CLU_005140_0_0_1; -.
DR OrthoDB; 2873870at2759; -.
DR TreeFam; TF314351; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0004222; F:metalloendopeptidase activity; IEA:UniProtKB-UniRule.
DR GO; GO:0008270; F:zinc ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00041; CUB; 5.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd04281; ZnMc_BMP1_TLD; 1.
DR Gene3D; 3.40.390.10; Collagenase (Catalytic Domain); 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 5.
DR InterPro; IPR015446; BMP_1/tolloid-like.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024079; MetalloPept_cat_dom_sf.
DR InterPro; IPR001506; Peptidase_M12A.
DR InterPro; IPR006026; Peptidase_Metallo.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR InterPro; IPR034036; ZnMP_TLD/BMP1.
DR PANTHER; PTHR24255; COMPLEMENT COMPONENT 1, S SUBCOMPONENT-RELATED; 1.
DR PANTHER; PTHR24255:SF34; CUB AND SUSHI DOMAIN-CONTAINING PROTEIN 3 ISOFORM X1; 1.
DR Pfam; PF01400; Astacin; 1.
DR Pfam; PF00431; CUB; 5.
DR Pfam; PF14670; FXa_inhibition; 2.
DR PIRSF; PIRSF001199; BMP_1/tolloid-like; 1.
DR PRINTS; PR00480; ASTACIN.
DR SMART; SM00042; CUB; 5.
DR SMART; SM00181; EGF; 2.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00235; ZnMc; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR SUPFAM; SSF55486; Metalloproteases ('zincins'), catalytic domain; 1.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 5.
DR PROSITE; PS51864; ASTACIN; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 2.
DR PROSITE; PS01180; CUB; 5.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS01187; EGF_CA; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU01211};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW Hydrolase {ECO:0000256|PROSITE-ProRule:PRU01211,
KW ECO:0000256|RuleBase:RU361183};
KW Metal-binding {ECO:0000256|PIRSR:PIRSR001199-2, ECO:0000256|PROSITE-
KW ProRule:PRU01211};
KW Metalloprotease {ECO:0000256|ARBA:ARBA00023049, ECO:0000256|PROSITE-
KW ProRule:PRU01211};
KW Protease {ECO:0000256|PROSITE-ProRule:PRU01211,
KW ECO:0000256|RuleBase:RU361183};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Zinc {ECO:0000256|ARBA:ARBA00022833, ECO:0000256|PIRSR:PIRSR001199-2};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT DOMAIN 20..219
FT /note="Peptidase M12A"
FT /evidence="ECO:0000259|PROSITE:PS51864"
FT DOMAIN 221..333
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 334..446
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 446..487
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 490..602
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 602..642
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 646..758
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 759..875
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT REGION 1..22
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 113
FT /evidence="ECO:0000256|PIRSR:PIRSR001199-1,
FT ECO:0000256|PROSITE-ProRule:PRU01211"
FT BINDING 112
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_note="catalytic"
FT /evidence="ECO:0000256|PIRSR:PIRSR001199-2,
FT ECO:0000256|PROSITE-ProRule:PRU01211"
FT BINDING 116
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_note="catalytic"
FT /evidence="ECO:0000256|PIRSR:PIRSR001199-2,
FT ECO:0000256|PROSITE-ProRule:PRU01211"
FT BINDING 122
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_note="catalytic"
FT /evidence="ECO:0000256|PIRSR:PIRSR001199-2,
FT ECO:0000256|PROSITE-ProRule:PRU01211"
FT DISULFID 82..104
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01211"
FT DISULFID 84..85
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01211"
SQ SEQUENCE 885 AA; 100300 MW; 73D87CDC1C6D337C CRC64;
DPGASVKFSG QNEKNRVPRA ATSRTERIWP GGVIPYLIGG NFTGSQRAMF KQAMRHWEKH
TCVTFIERSD EESYIVFTYR PCGCCSYVGR RGNGPQAISI GKNCDKFGIV VHELGHVIGF
WHEHTRPDRD KHVTIIRENI QPGQEYNFLK MEPGEVNSLG ESYDFDSIMH YARNTFSRGM
FLDTILPSRD DNGIRPAIGQ RTRLSKGDIA QARKLYRCPA CGETLQESTG NFSSPGFPNG
YPSYTHCIWR ISVTPGEKIV LNFTTMDLYK SSLCWYDYIE VRDGYWRKSP LLGRFCGDKL
PEVLTSTDSR MWIEFRSSSN WVGKGFAAVY EAICGGEIHK NEGQIQSPNY PDDYRPMKEC
VWKIIASENY NVGLTFQAFE IERHDSCAYD YLEIRDGSSE NSPLIGHFCG YDKPEDIRST
SNTLWMKFVS DGTVNKAGFA ANFFKEEDEC AKPDNGGCEQ RCVNTLGSYQ CACEPGYELG
PDKRSCEAAC GGLLTKLNGT ITTPGWPKEY PPNKNCIWQV VAPTQYRISM KFEFFELEGN
EVCKYDFVEI RSGLSSDSKL HGKFCGTEVP EVITSQFNNM RIEFKSDNTV SKKGFKAHFF
SDKDECSKDN GGCQHECINT VGSYVCQCRN GFVLHENKHD CKEAECEQRI HSPSGVITSP
NWPDKYPSRK ECTWEISATP GHRVKLAFSE FEIEQHQECA YDHLEVFDGE TEKSPILGRL
CGNKMPDPLI ATGNKMFLRF VSDASVQRKG FQARHSTECG GRLKAESNPR DLYSHAQFGD
NNYPVQVRCE WLLISERGSS VELSFQTFEV EEEADCGYDY VQLFDGHDST ARNLGRFCGS
GPPEEIYSSG DAILIYFHTD DTINKKGFHI RYKSRKYSAT THTKK
//