ID T1J7P7_STRMM Unreviewed; 1428 AA.
AC T1J7P7;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 27-MAR-2024, entry version 51.
DE RecName: Full=Peptidase S1 domain-containing protein {ECO:0008006|Google:ProtNLM};
OS Strigamia maritima (European centipede) (Geophilus maritimus).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Myriapoda; Chilopoda;
OC Pleurostigmophora; Geophilomorpha; Linotaeniidae; Strigamia.
OX NCBI_TaxID=126957 {ECO:0000313|EnsemblMetazoa:SMAR009702-PA, ECO:0000313|Proteomes:UP000014500};
RN [1] {ECO:0000313|Proteomes:UP000014500}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Brora {ECO:0000313|Proteomes:UP000014500};
RA Richards S.R., Qu J., Jiang H., Jhangiani S.N., Agravi P., Goodspeed R.,
RA Gross S., Mandapat C., Jackson L., Mathew T., Pu L., Thornton R., Saada N.,
RA Wilczek-Boney K.B., Lee S., Kovar C., Wu Y., Scherer S.E., Worley K.C.,
RA Muzny D.M., Gibbs R.;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:SMAR009702-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (FEB-2015) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00059}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH431938; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 126957.T1J7P7; -.
DR EnsemblMetazoa; SMAR009702-RA; SMAR009702-PA; SMAR009702.
DR eggNOG; KOG3627; Eukaryota.
DR HOGENOM; CLU_252662_0_0_1; -.
DR Proteomes; UP000014500; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0007156; P:homophilic cell adhesion via plasma membrane adhesion molecules; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd11304; Cadherin_repeat; 1.
DR CDD; cd00041; CUB; 2.
DR CDD; cd00112; LDLa; 1.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.60.40.60; Cadherins; 2.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 1.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 2.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR002126; Cadherin-like_dom.
DR InterPro; IPR015919; Cadherin-like_sf.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR PANTHER; PTHR24252; ACROSIN-RELATED; 1.
DR PANTHER; PTHR24252:SF13; TRANSMEMBRANE SERINE PROTEASE 12; 1.
DR Pfam; PF00431; CUB; 2.
DR Pfam; PF00057; Ldl_recept_a; 1.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00042; CUB; 1.
DR SMART; SM00192; LDLa; 1.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF49313; Cadherin-like; 2.
DR SUPFAM; SSF57424; LDL receptor-like module; 1.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS50268; CADHERIN_2; 2.
DR PROSITE; PS01180; CUB; 2.
DR PROSITE; PS50068; LDLRA_2; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
PE 4: Predicted;
KW Calcium {ECO:0000256|PROSITE-ProRule:PRU00043};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00124}; Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000014500};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 1188..1211
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 18..99
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 107..167
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 230..468
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 630..729
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 915..975
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT REGION 1251..1283
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1346..1398
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1261..1283
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 170..182
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 177..195
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 189..204
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ SEQUENCE 1428 AA; 160370 MW; 01A44933667CBE43 CRC64;
MCDYNVICKD VADWEIACGQ KQPALLNSPH RVIKSESVNG VYSPNLSCSW LISAPSRQEI
KLTFLKLNME NSKNCSNDRI DIYDGELSEN HKIHSFCGKI KINEPDCSDV IHLNDTHGVI
TSPDYDENNP FNKYPPNTTV QWKITAPEGH FIQLHFLFMN IERSNSCERC GPNELMCATK
NCISNEKRCN GVNNCPNQID EINCIIPKSS LRECGIPAVA VTDIGPRPRI IGGRVSKHGN
WPWQGALSIV SFNWEFQCGA IIIHPEWILT AAHCFRKISK PDRWALQVGK HNLSAREESA
HERSIDKIII HPKFGDKVPS SPFRNANDIA LVHVRRPLEY TDYVIPACLP RPFEPIEKDI
KIHVTGFGKT ENFTNIGLLK QVELPVINQS ICNEIHKNTL SDSMFCAGGQ SGHDSCQGDS
GGPAVVNYFG RWKLLGIVSW GDECAVEDKP GVYTKVSHFI PWIEETTGIR FTSTFANNLK
DYCRIETTSG RSERFTILER TAKGVAIGHL IATHDATISS ADVTLKLVTD NDEFSRFISI
GAGESTKDPT RTQFTFFIKE EYDLFEHSQV AELIQILNCA PNTAPATDDI HLFITFTDEN
DRPEFSLTSM TYKLPRLPLK LPFSKMYTPI YVTDKDKENT DNGNLEFFVW TDANLKTKSQ
VLSAEGEKVG TNKYLVNLYL LQLLDAATSP EYNVILMAKD TGTKPLDATI PIKIIVESEE
KENPNLPKFE NTAKYYFSMN SLSGEVSLVD ERPLIENRLT SVSLFIQVVV NNDKLLSDFS
ILTIKIIYAP QFERDSYSVY TYRDADLGTI LWTANAGPLL VYSVNKDTSN LQGYYMLDSK
SGSLSISSSL EKAGDKKIEI IASTVGVTVG YHSTEVTIHV LDRKNDQVLE KKCSIEENVE
NALCDGLTMT QSDNYQIISG NNDQVFEISN SQLRCKPQLD YEKTNFYNLV LRSETADQTI
YLTVNVKDVK EAPYFETVSH IFAIAPWIQP DNRITTFRVS KIIAQDPESI FTKTSDLEYV
FTNPDNLSNV FRLNSISGEL HWMKTPSEKF YEAEIEATNK RQDPQSTYLN IKARYSCSDI
FTLKLIVQSN SELDTNPKSI EKDLSTILKA QVVLILSKPD LNSDSESTFV YVLFAVVNDV
VLSYQQIEKN LITSKEETEE KLGKFILQPH SSLLGAQENE TGQPEQKLVI IIILSVIVCA
MLVAIIGIVL FKKCSSSAPL QNDTSSKEVL RNGQIFSDSG VKIDIDKIDG KTKSSKMDEG
QGRDNPSFQV SSEEVQTTTP SPTVKTATLT RLVNENEITF GEITEEHADD LHEIQVLEMD
YELPAIDYKP GKRKSVSFID ANTVFTAPEE DEDDHENKND NENSQIEPVD APTASPPIDK
IVENEESGEK NQEDALTPQV QIKCGIIEEL YDPENQDKIE EFVLSAYM
//