ID F6TBJ5_HORSE Unreviewed; 390 AA.
AC F6TBJ5;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 24-JAN-2024, entry version 62.
DE RecName: Full=Pepsin A {ECO:0000256|ARBA:ARBA00039700};
DE EC=3.4.23.1 {ECO:0000256|ARBA:ARBA00011924};
GN Name=LOC100059273 {ECO:0000313|Ensembl:ENSECAP00000006263.3};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000006263.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000006263.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000006263.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000006263.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000006263.3};
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- FUNCTION: Shows particularly broad specificity; although bonds
CC involving phenylalanine and leucine are preferred, many others are also
CC cleaved to some extent. {ECO:0000256|ARBA:ARBA00002318}.
CC -!- SIMILARITY: Belongs to the peptidase A1 family.
CC {ECO:0000256|ARBA:ARBA00007447, ECO:0000256|RuleBase:RU000454}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; F6TBJ5; -.
DR Ensembl; ENSECAT00000008373.3; ENSECAP00000006263.3; ENSECAG00000040481.2.
DR GeneTree; ENSGT00940000155036; -.
DR HOGENOM; CLU_013253_3_0_1; -.
DR Proteomes; UP000002281; Chromosome 12.
DR Bgee; ENSECAG00000040481; Expressed in triceps brachii.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR Gene3D; 6.10.140.60; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 2.
DR InterPro; IPR001461; Aspartic_peptidase_A1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR012848; Aspartic_peptidase_N.
DR InterPro; IPR033121; PEPTIDASE_A1.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR PANTHER; PTHR47966; BETA-SITE APP-CLEAVING ENZYME, ISOFORM A-RELATED; 1.
DR PANTHER; PTHR47966:SF22; PEPSIN A-3-RELATED; 1.
DR Pfam; PF07966; A1_Propeptide; 1.
DR Pfam; PF00026; Asp; 1.
DR PRINTS; PR00792; PEPSIN.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 2.
DR PROSITE; PS51767; PEPTIDASE_A1; 1.
PE 3: Inferred from homology;
KW Aspartyl protease {ECO:0000256|RuleBase:RU000454};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157,
KW ECO:0000256|PIRSR:PIRSR601461-2};
KW Hydrolase {ECO:0000256|RuleBase:RU000454};
KW Protease {ECO:0000256|RuleBase:RU000454};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281}.
FT DOMAIN 70..387
FT /note="Peptidase A1"
FT /evidence="ECO:0000259|PROSITE:PS51767"
FT ACT_SITE 88
FT /evidence="ECO:0000256|PIRSR:PIRSR601461-1"
FT ACT_SITE 279
FT /evidence="ECO:0000256|PIRSR:PIRSR601461-1"
FT DISULFID 101..106
FT /evidence="ECO:0000256|PIRSR:PIRSR601461-2"
FT DISULFID 270..274
FT /evidence="ECO:0000256|PIRSR:PIRSR601461-2"
FT DISULFID 313..346
FT /evidence="ECO:0000256|PIRSR:PIRSR601461-2"
SQ SEQUENCE 390 AA; 42135 MW; 4D4C50DE3B092D07 CRC64;
LSWIIFLYLP GLPHVPLVKR KSLRQNLREN GLLEDFLKQH PRNPASKYFP KEAATLAATE
GLENYKDVSY FGTISIGTPP QEFTVIFDTG SSNLWVPSTY CSSLACSDHN RFNPEDSSTY
EATSESISIT YGTGSMTGVL RYNTVRVSTC WLTTSSCPAP GPASGDPGGH IPSSFLYYAP
FDGILGLAYP SISSSGATPV FDNIWDQGLV SQDLFSVYLS SDDESGSMVI FSGIDSSYYS
GSLCWVPVSE EAYWQITVDS ITMNGESIAC SGGCQAIVDT GTSLLAGPPS AIDNIQSYIG
ASEDYSSEAV ISCSSIDSLP DIVFTINGVE FHLSPSAYIL EEDDSCISGF EGMDLDTSSG
ELWILGDVFI RQYFTIFDRA NNQICLAPVA
//