ID A0A1A6FW21_NEOLE Unreviewed; 393 AA.
AC A0A1A6FW21;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE RecName: Full=Peptidase S1 domain-containing protein {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
GN ORFNames=A6R68_11534 {ECO:0000313|EMBL:OBS57337.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS57337.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS57337.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS57337.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS57337.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS57337.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01117157; OBS57337.1; -; Genomic_DNA.
DR STRING; 56216.A0A1A6FW21; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd01100; APPLE_Factor_XI_like; 2.
DR Gene3D; 3.50.4.10; Hepatocyte Growth Factor; 2.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR000177; Apple.
DR InterPro; IPR003609; Pan_app.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR PANTHER; PTHR24252; ACROSIN-RELATED; 1.
DR PANTHER; PTHR24252:SF7; HYALIN; 1.
DR Pfam; PF00024; PAN_1; 2.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00005; APPLEDOMAIN.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00223; APPLE; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS00495; APPLE; 1.
DR PROSITE; PS50948; PAN; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124}.
FT DOMAIN 191..274
FT /note="Apple"
FT /evidence="ECO:0000259|PROSITE:PS50948"
FT DOMAIN 288..393
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:OBS57337.1"
FT NON_TER 393
FT /evidence="ECO:0000313|EMBL:OBS57337.1"
SQ SEQUENCE 393 AA; 43646 MW; 6AB166A4A32F5F0E CRC64;
CVTKLFKDTC FQGGDISTVF TPSAKYCQLV CTHHPRCLLF TFMAESSSDD PTKCFEKEMN
RKQSIKDIAL GIFSLTRCLQ TLTLTVSWPQ MLLSVDAFVL ITLAVCFLRS LPKNGPKNLK
DIFASLKHLK VDYQAHALQR ATPFLVSVSS TAGIVPQVNS EGLLSRAQQE MDECEGYTVQ
IWHVLYCLVF CHPSFYNDTD FLGEELDIVD VKGXESCQKM CTNAVRCQFF TYSPARGSCN
EGKGRCYLKL SLNGSPTRIL HGTGGISGYT LRLCKMDNVC TTKIKPRVVG GTASVQGDWP
WQVTLHITSP TKGHLCGGSI IGNQWILTAA HCFSGVETSK NLRVYGGIVN QSEINEDTAF
FRVQDIIIHD QYKMAESGYD IALLKLESAM NYT
//