ID A0A151PBI0_ALLMI Unreviewed; 293 AA.
AC A0A151PBI0;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Cathepsin O isoform A {ECO:0000313|EMBL:KYO46279.1};
GN Name=CTSO-1 {ECO:0000313|EMBL:KYO46279.1};
GN ORFNames=Y1Q_0021808 {ECO:0000313|EMBL:KYO46279.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO46279.1};
RN [1] {ECO:0000313|EMBL:KYO46279.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO46279.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO46279.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03000533; KYO46279.1; -; Genomic_DNA.
DR RefSeq; XP_019337006.1; XM_019481461.1.
DR AlphaFoldDB; A0A151PBI0; -.
DR GeneID; 102566412; -.
DR CTD; 1519; -.
DR OrthoDB; 5472948at2759; -.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 2.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR PANTHER; PTHR12411:SF947; CATHEPSIN O; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF00112; Peptidase_C1; 2.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..293
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018713085"
FT DOMAIN 105..292
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 293 AA; 32460 MW; 5FEB07692CFDF46E CRC64;
MQPLALWLVA CLLPGGGGPG LLGSHNRTRA PRGGGRQSAE AALWESVKRI TNLNSLSSND
NTTAFYGFNQ FSHLFPEEFK AVYLRSRPDK LPKYAEVPKR KETPLPKKFD WRDKNVVTEV
KNQNTCGGCW AFSVVGGIES AYAIKRNNLE ELSVQQTQVK LVKSSEYSFK GQTGLCHYFS
LSDFGVSIMG YAAYDFSGQE EEMMRKLVHW GPLAVTVDAI SWQDYLGGII QYHCSSGKAN
HAVLVTGFDR TGIIPYWIVR NSWGRTWGID GYARIKIGGN ICGIADTVAS VFV
//