GenomeNet

Database: UniProt
Entry: W5JPK0_ANODA
LinkDB: W5JPK0_ANODA
Original site: W5JPK0_ANODA 
ID   W5JPK0_ANODA            Unreviewed;       585 AA.
AC   W5JPK0;
DT   19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT   19-MAR-2014, sequence version 1.
DT   27-MAR-2024, entry version 52.
DE   SubName: Full=Coagulation factor XI {ECO:0000313|EMBL:ETN64810.1};
GN   ORFNames=AND_003431 {ECO:0000313|EMBL:ETN64810.1};
OS   Anopheles darlingi (Mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=43151 {ECO:0000313|EMBL:ETN64810.1};
RN   [1] {ECO:0000313|EMBL:ETN64810.1, ECO:0000313|Proteomes:UP000000673}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=20920257; DOI=10.1186/1471-2164-11-529;
RA   Mendes N.D., Freitas A.T., Vasconcelos A.T., Sagot M.F.;
RT   "Combination of measures distinguishes pre-miRNAs from other stem-loops in
RT   the genome of the newly sequenced Anopheles darlingi.";
RL   BMC Genomics 11:529-529(2010).
RN   [2] {ECO:0000313|EMBL:ETN64810.1}
RP   NUCLEOTIDE SEQUENCE.
RA   Almeida L.G., Nicolas M.F., Souza R.C., Vasconcelos A.T.R.;
RL   Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:ETN64810.1}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=23761445;
RA   Marinotti O., Cerqueira G.C., de Almeida L.G., Ferro M.I., Loreto E.L.,
RA   Zaha A., Teixeira S.M., Wespiser A.R., Almeida E Silva A.,
RA   Schlindwein A.D., Pacheco A.C., Silva A.L., Graveley B.R., Walenz B.P.,
RA   Lima Bde A., Ribeiro C.A., Nunes-Silva C.G., de Carvalho C.R., Soares C.M.,
RA   de Menezes C.B., Matiolli C., Caffrey D., Araujo D.A., de Oliveira D.M.,
RA   Golenbock D., Grisard E.C., Fantinatti-Garboggini F., de Carvalho F.M.,
RA   Barcellos F.G., Prosdocimi F., May G., Azevedo Junior G.M., Guimaraes G.M.,
RA   Goldman G.H., Padilha I.Q., Batista Jda S., Ferro J.A., Ribeiro J.M.,
RA   Fietto J.L., Dabbas K.M., Cerdeira L., Agnez-Lima L.F., Brocchi M.,
RA   de Carvalho M.O., Teixeira Mde M., Diniz Maia Mde M., Goldman M.H.,
RA   Cruz Schneider M.P., Felipe M.S., Hungria M., Nicolas M.F., Pereira M.,
RA   Montes M.A., Cantao M.E., Vincentz M., Rafael M.S., Silverman N.,
RA   Stoco P.H., Souza R.C., Vicentini R., Gazzinelli R.T., Neves Rde O.,
RA   Silva R., Astolfi-Filho S., Maciel T.E., Urmenyi T.P., Tadei W.P.,
RA   Camargo E.P., de Vasconcelos A.T.;
RT   "The genome of Anopheles darlingi, the main neotropical malaria vector.";
RL   Nucleic Acids Res. 41:7387-7400(2013).
RN   [4] {ECO:0000313|EnsemblMetazoa:ADAC003431-PA}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (JUN-2015) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC       {ECO:0000256|ARBA:ARBA00024195}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ADMH02000867; ETN64810.1; -; Genomic_DNA.
DR   AlphaFoldDB; W5JPK0; -.
DR   STRING; 43151.W5JPK0; -.
DR   EnsemblMetazoa; ADAC003431-RA; ADAC003431-PA; ADAC003431.
DR   VEuPathDB; VectorBase:ADAC003431; -.
DR   VEuPathDB; VectorBase:ADAR2_004112; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   HOGENOM; CLU_004497_6_1_1; -.
DR   OMA; KKTLCHA; -.
DR   OrthoDB; 3448513at2759; -.
DR   Proteomes; UP000000673; Unassembled WGS sequence.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 3.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   PANTHER; PTHR24260; -; 1.
DR   PANTHER; PTHR24260:SF154; GH18608P-RELATED; 1.
DR   Pfam; PF00089; Trypsin; 2.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 2.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 2.
DR   PROSITE; PS50240; TRYPSIN_DOM; 2.
DR   PROSITE; PS00134; TRYPSIN_HIS; 2.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000000673};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..28
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           29..585
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5010155761"
FT   DOMAIN          42..289
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   DOMAIN          319..584
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   585 AA;  65546 MW;  FC8CF1338C3541C3 CRC64;
     MARFWWCYAL VFGSLLLLLS SLFAPGQAQT CGRRMVDLMA LIHKGNAAKA GYWPWHAALF
     ENKIRSFEYM CGGSIVSQNL ILTAAHCLLT ERGLIAAERL LVQVGRNRLK LADTRGQEHE
     AHQLITHPDF DVNDLVHDIA LIKLATDISF TNYIQPVCLW DRNVDLQNIV GTKGFIVGYG
     FDDSDKVSDY LKDAEIPVVD SFTCINSNPE AFGLKLKSKM YCAGARDGVS ACNGDSGGGM
     FFTYGNVWYI RGLVSFIPLR DQVALCDPNQ YTVFTDVAKY LDWIREHYRE PSSGSSGGGS
     PIESPLDQSS KLRLLNLDVC GKSRYMNRAE SSKPVFLGYP WVALLEYAVD GEREKQTLCH
     GTLISDRYVL TAGHCITQLP KRFRLTTVRL GDYDIKTTRD CESVNGEQEC APPVQVMRIE
     SAIVHSGFNT PKYANDIALI RLRERAVISQ SNVQPICLPV SNELRSFKPT TYTLTAWPIG
     GNVLGRADRQ VVDSVECQAN YTRYSITLEK TFRQICLKQD QSSGTRCTFP KSAAPLQTIQ
     QLNGENRYVM HGLLSYGPKD CSQAYPDVYT YIGPYMEWIL TNIHE
//
DBGET integrated database retrieval system