GenomeNet

Database: UniProt
Entry: A0A3Q0GVE3_ALLSI
LinkDB: A0A3Q0GVE3_ALLSI
Original site: A0A3Q0GVE3_ALLSI 
ID   A0A3Q0GVE3_ALLSI        Unreviewed;       651 AA.
AC   A0A3Q0GVE3;
DT   13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT   13-FEB-2019, sequence version 1.
DT   24-JAN-2024, entry version 24.
DE   SubName: Full=LOW QUALITY PROTEIN: complement factor I {ECO:0000313|RefSeq:XP_025062148.1};
GN   Name=CFI {ECO:0000313|RefSeq:XP_025062148.1};
OS   Alligator sinensis (Chinese alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=38654 {ECO:0000313|Proteomes:UP000189705, ECO:0000313|RefSeq:XP_025062148.1};
RN   [1] {ECO:0000313|RefSeq:XP_025062148.1}
RP   IDENTIFICATION.
RG   RefSeq;
RL   Submitted (SEP-2023) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00196}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_025062148.1; XM_025206363.1.
DR   STRING; 38654.A0A3Q0GVE3; -.
DR   KEGG; asn:102384439; -.
DR   InParanoid; A0A3Q0GVE3; -.
DR   OrthoDB; 4629979at2759; -.
DR   Proteomes; UP000189705; Unplaced.
DR   GO; GO:0016020; C:membrane; IEA:InterPro.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00112; LDLa; 2.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 3.30.60.30; -; 1.
DR   Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 2.
DR   Gene3D; 3.10.250.10; SRCR-like domain; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR048722; CFAI_FIMAC_N.
DR   InterPro; IPR048719; CFAI_KAZAL.
DR   InterPro; IPR003884; FacI_MAC.
DR   InterPro; IPR036055; LDL_receptor-like_sf.
DR   InterPro; IPR023415; LDLR_class-A_CS.
DR   InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001190; SRCR.
DR   InterPro; IPR017448; SRCR-like_dom.
DR   InterPro; IPR036772; SRCR-like_dom_sf.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24253:SF91; COMPLEMENT FACTOR I; 1.
DR   PANTHER; PTHR24253; TRANSMEMBRANE PROTEASE SERINE; 1.
DR   Pfam; PF21286; CFAI_FIMAC_N; 1.
DR   Pfam; PF21287; CFAI_KAZAL; 1.
DR   Pfam; PF00057; Ldl_recept_a; 2.
DR   Pfam; PF00530; SRCR; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00057; FIMAC; 1.
DR   SMART; SM00192; LDLa; 2.
DR   SMART; SM00202; SR; 1.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF57424; LDL receptor-like module; 2.
DR   SUPFAM; SSF56487; SRCR-like; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS01209; LDLRA_1; 2.
DR   PROSITE; PS50068; LDLRA_2; 2.
DR   PROSITE; PS50287; SRCR_2; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00196}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydrolase {ECO:0000256|RuleBase:RU363034};
KW   Protease {ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000189705};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW   ECO:0000256|RuleBase:RU363034};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..18
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           19..651
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5017942710"
FT   DOMAIN          138..255
FT                   /note="SRCR"
FT                   /evidence="ECO:0000259|PROSITE:PS50287"
FT   DOMAIN          406..642
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   REGION          327..355
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        211..221
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00196"
FT   DISULFID        247..259
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        254..272
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        266..281
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        284..296
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        291..309
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ   SEQUENCE   651 AA;  73028 MW;  9FCA32F2CEC97E1D CRC64;
     MKVLLLLGFS CVFFCTYGGK VFQNQTSEYN XXXXXXXXXX XQQVQPSHKE THLVGECLKE
     KYTQDSCEKV FCLPWKRCVS GNCICKLPYQ CPKNGTSVSS INGKTFRTYC QLKSYECQRP
     EAKYMSKGDY IHGEKFDVSL NYGDSESEGV IQVKLVNNTE KLFLCNSKWS MNEANVACRH
     RGFETGAEYH RKTFTVPEHN NTSSCCLQVI CRGVETSLAE CKLIKRSPPG GVKNFATVAC
     HKVQRECTPQ EFCCANKKCI PLTETCNGIN DCGDLSDELC CKECKIKSFL CNSGVCIPKK
     YLCNKEIDCL TGEDELQVNC EDEQNVELET QDQDDEGQNP ETETQGQQSG NKEADHKVII
     RTRAKGKAQE VIPNYNADEE RRAIKTFLPE LKCGITNHTV TRRKRIVGGN PAAKGEFPWQ
     VAIKEEGGTG PSVYCGGVYI GGCWILTAAH CVRATRVHQY RIWSGLLDTI QYNREIDTFK
     LNKVIIHEKY NAGTYENDIA LLEMKSMDKG KPCSLAYSTP ACVPWSEYMF KPGHRCKISG
     WGLERDSAKQ FVLKWGYIDI LSNCTEIYKD RFFKGMECAG THDGSIDSCK GDSGGPLICF
     DSNNVAYVWG VVSWGENCGV AGYPGVYTKV ASYFDWISHQ VGRALISKYN V
//
DBGET integrated database retrieval system