ID G3QLG4_GORGO Unreviewed; 576 AA.
AC G3QLG4;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 2.
DT 27-MAR-2024, entry version 80.
DE SubName: Full=Complement factor I {ECO:0000313|Ensembl:ENSGGOP00000003316.4};
OS Gorilla gorilla gorilla (Western lowland gorilla).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Gorilla.
OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000003316.4, ECO:0000313|Proteomes:UP000001519};
RN [1] {ECO:0000313|Ensembl:ENSGGOP00000003316.4, ECO:0000313|Proteomes:UP000001519}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Scally A.;
RT "Insights into the evolution of the great apes provided by the gorilla
RT genome.";
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGGOP00000003316.4, ECO:0000313|Proteomes:UP000001519}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=22398555; DOI=10.1038/nature10842;
RA Scally A., Dutheil J.Y., Hillier L.W., Jordan G.E., Goodhead I.,
RA Herrero J., Hobolth A., Lappalainen T., Mailund T., Marques-Bonet T.,
RA McCarthy S., Montgomery S.H., Schwalie P.C., Tang Y.A., Ward M.C., Xue Y.,
RA Yngvadottir B., Alkan C., Andersen L.N., Ayub Q., Ball E.V., Beal K.,
RA Bradley B.J., Chen Y., Clee C.M., Fitzgerald S., Graves T.A., Gu Y.,
RA Heath P., Heger A., Karakoc E., Kolb-Kokocinski A., Laird G.K., Lunter G.,
RA Meader S., Mort M., Mullikin J.C., Munch K., O'Connor T.D., Phillips A.D.,
RA Prado-Martinez J., Rogers A.S., Sajjadian S., Schmidt D., Shaw K.,
RA Simpson J.T., Stenson P.D., Turner D.J., Vigilant L., Vilella A.J.,
RA Whitener W., Zhu B., Cooper D.N., de Jong P., Dermitzakis E.T.,
RA Eichler E.E., Flicek P., Goldman N., Mundy N.I., Ning Z., Odom D.T.,
RA Ponting C.P., Quail M.A., Ryder O.A., Searle S.M., Warren W.C.,
RA Wilson R.K., Schierup M.H., Rogers J., Tyler-Smith C., Durbin R.;
RT "Insights into hominid evolution from the gorilla genome sequence.";
RL Nature 483:169-175(2012).
RN [3] {ECO:0000313|Ensembl:ENSGGOP00000003316.4}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00196}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CABD030032666; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030032667; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030032668; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030032669; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030032670; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030032671; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030032672; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_018880873.1; XM_019025328.1.
DR AlphaFoldDB; G3QLG4; -.
DR MEROPS; S01.199; -.
DR Ensembl; ENSGGOT00000003391.4; ENSGGOP00000003316.4; ENSGGOG00000003371.4.
DR GeneID; 101152970; -.
DR CTD; 3426; -.
DR eggNOG; KOG3627; Eukaryota.
DR GeneTree; ENSGT00930000151042; -.
DR HOGENOM; CLU_006842_19_6_1; -.
DR OrthoDB; 4629979at2759; -.
DR TreeFam; TF330647; -.
DR Proteomes; UP000001519; Chromosome 4.
DR Bgee; ENSGGOG00000003371; Expressed in liver and 6 other cell types or tissues.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:InterPro.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0045087; P:innate immune response; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00112; LDLa; 2.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 3.30.60.30; -; 1.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 2.
DR Gene3D; 3.10.250.10; SRCR-like domain; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR048722; CFAI_FIMAC_N.
DR InterPro; IPR048719; CFAI_KAZAL.
DR InterPro; IPR003884; FacI_MAC.
DR InterPro; IPR002350; Kazal_dom.
DR InterPro; IPR036058; Kazal_dom_sf.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001190; SRCR.
DR InterPro; IPR017448; SRCR-like_dom.
DR InterPro; IPR036772; SRCR-like_dom_sf.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24264:SF40; HYALURONAN-BINDING PROTEIN 2; 1.
DR PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR Pfam; PF21286; CFAI_FIMAC_N; 1.
DR Pfam; PF21287; CFAI_KAZAL; 1.
DR Pfam; PF00057; Ldl_recept_a; 2.
DR Pfam; PF00530; SRCR; 1.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00057; FIMAC; 1.
DR SMART; SM00192; LDLa; 2.
DR SMART; SM00202; SR; 1.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF100895; Kazal-type serine protease inhibitors; 1.
DR SUPFAM; SSF57424; LDL receptor-like module; 2.
DR SUPFAM; SSF56487; SRCR-like; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS51465; KAZAL_2; 1.
DR PROSITE; PS01209; LDLRA_1; 1.
DR PROSITE; PS50068; LDLRA_2; 2.
DR PROSITE; PS50287; SRCR_2; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00196}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000001519};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW ECO:0000256|RuleBase:RU363034};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..576
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014154892"
FT DOMAIN 60..108
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 114..215
FT /note="SRCR"
FT /evidence="ECO:0000259|PROSITE:PS50287"
FT DOMAIN 333..567
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DISULFID 186..196
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00196"
FT DISULFID 229..247
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 241..256
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 259..271
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 266..284
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 278..293
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ SEQUENCE 576 AA; 65172 MW; D65C98D5BCC0C6DF CRC64;
MKLLHVFLLF LCFCLRFCKV TYTSQEDLVE KKCLAKKYTH LSCDKVFCQP WQRCIEGTCI
CKLPYQCPKN GTAVCATNRR SFPTYCQQKS LECLRPGTKF LNNGTCTAEG KFSVSLKHGI
TDSEGIVEVK LVDQDKTMFI CKSSWSMREA NVACLDLGFQ QGADTQRRFK LSDLSINSTE
CLHVHCRGLE TSLAECTFTK RRTMGYQDLA DVVCYTQKAD SPMNDFFQCV NGKYISQMKA
CDGINDCGDQ SDELCCKACQ GKSFHCKSGV CIPSQYQCNG EVDCITGEDE VGCEEETEIL
TADMDAERRR IKSLLPKLSC GVKNRTHIRR KRIVGGKRAQ LGDLPWQMAI KDASGITCGG
IYIGGCWILT AAHCLRASKT HRYQIWTTVV DWIHPDRKRI VIEYVDRIIF HENYNAGTYQ
NDIALIEMKK DGNKKDCELP RSIPACIPWS PYLFQPNDTC IVSGWGREKD NERVFSLRWG
EVKLISNCSK FYGNRFYEKE MECAGTHDGS IDACKGDSGG PLVCMDANNV TYVWGVVSWG
ENCGKPEFPG VYTKVANYFD WISYHVGRPF ISQYNV
//