GenomeNet

Database: UniProt
Entry: H3AWN3_LATCH
LinkDB: H3AWN3_LATCH
Original site: H3AWN3_LATCH 
ID   H3AWN3_LATCH            Unreviewed;      1691 AA.
AC   H3AWN3;
DT   18-APR-2012, integrated into UniProtKB/TrEMBL.
DT   01-MAY-2013, sequence version 2.
DT   27-MAR-2024, entry version 68.
DE   SubName: Full=Complement C5 {ECO:0000313|Ensembl:ENSLACP00000014054.2};
GN   Name=C5 {ECO:0000313|Ensembl:ENSLACP00000014054.2};
OS   Latimeria chalumnae (Coelacanth).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Coelacanthiformes; Coelacanthidae; Latimeria.
OX   NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000014054.2, ECO:0000313|Proteomes:UP000008672};
RN   [1] {ECO:0000313|Proteomes:UP000008672}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Wild caught {ECO:0000313|Proteomes:UP000008672};
RA   Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA   MacCallum I., Young S., Walker B.J., Lander E., Lindblad-Toh K.;
RT   "The draft genome of Latimeria chalumnae.";
RL   Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSLACP00000014054.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AFYH01007848; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01007849; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01007850; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01007851; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01007852; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01007853; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   RefSeq; XP_014346298.1; XM_014490812.1.
DR   SMR; H3AWN3; -.
DR   STRING; 7897.ENSLACP00000014054; -.
DR   Ensembl; ENSLACT00000014153.2; ENSLACP00000014054.2; ENSLACG00000012370.2.
DR   GeneID; 102364499; -.
DR   KEGG; lcm:102364499; -.
DR   CTD; 727; -.
DR   eggNOG; KOG1366; Eukaryota.
DR   GeneTree; ENSGT00940000155670; -.
DR   HOGENOM; CLU_001634_4_2_1; -.
DR   InParanoid; H3AWN3; -.
DR   OMA; YKRIIAC; -.
DR   OrthoDB; 4033541at2759; -.
DR   TreeFam; TF313285; -.
DR   Proteomes; UP000008672; Unassembled WGS sequence.
DR   GO; GO:0005615; C:extracellular space; IEA:InterPro.
DR   GO; GO:0004866; F:endopeptidase inhibitor activity; IEA:InterPro.
DR   CDD; cd00017; ANATO; 1.
DR   CDD; cd02896; complement_C3_C4_C5; 1.
DR   Gene3D; 1.50.10.20; -; 1.
DR   Gene3D; 2.20.130.20; -; 1.
DR   Gene3D; 2.40.50.120; -; 1.
DR   Gene3D; 2.60.120.1540; -; 1.
DR   Gene3D; 2.60.40.1930; -; 3.
DR   Gene3D; 2.60.40.1940; -; 1.
DR   Gene3D; 6.20.50.160; -; 1.
DR   Gene3D; 2.60.40.690; Alpha-macroglobulin, receptor-binding domain; 1.
DR   Gene3D; 1.20.91.20; Anaphylotoxins (complement system); 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR   InterPro; IPR009048; A-macroglobulin_rcpt-bd.
DR   InterPro; IPR036595; A-macroglobulin_rcpt-bd_sf.
DR   InterPro; IPR011625; A2M_N_BRD.
DR   InterPro; IPR011626; Alpha-macroglobulin_TED.
DR   InterPro; IPR000020; Anaphylatoxin/fibulin.
DR   InterPro; IPR018081; Anaphylatoxin_comp_syst.
DR   InterPro; IPR041425; C3/4/5_MG1.
DR   InterPro; IPR048843; C5_CUB.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR001599; Macroglobln_a2.
DR   InterPro; IPR002890; MG2.
DR   InterPro; IPR041555; MG3.
DR   InterPro; IPR040839; MG4.
DR   InterPro; IPR001134; Netrin_domain.
DR   InterPro; IPR018933; Netrin_module_non-TIMP.
DR   InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR   InterPro; IPR008993; TIMP-like_OB-fold.
DR   PANTHER; PTHR11412:SF83; COMPLEMENT C5; 1.
DR   PANTHER; PTHR11412; MACROGLOBULIN / COMPLEMENT; 1.
DR   Pfam; PF00207; A2M; 1.
DR   Pfam; PF07703; A2M_BRD; 1.
DR   Pfam; PF07677; A2M_recep; 1.
DR   Pfam; PF01821; ANATO; 1.
DR   Pfam; PF21309; C5_CUB; 1.
DR   Pfam; PF17790; MG1; 1.
DR   Pfam; PF01835; MG2; 1.
DR   Pfam; PF17791; MG3; 1.
DR   Pfam; PF17789; MG4; 1.
DR   Pfam; PF01759; NTR; 1.
DR   Pfam; PF07678; TED_complement; 1.
DR   SMART; SM01360; A2M; 1.
DR   SMART; SM01359; A2M_N_2; 1.
DR   SMART; SM01361; A2M_recep; 1.
DR   SMART; SM00104; ANATO; 1.
DR   SMART; SM00643; C345C; 1.
DR   SUPFAM; SSF49410; Alpha-macroglobulin receptor domain; 1.
DR   SUPFAM; SSF47686; Anaphylotoxins (complement system); 1.
DR   SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 1.
DR   SUPFAM; SSF50242; TIMP-like; 1.
DR   PROSITE; PS01177; ANAPHYLATOXIN_1; 1.
DR   PROSITE; PS01178; ANAPHYLATOXIN_2; 1.
DR   PROSITE; PS50189; NTR; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008672};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..1691
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003580687"
FT   DOMAIN          703..738
FT                   /note="Anaphylatoxin-like"
FT                   /evidence="ECO:0000259|PROSITE:PS01178"
FT   DOMAIN          1545..1690
FT                   /note="NTR"
FT                   /evidence="ECO:0000259|PROSITE:PS50189"
SQ   SEQUENCE   1691 AA;  190052 MW;  B2E73E924633967C CRC64;
     MMKFFYIFFV LALCERSICQ EQTYLLTAPR VFRVGATETV VVQTFGYDDG FSVNIAIKSF
     PDKKTTFASE SLSLSNANNF QGSVTLTIQP KDLPRKDSSK VYVYLEASSP GFTKEEKVLV
     SYQNGFLFIQ LDKPIYTPDQ SVKVRVYSLN EDLKPARKPI TLTFVDPEGV KVDIIEGDDF
     TGVVSVPHFK IPSNPRYGIW KTEAAYKSDY TTSAIAEFEV KEYAMPSFSL SIEPERNFIS
     YDRFENFNIA IKASYFYGKK VSRADVYVRF GIIDEQQEKT MLSGVIVAQL TEGVAEIFLD
     SKTEFARLGR ESLEDLDGSY LYITVSVQEY SGGHTEEAEL AEVKYVMSPY SINLVATPSY
     VKPGLPFHIQ VQVKDTLGQP VGNIPVTLTA VAFDENHEEI SLVDENSEQG KRDTLRANGV
     ALFVVNIPMT VNTLQFKVKT ADTTLTEESQ ISREYEARVY SSLTNSYLYI HITGSPTGLR
     VGNYFNVNLL ASSPYKSKIK QFGYQVMSKG KLVIFGTIDF SEGVVVHNLN IEVTSEMVPS
     ARLIVYYVVT GEGSAELVVD SVWINVEEKC TSKKQVVLSK DAEVYKPGKD MFLSIEAEPS
     SFVALSSVDS AIYGVRTKAK KSIERMLQHI EKSDLGCGAG GGKNNADVFR LAGLTFMTNA
     NAKALEEHDE PCTVIMRPKR SVNFEKQVEQ ELSRFKNPTY KKCCLDGIKA YPITETCDQR
     ARRIRKGEQC FKAFRYCCQF ANKLRFQSHT HTTLGRMNIV ASLEVEETQV RSYFPESWLW
     EVHEVTARSG SKRLAVTLPD SLTTWQIQGI GISDNGICVA DPLKVQVFKE VFLKMQMPYS
     VIRGEQIELR GSIYNYKEVP NTALVSMTVG DGICLFKGSA TGSKGTQSPP IKMVVRGSSV
     SSVSYVILPL ELGLHTINFT LKSQYGNEIV MHTLRVVPEG IKKEQNVGVT LDPQGIYGFI
     KRRHEFRYMT PKNVVPKSDI NRTVSVKGEI MGEIIATVLS AEGLNLLNNL PRGSGEAELM
     SIVPVYYVFY YLEKSDNWKI LGSKTLTIRM NMRRKMIEGV TSILSFKVKG GHAYSMWKDG
     VPSTWLTAFV VRIFGELNEY VPLDEMSVCN SVMWLIEDCQ KSNGLFKETS NYQAVKLQGT
     IPKESEEREI YLTAFTVIGI QKAFHMCPTR GIQDAIFKAI DILSNKWKNV QSTYTLAITA
     YALAVQNRRT LAARFAFASL KKEALVKGNP PVYRFWKETS SQVDTSTPSV VTARIVETTA
     YALLVTILNG DMNYAKPIIK WLSEQQRYGG GFYSTQDTII ALQSLTEYAV LLKRSELDMI
     IKVSNKKHGE FLHFEMTEEK SLVRAEEVPK DDDLVISTGS GTGISTVHIK TVYHAVSTSE
     ENCDFSLRIQ GTPNIDPLSA GTRKRRESEP LQRIEACAKY LPRKNEEFTE SSHAVMDIGL
     VTGLAAEEED LDTLANGVDN LISDYKIADG HVILQLDQIP SDDYICVAFR VREMFNVGML
     SPAVFTVYEY HTPDRRCSIF YNPYGNEKLV KLCQGNECKC MEVECSQMQK EINLTVSAND
     RIEAACKEDI VYAYKVNILS AKEDGNFVKY SASILDIYKK GADRVKQTMV VTFIKKKTCT
     DVLKIGKHYL IMGTEGIETK NYMTLEYDYP LDSKVWIELW PSEQDCDVDT CTDFIKTLEE
     FSENVLFFGC S
//
DBGET integrated database retrieval system