GenomeNet

Database: UniProt
Entry: I3JUR1_ORENI
LinkDB: I3JUR1_ORENI
Original site: I3JUR1_ORENI 
ID   I3JUR1_ORENI            Unreviewed;      2281 AA.
AC   I3JUR1;
DT   11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT   17-JUN-2020, sequence version 2.
DT   27-MAR-2024, entry version 56.
DE   SubName: Full=Zonadhesin {ECO:0000313|Ensembl:ENSONIP00000012606.2};
GN   Name=LOC100696410 {ECO:0000313|Ensembl:ENSONIP00000012606.2};
OS   Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC   Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX   NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000012606.2, ECO:0000313|Proteomes:UP000005207};
RN   [1] {ECO:0000313|Proteomes:UP000005207}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   Broad Institute Genome Assembly Team;
RG   Broad Institute Sequencing Platform;
RA   Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT   "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL   Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSONIP00000012606.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 8128.ENSONIP00000042179; -.
DR   Ensembl; ENSONIT00000012616.2; ENSONIP00000012606.2; ENSONIG00000010024.2.
DR   eggNOG; KOG1216; Eukaryota.
DR   GeneTree; ENSGT00940000163883; -.
DR   HOGENOM; CLU_001167_0_0_1; -.
DR   TreeFam; TF316399; -.
DR   Proteomes; UP000005207; Linkage group LG18.
DR   GO; GO:0016020; C:membrane; IEA:InterPro.
DR   CDD; cd06263; MAM; 4.
DR   CDD; cd19941; TIL; 2.
DR   Gene3D; 2.60.120.200; -; 5.
DR   Gene3D; 2.10.25.10; Laminin; 2.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR000998; MAM_dom.
DR   InterPro; IPR036084; Ser_inhib-like_sf.
DR   InterPro; IPR002919; TIL_dom.
DR   InterPro; IPR025615; TILa_dom.
DR   InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR   InterPro; IPR001007; VWF_dom.
DR   InterPro; IPR001846; VWF_type-D.
DR   PANTHER; PTHR23282; APICAL ENDOSOMAL GLYCOPROTEIN PRECURSOR; 1.
DR   PANTHER; PTHR23282:SF123; MAM DOMAIN-CONTAINING GLYCOSYLPHOSPHATIDYLINOSITOL ANCHOR PROTEIN 1; 1.
DR   Pfam; PF08742; C8; 2.
DR   Pfam; PF00629; MAM; 5.
DR   Pfam; PF01826; TIL; 2.
DR   Pfam; PF12714; TILa; 1.
DR   Pfam; PF00094; VWD; 4.
DR   PRINTS; PR00020; MAMDOMAIN.
DR   SMART; SM00832; C8; 2.
DR   SMART; SM00137; MAM; 4.
DR   SMART; SM00215; VWC_out; 2.
DR   SMART; SM00216; VWD; 3.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 5.
DR   SUPFAM; SSF57567; Serine protease inhibitors; 2.
DR   PROSITE; PS50060; MAM_2; 5.
DR   PROSITE; PS51233; VWFD; 3.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000005207};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..27
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           28..2281
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5025411864"
FT   DOMAIN          80..172
FT                   /note="MAM"
FT                   /evidence="ECO:0000259|PROSITE:PS50060"
FT   DOMAIN          222..399
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          603..783
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          893..1050
FT                   /note="MAM"
FT                   /evidence="ECO:0000259|PROSITE:PS50060"
FT   DOMAIN          1077..1233
FT                   /note="MAM"
FT                   /evidence="ECO:0000259|PROSITE:PS50060"
FT   DOMAIN          1295..1452
FT                   /note="MAM"
FT                   /evidence="ECO:0000259|PROSITE:PS50060"
FT   DOMAIN          1532..1689
FT                   /note="MAM"
FT                   /evidence="ECO:0000259|PROSITE:PS50060"
FT   DOMAIN          2071..2281
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   REGION          1242..1267
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1690..1946
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2120..2168
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2138..2161
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2281 AA;  250474 MW;  71B7A13229ABCFA5 CRC64;
     MLGGIRTDLV VTVAVLFLTQ TGTQCRAEND FLLVTLPEWR AHSEYVIQCF DDRRANLNCD
     WTASEKGAGD FAVNVSESGP LGIEGKACLE FWYLVPVAAN GSELRVLLKN SVGSVEIWTS
     PALPQNAWRQ AFVPLNITEP GSQVVFEAVQ GLSIEKEITF KQIGVRKGSC GQQCESNTEL
     WTDVTTRCLC SAGQLSCFPS QCPKGQLCGP QRGGPNGTST FGMCTIHSQA DGSTFDGVLF
     RFTTACTYVL AKTCSPTETL PMFTVEVVNE QNDNTSLPQV QKININMRIY RVSLLKSQTD
     RVVVNGIWRK LPLSLNSGTV NIRSNPATIV LATTFGLIVS FDSTGVVHVT LPSTYSDKVC
     GMCGNFNNHK EDDFGNLNAQ NATALAESWQ TGEPASSCET ILFPQCDPVE EAEYASEQYC
     GGLFSTTGPF ADCLSVVGAD SYIRGCVFGM CSTRGDPAVL CETLQVYTDI CNKAGIAVPM
     LRNSTLCPIQ CGENSHYNSC ADGCPEVCSN LDIAGSCGSC VARCECDSGF KLSGEKCVPA
     EDCGCWYNGK HYEKGETLVE GDCVQQCQCM GNNNVQCTQM QCADNEVCKV KDGVKGCFLF
     KPTTCSVYGD PHYITFDGLA YDFQGGCSYI LTTTCGGGSS VQFTVSGHNT HPPLQNFTRS
     KLQAVALQVE DLYLILNQSG EVYVNYKQVQ LPYSTKGTYG SVWVYVKNNH IILETTFGLK
     MQIDGENRLF LQVDEHYKYE LCGLCGTYSG YQDDDFVTPG GQNVSEAFEF GDSWRVPNNS
     ECLARPNNPR LCDKDEKDTA YNECSVLFGS GFMPCHEHIH PSIYLNSCVY DYCATSGDQH
     TLCESLKSYA TACQVSGVEL PNWQTGTACE DLSTSTAPPT TASPTPDQTF CPLNCNFEKD
     LCGWEQLMQD TFDWTKHSGP TPTNLTGPNQ DHTTGAGFYM YLEGDSVTHG DSARLMSSAC
     NFNGPLCLYF WYHMYGSATA MALNIYLLKD SKTTKLWSMM DNQGPEWHLG RADVKVSGPF
     QIIIEGIRGS NYQSDVAIDD ISIHFGSCSD GFPVLGSGTK PSTTTTEMLP LHQICNLDCN
     FDSNLCSWNQ MITDAFDWTW QRGSTPTLMT GPSADHTGDG HYLYIEANSA TYGDTARLIS
     SECSDSGPQC LQFWYHMYGT ADTMGLHVYL LQNRIANQIW RKQNDQGNMW HLAQVDIEPT
     TAFQIIIEGR RGSNDQSDVA IDDLKLYRGH CSNLVGGITT QSARPDLNTT TPPSVTVQTT
     AAPQPPMVTT TFQPPIINTT VPSNNRPPLQ PVCQLHCNFE QDLCQWNQLI TDAFDWTRQS
     GSTPTINTGP STDHTTGGGH YLYIEANSAT YGDTARLISS ECSDSGPQCL QFWYHMYGTA
     DTMGLHVYLL QNRIANQIWR KQNDQGNMWH LAQVDIEPTT AFQIIIEGRR GSNDQSDVAI
     DDLKLYHGHC SDLVGGITTQ SARPDLNATA PPSVPVQTTA APQPPMVTTT FQPPMINITV
     PPIVEETTNL INKNNVTEAP DNRPPLQPVC QLHCNFEQDL CQWNQLITDA FDWTRQSGST
     PTINTGPSTD HTTGGGHYLY IEANSATYGD TARFISSECS DSGPQCLQFW YHMYGSADTM
     GLHVYLVQNR TANQVWKKQN DQGNMWHLAQ VDITATDNFQ IIFEGRRGSN TQSDVAIDDV
     SLHRGRCAEL VNPTTTTTQS KPQTTARPQP STTTQLKPET TAGPQPSTTT QLKPETTAGP
     QPSTTTQLKP ETTAGPQPST TTQLKPETTA RPQPSTTTQL KPETTARPQP STTTQLKPET
     TARPQPSTTT QLKPETTAGP QPSTTTQLKP ETTAGPQPST TTQLKPETTA GPQPSTTTQL
     KPETTARPQP STTTQLKPET TARPQPSTTT QLKPETTAGP QPSTTTQLKP ETTAGPQPST
     TAGPQPTINK PQSTTARPQQ TTLPVPTPSC PRYSHYTTCV PACSPTCVFL NGPPHCSDNG
     VCVPGCVCDD GFVMKMKICV PLERCGCVDK NGTKHQFNEQ WYTDHCSQKC ECEEDDGIGK
     IDCDDEDECD GNAVCLQNEK GNYYCHSTDF DECTIKKDPE YRTFDKMKHD FEGEDSYVLV
     RTSNLPNNLP DVYIESINTP VLDQGGDSQH ENDSSSEEEQ NSKDDDDDDD DDDDDDDDDD
     SKEHDEHHRL QELKIRVYNH TMEFKKNRKL IVDGKPTDTP VSVTGGLKIW KRSSRIYLQT
     DFGLSVAFNG HHSAEITLPH IYRSKVGGLC GNFDAQKNND RMKPDGTIAR STQEFGESWR
     V
//
DBGET integrated database retrieval system