ID G3PZ76_GASAC Unreviewed; 2674 AA.
AC G3PZ76;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 68.
DE SubName: Full=Otogelin {ECO:0000313|Ensembl:ENSGACP00000022916.3};
GN Name=OTOG {ECO:0000313|Ensembl:ENSGACP00000022916.3};
OS Gasterosteus aculeatus (Three-spined stickleback).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC Gasterosteus.
OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000022916.3, ECO:0000313|Proteomes:UP000007635};
RN [1] {ECO:0000313|Ensembl:ENSGACP00000022916.3, ECO:0000313|Proteomes:UP000007635}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.;
RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGACP00000022916.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 69293.ENSGACP00000022916; -.
DR Ensembl; ENSGACT00000022959.3; ENSGACP00000022916.3; ENSGACG00000017343.3.
DR eggNOG; KOG1216; Eukaryota.
DR GeneTree; ENSGT00940000157490; -.
DR InParanoid; G3PZ76; -.
DR OMA; YREPPGK; -.
DR TreeFam; TF330609; -.
DR Proteomes; UP000007635; Unassembled WGS sequence.
DR GO; GO:0046556; F:alpha-L-arabinofuranosidase activity; IEA:InterPro.
DR GO; GO:0046373; P:L-arabinose metabolic process; IEA:InterPro.
DR CDD; cd19941; TIL; 3.
DR Gene3D; 2.80.10.50; -; 1.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR007934; AbfB_ABD.
DR InterPro; IPR036195; AbfB_ABD_sf.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF228; OTOGELIN; 1.
DR Pfam; PF05270; AbfB; 1.
DR Pfam; PF08742; C8; 2.
DR Pfam; PF01826; TIL; 2.
DR Pfam; PF00094; VWD; 4.
DR SMART; SM00832; C8; 2.
DR SMART; SM00041; CT; 1.
DR SMART; SM00215; VWC_out; 1.
DR SMART; SM00216; VWD; 4.
DR SUPFAM; SSF110221; AbfB domain; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 3.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS51233; VWFD; 4.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 38..224
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 411..587
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 880..1049
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1875..2059
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 2589..2674
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT REGION 1449..1470
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1502..1588
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1600..1806
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1600..1623
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1646..1704
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1715..1735
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1758..1790
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2674 AA; 289417 MW; E53BDC90F963186F CRC64;
SCLNGGRCVH PESCDCSLYQ ATGHRCQTVP NPGLEREMTC RSWGQYNFET FDGLYFHFPG
RCTYTLLRDC EETPQASIVI RVSNPPPRLP PGSCPDGSGR APPLPFVEHH TTEVRVAVRS
LSPPLSLLCS LQLPHHVHDL QLERISQYVL VTRQRGFTLA WEGGSGSVYL KLSPELVGRA
CGLCGNFNAD VQDDLRTSYG VLTHDVEMFG NSWAEAEPHG ARCPAVPSGF SSPCAAAEAH
VLLKVEEVCA VMLEPPFQSC HEFVSPLSFM AGCSNDLCMS GPEGDVVCQV LSEYARACAH
ADHPLHGWRQ HFPQCAARSC PAGLQHRECI SCCPASCNLE RTCIDSRLAC LDGCYCPDGL
IYEDGGCVEA SDCPCEFHGL FYHSGHTLQE ECSNCTCVGG VWNCTEHSCP GECSVTGDMY
FQSFDGRLYT FAATCQYVLA KSRNSGKFTV TIQNAPCGAN LDGSCIQSVN LVIDEDPRTE
VTLSHLGEVF LAGQYRVSLP YSDDLFHIQE LSSMFLQVRT AFGLRLQYGW AQFRVYLQAD
APWKDDTAGL CGTFNGNIQD DFLSPSGMIE STPQLFGNAW RVSSACSSSL SPPQLDPCDA
HQQAAAYAWE MCDVLNQDLF SACHQHLSPA TFHQQCRSDT CRCGTPCLCS ALAHYARHCR
RFSVVVDFRS QIPDCAVTCP ATMQYGTCVS SCQRRCSALS VPPRCGGECE EGCVCPPESF
YNHRTHTCVH RSECPCSFLG ADYEPGDVIM TSAGVQLCLD GKLVLSSSRP PTDMLCPPGQ
RYQNCSEGAD GVLLSRGVAC ERTCESSLLN LTCSSHEPCV AGCTCPPGLL KHGDECFEAA
SCPCLWKGKE YYPGDRVSSP CHQCVCQHGT FRCAFRACPA MCSAYGDRHY GTFDGLLFDY
VGACEVHLVK SSGDVMLSVT AENVGCFDGG VVCRKSLVVN VGGSSVAFDE DSGEPNPSSA
IDRKQRMFLW PAGFFTVVHF PDEDVTVLWD RKTTVHIQVG PRWQGKLSGL CGNFDTKTVN
EMRTPDNIDS ATPQEFGNSW TAAECVSSPD IRHPCSLSPL REPFAKRRCG VLLSEVFQAC
HPVLPITMFF SRTNTPAPRA SEEENTAERS QDAAAHHCIC IHVIVANCKK KKKFSISRIS
LIAGHFRERL RPPGKEYVSI VLLPLNNTHV SSCLPLPPPP AVPMTPSSSP TDTSRVSFEA
ADRPNYFLSA GAGGRVSLSK WEEGEAFREG ATFVLHRNTW SPGYDALESH ARPGFFLHAT
PARLHLLKYR HADSFRKATL FRLTGPSPDA LPVPRCQWRY DSCVVPCFKT CGDPSAEACA
GIPQVEGCLP VCPPHMVLDE VTRRCVFVED CEECLISITD IYSVTVTDHD FQTSRRCATR
PRILMSRRVR LLRQCLVQKT PLCRVYRRQR SSAHPAYTAR MSEHRDAYST WKTQPEVTYK
TVLLEVPSAE TTEETRRHAK STPPVTSATP PEITTEVAAM TTEILTSAPP TPAPGVPETL
LTTTSTRPPP GVLTTTLSPG TVQSVTETTR PSTGRAEVST SRAEVSTTRA EVSTSRAEVS
TSRAEVPTSR AEVWQTTPTP SRSPPGSTTV WDVATTHSMP TERFTPTSTT TAPTTTVGST
PGVPATTRVP PRITSPERPS VRPVTARVTA ATSPPTSSTG STPVGPGLTT TSATLPASAG
PVATLSSTKR ADTTTTSTST PDLTAVGEPA PIPTLTTPVV STTVSPTTVR ESPAPEGTLT
TKAPRPATPP PDAHLVSTQK PFATSSPAAA TSVQTTARGA AANRTSAARP PASTAHAAEA
TASRATRTCT VGGTSPPYAE IVDDCTKYIC VNNQLVLFNK SQSCPFTSDP PNCGLLGFAV
LVNGDKCCPK WDCPCRCSVF PDLNVITFDG NSVAVYKAAA YIVTRLPNET VSVLVQECPV
DSESPLLWNF TNLCLVALNI THQSDHVIVN RLQRRLYVNS RYAKPRFKKF GFEVYDTGNM
YLIRSPAGLK LQWYHSTGML VIDTDSSGSK LHAMGLCGES PFRTLKHLLL YGLLETSSPA
ACSPALKPPL LYGLLETCQS ACSPALKPPL LYGLLETCQS ACSPALKPPL LYGLLETCQS
LCSPALELPL LYGLLETCQS ACSPALELPL LYGLLETCQS ATCQSACSPG LELPLLYGLL
ETCQSPCSPA LEASPALWLL RDWSISVCTD SAGVPRAHGD VWKASGRCCM YRCDNDVVLP
VEYDCAAAPP PPPCRRAAEV LVGLADDASC CPRKVCVCNQ SLCDSAPPRC KYGEKLVSYY
RADSCCPDYV CECDPDLCES DVPACRGDQT AIATRADGSC CLAHICMCSS CTGAPPLCQD
GEVLTVDGNA TDRCCPAYQC VCEPSRCPVL VCPVGMSVAS GSSPGRCCPE QTCECSCEKI
ASPKCGLGEA AQLDRASPSD PHNQCACKRY KCVREAVCVF GERGVLRPGQ TLVERHADGV
CHSRQCSRSL DPASGFHLLR TASINCSAHC RPNQVYVPPK DQSTCCGCKN ISCLYEQENG
TAVLHKPGKS WVSNCVKFDC VETLSGPTLI SHSYSCPPFN ETECMKVGGT VVSYVDGCCK
TCKEDGKSCQ KVTVRMTIRK NDRRSNRPVN IVSCDGKCPS ASIYNYNINT YARFCKCCRE
TGLQRRSVQL YCSGNATWVS YSIQEPTECS CQWS
//