ID A0A087YMF0_POEFO Unreviewed; 2520 AA.
AC A0A087YMF0;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 49.
DE SubName: Full=Otogelin-like {ECO:0000313|Ensembl:ENSPFOP00000019203.1};
GN Name=OTOGL {ECO:0000313|Ensembl:ENSPFOP00000019203.1};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000019203.1, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000019203.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01001941; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 48698.ENSPFOP00000019203; -.
DR Ensembl; ENSPFOT00000019225.1; ENSPFOP00000019203.1; ENSPFOG00000019067.1.
DR eggNOG; KOG1216; Eukaryota.
DR GeneTree; ENSGT00940000160698; -.
DR OMA; ASGPCLC; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR GO; GO:0046556; F:alpha-L-arabinofuranosidase activity; IEA:InterPro.
DR GO; GO:0046373; P:L-arabinose metabolic process; IEA:InterPro.
DR CDD; cd19941; TIL; 4.
DR Gene3D; 2.80.10.50; -; 1.
DR Gene3D; 2.10.25.10; Laminin; 5.
DR InterPro; IPR007934; AbfB_ABD.
DR InterPro; IPR036195; AbfB_ABD_sf.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF225; OTOGELIN-LIKE PROTEIN; 1.
DR Pfam; PF05270; AbfB; 1.
DR Pfam; PF08742; C8; 4.
DR Pfam; PF01826; TIL; 3.
DR Pfam; PF00094; VWD; 4.
DR SMART; SM00832; C8; 4.
DR SMART; SM00041; CT; 1.
DR SMART; SM00216; VWD; 4.
DR SUPFAM; SSF110221; AbfB domain; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 4.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS51233; VWFD; 4.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00039}; Reference proteome {ECO:0000313|Proteomes:UP000028760}.
FT DOMAIN 70..246
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 429..606
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 897..1073
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1705..1887
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 2435..2520
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT REGION 1462..1517
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1549..1582
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 2449..2498
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
SQ SEQUENCE 2520 AA; 277360 MW; 432C51D8366F88FF CRC64;
PTQTYRPLLP ENSLSRSVLH NYTARDESSE YCGCLNGGWC QEDSVCDCSQ FQALGDRCQI
IPNQGQDRDG ICRSWGQHHF ETFDGIYFYF PGTCSYILAQ DCHSATPEYT VWIHNSRVCE
GSVYSCPRAL SLFFPNEEEI HISGYQVHQG GRRLSLPQTV GGVFIERLAD YLLVKSVFGF
SLAWDGGSGV YLKMSEEHQG TPCGLCGNFN HIADDDLTTA RGIRTDEPAV FANSWTVDLP
HERACPSVDL DFNGPCHSES DMDDAIEKCS ALLFFPFLSC HENIDPNPFV ASCVSDLCVA
DDEETFCRAL VEYTRACSHV GYPVREWRDS FPSCNDGCEE SFVYRDCISC CPPTCTFEKE
CLGTNLHCLD GCYCPDGLIL QNGTCIPASQ CPCVYHGTSY VQGQVLQQGC SVCVCMGGVW
NCTENNCTAE CSVVGDMFVT TFDGRMFLQP GACQYVLAKS RGSSKFTITL QYTTCEEHQV
CIQSVTVVLD EDASHQITLT REGEVIIGVN PTPTLPYVDG CTDAVNVHRL TSVVTQLKTG
IGLRLLYDGQ GGRVYLQLGS QWRGQTLGLC GTFNGNLRDD FLSPAGMIEG TPQLHANAWK
VSSACVAPVN LPIVDPCETN QHNAYYASQC DVLVETVFAP CHGYISPNIY QQQCRYQACR
CGSSCLCTAL AHYAYLCSKH GVDLNFRSQV SECGVVCLGG MQYHSCVSSC GRSCRGLANT
ETCSPADCAE GCACLDGSYY DDVRRRCVQL SQCHCYSMGG VSQPGEVTFT ASGPCLCRNG
KMECEPEEKE PDSVEEGECP EGKVYHSCRE QRGGVACAPT CRNLMLNLTC PPNTLCIPGC
VCPPGLVLHQ GECYYPENCP CAWLGLEYLP GQVVETSCYR CVCHRGYFNC SYSPCPAVCT
VYSDRHYHTF DGLEYDYQTD CQVYLLKSAG ETEVSIVAQN KDCYESGIVC MKILVIHVGL
TKIYFTDNSG NPSPSTVVGR ESEFEIWNSG YYTVVHFSSQ DLTILWDRKT TVHIRAGPHW
KGLLSGLCGN FDSVTVNDMT TSSHMEVNNA QTFGNSWSLG QCDIDYIVER PCERDLGRQP
YAKRECALLY SDVFAPCHNV VDVAWFYRNC LTDTCNCNRG GDCECLCTSI AAYAHKCCQQ
GVTIHWRSPS VCPYDCEYYN QELGDGPFSL VSAVFNDTMF GVNRTSSAVF PLTRERPGQL
PAPGILFNFM ITTGLWRNRT SRVPVVSLES AERPNYFFTV SGRSRLQLEQ WSRGPEFSRK
ATFIQHQGLF LPGYASFEVV SQPGIFLTLT RGAARAQRYS TAEGFKTSSS FTLEDSPFVI
PYRMMCEWRY QACASPCVHT CSDPDATRCQ FLPPVEGCFP RCPKNMVLDE VTRRCVYSED
CECKHDFVFK SCVMLPSTPT PFAYVTRSNR TTTPPPTPTT TAKATTISST TVSTVTTKPV
TTLLTTSPTT TPASTTTIVT TTPFETSTPS TTVMSPTPTT EITSTTPSTS PTTTPEVTTP
TTPLATSPTT LPPTTTVITT VPTTTVPETT PTTTSQPQTT ITTEISTSVE TTTALPSTEP
TTVTSTEMIT SPETEEVSTR IPWPTGPCTV SVTSPVRLNG LNSLLLSPQC LALSCSLEHR
EKITIYGSPG RIFGVSLQIC QLCPQPPFSY RIDECAELIC FNGELLFHNS SLHCRYNTAP
PQCSLLGLPI LINTDPCCPL WQCPCRCTVM SDLRVITFDG NNVALYDNGS YILVNLPRET
IIGTVEKCPT SQSPTGGTSG LCFKKLNITT SNYRIIVNRL DRKVTVNYRP AKLPFSRHSL
YVEDTGSMYL IHTPGSISIQ WYHSTGIMVL QYNAPSHAFV PTRGLCGCCD GNPEDDLKLP
NGTVVREVGD MMLFLQAWRV HLTDETEHTR RVGDNCTTGD CSTCLSMLTQ RAFTPCHSKV
PPEQFCDIMW AGDLHYKDHQ CDFLAAYVAV CYTHQVCISW RRHNFCPLRC PPGKEYQPCV
STCNSRTCLN KDYYEETTCS FIREECVCRS GTILHRADSP YCVTEDRCVC TDNEGNPRAP
GEVWNGSTRS CCLYKCMENG SVVAVEPDCS AVPTPLCERE GEYVLDVLEE GSCCPKKICE
CNMTICDSEA PPCDNGNRLV IGYSALSCCP EYRCECDPMA CPPVFAPDCR EDQFLVEVRG
QKSCCYSYLC VCESCIEPIP SCSHGEILAV DPNTTNSCCP QYHCVCDVNQ CPKSSLSCAP
GLSLVETAAP GDCCPQHHCE CQCEDRALPM CQLVSKGEVQ VEVPDSSTNC GCPQHACQKA
EVCLFQGVTV LGPGQSLVQY FEGELCYTVQ CLPYKDPQTG FYAMEITSVN CSQKCGSHQA
YVPSTDPQVC CGSCKNISCT FTNENGTAEF FTAGSSWVEN CTRYDCVETA VGAVILASGV
ICPPFNDSEC IQSGGVVQSY VDGCCKTCKE DGKTCKRVAI RTTIRKDDCR SNAPVTVYSC
DGKCPSATIF NFNINSHARF CKCCRESGLQ TRTVSLYCSR NASMVDYNFQ EPLDCSCQWN
//