GenomeNet

Database: UniProt
Entry: W5MX16_LEPOC
LinkDB: W5MX16_LEPOC
Original site: W5MX16_LEPOC 
ID   W5MX16_LEPOC            Unreviewed;      3634 AA.
AC   W5MX16;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   16-APR-2014, sequence version 1.
DT   27-MAR-2024, entry version 64.
DE   SubName: Full=Cubilin (intrinsic factor-cobalamin receptor) {ECO:0000313|Ensembl:ENSLOCP00000012925.1};
GN   Name=CUBN {ECO:0000313|Ensembl:ENSLOCP00000012925.1};
OS   Lepisosteus oculatus (Spotted gar).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Holostei; Semionotiformes; Lepisosteidae;
OC   Lepisosteus.
OX   NCBI_TaxID=7918 {ECO:0000313|Ensembl:ENSLOCP00000012925.1, ECO:0000313|Proteomes:UP000018468};
RN   [1] {ECO:0000313|Proteomes:UP000018468}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA   MacCallum I., Young S., Walker B.J., Lander E.S., Lindblad-Toh K.;
RT   "The Draft Genome of Lepisosteus oculatus.";
RL   Submitted (DEC-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSLOCP00000012925.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AHAT01017530; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AHAT01017531; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AHAT01017532; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AHAT01017533; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AHAT01017534; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   STRING; 7918.ENSLOCP00000012925; -.
DR   Ensembl; ENSLOCT00000012952.1; ENSLOCP00000012925.1; ENSLOCG00000010501.1.
DR   eggNOG; KOG4292; Eukaryota.
DR   GeneTree; ENSGT00940000155299; -.
DR   HOGENOM; CLU_000172_1_0_1; -.
DR   InParanoid; W5MX16; -.
DR   OMA; RGFTVRW; -.
DR   Proteomes; UP000018468; Linkage group LG9.
DR   Bgee; ENSLOCG00000010501; Expressed in mesonephros and 6 other cell types or tissues.
DR   GO; GO:0005768; C:endosome; IEA:UniProtKB-KW.
DR   GO; GO:0005765; C:lysosomal membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   GO; GO:0031419; F:cobalamin binding; IEA:UniProtKB-KW.
DR   GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR   GO; GO:0008203; P:cholesterol metabolic process; IEA:UniProtKB-KW.
DR   GO; GO:0015031; P:protein transport; IEA:UniProtKB-KW.
DR   CDD; cd00041; CUB; 27.
DR   CDD; cd22201; cubilin_NTD; 1.
DR   CDD; cd00054; EGF_CA; 5.
DR   Gene3D; 2.10.25.10; Laminin; 7.
DR   Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 27.
DR   InterPro; IPR000859; CUB_dom.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR013032; EGF-like_CS.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR   InterPro; IPR018097; EGF_Ca-bd_CS.
DR   InterPro; IPR024731; EGF_dom.
DR   InterPro; IPR035914; Sperma_CUB_dom_sf.
DR   PANTHER; PTHR24251:SF37; CUB DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR24251; OVOCHYMASE-RELATED; 1.
DR   Pfam; PF00431; CUB; 27.
DR   Pfam; PF00008; EGF; 2.
DR   Pfam; PF12947; EGF_3; 1.
DR   Pfam; PF07645; EGF_CA; 2.
DR   Pfam; PF12661; hEGF; 1.
DR   SMART; SM00042; CUB; 27.
DR   SMART; SM00181; EGF; 7.
DR   SMART; SM00179; EGF_CA; 6.
DR   SUPFAM; SSF57196; EGF/Laminin; 4.
DR   SUPFAM; SSF49854; Spermadhesin, CUB domain; 27.
DR   PROSITE; PS00010; ASX_HYDROXYL; 1.
DR   PROSITE; PS01180; CUB; 27.
DR   PROSITE; PS00022; EGF_1; 2.
DR   PROSITE; PS01186; EGF_2; 1.
DR   PROSITE; PS50026; EGF_3; 4.
DR   PROSITE; PS01187; EGF_CA; 2.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Reference proteome {ECO:0000313|Proteomes:UP000018468};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..3634
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004866913"
FT   DOMAIN          128..161
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          343..388
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          390..426
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          428..464
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          470..584
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          588..700
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          706..818
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          819..930
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          934..1047
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          1053..1166
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          1170..1282
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          1283..1395
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          1397..1512
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          1516..1628
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          1629..1745
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          1749..1861
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          1862..1974
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          1988..2101
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          2102..2223
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          2227..2346
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          2348..2460
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          2464..2577
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          2582..2699
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          2701..2813
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          2817..2931
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          2932..3047
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          3049..3162
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          3169..3285
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          3289..3404
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          3406..3518
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          3522..3634
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   REGION          23..44
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2164..2189
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          99..126
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   DISULFID        354..371
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        416..425
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        454..463
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        2348..2375
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00059"
FT   DISULFID        3049..3076
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00059"
SQ   SEQUENCE   3634 AA;  396620 MW;  F28EB278305006D8 CRC64;
     AGQLPWLLVI FLIIAGLKCS SEQYGGSKQK RDISKDQPRM TSSEGNLIFK TGDYKYIIFQ
     PGNQGSVKIG EENLSQLIDQ IKKNKDDITF IKANGGGTSQ NVTDQINQLR TRMTTLEGKV
     QNIEQTIQRK TCSSNPCQNT GTCLSLLDSY HCICPSNWQL GLEPGSSSVS STNCHNIYPF
     SNMSALFSSR CNCPPEWYGP HCTSRYDDCE GGSQDLCEHG ICIDSDRVIP NQPKYKCICD
     AGWTTLPGNP ACTADINECN LPNKPCSTNP HVECFNTIGS YYCGACPAGW QGNGYSCQDV
     NECTTNNGGC STSPMVQCLN TMGSYHCGPC PPGYEGDGKT CTQANVCSVN NGGCHPLATC
     ISSSGSLLPI CTCPPGYAGN GYGPAGCTPI SNICQSHSPC VNGQCMSTTT GYICTCDPGW
     SGVNCTENIN ECVSNPCQNG GTCVDGINGY TCTCTDHWMG LQCQTPQQVC GGYLTAVAGT
     FSYPNFPGND QYDHQVSCAW VVRTDSDKVL RITFPHFDLE NSANCNYDFL QIHDGESASG
     FMIGRYCGTN VPTELFSSHN SLYFWFRSDH SVSAGGFTVA WESRDPECGG EMTSTYGSIT
     SPGYPGNYPP NRDCYWTVTV NPGLLITFAF GTLSLEHHDN CNFDFLEIRD GLLPEDPVLG
     RFCSSVSPPP LQTTGPLAWI HFHSDFTVSD RGFHITYTTS PSDPGCGGTY TNSEGVIISP
     NWPNSYANNR QCIYVIRQPV NEIVFLNFTN MDLESHSGCA FDYVEVRDGS LETDPLIGKY
     CGSSLPAPIT STSNALWLRF KSDASVTRGG FRATYQVACG GTFSGSGLIR SPYHPNPYPH
     DKRCEWVITQ PTGYVVTLNF LSFDIESGTS CGFDYVEVRD GATGSSPLLG RYCGLQIPSL
     IQSTQRSMFI LFHTDSSVTN HGFTAEYNYA EAGCGGTLTE SSGTVTSPGH PTNYPHGANC
     TWYITAAPGH IIRLSFTSFN LEYHHECLYD YVDVYDNGTT ATGSLLGRYC GRSVPPSLTS
     TDNLLTILFV SDASLATEGF SAGYVTINAT TDCAEVYTSP TGTLTSPNYP NNYPMNRECV
     YKIIVETNRQ IMLNFTDFML EGPYPPCSYD YIEIRDGGYE TSPLIGKYCS EQPPPILISH
     SNRFWIKFRS DSSVTAKGFM AHWDGTLTGC GGTLTTNSGS FSSPNYPLPY HPNAECYWTL
     KAPGGSLIQL QFGDFHLESN SNCNYDYLAV YDGNSSDSRP LAKLCGNELP APIRSSREKM
     YIKLRTDNSV SAGGFLASYN QICQGVAIAN QSTGFVESPN YPNTYPHYAQ CSWTIQASAG
     NTINYTFTAF DIEPLCDYDY IKVYDGPNDQ YPLIGTFCGD TPPPANTTSG PSLHVVFRSD
     SWVAHNGFQM LWYQNGCGGD LFGPQGFFNS PGYPNRYPDN RECIWHIETA PGSSIQITIY
     EFDIEFHADC NYDLLEVYGG PDLLSPRLAR LCTTRPPGQP LHVSTTGNFV TVLFKSDLYV
     SGKGFNASWQ EVQGGCGGLF TAPTGEIHSP RYPNNYPDNV DCSWVISVDR HHRIILNFTD
     LDIEQHGTCA YDYIAIYDGP SASAPILGHL CGQQRPSPLT STQNTIFLRF RSDSSYNHRG
     FRAQFSETCG AIIQSDDLGG AIASPRYPAS YPANQNCSWI IKAQEPFNHV TLSFTDFNLE
     NKNNNCSTDV VEVLDGDNYA APSMGRYCGA DLPHPVTSFS NALVVNFVTD GQDSEKGFRA
     VYASSTSACG GHLHMETGAF NSPNYPEAYP PNVECVWSIT SSPGNRLQLS FISFDIQQSS
     GCSSDYLEIR EGNSTGQLVG RFCGGLLPAN YTSVIGHILW LKFVSDSSVP GAGFRATFAH
     LYGNEISGVS GQIASPLWPR NYPVDAHYRW TITVDAGFFI QVHFLDMDIE DLYDCYFDNL
     NIFDGPNAHS YSLGKFCGLS LPPLVTSSGS TMTVEFHSDE SVSGRGFLIE WQAVQSSGPL
     PTIAPGTCGG ALMTSESPMF LFSPGWPENY PENIECTWII RSPGSTVEFN LLSVDIEADI
     TCFYDSLVIR DGETNLSPLL ATICGRELPG PVHSSGDTMF LRFTSDGSYS GAGFNASFHK
     GCGGLFHADR GVISSPRYPE NYSPNLDCTW HVVVTSGLTI SLEFESQFHV QGLNTQCTTG
     DYLELRNGPD GSSPPLGPSG GNGHYCGTSS PSSLHTTDNT LFVHFVSDGS SEALGFKFTF
     EARGLVCGGG IELSDSDPPG YVMSPNYPSN YPQNVDCIWV ITVPSGRVIQ LDFEDEFYIE
     SSARPSCLYD YLELRDGATS SADLIARLCG IQLPSTQHST GSVMYLRFRT DSSVTHKGFK
     AKYSIATCGG RYVGQSGIIK SPGFPDSSYP DGSYCEWYLQ GPTGHYLTLT YIAMDLQTST
     DCSADHVEIR ENNATGHLLG RHCGSALPSP VVTADSYAYV KFVSDGSENA LGFSLRFDAS
     VEECGGDLNA PLGTITSPNY PNLYPHSRVC EWRITVPQGR RVTLTFNDLH LEDHNTCLYD
     YVAVYNGLLS SAPTLARYCG TVSTGTLVKS SGNTMAVVFV TDASVSNGGF SADYSSNEPA
     VCGGILNAPG GGNFTSPGYD GVSNYTDNLS CEWLIQNPTH INSSIHISFD DFHLEHHQTC
     EWDYIEFRLG DSNGELVTRY CGQSVPNMPL VVFAPQLWVH FFTDPYVEDI GFKARYSFSG
     CGGLQSGESG VVSSPNYPNP YDNLSRCAWL LEAPEGHRIT ITFNYFDVEQ HPQCSWDSVT
     IMNGGSPGSP VIGHYCGTSS PGTVQSGSNN LVIVFIADHT VSRGGFYATW TADSSGCGGI
     IHADSGNIKS PGHPQNFPAN SECVWRIIAH EGNHLEMNFS SDFEIPDSGG QCQSSYVKVW
     AGGADQENSL LATGCGTTAP GLVVAPFNVI TARFQSQETI GKGFSAFFTT RCGANFTAPE
     GRIVSPNYPE HYPHDSNCNY LINAGQQKVV ILHFQTFQVE GRCLSQSTCA YDGLKIYSGT
     FPTAPLLTTL CGSEIPGPFT TFGPMLLNFY SDNSINDNGF LAQYEVVSCG GVFNNSAGTV
     SSPTLSITNY HNNMNCTYHI IVSQDRVVEL KFNTFHLEAS SNCRYDYVAV YNGNDSNASL
     LGRFCGRELP LTLRSSFNHL FIMFITDFSI GAGGWRATYR ETLGPQQGCG GYLTSLTGMF
     GSPDINYDGK YEAGLDCLWN IVVPPNKVVN LTFTSFHLES PSGTSCRYDY VKIYDGDNVN
     SPLVGTFCGA LVPASFTSTN NFLTIRFITD GSVAFSGFNA TFTAVDLLCG GLLNATTSPQ
     TITFPQHPNN YPDFTNCRWT LDSPTQEHVR LSVQHFHLQS TQNCAQNYLE FKDWPGGDYG
     QSHKFCGTDV TVPDFYSYGR TVLVSFKSDE YMNGNGFSLT YQIASCSRVY EQAFGYLKSP
     GWPDMYPHNI ECEIILHAPE NNSISLFFNS FDLETHTSCA YDFLEIKNGS TSSAPLLGRY
     CGQTLPSPVF PMSNLLYLHF KSDFSNARNG FEITWTSTPT GCGGVLYGDH GSFASPNYPG
     TYNNGTDCEW TITAPVGRVV TVTFFYINID DPGDCQNNYL KLYDGPNSNT QPIGPYCGVE
     TSIAPFTSSF HQVFIQFHSQ YVTQPSGFRL TWRS
//
DBGET integrated database retrieval system