ID W5MX16_LEPOC Unreviewed; 3634 AA.
AC W5MX16;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 64.
DE SubName: Full=Cubilin (intrinsic factor-cobalamin receptor) {ECO:0000313|Ensembl:ENSLOCP00000012925.1};
GN Name=CUBN {ECO:0000313|Ensembl:ENSLOCP00000012925.1};
OS Lepisosteus oculatus (Spotted gar).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Holostei; Semionotiformes; Lepisosteidae;
OC Lepisosteus.
OX NCBI_TaxID=7918 {ECO:0000313|Ensembl:ENSLOCP00000012925.1, ECO:0000313|Proteomes:UP000018468};
RN [1] {ECO:0000313|Proteomes:UP000018468}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lander E.S., Lindblad-Toh K.;
RT "The Draft Genome of Lepisosteus oculatus.";
RL Submitted (DEC-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLOCP00000012925.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AHAT01017530; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AHAT01017531; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AHAT01017532; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AHAT01017533; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AHAT01017534; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 7918.ENSLOCP00000012925; -.
DR Ensembl; ENSLOCT00000012952.1; ENSLOCP00000012925.1; ENSLOCG00000010501.1.
DR eggNOG; KOG4292; Eukaryota.
DR GeneTree; ENSGT00940000155299; -.
DR HOGENOM; CLU_000172_1_0_1; -.
DR InParanoid; W5MX16; -.
DR OMA; RGFTVRW; -.
DR Proteomes; UP000018468; Linkage group LG9.
DR Bgee; ENSLOCG00000010501; Expressed in mesonephros and 6 other cell types or tissues.
DR GO; GO:0005768; C:endosome; IEA:UniProtKB-KW.
DR GO; GO:0005765; C:lysosomal membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0031419; F:cobalamin binding; IEA:UniProtKB-KW.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR GO; GO:0008203; P:cholesterol metabolic process; IEA:UniProtKB-KW.
DR GO; GO:0015031; P:protein transport; IEA:UniProtKB-KW.
DR CDD; cd00041; CUB; 27.
DR CDD; cd22201; cubilin_NTD; 1.
DR CDD; cd00054; EGF_CA; 5.
DR Gene3D; 2.10.25.10; Laminin; 7.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 27.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR PANTHER; PTHR24251:SF37; CUB DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24251; OVOCHYMASE-RELATED; 1.
DR Pfam; PF00431; CUB; 27.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF12947; EGF_3; 1.
DR Pfam; PF07645; EGF_CA; 2.
DR Pfam; PF12661; hEGF; 1.
DR SMART; SM00042; CUB; 27.
DR SMART; SM00181; EGF; 7.
DR SMART; SM00179; EGF_CA; 6.
DR SUPFAM; SSF57196; EGF/Laminin; 4.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 27.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS01180; CUB; 27.
DR PROSITE; PS00022; EGF_1; 2.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS01187; EGF_CA; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000018468};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..3634
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004866913"
FT DOMAIN 128..161
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 343..388
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 390..426
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 428..464
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 470..584
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 588..700
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 706..818
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 819..930
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 934..1047
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1053..1166
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1170..1282
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1283..1395
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1397..1512
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1516..1628
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1629..1745
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1749..1861
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1862..1974
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1988..2101
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2102..2223
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2227..2346
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2348..2460
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2464..2577
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2582..2699
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2701..2813
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2817..2931
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2932..3047
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3049..3162
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3169..3285
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3289..3404
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3406..3518
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3522..3634
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT REGION 23..44
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2164..2189
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 99..126
FT /evidence="ECO:0000256|SAM:Coils"
FT DISULFID 354..371
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 416..425
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 454..463
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2348..2375
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00059"
FT DISULFID 3049..3076
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00059"
SQ SEQUENCE 3634 AA; 396620 MW; F28EB278305006D8 CRC64;
AGQLPWLLVI FLIIAGLKCS SEQYGGSKQK RDISKDQPRM TSSEGNLIFK TGDYKYIIFQ
PGNQGSVKIG EENLSQLIDQ IKKNKDDITF IKANGGGTSQ NVTDQINQLR TRMTTLEGKV
QNIEQTIQRK TCSSNPCQNT GTCLSLLDSY HCICPSNWQL GLEPGSSSVS STNCHNIYPF
SNMSALFSSR CNCPPEWYGP HCTSRYDDCE GGSQDLCEHG ICIDSDRVIP NQPKYKCICD
AGWTTLPGNP ACTADINECN LPNKPCSTNP HVECFNTIGS YYCGACPAGW QGNGYSCQDV
NECTTNNGGC STSPMVQCLN TMGSYHCGPC PPGYEGDGKT CTQANVCSVN NGGCHPLATC
ISSSGSLLPI CTCPPGYAGN GYGPAGCTPI SNICQSHSPC VNGQCMSTTT GYICTCDPGW
SGVNCTENIN ECVSNPCQNG GTCVDGINGY TCTCTDHWMG LQCQTPQQVC GGYLTAVAGT
FSYPNFPGND QYDHQVSCAW VVRTDSDKVL RITFPHFDLE NSANCNYDFL QIHDGESASG
FMIGRYCGTN VPTELFSSHN SLYFWFRSDH SVSAGGFTVA WESRDPECGG EMTSTYGSIT
SPGYPGNYPP NRDCYWTVTV NPGLLITFAF GTLSLEHHDN CNFDFLEIRD GLLPEDPVLG
RFCSSVSPPP LQTTGPLAWI HFHSDFTVSD RGFHITYTTS PSDPGCGGTY TNSEGVIISP
NWPNSYANNR QCIYVIRQPV NEIVFLNFTN MDLESHSGCA FDYVEVRDGS LETDPLIGKY
CGSSLPAPIT STSNALWLRF KSDASVTRGG FRATYQVACG GTFSGSGLIR SPYHPNPYPH
DKRCEWVITQ PTGYVVTLNF LSFDIESGTS CGFDYVEVRD GATGSSPLLG RYCGLQIPSL
IQSTQRSMFI LFHTDSSVTN HGFTAEYNYA EAGCGGTLTE SSGTVTSPGH PTNYPHGANC
TWYITAAPGH IIRLSFTSFN LEYHHECLYD YVDVYDNGTT ATGSLLGRYC GRSVPPSLTS
TDNLLTILFV SDASLATEGF SAGYVTINAT TDCAEVYTSP TGTLTSPNYP NNYPMNRECV
YKIIVETNRQ IMLNFTDFML EGPYPPCSYD YIEIRDGGYE TSPLIGKYCS EQPPPILISH
SNRFWIKFRS DSSVTAKGFM AHWDGTLTGC GGTLTTNSGS FSSPNYPLPY HPNAECYWTL
KAPGGSLIQL QFGDFHLESN SNCNYDYLAV YDGNSSDSRP LAKLCGNELP APIRSSREKM
YIKLRTDNSV SAGGFLASYN QICQGVAIAN QSTGFVESPN YPNTYPHYAQ CSWTIQASAG
NTINYTFTAF DIEPLCDYDY IKVYDGPNDQ YPLIGTFCGD TPPPANTTSG PSLHVVFRSD
SWVAHNGFQM LWYQNGCGGD LFGPQGFFNS PGYPNRYPDN RECIWHIETA PGSSIQITIY
EFDIEFHADC NYDLLEVYGG PDLLSPRLAR LCTTRPPGQP LHVSTTGNFV TVLFKSDLYV
SGKGFNASWQ EVQGGCGGLF TAPTGEIHSP RYPNNYPDNV DCSWVISVDR HHRIILNFTD
LDIEQHGTCA YDYIAIYDGP SASAPILGHL CGQQRPSPLT STQNTIFLRF RSDSSYNHRG
FRAQFSETCG AIIQSDDLGG AIASPRYPAS YPANQNCSWI IKAQEPFNHV TLSFTDFNLE
NKNNNCSTDV VEVLDGDNYA APSMGRYCGA DLPHPVTSFS NALVVNFVTD GQDSEKGFRA
VYASSTSACG GHLHMETGAF NSPNYPEAYP PNVECVWSIT SSPGNRLQLS FISFDIQQSS
GCSSDYLEIR EGNSTGQLVG RFCGGLLPAN YTSVIGHILW LKFVSDSSVP GAGFRATFAH
LYGNEISGVS GQIASPLWPR NYPVDAHYRW TITVDAGFFI QVHFLDMDIE DLYDCYFDNL
NIFDGPNAHS YSLGKFCGLS LPPLVTSSGS TMTVEFHSDE SVSGRGFLIE WQAVQSSGPL
PTIAPGTCGG ALMTSESPMF LFSPGWPENY PENIECTWII RSPGSTVEFN LLSVDIEADI
TCFYDSLVIR DGETNLSPLL ATICGRELPG PVHSSGDTMF LRFTSDGSYS GAGFNASFHK
GCGGLFHADR GVISSPRYPE NYSPNLDCTW HVVVTSGLTI SLEFESQFHV QGLNTQCTTG
DYLELRNGPD GSSPPLGPSG GNGHYCGTSS PSSLHTTDNT LFVHFVSDGS SEALGFKFTF
EARGLVCGGG IELSDSDPPG YVMSPNYPSN YPQNVDCIWV ITVPSGRVIQ LDFEDEFYIE
SSARPSCLYD YLELRDGATS SADLIARLCG IQLPSTQHST GSVMYLRFRT DSSVTHKGFK
AKYSIATCGG RYVGQSGIIK SPGFPDSSYP DGSYCEWYLQ GPTGHYLTLT YIAMDLQTST
DCSADHVEIR ENNATGHLLG RHCGSALPSP VVTADSYAYV KFVSDGSENA LGFSLRFDAS
VEECGGDLNA PLGTITSPNY PNLYPHSRVC EWRITVPQGR RVTLTFNDLH LEDHNTCLYD
YVAVYNGLLS SAPTLARYCG TVSTGTLVKS SGNTMAVVFV TDASVSNGGF SADYSSNEPA
VCGGILNAPG GGNFTSPGYD GVSNYTDNLS CEWLIQNPTH INSSIHISFD DFHLEHHQTC
EWDYIEFRLG DSNGELVTRY CGQSVPNMPL VVFAPQLWVH FFTDPYVEDI GFKARYSFSG
CGGLQSGESG VVSSPNYPNP YDNLSRCAWL LEAPEGHRIT ITFNYFDVEQ HPQCSWDSVT
IMNGGSPGSP VIGHYCGTSS PGTVQSGSNN LVIVFIADHT VSRGGFYATW TADSSGCGGI
IHADSGNIKS PGHPQNFPAN SECVWRIIAH EGNHLEMNFS SDFEIPDSGG QCQSSYVKVW
AGGADQENSL LATGCGTTAP GLVVAPFNVI TARFQSQETI GKGFSAFFTT RCGANFTAPE
GRIVSPNYPE HYPHDSNCNY LINAGQQKVV ILHFQTFQVE GRCLSQSTCA YDGLKIYSGT
FPTAPLLTTL CGSEIPGPFT TFGPMLLNFY SDNSINDNGF LAQYEVVSCG GVFNNSAGTV
SSPTLSITNY HNNMNCTYHI IVSQDRVVEL KFNTFHLEAS SNCRYDYVAV YNGNDSNASL
LGRFCGRELP LTLRSSFNHL FIMFITDFSI GAGGWRATYR ETLGPQQGCG GYLTSLTGMF
GSPDINYDGK YEAGLDCLWN IVVPPNKVVN LTFTSFHLES PSGTSCRYDY VKIYDGDNVN
SPLVGTFCGA LVPASFTSTN NFLTIRFITD GSVAFSGFNA TFTAVDLLCG GLLNATTSPQ
TITFPQHPNN YPDFTNCRWT LDSPTQEHVR LSVQHFHLQS TQNCAQNYLE FKDWPGGDYG
QSHKFCGTDV TVPDFYSYGR TVLVSFKSDE YMNGNGFSLT YQIASCSRVY EQAFGYLKSP
GWPDMYPHNI ECEIILHAPE NNSISLFFNS FDLETHTSCA YDFLEIKNGS TSSAPLLGRY
CGQTLPSPVF PMSNLLYLHF KSDFSNARNG FEITWTSTPT GCGGVLYGDH GSFASPNYPG
TYNNGTDCEW TITAPVGRVV TVTFFYINID DPGDCQNNYL KLYDGPNSNT QPIGPYCGVE
TSIAPFTSSF HQVFIQFHSQ YVTQPSGFRL TWRS
//