ID A0A2T7PRG2_POMCA Unreviewed; 2613 AA.
AC A0A2T7PRG2;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=Cubilin {ECO:0008006|Google:ProtNLM};
GN ORFNames=C0Q70_02978 {ECO:0000313|EMBL:PVD36008.1};
OS Pomacea canaliculata (Golden apple snail).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Mollusca; Gastropoda;
OC Caenogastropoda; Architaenioglossa; Ampullarioidea; Ampullariidae; Pomacea.
OX NCBI_TaxID=400727 {ECO:0000313|EMBL:PVD36008.1, ECO:0000313|Proteomes:UP000245119};
RN [1] {ECO:0000313|EMBL:PVD36008.1, ECO:0000313|Proteomes:UP000245119}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SZHN2017 {ECO:0000313|EMBL:PVD36008.1};
RC TISSUE=Muscle {ECO:0000313|EMBL:PVD36008.1};
RA Liu C., Liu B., Ren Y., Zhang Y., Wang H., Li S., Jiang F., Yin L.,
RA Zhang G., Qian W., Fan W.;
RT "The genome of golden apple snail Pomacea canaliculata provides insight
RT into stress tolerance and invasive adaptation.";
RL Submitted (APR-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00196}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PVD36008.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PZQS01000002; PVD36008.1; -; Genomic_DNA.
DR STRING; 400727.A0A2T7PRG2; -.
DR Proteomes; UP000245119; Miscellaneous, Linkage group lg2.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0007160; P:cell-matrix adhesion; IEA:InterPro.
DR CDD; cd00041; CUB; 5.
DR CDD; cd00054; EGF_CA; 9.
DR Gene3D; 2.10.25.10; Laminin; 15.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 5.
DR InterPro; IPR005533; AMOP_dom.
DR InterPro; IPR026823; cEGF.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR003886; NIDO_dom.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR InterPro; IPR001190; SRCR.
DR InterPro; IPR017448; SRCR-like_dom.
DR InterPro; IPR036772; SRCR-like_dom_sf.
DR PANTHER; PTHR24034; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24034:SF166; VACUOLAR-SORTING RECEPTOR 1; 1.
DR Pfam; PF12662; cEGF; 5.
DR Pfam; PF00431; CUB; 5.
DR Pfam; PF07645; EGF_CA; 6.
DR Pfam; PF06119; NIDO; 1.
DR PRINTS; PR00258; SPERACTRCPTR.
DR SMART; SM00042; CUB; 5.
DR SMART; SM00181; EGF; 20.
DR SMART; SM00179; EGF_CA; 15.
DR SMART; SM00539; NIDO; 1.
DR SMART; SM00202; SR; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 6.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 5.
DR SUPFAM; SSF56487; SRCR-like; 1.
DR PROSITE; PS50856; AMOP; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 11.
DR PROSITE; PS01180; CUB; 5.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 11.
DR PROSITE; PS50026; EGF_3; 9.
DR PROSITE; PS01187; EGF_CA; 6.
DR PROSITE; PS50287; SRCR_2; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00196};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000245119};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 2484..2509
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 60..171
FT /note="SRCR"
FT /evidence="ECO:0000259|PROSITE:PS50287"
FT DOMAIN 177..285
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 284..405
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 411..514
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 515..625
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 629..746
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 918..1064
FT /note="AMOP"
FT /evidence="ECO:0000259|PROSITE:PS50856"
FT DOMAIN 1512..1552
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1689..1730
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1731..1771
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1904..1942
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1986..2027
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2139..2180
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2184..2222
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2223..2261
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2262..2299
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 2518..2537
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 141..151
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00196"
FT DISULFID 629..656
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00059"
SQ SEQUENCE 2613 AA; 287538 MW; 634B15C632CC4168 CRC64;
MQTPMAQPQG KLSEAPYPAM ERKLRYQNAL FPVGMETATH YKLLESAVIQ ASLNHETKTI
RLANGKTPYE GRIEVFTKGV WGLITDYSYS NRPVHNAIAA GLACRNLFGP DYGGSVVPKE
EASVRFGVPR GRAILMSNIV CNGNEKSLLD CDRSTTDTST TLEDSLSVVC KTVPKGCNGY
LEIATGNLTS PGYPQSKVYT TDTDCVWKIV NKVAKIRLSF TKVDFNGSTL TTSNSLEVFD
SNGYLSLSSV TSSTNVTSSS SEIYVWLKVP SGSEGIQFHV NWDAAPCFES ISSSGYIRSP
GYPQQYPTNL NCTYHLLKRR GLHASISFSN FALSNMVQGV CGDYLQVFDG DTELSPSIRG
PLCNTTRPAT ITTTNVNSTK VIVKFTSDAV NTSVENGFEA YFRSLEPDPI RPGNYTEDSG
TFTTPGYPNV YNGEEDVIWT ITTKPYKTIT INFHHVHLKG CCNDYIMIYD QVRSTSYYGG
NIPSFATDTN SVQVTMQSSL DNEMNVINAS WITECRQVFT AFSGSILSPN DTTFDSASAD
CTYIIRQQPG YYITLQFSTI NFERDIKCSN YIEIHDGSTD LFPLLGERYC GQTLPPTLNS
TQNIVLLKFA TQTGKNSFSL RYSSHPKECS SGLMKDSDGS FTSPGFPLGY PSNTYCVWTI
NLDQKDLQIE LTFDDFSVGS QGTQCQYAFV RVYDADTVDA NKVMPTLCGT TLPTKLRSTG
HSMTVVLNVF GSLTNANGFR ATYAGRRLNG LFPYGTAARD RALGSNTYET MEVTIEDGFP
FGDSLREAVY IGTNGIISFQ KGSAWPRLDP TSVVVCVYCA YISNIQGVYY HLYNDVSENR
GVLEKASSEV QDFTGMFSFS ARTVLVVTWD RVQQSFPSLS MGDSTFQVAL VTDGAASYAI
VQYRLGNKGS WIYELGRIPS DYRQVCRDWH SRNKVYEKRK ERETGFANLP QCPCSRRGFF
RHIVNQWRFH RSDEQNHFEC YILNVARTRL FAPLGKECCY STDPLKPEQM DMLITSRPFA
GASLAYNPLI FEQTDLYETE DRLAHVACCL NSPDRFCRRF YQLRPVGDCD PNPALGWSPL
YGDPHVTTLD GRQYIFNGLG EFTMATVRTA HVTFTLQART QRAELRPGTP TNGTVFTAVG
AEENGVRVFL QIDPNTNTSK KSSVCARALY HVFCVVSALI LFANNIDYTR QLQQEGDNFV
LDHEGAVDNS STVMRYEARM GPKDYNHHEF QPVFLDELDP VVRREAEGLC GTQNLPCIYD
FLATGNKDFA LSTADQQKLS DDLMKQSKNS LPSVKVPAAA YVTVGVQKTV FIKGSDPDVN
DRLTYRIVDD AKGAVNINSS SGAITIFLKT LDPIKIQVYV VDEVGGQSTV ESLPLVVCSG
CEGHGLCNES DPRDVSGDSY FQYAVCQCQP AWSGNNCELD KDGCSDNPCH PLQTCADVPA
NEQGSSDVGY RCSDCPPGFS SRNGSQDCID IDECGNKALD KCDMICINTY GGYECSCQEG
FRLSTDGRTC ADVNECLEST HDCRQVCNNT EGGYRCECAP GYSYDEITKD CIIDSSLSGV
CQASGCSQGC SVATDPSSSS SSSNSIPLCF CYQGYDLDPR DNKTCVDRDE CRDKVCDQVC
NNTAGGFSCS CYDGFALNKD ERTCSACPYL KYGPECRQTC TCSGRSVACH PVRGCVCQDG
WTGKSCEEDV DECAENPDVC GTQKRCVNNK GSYSCVCRPG YATTDDGVCQ DIDECKLNIS
DCQQECVNVP GYFNCDCKYG YRLTADRRHC SKVNECTNSL DHCEMLCENT YGSYFCYCKQ
GYRLNNDNRT CSDINECTEH SDNCPQTCHN IDGGYRCACE DGYIYDDATN KCKIDPTVQN
VCRISSCSKY CRVKQDPSTN SSIPECFCDR GYELDSKDNQ TCKDHDECND SLCTQLCTNT
DGSFSCSCNR GYTLNDDGRT CSPCPHLHYG VGCHEVCRCR GRGKDCDKVR GCICNDGWTG
SQCQDDIDEC AGNSSICGQE GICINNLGSY TCLCRPGYQD DGQGNCTDIN ECTNGLARCS
GICNNTDGGY VCSCPAGEKL DADGKTCKGF ESFNFYRQGL LTVMYTLMLI VLIPPACPNG
TYGENCTHTC NCGRGAERRD NEIGCVCRPG WQGTDCDTDT DECLNQDVTD NCTLTHAVCV
NSPGGYKCEC HVGFLRDQFG VCQDINECPS SPCSQGCINT NGSYTCTCHS GFQFNTTIKD
CEDINECLNS PCSQGCVNTK GSYTCTCYSG FKINSATKDC EDVDECLNSP CTQLCTNIDG
DYYCSCSSGY MLVENGKCEA ITSIALTISI NIVVNVSDLV NEKSNEYQKW NESVTNSLEF
RTEYGSFMVL VRTLIVFLTT VIKLIHSTVS ASSQLTQYFS TNVVGFMSFV VTKLRAGSVI
ADGIATTKSS SVGSLTVAML HLTGSRLTIN NQTGDVQVAV QQNLAQSKSK CELFELISPC
GEGTECDSTA EVPICRTVEN SRSWVLITSL AVGIPAGICL VLIVSCCLVR HKRKTSKRAK
NNTSTEEIPR SRSSSQAHAY DTLQPAFSTI PRIYLQAAYC KAQVRDANVH NILEPTVEFV
LVLVVAVTTS TYMRWFWKSR AVVGGYGVFV VTL
//