ID A0A2A4JDL6_HELVI Unreviewed; 3737 AA.
AC A0A2A4JDL6;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE RecName: Full=Cubilin {ECO:0008006|Google:ProtNLM};
GN ORFNames=B5V51_3588 {ECO:0000313|EMBL:PCG69876.1};
OS Heliothis virescens (Tobacco budworm moth).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Heliothinae; Heliothis.
OX NCBI_TaxID=7102 {ECO:0000313|EMBL:PCG69876.1, ECO:0000313|Proteomes:UP000218220};
RN [1] {ECO:0000313|EMBL:PCG69876.1, ECO:0000313|Proteomes:UP000218220}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HvINT- {ECO:0000313|EMBL:PCG69876.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:PCG69876.1};
RA Fritz M.L., Deyonke A.M., Papanicolaou A., Micinski S., Westbrook J.,
RA Gould F.;
RT "Contemporary evolution of a Lepidopteran species, Heliothis virescens, in
RT response to modern agricultural practices.";
RL Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PCG69876.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NWSH01001864; PCG69876.1; -; Genomic_DNA.
DR STRING; 7102.A0A2A4JDL6; -.
DR Proteomes; UP000218220; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00041; CUB; 23.
DR CDD; cd22201; cubilin_NTD; 1.
DR CDD; cd00054; EGF_CA; 6.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 25.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR PANTHER; PTHR24251:SF28; NEUROPILIN AND TOLLOID-LIKE, ISOFORM B; 1.
DR PANTHER; PTHR24251; OVOCHYMASE-RELATED; 1.
DR Pfam; PF00431; CUB; 23.
DR Pfam; PF00008; EGF; 3.
DR Pfam; PF07645; EGF_CA; 2.
DR SMART; SM00042; CUB; 25.
DR SMART; SM00181; EGF; 8.
DR SMART; SM00179; EGF_CA; 6.
DR SUPFAM; SSF57196; EGF/Laminin; 3.
DR SUPFAM; SSF57184; Growth factor receptor domain; 2.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 25.
DR PROSITE; PS00010; ASX_HYDROXYL; 2.
DR PROSITE; PS01180; CUB; 26.
DR PROSITE; PS00022; EGF_1; 4.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS01187; EGF_CA; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000218220};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..3737
FT /note="Cubilin"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5013354229"
FT DOMAIN 143..179
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 181..222
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 412..448
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 450..489
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 495..609
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 613..726
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 732..844
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 845..961
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 965..1078
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1084..1199
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1203..1316
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1317..1447
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1449..1559
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1563..1687
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1688..1806
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1810..1920
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1921..2034
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2048..2167
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2168..2288
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2292..2408
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2410..2532
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2536..2644
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2768..2875
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2879..2992
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2993..3105
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3107..3219
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3226..3352
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3356..3512
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3515..3624
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3630..3736
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DISULFID 169..178
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 212..221
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 438..447
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 479..488
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 613..640
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00059"
SQ SEQUENCE 3737 AA; 416915 MW; 4A46E32BC5D1C69B CRC64;
MSSTLGKWLL LAAICFVPLD CEIYEDRPKI KTEGGNLILE PAFDKNLYLR VNGPKSKIFV
GDTNILGDNS INGNVPDDAQ NQIPNNGNSD TNLNGILQRL ERLEIKESTL PNDLLLNITM
LWRRVNNIRR KVINLQALLD EGTRDDCQSN PCEHGGTCLN IVRGYHCLCP SNWEGKNCDI
DVNECRNYAG TDLGCQNGAT CINRPGTYEC LCKSGWFGFH CTRKAKDCTG GDFEMCGHGT
CVQVSSGEGV QCICHQGWTT NGTGVACLTD VNECESYQGP RCSVNPKVDC INLPGSFRCG
QCPSGYEGDG YSCYDIDECT TIPNGGCSPL VTCHNTIGSR ICGLCPPGYQ GDGVTCTWRG
TCNIGHGGCH PSAQCIESAT PTGQSAQCIC PEGMVGDGMG ISGCYVPTSG NYTDSCASNP
CGVHGQCHAL RSGYTCICAR GYGGASCDSA ADACRANPCH NGGACRLDDS AAAGFRCECA
AQYTGSLCQV RTKSCGGVLD NEQGSIVYPL TNTTYNHNSQ CAWVIHTAPD KVINVTFSKF
NLEASPECMY DFLQIHDGRK SSSQLIGRFC GNNYPKGGNI VSSHNNLYFW FHSDKTVAKD
GFALHWTSIS PVCGGHVDAT THGHISSPGS PGPYPPNRDC YWHLSTSLGK RINLHFFALD
IESHSNCSFD YLAIYDGEHL TDPLIERYCN TTQPAPVQSA SSDMLIHFHS DAYGSGHGFQ
IAYAPMEGIP GCGGYYTTNT GELVSPTRDG LYLSNLFCEY KIKTSLDTKI KIEFKSFKLE
RSFRCKYDYL KIYDGPSSDS RLVGKFCGTT YPKSYTSTSN NLYFVFRTDR SLPSEGFRIT
YTAICENTIV GDSGVIKSPG YPFSYPDNKL CEYVIRTDPG KAIQLTFQDF DIEDNRYNDC
RYDSVEIRDG HDSNATLLGR YCGGAEHTPP VQTSTLNYMY ISFKSDLSVS GTGFYANYTT
IDTECGGIHR DTTGLINHPS SDATYKNYQS CKWLLIAPEG MHIKLTWNRF DIEEMPSCGS
DYLQLVEIDD NNENNVLGKF CGSRAPPALT TSTNRLMLRF ESDSSIRSSG FSVSYTFLDQ
KTHCGGAYIK SHGFIYSPGW PKAYEPNRDC TWTITVPVGQ QIMLNISQFH LERPIRDKCN
LGDYLEIRNG ATENSPLIGQ YCGSFESKRI VSLSNALHLH FHSDFYLSGT GFKLEWDGTI
TGCGGTLTSP SGSVSSPNYP NEYNENAECF YRIVTSAGSR IRITFTELDL ERTLNCRDDY
VQIYDGRDST ANSIGKHCFM SPELANIETT SNYAFIKFRS DIYQGGKGFL FNYQTICNTN
VTGRYGVIES PGYPSNYEMN LNCLWTIQVP KGNTINVTFT HFDIYRSRRF GYPAWRSYRP
YPNRVVYGDC QLDYLQTKEL SDPNYSEKLC GSTIPAMITT RSNALQIKFT TGIYVARSGF
RLEWISSGCG GHIIKRMGTV AIDRSKTNEK ELECEWLIET QVGKSIVLTF SEIYMLESAN
CTTDAIEIYN GQNTLAPLLT KICHRDYVSV QSDSNFMLVR LSKRSTLRDV HFSSDFRSVN
SKCGGVMNSK SGMIYSKNYP QNYDNNLDCL WYISVPAFHR IELNFIDLDL YTLFRSKIAN
DQSCGDTIQI YDNEYLSPTG ANNSFRICPS TPLNETQFVS EHNNIVVQFK TDAFGTAKGF
KANFTVACGA TIKAEHEGII QIDNFVHHGS TSCIWNILAD SPDKKIYLTF TLMSLPKDNN
VITNRTCPSS YLRVLDGDDN KAPIIGEYCG RKVPPMIVSR GSALTVEFGS YTGRINGIFA
AHYSPLSNTC GGVLTSEEGT IASPNFPLPY PVNTDCEWFL RSSPGNTAYV QFEQFDLRFS
EGCNDDYVEI RETNGAGRLL GVYCSSDIPS NHSTAAQIYI KFHGNSQPSG RGFLLQYGFE
RENDIRGDGG EISSPLFPTR YLGSGEYSWR VFTTDSDTVS VTIDVLEIYS HSEVNYNKLI
IYDGYDSTAP ILEDLTGVLV EPKVVQSSSN VIFITLKMDE SNAGSKFHMT WTKSSRDTYS
PNEAKINCGS NATVTILPGN TTAIKSPNYP QDYDDNLNCE WVFKSQLGRH LTVSFSEFNI
EETTACFADS VSVYSADTLG NWKPLIEDTC TSEAVSAGLN SSTYMKIKFK TDSSVVRKGF
VAQVSSKCGG IMTGLSGEIG PTWLDVQTLY RYRFKVQCDW TVKVRPGRTI KLDFDHFNIT
NKKDGDCETY VILRNGDSAD APLLASGKYC GFDHEIKADI TTSSNALFVR YVKSTYDFQT
FKINYEEKSF ECGSTASLTA DHPWEIITSP KYPEIPVPFS ECEWVFSGPP GEILRIDFID
RFDLLDSEDC DKEVVEIRMG SSRFSPLNGR YCNDRPPTIK SIDNTMYIKY STALTEPRNG
FKANISIDIC GGTIISDAGE IKSPGYPHMQ VLPYGTVCTW TVIGPPRHVF RIKPEDVQLP
LSESDCATKL TIEEALPANN TITILRTLCN DYNLAPIETS RNEFNVKLYI GKPDAWDQTS
QNRGFKISFN SSRPMCGGTV TSREGFLTSP SYPLETTLRY CQWIIKVPDK SRKVRLEILD
MDEERHRIGI FNDITFKTIV QTIPNKDYIP GTKVFESSGN TMAIYVWLNR TAPSHRFKAK
FSSDEPALCG GELTDRNQVI LSPDLNRSYT CEWHYNSPTT YNDYPTTNLT YNSVYLTGSV
NSSMSRTRCR FFDPQIFIKS KDDTRFTREI CGNADVNIRI PSQVLDITAT KSKLNSLYFH
LEFNSQPCGG IVRVGYDPVN ILNIPEFYNS TLDCAWIVTG PSNRVEVKIE GSFTFDCEGE
FLKISTSLRQ DAPIIGDYCK GKALDTLLTH FRYLYIQYHS NIKNTTNLKL MVRSVTEQCG
GLLSSYQSTF ATPNYPKHYL PDQECAWEIR ADVGFRVSLK FVERFVIEDT ANCTKDAVII
YDWKDDVFTE IARVCGRTTP PIYNSTFNRM KVVLRTDADK NLDGFKAEWE QICGGQYTAT
EKEQILYSPG FPYAYHQNMY CHYDMLAPGN KIVLKFLDFE LEGGAPDCLS DNLTLIVDSN
YNYDYRVYCG SDIPPITTES DKVSLIFQSD KYIQRRGFRI SYAIYSCGGR VNSTTVLSSS
LSEIYETNLN CTWFIEGPAT KRVVLKFLYI DLESQRDCYS DFIAVYDGFV IDEEKRIGLL
CGHFNSTTVL RSKGNTALLQ FSTDPSINYR GFKVEVYFSY SEAAQCGGYI NLTSGSSQTL
KSPFMGHPVY ENFLDCDWSL ISSPDTVIKI EFTSFHVSPC QNVNQTAIGY SKCDCDLVEI
KDGLNPNSLV IGTYCGHSLP PQLTSSGNTM SVRLATDGEI GSQGFVATIT TQRALCGQSN
LVVGETPQRL KSPGYETGSV PRGMHCVYLL DASSNPYQLL RVTVVTMDLR PPVAEGADVN
RCNRDKLIMA SSTLHPNVTI GKDYILNNQD SDFFSRNSFY DVDLHFPTQF EICGHREGTD
FYLYGSTSLN LITSSEIDSN VYKGLEIEFA YVGFCGRNYS EPNGRIQSTN TNSPINPSDC
YTLITAPVNY TVSVYFISIV PLYWNEDCYF EIFDGNNVTA PRLLKIYSEF ESNTPVFSTG
RYLLLHNHEN DNDRISFDLN YVTTNKGRGC GGKLQSEVGQ VTSPMYPNIY RQISTCEWEL
ETPTGTHLLL RFSVFDLGIT CDQNYVKLVD SKGVVVRTFC EENPADYMSP DNYVKIVFVT
TMNNGGTGWV ADFIGLE
//