ID E2B4S3_HARSA Unreviewed; 3712 AA.
AC E2B4S3;
DT 30-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2010, sequence version 1.
DT 27-MAR-2024, entry version 67.
DE SubName: Full=Cubilin {ECO:0000313|EMBL:EFN89302.1};
DE Flags: Fragment;
GN ORFNames=EAI_06193 {ECO:0000313|EMBL:EFN89302.1};
OS Harpegnathos saltator (Jerdon's jumping ant).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Ponerinae; Ponerini; Harpegnathos.
OX NCBI_TaxID=610380 {ECO:0000313|Proteomes:UP000008237};
RN [1] {ECO:0000313|EMBL:EFN89302.1, ECO:0000313|Proteomes:UP000008237}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=R22 G/1 {ECO:0000313|EMBL:EFN89302.1,
RC ECO:0000313|Proteomes:UP000008237};
RX PubMed=20798317; DOI=10.1126/science.1192428;
RA Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G.,
RA Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D.,
RA Wang J., Liebig J.;
RT "Genomic comparison of the ants Camponotus floridanus and Harpegnathos
RT saltator.";
RL Science 329:1068-1071(2010).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL445595; EFN89302.1; -; Genomic_DNA.
DR STRING; 610380.E2B4S3; -.
DR InParanoid; E2B4S3; -.
DR OMA; RGFTVRW; -.
DR Proteomes; UP000008237; Unassembled WGS sequence.
DR GO; GO:0005768; C:endosome; IEA:UniProtKB-KW.
DR GO; GO:0005765; C:lysosomal membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0031419; F:cobalamin binding; IEA:UniProtKB-KW.
DR GO; GO:0008203; P:cholesterol metabolic process; IEA:UniProtKB-KW.
DR GO; GO:0015031; P:protein transport; IEA:UniProtKB-KW.
DR CDD; cd00041; CUB; 26.
DR CDD; cd22201; cubilin_NTD; 1.
DR CDD; cd00054; EGF_CA; 5.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 26.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR PANTHER; PTHR24251; OVOCHYMASE-RELATED; 1.
DR Pfam; PF00431; CUB; 26.
DR Pfam; PF00008; EGF; 3.
DR Pfam; PF12947; EGF_3; 1.
DR Pfam; PF07645; EGF_CA; 1.
DR SMART; SM00042; CUB; 26.
DR SMART; SM00181; EGF; 8.
DR SMART; SM00179; EGF_CA; 6.
DR SUPFAM; SSF57196; EGF/Laminin; 6.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 26.
DR PROSITE; PS00010; ASX_HYDROXYL; 2.
DR PROSITE; PS01180; CUB; 26.
DR PROSITE; PS00022; EGF_1; 4.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS01187; EGF_CA; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000008237};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 122..158
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 160..201
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 393..429
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 431..467
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 473..606
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 610..719
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 725..837
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 838..954
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 958..1071
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1077..1189
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1193..1304
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1305..1423
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1425..1537
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1541..1653
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1654..1771
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1775..1884
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1885..1998
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2016..2137
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2138..2252
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2256..2373
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2376..2493
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2497..2610
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2747..2853
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2855..2974
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 2975..3087
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3089..3203
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3210..3331
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3335..3477
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3479..3583
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 3589..3701
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DISULFID 148..157
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 191..200
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 397..407
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 419..428
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 457..466
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EFN89302.1"
SQ SEQUENCE 3712 AA; 413518 MW; 8FC4439D025962C9 CRC64;
RPVLESRDGH LIISSSKDRN ITLKIIGSGY VNVNEINLLH VASAAQNATR VIDRWRMGYL
AEMESSLQQL TTIVTGSTGL QRRVALLERN IDTNITNRIG NRTRIILKEL EDSLRAMQRL
LRENECQSNP CQNGGTCEDL YDAYQCHCPS NWEGPNCMTD VNECARFLGT DLGCQNGATC
RNLPGSYRCD CLPGWFGLHC TQKTSICNTE NSEALCGEHG VCVAKSSSPQ GYACICDQGW
ESDGTSPACT KDVDECAADH PPCSVNPPVP CRNTRGSFTC GACPHGYSGN GYYCTDIDEC
LINNGGCSIS PYVSCMNTMG SRVCGSCPIG YRGDGVSCIF VGGCSINNGG CHLLATCTEN
PSLTSSYVLC RCPAGYVGNG MGPNGCQLAD VSVNTACSSN PCVHGTCVPN GANGFTCTCS
PGYSGVTCNT PADPCMPNPC RNNGVCVVLN GQATCECTST FTGSRCETQR QACGGVSRNP
VGVLQFPIGG STYQHRLSCA WVLITEPTKV LNITFTAFHL EQSTDCKFDF LQVRCSSGSQ
GSLSSTFTPE IHDGKNAGGQ MINRFCGETL PNGNGNIVSS HNSLYLWFHS DGSISHDGFT
FHWNSIDPIC GGILENDYGT ITSPGSPGRY PPNRDCFWRI FVPESKRIQF HFGQLMLEEH
PTCQNDYLEI TGIEDERLGL YCNHTHPPPL ITPTSEAKVH FHSDGAGQDV GFQIHYTMVE
GIPGCGGTYT TASGTISSPG HSSTYLPNMQ CEWKIQLPPG ERIRAMWLRF DLEESLSCHF
DYVEVYDGPH TSSELIGRYC GSELPSAIKS TTNTLVVVFS SDWSFESEGF AISYETFCGG
EFRDETGIIH SPFYPNNYPG SKTCIYEIIQ PVNQGIVLNI LDMDIEGLST DCFYDYIEIH
DGDNENATKL ATLCGNEYHI PPTPFISTHN YMFIKFTTDN SLEGRGFKAN YTTIDRRCGG
LLKTSGEIIK PPTEDGHYLN DEDCIWTIQA PPGYGVQLNW LSFNLEMHRR CVLDYVNIYE
NYTSPSENII ATYCGNKVPL GFTTQSSTVT ILFHSNTYGT SDGFVATYLF VDLTKICGGH
YMKSTGVIRS PGYPDDYENR KECVWTIEAQ ARHRIILTVN HFELEDHSSC IFDYLEIRNG
GYETSPLIGR FCGEEIPTEI PSQASQLYIK FVSDFSRRQK GFEIQWDSTT EGCGGTMNAV
TGDIISPNYP EPYLHNAECY WKIAVAAGSL VQITIVDLEL EHNERCRFDF IEIFEGISHR
VRKGRYCGAS YPKIIQSATN EMTIRFRSDF TNSARGFHLK YETQCHNRLH GFYGVIESPN
FPNKYEHAMN CSWIIEAPIG NKINLTFSHF DLEGRGDSDN ICQFDYLEIK EGEGDTPNSE
LGRFCGSVNL PLKISSTQNQ IFINFVTDSF IAFNGFRLEW LTHGCGGILS KPVDNFTSPG
YPSGYPTNIV CEWLIDVDYA HSIELTFHEV NTEKHKNCYY DKIQIYAGQD ADAPKLTELC
HTEKPVVYTS PGNKMFVKFQ SDISYAGRGF RASYRTVPIK CGGKFTTSSG IIHSANYPQN
YPHSQNCEWL IEVDNSHVVN VTFLDFDIEN SRNCTDDYVK IFDGPTKDDA LLGTHCRNQL
PPSYISTGNQ MLVVMRTDSI ISAKGFKAQY SRACGARITV KDHGSFTYPN DNGDNGNCTW
ILTAENPADH VTLTFTHMEV DPADFSRIWN DSCYLSYIEA FEGDSTDGPS LGKWCNNVAP
PPVTSTGSSL TLHLFIRYDF HGHFAATYSV LNTACGGNYT SEQGMITSPS YPNSYPLNSE
CVWILNTSPG NRISLTFTEF DVETSENCDL DYVEVRENSG IGKLINVLCG KDAAPITSSS
KLWLKFKSDD SGTAKGFVAE YSFIGGNELQ GPLGRITSPL YPKPYRRTAN FSWRITVDME
SIIQIQFRDI KIENIVDTCI FTSVTVYDGY DDEAPILIQA CGFSVPDPVE SSSNIVYITM
TSDYIRQGNW FDLTWLEIPK DVPSAQDTKE IKLSECNKEV ALMGEHNYTY SFSSPGWPIG
YAHNLRCNWV FTSPPGTHLV LRILSMNLEE TMNCIADSVS VYSGLALIST SDAHLKSKLC
LANSTMTSIR TTNVMTVKFE TDSSVNKTGF NAYVYRDCGG NLTGPNGIIE HDNSSSRSTW
HYTCDWTVEV KPGKTIKVDI IDMSISQTAD HTCNDNYLLL KNGGDMFSPL LGNGKYCGEV
LPAELQTIGN LLFVRAVRNG PHLRFKLTYR EVSMSCGGEF ILTNKQKKWE ITTPNYPNIP
PTYAECTWKA IASAGERLSI HFPERFDLSY STDCEREYVE IRDGGTDTSR SLGKFCKDVA
PNSKTTTGNM MYIHFYTDLP EPKNGFKAVI TSGDVCGGIL RGVSGVIESP HYPHFYPTNQ
SCWWWIIGPT DHTLKLQFRD IHLPGYRICN ATDHVEIGEK TVETDKPSSI GSYCGLLKPD
VIETSSNEAY VMFVSDNRDY INYRGFSINF TATQETCGGS LTAMSGIIKS PGYPNPRTRP
RYCDWRIELP QGYQVVVNIL DLDIVSLSGP THVGYSLSFY NDFRYKSKIK TLGRISHDST
EEISSSSNTM IIGYWSSAGH RGFKLRYYSR VPAPCGGRLS GREGNITGPT TRPFNESSYI
CNWKLVSPLY RFIPGNLAHF TNIANTANNN FTLTIKVVGD LGEFEISPRG KCIYPEYIEL
HGVGVLCGNI TQPRYLRSPK PVNELTVMNG TFGKHMSYNI QYQWQPCGGI LQGPSHTVTS
PRNISYPINC VWHVNYPSNS ETISLTFARF NLGSCEQGYI IIRNGGSTSP EIGKFCGNIK
PYNITSASNQ LWIEYFAADE PNDFEFNLNV ASVGCGGTLR GLSREILSPE YPRQYPNNSE
CTWEIMGDNG YHLGLTFIER FSLETSPNCE KDYVQIFDWI NGEESSNSEW KSLGKVCGRN
TPAPFNSTSN RMKVTFHSNE AVQSDGFRAV WNENCGGVFQ ATKNIKMIQS PSYPYSYPPN
AFCNYTIVAP NEDIIVEFTD FHLERGHNCR FDNITVITGD LYELQTVDTY CGNNKPAIAR
SISRIEIIFM TDKFVQRSGF QFKYFLNQCG GIITKPSELK PLMHGEEYFG RMNCTWVIKA
PQGKSVLVRF EKFILEFSTG CYFDNVAIYE GDLVEKDRRI ALICGNLTEN LPTFKSEFNS
MVVNFNADSS RHFEGFTAKV LFTTSPAEGC GQIINMTSTQ SKSFRTQQAA TYQAFEECHW
TVMTSPGKNI RFTINSIDIK NSTNSTGNTN KCTGDFLEVR DGAGSYGDLL GQYCGNVQPP
PILSTLNMLW IRLYTDGTAE GAGVTATLEV IDSTCGLSSR TINETRQVLT SPGYPNTYTP
GLRCRWILRH PDTYSDRMRI QFIDFDMEDS SNCENEYLEI SEEMKYINEG FGKNFIYNGV
QKHPITIEIG SRFPYSTYKY CGGQLPHDYY SYSNSIQVVF KSIFNGHKGF KLEYSMASCD
RNYTSEQGRI IHQGFTNCLI TITVPENRTI SLYFNQFSIY DSEHCTHYAL QVHDGDSSGP
LLTTLCSGAL PSPIFSTGNK LTLRSWAENT NSYQSYDIIY TTTDAGRGCG GRIFNYGGRF
TSPLYPNIYR NNTVCTWDVS VPRGFKVILE FAVFDIGTRK NCENNNLKIY DAVPSGELLS
NTYCGGDDPA RFEAASDRIL VRYTSTVNNI GTGWVITFMA QSTTKPIDIV NN
//