ID A0A158P213_ATTCE Unreviewed; 3036 AA.
AC A0A158P213;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 45.
DE RecName: Full=Multiple epidermal growth factor-like domains protein 8 {ECO:0008006|Google:ProtNLM};
GN Name=105627145 {ECO:0000313|EnsemblMetazoa:XP_012063821.1};
OS Atta cephalotes (Leafcutter ant).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Myrmicinae; Atta.
OX NCBI_TaxID=12957 {ECO:0000313|EnsemblMetazoa:XP_012063821.1, ECO:0000313|Proteomes:UP000005205};
RN [1] {ECO:0000313|Proteomes:UP000005205}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21347285; DOI=10.1371/journal.pgen.1002007;
RA Suen G., Teiling C., Li L., Holt C., Abouheif E., Bornberg-Bauer E.,
RA Bouffard P., Caldera E.J., Cash E., Cavanaugh A., Denas O., Elhaik E.,
RA Fave M.J., Gadau J., Gibson J.D., Graur D., Grubbs K.J., Hagen D.E.,
RA Harkins T.T., Helmkampf M., Hu H., Johnson B.R., Kim J., Marsh S.E.,
RA Moeller J.A., Munoz-Torres M.C., Murphy M.C., Naughton M.C., Nigam S.,
RA Overson R., Rajakumar R., Reese J.T., Scott J.J., Smith C.R., Tao S.,
RA Tsutsui N.D., Viljakainen L., Wissler L., Yandell M.D., Zimmer F.,
RA Taylor J., Slater S.C., Clifton S.W., Warren W.C., Elsik C.G., Smith C.D.,
RA Weinstock G.M., Gerardo N.M., Currie C.R.;
RT "The genome sequence of the leaf-cutter ant Atta cephalotes reveals
RT insights into its obligate symbiotic lifestyle.";
RL PLoS Genet. 7:e1002007-e1002007(2011).
RN [2] {ECO:0000313|EnsemblMetazoa:XP_012063821.1}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (APR-2016) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADTU01006956; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_012063821.1; XM_012208431.1.
DR EnsemblMetazoa; XM_012208431.1; XP_012063821.1; LOC105627145.
DR GeneID; 105627145; -.
DR KEGG; acep:105627145; -.
DR eggNOG; KOG1388; Eukaryota.
DR InParanoid; A0A158P213; -.
DR OrthoDB; 5471913at2759; -.
DR Proteomes; UP000005205; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0048513; P:animal organ development; IEA:UniProt.
DR GO; GO:0016043; P:cellular component organization; IEA:UniProt.
DR GO; GO:0048731; P:system development; IEA:UniProt.
DR CDD; cd00041; CUB; 1.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00055; EGF_Lam; 4.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 4.
DR Gene3D; 2.10.25.10; Laminin; 7.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 2.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR InterPro; IPR002049; LE_dom.
DR InterPro; IPR016201; PSI.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR PANTHER; PTHR46093; ACYL-COA-BINDING DOMAIN-CONTAINING PROTEIN 5; 1.
DR PANTHER; PTHR46093:SF17; MULTIPLE EGF-LIKE-DOMAINS 8; 1.
DR Pfam; PF00431; CUB; 1.
DR Pfam; PF07645; EGF_CA; 2.
DR Pfam; PF13415; Kelch_3; 1.
DR Pfam; PF13418; Kelch_4; 3.
DR Pfam; PF13854; Kelch_5; 1.
DR Pfam; PF00053; Laminin_EGF; 4.
DR PRINTS; PR00011; EGFLAMININ.
DR SMART; SM00042; CUB; 1.
DR SMART; SM00181; EGF; 9.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00180; EGF_Lam; 5.
DR SMART; SM00612; Kelch; 5.
DR SMART; SM00423; PSI; 9.
DR SUPFAM; SSF57196; EGF/Laminin; 3.
DR SUPFAM; SSF117281; Kelch motif; 3.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS01180; CUB; 2.
DR PROSITE; PS00022; EGF_1; 2.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS01248; EGF_LAM_1; 3.
DR PROSITE; PS50027; EGF_LAM_2; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Laminin EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00460};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000005205};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..3036
FT /note="Multiple epidermal growth factor-like domains
FT protein 8"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5007629891"
FT TRANSMEM 2890..2912
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 34..146
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1292..1333
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1415..1469
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 1473..1607
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 1635..1671
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 337..371
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 337..365
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 1437..1446
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 1661..1670
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 3036 AA; 340654 MW; B396AD0CEC8CEC38 CRC64;
MGWTISTTAW IVIWSCLLPG FYLAPQASPK QVPCDKTRKV FTESWGIISD GPMGSNYTQD
SHCEWLIKAN HSRQFITLSF RTMGTECSYD YVFVYDGDSF RSPLLGSFSG KTEPQQVTSS
SGYMLILLYS DTNYVLDGFH AEFSVTDCPN NCTHHGKCIN NTCFCENDWG SKDCSRALCP
NNCNQAGICG IKRCKCKDGY SGQSCSLHKT HPEGNRWHWL SHSEGGLRPR AAHTAIYVQE
TDSLYVFGGY DLNYILSDLE VYRFSTSEWE DEYGSILEGA ASAEYLDPTL IATELQRHPG
AKEIYGLPTT SLFWKVLYSI KDNNTFGLLD RTADSQHSRE FRNMPKETEK SRSTIRIDRN
GDPRRKHPRN IRPRYSQMST RYRRNLENLY RERDATDMQN EDVKWEEQIV AGSAEENVFV
PEFTDSKLTT SDSLKEEVTE PTVDELPKPS PRYGHAACKY QDGFVIYGGK VEDGTLSNEL
WHYNVIKRMW TLRAKNSPFY PPSLTRHTLT LAGDYIYLFG GSTVDGEFSS SLYKIKLRLS
DPTAINERWI EVRPRGGKEL DVRVVAHSTV YHHATNSLLV YGGVVASVAR FSKLSDRMFV
FQLDRKVWSE IHYPRAHLRE TYVPRERAFH TCNIIGNYLV VFGGYSHRHN KEEICYDNQM
YLYHLGCHTW VSHEVLGLND KDSRYPKQQG VFAHAADVRN GNTLLLVGGY HGNVNADLLA
YTLPPMLAPG DEDYIEPEQI CSRHKSLMEC AANPECGWCS ADEICYGRTI GSNCTTNLQT
TRCPGVCPAL GDCHSCLIHG QPGGGWGTNF RGRKSVSNKL NLGTCTWCVQ NARCHHKDDN
YGVCGLRDDT PSQIPGWWGS KGTEITKAEE CREMDKRPGL TFLKYKPPVN FSQPDSVAIV
NATTVDFNVP SMQGAKTESA LGGEMIARLT GFLRPPNYFW DSAAEHLKIC VSYNSATLHV
SRNDDPDKLE LVANLTAETS QCIPTTWPGE DSMELQSGRY LLDFESKRMV TTSYAYASKM
EIVHNKNMEN GKVFTFEYLE PYQNGSCHQY NNCLHCLTDS SCGWCDITNE CLPRSVNETE
SCVKDIEWDK ERGETIREWH YLTITPSACA NCSNYISCES CVNTKLCEWW TEEARCARIG
RLPNAVVSLF ECPIPCRQRS NCTKCLDERG RCVWCEATQE CFSFSVYTSE YQFGLCREWM
DQAGLMGVTS RSGSSLTGND QCKSCSRHSN CSSCLHSLSC GWCYSLENPI TGVCVQGDFN
QAHVNCSAII NEYRNSSLNA DESGWAYAQC PDVDECDLGL HDCHPDALCT NTHGSFSCQC
KRGFNGDGKE NCTKTCYERC VNGYCSEAPD YKCECNLGWT GPDCRTNCGC YNHSTCLQGP
GICDECQDWT IGRYCEECKA GSYGNATTPL GCRECNCNGH GDVELDVCDR QTGMCFCRDN
TEGDKCQRCK RGYYGDPRNG GMCYYGCMSR GMLGGEGNGK QGLGSRHSQS SLWDNYVGDS
PTRECLWIVG PETELSLDAT TPTIQSVIQF TIHDDINVSC QENSVYVYDG LPEFVSSTGG
HQSQLLGVYC TESTNYPVTV EAKSGFLTVH YKQLDEVEGF NASYVIMTCN NCPGNRECRN
GNCLCKSGYV GINCNVEICP ENCTASEKRG VCDKGYGRCV CVPGYGGRDC SIPLKNYQLT
FTELFNSEYL ADHQDHLRKT LPRFGHSLVA DRRGSLWMFG GYSLSHGPLN DIRLFDTKNN
TWMQVTVEST SEASMPQGRY FHAAEIVHSR QQIYVYGGLS MKDEDVQGLS NNTLSDFWKF
SLQNQRWSQI VQDELKKEPL PLAGHTLTLR RDGESESLIL IGGFSPKYGY LDVIWEFNLE
TETWDTVNTV GNGPVGVYGH STVYHSKSDS LYVFGGYTYA INRTFISNRL YALNYKTRTW
SVLPPFEDEI TDGNSLPQAR FLHSAVTTDE YMVIFGGRQN PHNTSDSLIA YKYSCNLWIR
LITKDTETIG SPPPPAYAHA MTHADPESNA VYVIGGFDGG IKSHVTLINI PEDLCNLWKD
KITCRKYFGC SFCAVTTLNG TNISLCFSNE VSSNRGDKCD INVTQAQRSN GIFCNSDWMA
SRKCQTFRTC TECLAEWPYY KDNEPVCKWC TNCPHGKCIP SDKDCNDQTQ DMMIRCDTSV
SNVNQCGERL CPASDCEKCY GLDECVWTRQ VLKTNELGME LTGEPVYDWN CVKINIFERT
SIKMSSTQCE KRCSDHKDCR SCLKGTGAEG GWSECRWSTQ LNECISPSYQ PLYCAGGVCG
LVLRNADMDH CPEPCSVFKQ CSTCLKHSHC GWCSLDSANV TGQGICTEGS LEAPADHPAG
GTCEMLYRQH FPEVEPPITT TLHPYHVDSS ETQDNVTMSL LTTVSKPDFS WHYVRCPPEN
ECENGHHTCS PKSEKCFDLE EGFECKCGDG YKTETTWGND FGKKICVPMC TQGCVRGTCV
KPDLCRCDFG YVGSNCSIQC QCNGHSDCAG PDKLDNCTKC HNNTMGKQCE KCLPLYVGNP
ADNGQCVPCL EYCNGHTRIC INDNVTVPDP NSVDKMSIEK LSRQLEEGPV AKAKCINCGN
NTRGDKCGEC MTGYFRGTED LRDVCRPCEC HGHGFTCDPV TGEKCNCGNN TESEPSCMSG
PIKGTNMGGT PCWMVQCSKC RENYAGTPTM GHQCYKTVTV DNKMCFDSKL IASYVSFDEC
KMKPKPLNPG QTVFYMVQPR FMNVDIRVMV DVTQGALNLF LSPRDDSFVV ILNSTTGYQE
VELDNRFRLR PDQPSNRFRI VEFHPHLGAA VNGSTMEPIR WNTGQQYFVM EHWLEDNLAS
FLTIERRNTF LVIHNLTNRL VLTLPQDKHE LGQTKFHIVL QAIDPVNPEL NGRAAYGMIF
FRQDQLHIDL FVFFSCFFSC FFLFLAGCVV AWKAKQSADL RRARRRHVVE MLHMAKRPFA
SATILYDRDG GECSPNSPQR KGRRGKHVSF HSDVRPVAVE PTDDGVAAVA TVFIRLPGGR
QAPVKLALGS SLILLTRVYP VNSRVFLRRR NSHATN
//