ID A0A158N9I8_ATTCE Unreviewed; 2272 AA.
AC A0A158N9I8;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 {ECO:0008006|Google:ProtNLM};
GN Name=105617240 {ECO:0000313|EnsemblMetazoa:XP_012054196.1};
OS Atta cephalotes (Leafcutter ant).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Myrmicinae; Atta.
OX NCBI_TaxID=12957 {ECO:0000313|EnsemblMetazoa:XP_012054196.1, ECO:0000313|Proteomes:UP000005205};
RN [1] {ECO:0000313|Proteomes:UP000005205}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21347285; DOI=10.1371/journal.pgen.1002007;
RA Suen G., Teiling C., Li L., Holt C., Abouheif E., Bornberg-Bauer E.,
RA Bouffard P., Caldera E.J., Cash E., Cavanaugh A., Denas O., Elhaik E.,
RA Fave M.J., Gadau J., Gibson J.D., Graur D., Grubbs K.J., Hagen D.E.,
RA Harkins T.T., Helmkampf M., Hu H., Johnson B.R., Kim J., Marsh S.E.,
RA Moeller J.A., Munoz-Torres M.C., Murphy M.C., Naughton M.C., Nigam S.,
RA Overson R., Rajakumar R., Reese J.T., Scott J.J., Smith C.R., Tao S.,
RA Tsutsui N.D., Viljakainen L., Wissler L., Yandell M.D., Zimmer F.,
RA Taylor J., Slater S.C., Clifton S.W., Warren W.C., Elsik C.G., Smith C.D.,
RA Weinstock G.M., Gerardo N.M., Currie C.R.;
RT "The genome sequence of the leaf-cutter ant Atta cephalotes reveals
RT insights into its obligate symbiotic lifestyle.";
RL PLoS Genet. 7:e1002007-e1002007(2011).
RN [2] {ECO:0000313|EnsemblMetazoa:XP_012054196.1}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (APR-2016) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADTU01009655; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_012054196.1; XM_012198806.1.
DR STRING; 12957.A0A158N9I8; -.
DR EnsemblMetazoa; XM_012198806.1; XP_012054196.1; LOC105617240.
DR GeneID; 105617240; -.
DR KEGG; acep:105617240; -.
DR eggNOG; KOG1217; Eukaryota.
DR eggNOG; KOG4297; Eukaryota.
DR InParanoid; A0A158N9I8; -.
DR OrthoDB; 2880384at2759; -.
DR Proteomes; UP000005205; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00033; CCP; 11.
DR CDD; cd00054; EGF_CA; 5.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 11.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR Gene3D; 2.10.50.10; Tumor Necrosis Factor Receptor, subunit A, domain 2; 2.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR003410; HYR_dom.
DR InterPro; IPR001759; Pentraxin-related.
DR InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR InterPro; IPR011641; Tyr-kin_ephrin_A/B_rcpt-like.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR19325; COMPLEMENT COMPONENT-RELATED SUSHI DOMAIN-CONTAINING; 1.
DR PANTHER; PTHR19325:SF573; SUSHI DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00008; EGF; 3.
DR Pfam; PF07699; Ephrin_rec_like; 2.
DR Pfam; PF02494; HYR; 2.
DR Pfam; PF00354; Pentaxin; 1.
DR Pfam; PF00084; Sushi; 10.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00895; PENTAXIN.
DR SMART; SM00032; CCP; 12.
DR SMART; SM00181; EGF; 8.
DR SMART; SM00179; EGF_CA; 5.
DR SMART; SM01411; Ephrin_rec_like; 3.
DR SMART; SM00159; PTX; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF57535; Complement control module/SCR domain; 11.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 6.
DR SUPFAM; SSF57184; Growth factor receptor domain; 2.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS00022; EGF_1; 5.
DR PROSITE; PS01186; EGF_2; 3.
DR PROSITE; PS50026; EGF_3; 6.
DR PROSITE; PS50825; HYR; 2.
DR PROSITE; PS51828; PTX_2; 1.
DR PROSITE; PS50923; SUSHI; 11.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000005205};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Sushi {ECO:0000256|PROSITE-ProRule:PRU00302}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..2272
FT /note="Sushi, von Willebrand factor type A, EGF and
FT pentraxin domain-containing protein 1"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5007628825"
FT DOMAIN 95..276
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 393..453
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 467..525
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 526..592
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 591..673
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 674..759
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 1161..1197
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1199..1240
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1242..1278
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1280..1313
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1316..1358
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1360..1396
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1601..1663
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 1664..1721
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 1722..1789
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 1790..1873
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 1904..1964
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 2087..2151
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 2152..2214
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 2215..2268
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DISULFID 395..438
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 424..451
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 496..523
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 1187..1196
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1230..1239
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1268..1277
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1348..1357
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1386..1395
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1692..1719
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 1760..1787
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 1935..1962
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 2185..2212
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
SQ SEQUENCE 2272 AA; 255486 MW; 879B9FA357394731 CRC64;
MGFSRILPLL IGALSLFSNF AECVDDDNGG SINSVENNTN DNNVSSTFLN FDQKTNEDQR
KHLDDTDRML SKMDVLSRLL KMHIDQLRNK TDQVEMVFLV DASGSVGAEN FRNELNFVTK
LLSDFTVDAL AARVALITFG GRGSVYRNID QISRHGPNDH KCYLLNKQFS NITYSGGGTY
TRGALLEALA ILEKSREAAS KMVFLITDGF SNGGDPRPAA HLLKNTGAII FTFGIRTGNV
EELHDIASHP EYTHSYLLDS FAEFEALARR ALHSDLKTGQ YVAVTLPTDC NSLCSDSMNR
TCCDELATCT CGTATGHYAC ICPPGYFGSG LRGFCQPCPN GTYASTNTSG DSTAACVSCP
DANHITIKVP ATSVVDCVCA SGFITDGYKC EAITCPKLRI PDNGYLVKAS ACSNVVHAAC
GVRCRIGFHL TGDSIRLCGK DGAWSGNEPQ CLLKTCSALR SPTHGRVKCE HDEDYQQQFQ
ENSTVYPIDT RCQFRCDVGY QLRGSKVRNC LPLSRWDGLK VTCKAVKCKS LPQIANGDIT
PEICIGPAKV PFATNCTITC NEGYILEGPT DRMCIGRTGI WSQRHSVNRC VDKTPPSLEC
PANIIEETEK GQNYAYVNWT VPKVTDNADA PPIVWTKPHI VLPWKAKIGT RNVVYVAQDA
SGNKARCKFK VKVLDREPPT IENCIDPPTL YTDFDSGLAN VTWDEPVFYD NSRIAVRVNQ
SHQPGQDLFF PVGHTKVFYN ATDKYGNRAS CVLNITVEDV CKSLKAPTNG RLNCSSSDDR
ETQCVVACED DYDFAIEPMN FNIVNNELLL KCNSSNHMWD SNYLPECSET QTPITISQEG
DVILQSNGSA ICDNQPALGE LNKNIANDLK SKFLEICDND IECDLVSFDP KCEDDLSLSK
DIEDNLIRRR RFEPKNKQAR STDTFKDTET TLFERLKRAA IRLSSEPNKN NTRSKRKRNR
IEIKFKFIGK IIEENYENPK RGVQKLRERI DAMTQVGKLN LLDNKTNQEI AKLALNLYFV
FKEPQDLCDL GSVLKRHGCV KCPAGTFYNS STRTCQPCPF GEYQDAIASL TCVPCPEYTF
TKRMHARTLK DCIPVCRPGY YSRRKRYHGS RVGMEPCFAC DIGFYQPNYG QSQCLPCPSN
VTTEKRGSID ISDCLPIRDE EIDDCRTDPC LNGGQCLRDE SGYVCECREY YVGLKCEEFK
DPCDSSPCLN EGMCTTWQYL NNSVMYECVC KSSYTGDNCE IYVDECYTNP CQNGGRCMST
ENDFVCECRD GFEGQFCEVS MDHCEHMPCE EGSVCRTVNG TWQCLCKPGF LGRHCNLLPC
DWLPCHTNAI CVNVKEENAT RKSYRCECPD GYTGEDCATK INHCEYSPCL NNGRCINFVL
DYICECPIPF TGRDCEIELS SDYVMHFTKS GTTDYVATKG PARDFLQLSV CLWLQSLDTF
NYGTILSYAT TFYDNAFTLT DYNGLVLYIN GEKIVTDVRV NDGNWHFLCV TWESESGSWR
VFVDGILKDN SIGLAQGAVV RANGSLVIGQ EQDRLGGGFS ESEAFLGRLG LLDMWDVVLN
ESDVTKLWNS CEKYHGNLIA WAQMRQYIHG DVVILSSPFC HGCPLPVMPF KGNIKVSEDL
SEITYYCDNG YVVRFGNKEY RSVRRKCLKH GQWEGYNTPI CMKIKCGFPG YFPRGHIYGK
SYLFEDEIYY SCNEGYELRG NPHRICNSDG KWIGLPPICI GMTCKNLLAP ENGDIEYILE
ENERDDVTIL QAGQQLEFKC NPGYRLIGER YLTCLDIGIW DHKRPSCTPY GCPLPKQIEH
GYIIPSNSDQ TSVRDPERNI IDNSSERTYH YNDIIGFSCH RGYKFRNNHT LTEFKLQCSA
NGTWTGFIPD CVPRTCPWPD RVADARMFLK KRDNITVEIP MEEDATWKPD RRSNESENEI
SPETFISGAE ILIVCDLGYE LVGDQVRMCT EEERWSSTFT SCKPRNCSIE EHPIFKFFKK
LGNETALENS NTDVILFELD EKRYKKNVTH QYKDFDIFVE RNSYKGRIVL TCRNGAQMNF
HKLIANETIS NITWMCNTIA KWEVSNLLMK ESILEQLLND STDICNRSCA PPQIPEYGYI
DNGNNTDNVN NRRTINSVVI FKCRHGYILE GAEQSICLSD ARWSALPSCK PVACGKPPIL
ANAILKSDVD ETQNYTFGNM ISYQCVPGYR VFGQANLRCL GSGKWSRLNG RCSKISCGKP
QIQHGIALYG RSYLFQDQLT YICLDGEKKG MITCQANGKW NELPKCDGNR NV
//