ID A0A016TA31_9BILA Unreviewed; 3228 AA.
AC A0A016TA31;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
GN Name=Acey_s0122.g1086 {ECO:0000313|EMBL:EYB99506.1};
GN Synonyms=Acey-lgx-1 {ECO:0000313|EMBL:EYB99506.1};
GN ORFNames=Y032_0122g1086 {ECO:0000313|EMBL:EYB99506.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYB99506.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYB99506.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01001458; EYB99506.1; -; Genomic_DNA.
DR STRING; 53326.A0A016TA31; -.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR GO; GO:0016810; F:hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 3.20.20.370; Glycoside hydrolase/deacetylase; 1.
DR InterPro; IPR006150; Cys_repeat_1.
DR InterPro; IPR006149; EB_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR011330; Glyco_hydro/deAcase_b/a-brl.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR002509; NODB_dom.
DR PANTHER; PTHR45985; -; 1.
DR PANTHER; PTHR45985:SF11; LIN-12 AND GLP-1 X-HYBRIDIZING; 1.
DR Pfam; PF01683; EB; 17.
DR Pfam; PF01522; Polysacc_deac_1; 1.
DR SMART; SM00181; EGF; 31.
DR SMART; SM00289; WR1; 23.
DR SUPFAM; SSF81995; beta-sandwich domain of Sec23/24; 1.
DR SUPFAM; SSF88713; Glycoside hydrolase/deacetylase; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 2.
PE 4: Predicted;
KW EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000024635}.
FT DOMAIN 1791..1828
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2479..2516
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 254..278
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 498..524
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2270..2312
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3228 AA; 346841 MW; EC6CDBD08580742D CRC64;
MLVEMLTLRT LVNKWYQRSS SGVRPLCRSF YTLERATTFG GSRTGSFLTS FLIRVGRFRS
LDSISITFEI VIRNRIMREL LAAAFVFTFV IASDVGLNLP CNREMDGLLA ADPDGNPSAF
LSCQSNGAGS TGYWERRVCP DEMVFDFINQ RCQQRKHRKQ PMLNIAILNN SCAHGETCIG
GTVCDLERLR CLCPYGTVPQ LETLSCIKPH SSYGGFGNFE KSPSTPAGFL PNPSQTNPPP
FTFNFNPLFG KETYPNYGNQ NSNNKYGNAN GASGNEFSSL PSGNNKYGAN GQWNGAASGS
GNNAGGYNGV NSFSELLPKP GVESQPFVFK PNFFASTSAP VPTKKVPTLA RPGQSCRDNE
MCIGGSLCTL PIALCLCPGE LEEKDGECVL PASATIQIEK VGIGALCSDL AECDHGSTCV
MGRCACVAPL IQHEGRCVLR QERKEVGPGE LCDNGEVCVR GSVCDTVIPV CVCPPNTDLS
NGDCVHISSI RPAIAAPAPT YTPPTAPAPQ LPQLPATQLP PQPQPVFLPS TTPAPVFVPS
TTQLPQQPMQ PSPFGGHYGQ SQNVYQEKPV TAPIYPAQPS HVTYSNYQST MPPPTVRPTS
KPNLMKISLG GSKQSGVGVP CSLNTDCMIG AYCNGNTNPP SCQCLSTHVN IEGRCERVVY
PGQVGCRSDL QCSAAYTGTK CVDRICVCPE GYKEVDQTCV PEQANPNERC GYVHGHPECS
KGFSCINQIC VCPTTHTITN GFCSSNTTDH PLTQCDEACE PPRKCLNNRC VCPDGVSCAS
AEVSRRRRRG VVEQSMVCWP GASQCSAGNG VCIDNVCHCT HGFVEVNGVC APEIVRIGEN
CDPNGISPRC PENAVCKDGI CQCATPGGCD RDTFMAGSRL MDGRCTMDRE CPDGQCVAGR
CQCNDGFALQ NGACTSITGA FKNINGQCTA DDRCSGGSTC RGNVCQCVDG SSELHGRCRQ
SPGGRCSYGQ TCDGGSSCEF GLCRCPDGHI IDAGKCVMGK AEPGKSCQHG QKCVHGSVCR
FGMCMCIAKY MASKGRCVRR ENVLVPTSPA PSSTTSSATT SVRVGAVKGP GFECHEKDLC
SGGSRCRDGF CVCNEFEVII NEQCVGSHEQ ANEIIDKLLV SAPGQPCDAR TNCTGGSICI
NRTCTCEQGN IDNSGTCTEA KKEYGTNAAP PKKSPDQFQP GFTCTLTIEC PYRTECLRGV
CRCKKGETIV DNTCRKAIHQ VLPGGKCDPR KGYDCVGEAH CFYGVCTCTR HLVNNGKECA
TIEEVEMVTP GKRCGLGQTC SGGARCLDGF CRCPEDEVPD VNKKCVKKSQ VYPVFNKYPT
TSEATSIYYP NPVTATLPSA SANSALTLDK AKELEAFEAL LKANPSFSEI DAKIAQTIYG
HICRSKDECP ANSFCFQQLC RCMLGYRATG GYCEPITDGC AISNNPGKPC GTIAHPGEDC
TRNQVCSYNS YCGLFSGVCE CPSGMATMNG RCERTTVAPG LGCVTSKNCH SSSYCDNGLC
LCKTGYQLIN NFCVLPESAG GAAAGAALKQ SNPNSQEGSI LPRTIHSYPS GMIPLDFTKP
PFNFASKPEI KTVYANVNSQ QATSSSYTPS QVPRFQSFPV PFAKGASTKP GDVESERTKS
TSRLRIAMPG DYCGDDSICI GNSLCQKQFC RCPPNTFAEN GICSIRRRLS PGPKHSNDHE
EFVDLNSQED SAESRQFAAP LENCQNFEFC TGGSECLSIQ GMGLVCQCPT NTIFLEDECV
DAPRNAELAG IGESCQDGEI CLGGSRCIQN ICMCDDDKHD ILGICVTTAR PGDDCSDGQI
CVDGAVCAAS VKTCVCPPGR MSKMGRCVEN GQPQDSTSRM AIPGEPCGSQ SSCADNSFCS
GERICVCMPR FANLNGRCVP SNMVRSPGEE CHLDNMCTGG STCSGGACTC PLGQLLMDAR
CVHVSDEIRR RPSQNECEVD GDCAEHYQCV NRMCVCHGDF SRCLRMVLLR AEQSCREDAH
CPEYATCNEN LCVCNDGYKM IQGKCIQRNL LKQKVTMKGT KTVKDIAMKI TKPSGPGAIC
GESHHCAMGS VCFHGYCVCG LESVPRNGSC VSRSGNIGFG EKCSEHYKCR DGLSCVQDRC
ACLPDNLSCN ESEPVTSPPG GACTDSRICV GGSVCREGWC ICPDPTMIVQ RGICIQPGPR
PTLPPSTASI NSQVHQPYLP RTAYQNVPGA GMAAVNQYNI QQGRKIPPGA SCGPLDTCVG
GAMCVDGMCV CPSGMQASAQ GRCEKASTPA TSTAMILNPG SVHATEFTYQ HTHHLQSRPR
PSPTSSSSSS SYAAAAAATQ HVTHAPTPTL TNANTASEAD ECAAIGLYCR GNTVCRNLSC
QCPDDYVLHH DGCVPPDEVG RRKVLGKARH QGTPTYARPG QRCANGETCV GGSVCNEMMM
CACPPEKPIL QGDACVAQQY RKIATPGESC DENTECTKES SCHGGQCRCQ YGYIAVSGQC
VALPMPTTPA MKNVVLARPL DSCDNGEQCE GGSSCDQETG VCMCPPGYIV YGVQCQPPPQ
STAMQPAATP IPQAAPTFAV SMNTECADDT NCGANKVCVV GRCKCRPGFV DHDGVCEPLE
SVTLRKLKYG QTCNPAMDTC IQGSSCIDQV CKCGPGFALS PNGWCERFDF NMEGRTSPSY
SSFQENYYTT TTQQSIVIHT VPDEHPLFET TASQSSSRSS STGPKVIRHR FLGTKCRDND
VCINGGECRE GVCRCPERLY ERDGRCLRAD QLPRAAPSES CAKGEMCTGG SICDNDSKTC
ICAADHLIVD GICRSKDATP FAAPGQSCSR GEDCSGGSYC ADGICQCDSN HFAEDGYCRH
IASRSSEIKY VAGNGLRFSS KAFAPRIPST PCNETTCRLP DCFCSLTGRK PPGGLAPNTI
PQFVVLTFDD AVNGRTLPDY RELFETVKYR NPNGCPVKAT FFISHEWTNY DAVQWLFQQG
MELASNSISH VSLEGTNANR WLNEMDGQRR IIAKFANANE EEIVGMRAPQ LALGGDEQFE
MMARAGFLYD NSMSANPGVN GDPFWPQTLD HSVPWDCYDA NCPKSSFPGI WTVPLNQFYG
TYLPQIDSFR RSSMVRAAVD LNTTVDQLTN MLFSNFDRSY TASRAPYVLA LNADLLQLNG
RNTGMQALQR FLEEVLYRKD VYVVTLKQLI QWMKNPVPLS QITQSEAVKC SQGPLSQYPA
ISQRSCSKPN KCMYRTPGLG SQEHQFLTCS PCPDQYPWLD NPIGNMTP
//