GenomeNet

Database: UniProt
Entry: A0A016TA31_9BILA
LinkDB: A0A016TA31_9BILA
Original site: A0A016TA31_9BILA 
ID   A0A016TA31_9BILA        Unreviewed;      3228 AA.
AC   A0A016TA31;
DT   11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT   11-JUN-2014, sequence version 1.
DT   27-MAR-2024, entry version 31.
DE   RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
GN   Name=Acey_s0122.g1086 {ECO:0000313|EMBL:EYB99506.1};
GN   Synonyms=Acey-lgx-1 {ECO:0000313|EMBL:EYB99506.1};
GN   ORFNames=Y032_0122g1086 {ECO:0000313|EMBL:EYB99506.1};
OS   Ancylostoma ceylanicum.
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC   Ancylostomatinae; Ancylostoma.
OX   NCBI_TaxID=53326 {ECO:0000313|EMBL:EYB99506.1, ECO:0000313|Proteomes:UP000024635};
RN   [1] {ECO:0000313|Proteomes:UP000024635}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX   PubMed=25730766; DOI=10.1038/ng.3237;
RA   Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA   Aroian R.V.;
RT   "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT   ceylanicum identify infection-specific gene families.";
RL   Nat. Genet. 47:416-422(2015).
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EYB99506.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JARK01001458; EYB99506.1; -; Genomic_DNA.
DR   STRING; 53326.A0A016TA31; -.
DR   Proteomes; UP000024635; Unassembled WGS sequence.
DR   GO; GO:0016810; F:hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds; IEA:InterPro.
DR   GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR   Gene3D; 3.20.20.370; Glycoside hydrolase/deacetylase; 1.
DR   InterPro; IPR006150; Cys_repeat_1.
DR   InterPro; IPR006149; EB_dom.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR011330; Glyco_hydro/deAcase_b/a-brl.
DR   InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR   InterPro; IPR002509; NODB_dom.
DR   PANTHER; PTHR45985; -; 1.
DR   PANTHER; PTHR45985:SF11; LIN-12 AND GLP-1 X-HYBRIDIZING; 1.
DR   Pfam; PF01683; EB; 17.
DR   Pfam; PF01522; Polysacc_deac_1; 1.
DR   SMART; SM00181; EGF; 31.
DR   SMART; SM00289; WR1; 23.
DR   SUPFAM; SSF81995; beta-sandwich domain of Sec23/24; 1.
DR   SUPFAM; SSF88713; Glycoside hydrolase/deacetylase; 1.
DR   SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR   PROSITE; PS01186; EGF_2; 1.
DR   PROSITE; PS50026; EGF_3; 2.
PE   4: Predicted;
KW   EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW   Reference proteome {ECO:0000313|Proteomes:UP000024635}.
FT   DOMAIN          1791..1828
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          2479..2516
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   REGION          254..278
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          498..524
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2270..2312
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3228 AA;  346841 MW;  EC6CDBD08580742D CRC64;
     MLVEMLTLRT LVNKWYQRSS SGVRPLCRSF YTLERATTFG GSRTGSFLTS FLIRVGRFRS
     LDSISITFEI VIRNRIMREL LAAAFVFTFV IASDVGLNLP CNREMDGLLA ADPDGNPSAF
     LSCQSNGAGS TGYWERRVCP DEMVFDFINQ RCQQRKHRKQ PMLNIAILNN SCAHGETCIG
     GTVCDLERLR CLCPYGTVPQ LETLSCIKPH SSYGGFGNFE KSPSTPAGFL PNPSQTNPPP
     FTFNFNPLFG KETYPNYGNQ NSNNKYGNAN GASGNEFSSL PSGNNKYGAN GQWNGAASGS
     GNNAGGYNGV NSFSELLPKP GVESQPFVFK PNFFASTSAP VPTKKVPTLA RPGQSCRDNE
     MCIGGSLCTL PIALCLCPGE LEEKDGECVL PASATIQIEK VGIGALCSDL AECDHGSTCV
     MGRCACVAPL IQHEGRCVLR QERKEVGPGE LCDNGEVCVR GSVCDTVIPV CVCPPNTDLS
     NGDCVHISSI RPAIAAPAPT YTPPTAPAPQ LPQLPATQLP PQPQPVFLPS TTPAPVFVPS
     TTQLPQQPMQ PSPFGGHYGQ SQNVYQEKPV TAPIYPAQPS HVTYSNYQST MPPPTVRPTS
     KPNLMKISLG GSKQSGVGVP CSLNTDCMIG AYCNGNTNPP SCQCLSTHVN IEGRCERVVY
     PGQVGCRSDL QCSAAYTGTK CVDRICVCPE GYKEVDQTCV PEQANPNERC GYVHGHPECS
     KGFSCINQIC VCPTTHTITN GFCSSNTTDH PLTQCDEACE PPRKCLNNRC VCPDGVSCAS
     AEVSRRRRRG VVEQSMVCWP GASQCSAGNG VCIDNVCHCT HGFVEVNGVC APEIVRIGEN
     CDPNGISPRC PENAVCKDGI CQCATPGGCD RDTFMAGSRL MDGRCTMDRE CPDGQCVAGR
     CQCNDGFALQ NGACTSITGA FKNINGQCTA DDRCSGGSTC RGNVCQCVDG SSELHGRCRQ
     SPGGRCSYGQ TCDGGSSCEF GLCRCPDGHI IDAGKCVMGK AEPGKSCQHG QKCVHGSVCR
     FGMCMCIAKY MASKGRCVRR ENVLVPTSPA PSSTTSSATT SVRVGAVKGP GFECHEKDLC
     SGGSRCRDGF CVCNEFEVII NEQCVGSHEQ ANEIIDKLLV SAPGQPCDAR TNCTGGSICI
     NRTCTCEQGN IDNSGTCTEA KKEYGTNAAP PKKSPDQFQP GFTCTLTIEC PYRTECLRGV
     CRCKKGETIV DNTCRKAIHQ VLPGGKCDPR KGYDCVGEAH CFYGVCTCTR HLVNNGKECA
     TIEEVEMVTP GKRCGLGQTC SGGARCLDGF CRCPEDEVPD VNKKCVKKSQ VYPVFNKYPT
     TSEATSIYYP NPVTATLPSA SANSALTLDK AKELEAFEAL LKANPSFSEI DAKIAQTIYG
     HICRSKDECP ANSFCFQQLC RCMLGYRATG GYCEPITDGC AISNNPGKPC GTIAHPGEDC
     TRNQVCSYNS YCGLFSGVCE CPSGMATMNG RCERTTVAPG LGCVTSKNCH SSSYCDNGLC
     LCKTGYQLIN NFCVLPESAG GAAAGAALKQ SNPNSQEGSI LPRTIHSYPS GMIPLDFTKP
     PFNFASKPEI KTVYANVNSQ QATSSSYTPS QVPRFQSFPV PFAKGASTKP GDVESERTKS
     TSRLRIAMPG DYCGDDSICI GNSLCQKQFC RCPPNTFAEN GICSIRRRLS PGPKHSNDHE
     EFVDLNSQED SAESRQFAAP LENCQNFEFC TGGSECLSIQ GMGLVCQCPT NTIFLEDECV
     DAPRNAELAG IGESCQDGEI CLGGSRCIQN ICMCDDDKHD ILGICVTTAR PGDDCSDGQI
     CVDGAVCAAS VKTCVCPPGR MSKMGRCVEN GQPQDSTSRM AIPGEPCGSQ SSCADNSFCS
     GERICVCMPR FANLNGRCVP SNMVRSPGEE CHLDNMCTGG STCSGGACTC PLGQLLMDAR
     CVHVSDEIRR RPSQNECEVD GDCAEHYQCV NRMCVCHGDF SRCLRMVLLR AEQSCREDAH
     CPEYATCNEN LCVCNDGYKM IQGKCIQRNL LKQKVTMKGT KTVKDIAMKI TKPSGPGAIC
     GESHHCAMGS VCFHGYCVCG LESVPRNGSC VSRSGNIGFG EKCSEHYKCR DGLSCVQDRC
     ACLPDNLSCN ESEPVTSPPG GACTDSRICV GGSVCREGWC ICPDPTMIVQ RGICIQPGPR
     PTLPPSTASI NSQVHQPYLP RTAYQNVPGA GMAAVNQYNI QQGRKIPPGA SCGPLDTCVG
     GAMCVDGMCV CPSGMQASAQ GRCEKASTPA TSTAMILNPG SVHATEFTYQ HTHHLQSRPR
     PSPTSSSSSS SYAAAAAATQ HVTHAPTPTL TNANTASEAD ECAAIGLYCR GNTVCRNLSC
     QCPDDYVLHH DGCVPPDEVG RRKVLGKARH QGTPTYARPG QRCANGETCV GGSVCNEMMM
     CACPPEKPIL QGDACVAQQY RKIATPGESC DENTECTKES SCHGGQCRCQ YGYIAVSGQC
     VALPMPTTPA MKNVVLARPL DSCDNGEQCE GGSSCDQETG VCMCPPGYIV YGVQCQPPPQ
     STAMQPAATP IPQAAPTFAV SMNTECADDT NCGANKVCVV GRCKCRPGFV DHDGVCEPLE
     SVTLRKLKYG QTCNPAMDTC IQGSSCIDQV CKCGPGFALS PNGWCERFDF NMEGRTSPSY
     SSFQENYYTT TTQQSIVIHT VPDEHPLFET TASQSSSRSS STGPKVIRHR FLGTKCRDND
     VCINGGECRE GVCRCPERLY ERDGRCLRAD QLPRAAPSES CAKGEMCTGG SICDNDSKTC
     ICAADHLIVD GICRSKDATP FAAPGQSCSR GEDCSGGSYC ADGICQCDSN HFAEDGYCRH
     IASRSSEIKY VAGNGLRFSS KAFAPRIPST PCNETTCRLP DCFCSLTGRK PPGGLAPNTI
     PQFVVLTFDD AVNGRTLPDY RELFETVKYR NPNGCPVKAT FFISHEWTNY DAVQWLFQQG
     MELASNSISH VSLEGTNANR WLNEMDGQRR IIAKFANANE EEIVGMRAPQ LALGGDEQFE
     MMARAGFLYD NSMSANPGVN GDPFWPQTLD HSVPWDCYDA NCPKSSFPGI WTVPLNQFYG
     TYLPQIDSFR RSSMVRAAVD LNTTVDQLTN MLFSNFDRSY TASRAPYVLA LNADLLQLNG
     RNTGMQALQR FLEEVLYRKD VYVVTLKQLI QWMKNPVPLS QITQSEAVKC SQGPLSQYPA
     ISQRSCSKPN KCMYRTPGLG SQEHQFLTCS PCPDQYPWLD NPIGNMTP
//
DBGET integrated database retrieval system