ID T1FQR7_HELRO Unreviewed; 994 AA.
AC T1FQR7;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 27-MAR-2024, entry version 62.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ESN96316.1, ECO:0000313|EnsemblMetazoa:HelroP189192};
GN Name=20211164 {ECO:0000313|EnsemblMetazoa:HelroP189192};
GN ORFNames=HELRODRAFT_189192 {ECO:0000313|EMBL:ESN96316.1};
OS Helobdella robusta (Californian leech).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Clitellata;
OC Hirudinea; Rhynchobdellida; Glossiphoniidae; Helobdella.
OX NCBI_TaxID=6412 {ECO:0000313|EnsemblMetazoa:HelroP189192, ECO:0000313|Proteomes:UP000015101};
RN [1] {ECO:0000313|Proteomes:UP000015101}
RP NUCLEOTIDE SEQUENCE.
RA Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ESN96316.1, ECO:0000313|Proteomes:UP000015101}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=23254933; DOI=10.1038/nature11696;
RA Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT "Insights into bilaterian evolution from three spiralian genomes.";
RL Nature 493:526-531(2013).
RN [3] {ECO:0000313|EnsemblMetazoa:HelroP189192}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMQM01001381; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KB097495; ESN96316.1; -; Genomic_DNA.
DR RefSeq; XP_009025498.1; XM_009027250.1.
DR AlphaFoldDB; T1FQR7; -.
DR STRING; 6412.T1FQR7; -.
DR EnsemblMetazoa; HelroT189192; HelroP189192; HelroG189192.
DR GeneID; 20211164; -.
DR KEGG; hro:HELRODRAFT_189192; -.
DR CTD; 20211164; -.
DR eggNOG; KOG1217; Eukaryota.
DR HOGENOM; CLU_300965_0_0_1; -.
DR InParanoid; T1FQR7; -.
DR OMA; WICEDIN; -.
DR OrthoDB; 3876300at2759; -.
DR Proteomes; UP000015101; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 5.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 2.10.25.10; Laminin; 8.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR002889; WSC_carb-bd.
DR PANTHER; PTHR24039; FIBRILLIN-RELATED; 1.
DR PANTHER; PTHR24039:SF28; FIBULIN-1; 1.
DR Pfam; PF07645; EGF_CA; 8.
DR Pfam; PF01822; WSC; 1.
DR SMART; SM00181; EGF; 10.
DR SMART; SM00179; EGF_CA; 8.
DR SMART; SM00321; WSC; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR SUPFAM; SSF57184; Growth factor receptor domain; 2.
DR PROSITE; PS00010; ASX_HYDROXYL; 5.
DR PROSITE; PS01186; EGF_2; 7.
DR PROSITE; PS50026; EGF_3; 6.
DR PROSITE; PS01187; EGF_CA; 4.
DR PROSITE; PS51212; WSC; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000015101};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..994
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5010980891"
FT TRANSMEM 958..981
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 375..466
FT /note="WSC"
FT /evidence="ECO:0000259|PROSITE:PS51212"
FT DOMAIN 599..641
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 642..683
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 707..749
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 823..864
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 865..907
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 908..950
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
SQ SEQUENCE 994 AA; 109764 MW; 7B6B6FCBA1E85325 CRC64;
MALKILMVLA LSSCCIIDIN ASVVNVSIRK PAYMTNEATG HSAKLGVDND ITTSTETLSS
QTTPKWFVVD LVNAYQVKYI ILYNEHSCGS CDSDMNYFNV GLTNSFNPSG TAEIRGTYDQ
CGWWPAPVTA ADTQMSVICE DGTKFSRFII IQQSLEVENN GASSSLQLGE LMVYANDVFP
QYENVAYKKK AYLSTEQVSA SNCVDGNTTS FCSLGSEVYG PYMNMSHEYD WFIVDLARPY
QIVYVTLHAK ATDTLMNNFV VGQLSSVNPT YPTYLRGRYS VCGYGPPSYT IDAQPMRVDC
NDRTSYFRYV IVQQEASQAY DGENNADKYV GCFRNFQIKL TYQESSLDTC IYICRQMNVP
YASVKDSQCF CAPLNVGYLG CFIDGSLDLS ADYESIPSRT IETCITYCLS RNYDYAGLQR
GDTCKCGNSY GAYGKDSDSA CNTNCAGNTN EECGDSLKHY LVCSKLSTSK HCTGVYHCST
GNNFTPDMTC QSHSCQPGWT GGACDKRDCQ TNNGGCGIHP CSSFQIGTTT YTECVCRLGY
TKSFYNDDCV LGEFFQMYVQ FVLTCNTTNS ICVAGSTDYS CNCKSGYQHP ADNNTFCYDI
DKCAASTNPC RTPSETCVNT DGSKKCPCSP GYMLSAASVC EDIDECFENK NNCTKPLSTC
LNTYGSFICI CPSGYVQVNN DCSDNVLYFT SSKIFFKRLS KNKFFLDIDE CLYNPNACDN
ETTTCVNRVG TYSCKCLSGF YAKNPWTCED IDECARNKHN CTNSTETCVN TFGSFVCQYI
NECKVETSSA CNIATSTCFN MIDTYNCTCL DGFFNKDQWT CEDIDECALN QHNCSSPTET
CINTAGSFAC QCSSGFQRFN NICSDINECL DKTSPCDPQT STCVNQIGSY NCSCFSGFQN
KGQWICEDIN ECVKNANACN TAISTCVNNV GTFSCACFSG YVNKDQWNCE VSGERNVLFA
FIGVFAAVVL TLMGVFAYWA ASYQFKPARA NFSE
//