ID R7UQI9_CAPTE Unreviewed; 1608 AA.
AC R7UQI9;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 60.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ELU05671.1, ECO:0000313|EnsemblMetazoa:CapteP229021};
GN ORFNames=CAPTEDRAFT_229021 {ECO:0000313|EMBL:ELU05671.1};
OS Capitella teleta (Polychaete worm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Polychaeta;
OC Sedentaria; Scolecida; Capitellidae; Capitella.
OX NCBI_TaxID=283909 {ECO:0000313|EMBL:ELU05671.1};
RN [1] {ECO:0000313|Proteomes:UP000014760}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=I ESC-2004 {ECO:0000313|Proteomes:UP000014760};
RA Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ELU05671.1, ECO:0000313|Proteomes:UP000014760}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=I ESC-2004 {ECO:0000313|EMBL:ELU05671.1,
RC ECO:0000313|Proteomes:UP000014760};
RX PubMed=23254933; DOI=10.1038/nature11696;
RA Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT "Insights into bilaterian evolution from three spiralian genomes.";
RL Nature 493:526-531(2013).
RN [3] {ECO:0000313|EnsemblMetazoa:CapteP229021}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMQN01001324; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KB301364; ELU05671.1; -; Genomic_DNA.
DR STRING; 283909.R7UQI9; -.
DR EnsemblMetazoa; CapteT229021; CapteP229021; CapteG229021.
DR HOGENOM; CLU_003792_0_0_1; -.
DR OMA; FYLNTHT; -.
DR Proteomes; UP000014760; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 3.
DR CDD; cd01472; vWA_collagen; 3.
DR CDD; cd01450; vWFA_subfamily_ECM; 3.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 7.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF12661; hEGF; 1.
DR Pfam; PF00092; VWA; 7.
DR PRINTS; PR01217; PRICHEXTENSN.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00179; EGF_CA; 3.
DR SMART; SM00327; VWA; 7.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR SUPFAM; SSF53300; vWA-like; 7.
DR PROSITE; PS00010; ASX_HYDROXYL; 2.
DR PROSITE; PS00022; EGF_1; 2.
DR PROSITE; PS50026; EGF_3; 3.
DR PROSITE; PS01187; EGF_CA; 3.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR PROSITE; PS50234; VWFA; 7.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000014760};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..1608
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008788249"
FT DOMAIN 52..238
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 256..432
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 453..631
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 754..933
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 936..972
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 980..1165
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1168..1205
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1213..1390
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1397..1433
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1441..1608
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 643..746
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 962..971
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1195..1204
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1423..1432
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1608 AA; 176658 MW; 02B7E5DBE60A6307 CRC64;
MAQRRKMCYV VLAVFLGCLI PSCSSQGVYG NETDIDAKCI CVYVLECKRK IDIAFVIDQS
GSITDKNPPN GAYDNWELIK EFVIHTVEYF NVAYDETRVA AVTFGNQGIV RFYLTNYTSK
TDIERAVRSI PAGIGQTNTY AGLQKMRREV FNRQNGDRPE VPNVGIVITD GESNINDWLT
IDEAERARSE GTKLFSVGVT NDVNVQELKD ISSYPHIENR NYWKTPNFTS LNTIIESLQR
ETCEETPVVP GCEQADIALV IDSSGSIQEE GDNWARMKTF TQELVNRLDI ASNRVRVGAV
KYSGNAYVEF YLNTYSKKSD IVTSLGEMQF LGTKTYTGLG LRYMRNEIFK QDRGDRPDVP
NIAIVITDGR SNMNEQETKF EADRAKESGI RIIAVGITDK INPTELRNIA SDNQSLITVD
NFDVLMGQLD KIISITCQKP TVEPPKACAK EADIALILDQ STSIVFEGQG KWRNEMLGFA
VSVVESFPIA ANLTRVGVVK FSDTANVVIP LNQYYDEESL KTAITDLDHV GGETNIADGI
RKARVNFFRA DRGDRDGIPN IAILVTDGRA NVDTDNTELE AEMAKAEGIV LFTIGITQDV
DERELKSIAS HPDYYIFVNA FFQLQGILTQ LIDTACLVIT TLPTTTPTTT PTTTPTTTPT
TTPSTTPSTT PSTTPSTTPS TTPSTTPSTT PSTTPSTTPS TTPSTTPSTT PSTKRSTTTR
STTTAGSTTE RIPPTEVTFT PTITPLPDVG SSVDLAIALD ASGSINDQNF KIMLDFVKDL
VKKLDVPSGK VQVSLLTFSN QPNIEFFLNE YTSRLQMMAA IDRIKYVRGT TNTAGTLSYV
RNTVFTRKYG DRDNNRNFLI LLTDGESDDN VATLSEAKLL REQGVHIMTV GIGSWLDVFE
LQAIASYPYQ ENMIMAGNFT DLQRFLTILI DAVCDNNQEC ANDPCRNGGS CRNGILHYQC
VCPAGYGGAD CENECKEAGD VIFALDGSGS IGKANFQVMT DFVTTIVRSL DISNRQSNTQ
GTRIGLLTFG DDPNFEFNLN DYLSSDTELL NAINVRFIDG TTNTADAIRY VRENMFTASA
GDRPLVPNYL VVMTDGKSNN PEDTWVQAML ARNQGITVIS VGIGTGFNQE ELEGMASSPL
TSNVITASNF NSLTDEMRLR MSEALCNNEN NCNPNPCQNG ATCTDLVGGY ECGPCPFGFT
GYNCERGCSG EIDLYFVLDS SGSIRVERYP QILKFVADII SQLDVHQDRT RVGLIYWSDN
AHMLFTLDQY TNRENAMQAV MRTPFLGEKT NTASALEMLY KQGFTVANGD RINANNIAIV
ITDGNSNINP KMTPKDAIEA RTAGIHLMVV AVGSTFVNYG ELEAIASTPT DLNILNVDHY
EDLDTIKDQL VLSTCDDVNE CATNPCQNNG RCIDGLRSYT CLCSNNYAGI NCERQCSRRL
DLVFVLDLSG SVEEDYRLVI NFARAVTYGL NIDSDLVRIG AVTYATEVQD EFSMNTYSGF
KLSVINAMNF YHEGGRTNTQ GALSVALNQF TPARGDRAGV QNVVVLVTDG YSNVDKPDTI
PKAVTLKNAG IDIYSIAIGE TPNSLELSEI SSDPDSEYLY RLRELSAE
//