ID H2KTR5_CLOSI Unreviewed; 1885 AA.
AC H2KTR5;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2012, sequence version 1.
DT 27-MAR-2024, entry version 51.
DE SubName: Full=Fibropellin-1 {ECO:0000313|EMBL:GAA33950.2};
GN ORFNames=CLF_105899 {ECO:0000313|EMBL:GAA33950.2};
OS Clonorchis sinensis (Chinese liver fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA33950.2, ECO:0000313|Proteomes:UP000008909};
RN [1] {ECO:0000313|EMBL:GAA33950.2}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Henan {ECO:0000313|EMBL:GAA33950.2};
RX PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA Yu X.;
RT "The draft genome of the carcinogenic human liver fluke Clonorchis
RT sinensis.";
RL Genome Biol. 12:R107-R107(2011).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF143932; GAA33950.2; -; Genomic_DNA.
DR Proteomes; UP000008909; Unassembled WGS sequence.
DR CDD; cd00110; LamG; 4.
DR Gene3D; 2.60.120.200; -; 4.
DR Gene3D; 2.10.25.10; Laminin; 14.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR003645; Fol_N.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR PANTHER; PTHR15036:SF85; SP2353, ISOFORM A; 1.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF02210; Laminin_G_2; 4.
DR SMART; SM00181; EGF; 16.
DR SMART; SM00274; FOLN; 5.
DR SMART; SM00282; LamG; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 5.
DR SUPFAM; SSF57196; EGF/Laminin; 5.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS50026; EGF_3; 12.
DR PROSITE; PS50025; LAM_G_DOMAIN; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000008909}.
FT DOMAIN 480..662
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 722..904
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 931..1138
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1140..1177
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1179..1216
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1255..1293
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1331..1366
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1369..1407
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1408..1442
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1445..1483
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1521..1559
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1595..1632
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1633..1670
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1671..1708
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1709..1747
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DISULFID 1144..1154
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1622..1631
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1885 AA; 210545 MW; 1C75CA8EEE023FDC CRC64;
MLNTREVELE SLTLTKNTSE QYHTIIRIGE KLRYLKIAAV ESFGDEPFLA PPSNNGEGES
KHIWMPDSSL YVVESDNVNG LVMSANEGLI RHFSILDPSQ CPWLMEACPK HQFAIQFDIQ
PIWQGRRDEK SNVLTIRTRQ SVENPTVWVV QFVMPNSTTL SVSFYPEDHM DRQCPIFPPQ
NLTLSPSGWT RVSALIRAKT GELMLDKQQT LDAPAPSDAK RASLLHRNWR SLASLLREIP
TNRLGKLEAM DFGKDSYVRY DFKNRLKKTA DREELSIEFE VPRGVENGLI WFVENNASKS
FVYLKDGRLH YTFLFVDRTR SAQLTLTEEI YLNTPLQSDR PHLLSLVRDR DNLTLTLNER
SHASKHIDGR IPLVPSDGNA YLGGSQNPAK DTAGKVTETF RGRITKAQVA REGDKKVDLL
QVATDRHWND SVVRSGGVRF EIKEPKVKLV IGPTASRPLQ SGFSAGSSPI GMALSREPVQ
ISFRGTQNSV VRFDTWDFVL YRSFEIEFVT YEPNGILFFV GPDREHTDFV CVELYDGNVY
FVYAVGDHFR HIQLNPDNMK VNTGMSNRVY VSRNEHHQFL VKFNDRVVDV DQGKTAHQAE
FATYTYIGGV DHSSRLPWHV WSRENFAGCI PSIRINDDKF LKASSRMSQH TDASRGIEFG
VCRVPDRRCT REICGGGQCA ERSYPYFEPL NFACDCSGSD KTFRDGVTDI RRSEACFRDA
PILEMDGEMV YLIDFDRQLH TMTTHTDDIS LQFRTQQSNA PLFYAVSPAD KSYFRVDLSG
GRIRIQTNIN HVDNRFNFEQ YSLASGPLDD NQWHTLRIRR RAEYMAMSID GIHDDIVTIP
LQSHLNTFQL IAQQIYLGSA GPSDYSRPPP PGSEQRFLPP PPRFVGEFRN FYWNEYDFIG
TASCSGKYGT DMLTPRAELP TFPQWPREPT YSITLTYKLN YTRLAKVFDM REAGDMWLIE
FKTEYDGVLL SAREPGSMDQ VHITLVLLRG RLHVVYNVHG RSGVHQITSG PTANNLNDGR
WHRIVIGLDR RKKELVAFLD SYPPQIIGIG LPVNRIQLLN FYFGGLPDTE WQSVWSLLRS
YAPGSLQSSD AEHGRQPAFT GCIGGFSMRS DKFASDLLRR HDSELTAYPP TEIVRGFCRE
QKRCTNDYCY HGRCEQVSER EIRCICTGTQ YEGPRCETLI VNCPPNYCNQ RGVCTIVNGQ
PKCDCSGSGY HGERCELSPC TAPGGYCYNG GVCSIRGGLP YCDCRGTGFY GDRCRDPICP
PNYCYNRGRC HVDANERPVC DCQGTGYIGP RCEQTACTPD TCDNGGKCVV NSYGVIECIC
TGTGFRGPQC RTPICTSDYC SNRGRCVVGP NEQPMCDCTG TGYTGERCHI SICQQGYCAN
GGRCVYGPGD QPRCECQGTG YTGERCHIKI CADGYCANGG RCTVNSQNQP VCDCSGTGFG
GSRCTDPICR PGFCQNGGRC YLNPSGQPTC ACEGTGFTGS MCQTPVCSNG YCLNGGRCSV
DANNQPVCNC GGTEHRGSRC ETPICSHGHC MNGGTCRVNE LGQPVCDCLL TNYQGPRCEI
PICPTGYCGG RGTCVVVQRK PTCNCHSGYR GSRCEIEVCP ENYCINGGYC RPGPDRTPIC
TCPPNYEGDR CELPKTCPSD YCFHGGRCTM VRGVPQCDCL GTDYTGSRCE TPQTCPVGFC
QNGGRCSVQQ GNYVCDCTGT GYRGIICNEP IACPPNYCLF GGICSVLPDK QYVCDCSRTA
RTGKHCEGSS NGIYVTYEKE GYLIYPLSPY VRTVEDNVTL GFRTYMQSGT LITFMTTDGR
HWAVKLSLVA GMDFLAAIRS ADLEDIQCSQ TAFDRKSKLS DLVGFRYTFN KQPVKNKTNA
SCRILKIGSR QFKNSGNKAT HSGRN
//