ID G7YUU2_CLOSI Unreviewed; 2041 AA.
AC G7YUU2;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 27-MAR-2024, entry version 42.
DE SubName: Full=Laminin gamma 1 {ECO:0000313|EMBL:GAA56722.1};
DE Flags: Fragment;
GN ORFNames=CLF_111412 {ECO:0000313|EMBL:GAA56722.1};
OS Clonorchis sinensis (Chinese liver fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA56722.1, ECO:0000313|Proteomes:UP000008909};
RN [1] {ECO:0000313|EMBL:GAA56722.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Henan {ECO:0000313|EMBL:GAA56722.1};
RX PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA Yu X.;
RT "The draft genome of the carcinogenic human liver fluke Clonorchis
RT sinensis.";
RL Genome Biol. 12:R107-R107(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Henan;
RA Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA Wu Z., Yu X.;
RT "The genome and transcriptome sequence of Clonorchis sinensis provide
RT insights into the carcinogenic liver fluke.";
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00460}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF144352; GAA56722.1; -; Genomic_DNA.
DR Proteomes; UP000008909; Unassembled WGS sequence.
DR CDD; cd00055; EGF_Lam; 9.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.10.25.10; Laminin; 9.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR008211; Laminin_N.
DR InterPro; IPR002049; LE_dom.
DR PANTHER; PTHR10574:SF435; LAMININ SUBUNIT GAMMA-1; 1.
DR PANTHER; PTHR10574; NETRIN/LAMININ-RELATED; 1.
DR Pfam; PF01391; Collagen; 8.
DR Pfam; PF00053; Laminin_EGF; 10.
DR Pfam; PF00055; Laminin_N; 1.
DR PRINTS; PR00011; EGFLAMININ.
DR SMART; SM00180; EGF_Lam; 11.
DR SMART; SM00136; LamNT; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 10.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01248; EGF_LAM_1; 5.
DR PROSITE; PS50027; EGF_LAM_2; 7.
DR PROSITE; PS51117; LAMININ_NTER; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00460};
KW Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW ECO:0000256|PROSITE-ProRule:PRU00460};
KW Reference proteome {ECO:0000313|Proteomes:UP000008909}.
FT DOMAIN 1..125
FT /note="Laminin N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51117"
FT DOMAIN 188..244
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 245..293
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 294..346
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 483..533
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 651..706
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 707..760
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 762..801
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT REGION 1645..1844
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1920..1947
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1962..2041
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1645..1662
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1818..1832
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2027..2041
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 216..225
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 228..242
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 267..276
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 317..326
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 501..510
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 679..688
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 707..719
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 709..726
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 728..737
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 776..785
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT NON_TER 2041
FT /evidence="ECO:0000313|EMBL:GAA56722.1"
SQ SEQUENCE 2041 AA; 220155 MW; 0B7B697D39F1B8BB CRC64;
MVIYKRFDDQ SDWTPWAYFS SNCYTYFGIP YQPVPTFSRP DEVICQEEYS TLQPLYGGEL
VFSVINGRPN YERFFEDAEL QRWSTASQIR VELKKMHTFG DERGAEKDTL LTYYFAIDKF
TVGGRCLCNG HGNECRPSTG PGQPDRLVCV CAPSHHTAGD NCEQCAPDHR DVPWQPATPE
NPNPCRPCKC NGNSQLCEFD LDLYDQTGSG SRCIGCGNNT EGINCERCKT GYFPDPVYPT
VCQPCSCDPV GTVDSQEDCA TTGQCRCKPG VGGPRCDRCL EDHYGFSAGG CLPCNCSTVG
GLDNRAVCDA QTGQCLCKQN VGGKQCEKCK LGHYGLMSND PLGCKPCACS SHSSECELDV
TKVAEAAGKP GEIVDLATLV DPAKVIINCP PDYHPCFACL QKDRQSVRIE CNGTSEGHLP
CLCVKDLNQC QYCASGWSEP QKDPRYEVCK CPPQYTGTSC ESCAFGYRRD PPDGLPTDRC
VACTCNNHST VCHPETGQCE CQDNTGGLFC DRCADGYYGN AFAPVGSADA CKPCPCPSGT
KCEQVHWPDQ TIKVVCTDCP DNRAGVRCER CAENYYGDPS KGIPCKPCDC SGNVDPREFG
NCDGITGECL KCIFGTTGKH CEKCLPGYRR NFKPATETGE ALVPARGCSP CHCNPVGTLA
SYGQSGIGVC NPETGQCPCK PGVGGLRCDK CYPGFYGFRT GEGCKPCDCD SIGAIGEACD
DHTGSCSCRP YVTGRQCDQC LPGYFNLTST HGCMACNCHP YGAEDRQCDK TGQCKCKPFA
VGKKCDQCQE NHYNLEVGCL PCPACYNLVQ ARVSKLWGML ESVFGPLRPD SKPGVQISPD
DKDLYAEMKK LNETIVSLYR DVLQIGGSII IIIMDSMTSV LNTDASLPYN HDLFESLIVK
KKNKDGRGRD VVLPYYNQSE LLLIKFRSSE QSLIDITRLS PLLLGILLYI TPFVSAVICL
GKIYPSDNSG TPHSSFNVFW SSKQPALTYT SQYDYYERLR CQQVCTPDKC TCIGPKGMVG
SPGPPGPPGE PGPVGDPGLV GFQGTKGEKG YSGQLGRQGF KGERPIKFLC IKIYRKHLYG
LPSHESRTAL SSSRKTILNT LRCSQLPPND DYHMRPTIFH VSPTLVHSGY PGLMGEKGEK
GDIGCYGDPG ENGIPGPPGS FGFKGLPGPM GRQGPKGQPG QVIYFTPVRN PYEPFRRLPC
ELDSTANCLP NLFYRVCTDH LDQLGKQAIW EKKDQGVTPD RSDPLLRASK ASKAEWVRRV
VQDRQSRLKV RKAIKVNPVH VVQTATRVFW MLIHSKDHQA CQVLWDHQEN EGLLALLESP
DCRVWMDLRD LGEKKAYLDV MATEAHRVML ETPDCLVHRE PEVLMGNLVV QGFLERKGTK
VFPELMDFLV SVAPKENLGA NASALGQKED KEKKVFRVLW VHLDFPVQWE LSESREKKDF
QVQSVSPVCL VAIARKDSLA YLAPKEKQVS QDLQVQLEGL VLKENLDHEV HLETLLSKFY
PGLQDIQVLP VHLGAREPRV IWVSLVRRAF EELEYVVQPA KGDYQVTQGR MEGLEFLDYL
DFLDQKDGVT SHVWPVLMVN LADVVNPEFK EIQGFLDPLD SLERMVTLDG LATPVAMGKP
GFDGSFGEKG DRGEKGLVRI VQEKIIPGER GEDGRPGEKG ESGDRGIPGS FGYIGDPGPP
GRRGFPGPHG PVGAPGEDGS PGFRGDPGVD GRSVDGMPGE PGAPGYPGLK GRQGIPGAKG
YCDRVRPQIT KGYQGEPGFR GDPGPAGEPG SRGERGIKGG SGFPGLRGTD GENGTAGMPG
MKGSPGYPGS AGPPGYPGSV GMPGPPGPPG DMGRPGESGY IGQKGLPGDI GEPGLMAQLL
RGEPGFRGLP GSMGIPGISG IPGLKGFPGR AGLPGMNGTM GPKGFQGPKG VKGVRGQPSF
AILPGEPGRK GQKGLPGTWG LKGEPGRPGE CYRPGVPGDI GPKGLPGYPG PIGEEGLPGD
RGEPGIDGQA FAPGPKGDRG EDGYPGSIGR TGIKGEPGRP GPAGAKGIPG PMGPPGQPGW
P
//