ID A0A0R3W133_TAEAS Unreviewed; 1712 AA.
AC A0A0R3W133;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE SubName: Full=Collagen IV NC1 domain-containing protein {ECO:0000313|WBParaSite:TASK_0000342101-mRNA-1};
GN ORFNames=TASK_LOCUS3422 {ECO:0000313|EMBL:VDK31787.1};
OS Taenia asiatica (Asian tapeworm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC Eucestoda; Cyclophyllidea; Taeniidae; Taenia.
OX NCBI_TaxID=60517 {ECO:0000313|Proteomes:UP000046400, ECO:0000313|WBParaSite:TASK_0000342101-mRNA-1};
RN [1] {ECO:0000313|WBParaSite:TASK_0000342101-mRNA-1}
RP IDENTIFICATION.
RG WormBaseParasite;
RL Submitted (FEB-2017) to UniProtKB.
RN [2] {ECO:0000313|EMBL:VDK31787.1, ECO:0000313|Proteomes:UP000282613}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Pathogen Informatics;
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; UYRS01018298; VDK31787.1; -; Genomic_DNA.
DR STRING; 60517.A0A0R3W133; -.
DR WBParaSite; TASK_0000342101-mRNA-1; TASK_0000342101-mRNA-1; TASK_0000342101.
DR Proteomes; UP000046400; Unplaced.
DR Proteomes; UP000282613; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023:SF1019; COLLAGEN; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 20.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000282613};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1479..1705
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 1..655
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 668..1477
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 95..109
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 335..349
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 858..872
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 928..963
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1016..1059
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1130..1150
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1291..1322
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1403..1420
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1712 AA; 169711 MW; 57361444219D331A CRC64;
MGLPGSIGPR GSRGPPGPRG QQGLPGYPGE PGRKGIMGDK GQPGMPGTDG LKGMPGIKGP
KGFPGDPGCC GIKGLPGFPG LPGPDGYDGL QGPRGPKGEK GEKGDTIIGD GRAVEGPPGM
KGSRGPQGPV GPRGPDGERG RPGVQGPKGF MGPAGDHGEQ GPKGPKGKTP SPSIQYAQRG
DKGEKGYKGA SYEKPPVLKG IQVGPKGYKG IPGERGPPGD KGLKGERGIP GFPGVPGQPG
PQGDPGPPGK KGKTGPSGAS GASGRPGIDG SIGPPGPEGP RGVKGEKGYP GRTGPKGEQG
PKGDTITTEG TLPGPDGERG EMGPPGPPGD EGPMGEKGLE GERGDPGEPG PKGEPGPGYG
TIVPGEPGPK GERGLPGVDG KRGKTGPSGP RGVKGVKGLP GKDLIGPKGE RGPPGVPGTQ
GPVGERGPEG LQGPPGDPGE DGPSIDGVKG PPGVKGLPGS PGLRGRPGPR GPTGPQGDCA
YCPDGLPGPR GPKGFTGRPG RPGPDGADGA PGAKGLPGFQ GPDGPMGPAG RKGAKGEPGP
MGERGPPGEP GRIEFINQTI TEAPRGQDGP PGRLGSRGPT GPKGFPGQRG PRGPQGLKGE
PGAKGLPGFS GPQGVQGPPG DPGPDGRVIR AQKGEPGLPG RDGYPGPKGE KGERGPDRII
EQLYEHLKGE PGPKGEKGKP GGPGFPGAQG APGEKGLRGT DGKDGKKGYP GDVGPPGLPG
EKGRRGPKGQ PGLPGGRGPD GRQGPQGPMG PKGQRGPEGE RGEIGLPGPD GGSEPGLRGP
RGEPGEKGRK GEPGAPGLQG PPGEKGPKGI AGVAGRPGQP GPKGVKGEPG APGEVLYGPK
GPKGARGPPG LPGPRGPKGD KGPDDFISEG MKGDKGQKGQ AGQIGPPGID GPPGPKGKQG
PPGPIGKDGI QGEVGDQGEM GPVGSPGRQG PKGPPGLPGE TGPEGPEGPP GPAGPPGPKG
DPGPKGYPGE KGDIGPPGSG SPGIKGEKGQ PGRMGIPGEP GYDGIPGLKG EPGPPGQSIQ
GPPGPRGPKG EQGPQGPPGT DKPPARKGAP GPPGPKGPRG LKGEPGIPGP EGRPGPRGPI
GPVGDEGPQG RPGLPGPAGP KGPKGQPGVQ YGEPEMGEKG ERGPSGPPGD VGVSGPPGPK
GEPGPMGLPG IPGPQGFKGL PGFQGKRGAR GPEGPRGPDG LPGPRGSHGP RGPRGPQGPV
GPKGERGLQG EPGDSMVGQP GAKGELGDPG PMGSPGVKGA TGVPGTRGRP GERGDPGPDL
IGAKGQKGEL GAIGRPGPQG PRGPKGPKGI SYPQPGEPGP DGPPGPKGVK GESGPPGPDG
QPGPKGERGL PGKPGLRGIP GPDGRPGLDG RDGQPGPQGF KGEKGFRGPK GAQGPMGLPG
PKGEQGDRLS GEKGQKGQPG PDGPPGEMGP RGPQGQPGPQ GPTGEPGQKG EPAPMGPKGF
EGPLGEPGPR GPTGPKGDIG PHGEPGLPGR VGDTRGNTGF LFAKHSQTMR IPECPSSTRK
LWEGYSFLSM SGSERPHVND LASPGSCLQM FNPIPFMFCE KQENCYFAQR NDRSYWLSTE
DEPMMWNPVP VNESQRHISR CVVCEAPSTL YAFHSQTLEI PKCPMGWQDM WVGSSFLMNT
GYGAQGGGQQ LASPGSCLPI FRSHMFIECT AKGLCGFFEE HKNFWLRVMP SSMDDHMFGM
VMGQVIKVRQ SQDSVGKCIV CMRTQPLVTY LV
//