ID A0A075A5Z5_9TREM Unreviewed; 1779 AA.
AC A0A075A5Z5;
DT 01-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 01-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 42.
DE RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
GN ORFNames=T265_02610 {ECO:0000313|EMBL:KER31045.1};
OS Opisthorchis viverrini.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Opisthorchis.
OX NCBI_TaxID=6198 {ECO:0000313|EMBL:KER31045.1, ECO:0000313|Proteomes:UP000054324};
RN [1] {ECO:0000313|EMBL:KER31045.1, ECO:0000313|Proteomes:UP000054324}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Young N.D., Nagarajan N., Lin S.J., Korhonen P.K., Jex A.R., Hall R.S.,
RA Safavi-Hemami H., Kaewkong W., Bertrand D., Gao S., Seet Q., Wongkham S.,
RA Teh B.T., Wongkham C., Intapan P.M., Maleewong W., Yang X., Hu M., Wang Z.,
RA Hofmann A., Sternberg P.W., Tan P., Wang J., Gasser R.B.;
RT "Opisthorchis viverrini - life in the bile duct.";
RL Submitted (NOV-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL596651; KER31045.1; -; Genomic_DNA.
DR RefSeq; XP_009165172.1; XM_009166908.1.
DR STRING; 6198.A0A075A5Z5; -.
DR GeneID; 20316798; -.
DR KEGG; ovi:T265_02610; -.
DR CTD; 20316798; -.
DR OrthoDB; 2882192at2759; -.
DR Proteomes; UP000054324; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 18.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000054324};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..35
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 36..1779
FT /note="Collagen IV NC1 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001704507"
FT DOMAIN 1544..1770
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 148..261
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 285..537
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 550..618
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 648..720
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 734..1539
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 380..397
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 593..609
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 884..911
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1331..1347
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1490..1506
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1779 AA; 176725 MW; C793FB5F10A54168 CRC64;
MKPHKSSRVS RTSMVSFAIL LLWVLMSVLI PVTQTAVTCI PSPNCTECTC TGLPGPRGDM
GVRGHAGRPG PVGPRGPPGL RGPDGIRGIP GIPGDPGPKG ISGDKGVPGV PGVDGLKGAP
GIKGPKGYPG EEGCCGAKGP KGLKGLPGSY GYDGEMGEKG LPGPKGEKGE PAVPLAPGLD
GTGGPKGERG PPGGRGYPGT PGEKGDMGFP GEPGFKGQKG ERGDPGPKGP KGYTPPPIIL
PYAEQGLKGD KGEKGTSYDD STLIKGRVIG MKGQPGPVGL PGPIGDPGVK GRQGPSGFKG
VIGSPGEIGD PGLTGKFGKP GAPGLPGVPG RPGQDGLPGP IGPPGFRGQT GDKGYSGSRG
PKGEKGEKGE TTDVPPVPGP DGAVGEPGEP GPPGIPGQMG PKGFEGEPGS RGEPGPKGPR
GAGYGNVIPG EPGRKGEKGL QGIEGKRGKQ GRPGISGRKG EKGFPGRDVY GVKGEKGPRG
FPGPIGETGE RGDTGLPGIK GEPGEDGPTI DGPAGPKGVK GMPGLAGLRG RDGDPGPIGP
KGDCVVCPDG RPGIPGPKGF PGQPGPAGLP GLDGAPGQKG LVGPQGKDGQ PGFPGEKGEK
GERGSDGPRG MKGEPGPVEF VNATVPEAPR GEIGMPGRMG ILGEAGPKGY PGPKGVPGPK
GFRGEPGPAG LPGFHGTPGK PGQRGEPGIP GRVFDARKGS PGLPGRDGPK GAKGEPGEDG
EHIYLDQLYE YVKGEPGPKG YPGEKGLPGF PGEQGREGEK GLPGMLGKQG KQGYPGPVGL
PGPDGDPGYA GPKGVKGQPG PTGVIGPKGT PGPQGMKGFP GPPGEPGERG TPGPAGSFGK
GPPGHQGEPG MPGPPGPKGY PGLKGTKGER GMDGIPGFAG KKGETGDRGP RGDPGEAGDV
LQGRKGEKGQ KGQPGDPGPE GLRGEKGLDV VPGLELKGPK GLPGPKGVDG LPGQEGPRGK
LGIPGPPGLP GTQGQIGPKG EPGLPGRIGP KGVTGPPGDD GAPGPRGDTG DPGPAGPPGF
RGSKGIPGKP GPKGSPGQPG RIGESLKGEK GFPGVKGIKG QPGKQGDPGI MGAKGVPGPR
GFGTKGHKGE KGFKGEPGIQ GPKGVAGLDG KQGRPGDMGP PGPVGEKGDP GLPGSVGRPG
RNGAVGEPGE PGPRGPKGLS GPSGEKGPKG MPGITYAEPI PGTKGAKGEP GFVGQKGPKG
VPGPRGNDGV PGIRGPSGPA GLKGIKGFPG LKGLKGSPGP RGPDGSPGLK GNRGPPGEKG
NLGLEGLPGD RGEPGDRGQS VQSPVGEKGE AGLPGPMGPQ GTKGEAGRPG DRGDTGPQGD
PGADLPGAPG EKGLPGPPGP PGIRGPPGPK GLKGSFPGRP GATGLPGVEG PKGFQGEPGP
KGYPGSIGRP GMKGMQGEKG LRGLDGKPGR PGKPGEKGMR GQPGRPGEKG REGQKGDMGP
VGLPGPEGIP GIGESGLEGE KGYPGPAGRP GEPGRPGDEG SRGRQGPPGP PGDEGEPAPK
GPKGYPGPQG PKGIAGPQGA PGDIGPAGEP GIGGVVGDTR GNTGHLFTVH SQSTRIPYCP
SGTHKLWEGY SFLSMTGSDR AHVNDLASPG SCLQLFNPIP FMFCEKQENC YYAQRNDRSY
WLSTEEEPMM WNPFPANESQ RHISRCVVCE APSKLYAFHA QTLEIPKCPD GWSTLWPGSS
FLMNTGYGAQ GGGQQLASPG SCIPQFRPHM FIECTAKGLC GFFEEHKNFW LRVMPSSMDE
HMFGMVMGQV IKVRSGPDSV SKCIVCMRTQ PMTTSFYIY
//