ID A0A074YYI0_9TREM Unreviewed; 1754 AA.
AC A0A074YYI0;
DT 01-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 01-OCT-2014, sequence version 1.
DT 22-FEB-2023, entry version 40.
DE RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
GN ORFNames=T265_11470 {ECO:0000313|EMBL:KER19856.1};
OS Opisthorchis viverrini.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Opisthorchis.
OX NCBI_TaxID=6198 {ECO:0000313|EMBL:KER19856.1, ECO:0000313|Proteomes:UP000054324};
RN [1] {ECO:0000313|EMBL:KER19856.1, ECO:0000313|Proteomes:UP000054324}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Young N.D., Nagarajan N., Lin S.J., Korhonen P.K., Jex A.R., Hall R.S.,
RA Safavi-Hemami H., Kaewkong W., Bertrand D., Gao S., Seet Q., Wongkham S.,
RA Teh B.T., Wongkham C., Intapan P.M., Maleewong W., Yang X., Hu M., Wang Z.,
RA Hofmann A., Sternberg P.W., Tan P., Wang J., Gasser R.B.;
RT "Opisthorchis viverrini - life in the bile duct.";
RL Submitted (NOV-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL597127; KER19856.1; -; Genomic_DNA.
DR RefSeq; XP_009176398.1; XM_009178134.1.
DR STRING; 6198.A0A074YYI0; -.
DR GeneID; 20325638; -.
DR KEGG; ovi:T265_11470; -.
DR CTD; 20325638; -.
DR OrthoDB; 2882192at2759; -.
DR Proteomes; UP000054324; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1079; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 13.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000054324};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..32
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 33..1754
FT /note="Collagen IV NC1 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001703772"
FT DOMAIN 1521..1747
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 183..907
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 923..1514
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 267..281
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 442..477
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 598..638
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 825..839
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 987..1002
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1754 AA; 174775 MW; B55E32F1BF430B17 CRC64;
MRRRVRRWSA APFELSPLLL GILLYITPFV SAQYDYYERL RCQQVCTPDK CTCIGPKGMV
GAPGPPGPPG EPGPVGDPGF VGFQGTKGEK GYSGQLGRQG FKGERGPQGP PGYHGQHGLS
GYPGLMGEKG EKGDIGCYGD PGENGIPGPP GNFGFKGIPG PMGRQGPKGQ PGQVIYFTPG
EKGVYGPPGP TGQTGDMGEK GPRGDPGPIG PSVKGFKGVQ GRMGPPGRPG SAITIEGQKG
DKGQPGPRGP DGYPCVLDAH SQQGPPGVPG PMGPPGERGP SGPPGESGLP GLDGPPGPRG
EKGLPGRHGD RGPPGDAGNP GLPGAPGARG FDGEPGRPGV PGEKGDQGIP GADGLPGFRG
PKGKPGGKRF CAGPKGRQGE KGFPGSIGAP GFPGPMGPIG IAGEKGFPGP VGQPGLPGRY
CTEGQPGLPG PKGEIGLPGS PGPVGGPGPE GKPGPRGPPG DTPDQVLPGP PGHPGPPGSF
GRKGAKGNMG EPGEKGFRGT GIRGPAGERG LPGDPGSDGR PGIPGLPGLP GPKGRSNLTC
LACPDGEPGR RGEPGIQGDP GFYGSPGLAG GDGDIGWPGD PGRDGMPGMP GRKGKPGFDG
SFGEKGDRGE KGLVRIVQEK IIPGERGEDG RPGEKGESGD RGIPGSFGYI GDPGPPGRRG
FPGPHGPVGA PGEDGSPGFR GDPGVDGRSV DGMPGEPGTP GYPGVKGRQG LPGAKGYCDR
VRPQVTKGYQ GEPGFRGDPG PAGEPGSRGE MGIKGKQSQI QGEPGFRGDP GPAGEPGSRG
EMGIKGGSGF PGLRGTDGEN GTAGMPGMKG SPGYPGSAGP PGYPGSVGMP GPPGPPGDIG
RPGESGYIGQ KGLPGMNGTM GPKGFQGPKG VKGGRGQPSF AVLPGEPGRK GQKGLPGTWG
LKGEPGRPGE CYRPGVPGDI GPKGLPGYPG PTGEEGLPGD RGEPGVDGQA FAPGLKGDRG
EDGYPGSIGR IGIKGEPGRP GPAGIKGIPG PIGPPGQPGW PGRPGLDGPS GIPGEPGRSV
ISRPGIRGEP GIPGPPGSPG EKGIAGMDGK VGKPGESGAS STGRPGSRGM PGPPGLRGEH
GFPGSLGQPG PKGLPGSFGF PGLKGDRGLP GRNGAEGMDG RPGPRGQTGQ MGPPGDIGMR
GDMGPRGEPG QSPLLHPGSS GPPGYRGPPG YPGPIGQQGD KGFPGSVGPP GEVGSAGQPG
IPGYPGIKGR DGQPGFPGTP GMPGPVGPGG ARGIPGPSGP TGPKGIRGMK GQRGTTPYCP
VGIRGLKGRP GPPGPPGAPG PQGDKGTLGM PGPKGLPGAP GLAFPGPKGD IGPVGPEGYP
GAPGERGVTG IPGQLGFPGA KGEPGFPGRP GIPGDRGDVG PSGRDGLPGR RGPKGAAGFP
GADGLPGIDG HPGDVGEPGI MGPSGRPGLK GFSGERGFPG APGLPGRLTS ALKGIRGDPG
PPGEYGEKGV PGLPGQPGKQ GEPGEKGVKG RPNPSLGPRG PPGDIGPRGE PGYAGPKGYP
GEKGYRGQPG QNGTIGLGDG GYLFTVHSQD TRPPQCPVYT IELYTGYSLV TLLGDDDSVT
MDLGSPGSCL RKFSIMPYAT CFAQTTGSCH TNKRNGRSYW LSTLEQYMTS PTKVDRIQPY
ISRCVVCQSR TNLVAFHSQR PDAGQNCPTG WENAWNGFSF PMTVSGPTGG AQPLESPGSC
LQHFRALPFI ECNNQDGNCF HWQDGRSYWL SSIPQNQQFI KPIGTPFRSE TGVLQHISRC
SVCRKSMEAL LMEY
//