GenomeNet

Database: UniProt
Entry: A0A074YYI0_9TREM
LinkDB: A0A074YYI0_9TREM
Original site: A0A074YYI0_9TREM 
ID   A0A074YYI0_9TREM        Unreviewed;      1754 AA.
AC   A0A074YYI0;
DT   01-OCT-2014, integrated into UniProtKB/TrEMBL.
DT   01-OCT-2014, sequence version 1.
DT   22-FEB-2023, entry version 40.
DE   RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
GN   ORFNames=T265_11470 {ECO:0000313|EMBL:KER19856.1};
OS   Opisthorchis viverrini.
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC   Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Opisthorchis.
OX   NCBI_TaxID=6198 {ECO:0000313|EMBL:KER19856.1, ECO:0000313|Proteomes:UP000054324};
RN   [1] {ECO:0000313|EMBL:KER19856.1, ECO:0000313|Proteomes:UP000054324}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Young N.D., Nagarajan N., Lin S.J., Korhonen P.K., Jex A.R., Hall R.S.,
RA   Safavi-Hemami H., Kaewkong W., Bertrand D., Gao S., Seet Q., Wongkham S.,
RA   Teh B.T., Wongkham C., Intapan P.M., Maleewong W., Yang X., Hu M., Wang Z.,
RA   Hofmann A., Sternberg P.W., Tan P., Wang J., Gasser R.B.;
RT   "Opisthorchis viverrini - life in the bile duct.";
RL   Submitted (NOV-2013) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC       Secreted, extracellular space, extracellular matrix, basement membrane
CC       {ECO:0000256|ARBA:ARBA00004302}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KL597127; KER19856.1; -; Genomic_DNA.
DR   RefSeq; XP_009176398.1; XM_009178134.1.
DR   STRING; 6198.A0A074YYI0; -.
DR   GeneID; 20325638; -.
DR   KEGG; ovi:T265_11470; -.
DR   CTD; 20325638; -.
DR   OrthoDB; 2882192at2759; -.
DR   Proteomes; UP000054324; Unassembled WGS sequence.
DR   GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR001442; Collagen_IV_NC.
DR   InterPro; IPR036954; Collagen_IV_NC_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1079; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR   Pfam; PF01413; C4; 2.
DR   Pfam; PF01391; Collagen; 13.
DR   SMART; SM00111; C4; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   PROSITE; PS51403; NC1_IV; 1.
PE   4: Predicted;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000054324};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..32
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           33..1754
FT                   /note="Collagen IV NC1 domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5001703772"
FT   DOMAIN          1521..1747
FT                   /note="Collagen IV NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51403"
FT   REGION          183..907
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          923..1514
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        267..281
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        442..477
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        598..638
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        825..839
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        987..1002
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1754 AA;  174775 MW;  B55E32F1BF430B17 CRC64;
     MRRRVRRWSA APFELSPLLL GILLYITPFV SAQYDYYERL RCQQVCTPDK CTCIGPKGMV
     GAPGPPGPPG EPGPVGDPGF VGFQGTKGEK GYSGQLGRQG FKGERGPQGP PGYHGQHGLS
     GYPGLMGEKG EKGDIGCYGD PGENGIPGPP GNFGFKGIPG PMGRQGPKGQ PGQVIYFTPG
     EKGVYGPPGP TGQTGDMGEK GPRGDPGPIG PSVKGFKGVQ GRMGPPGRPG SAITIEGQKG
     DKGQPGPRGP DGYPCVLDAH SQQGPPGVPG PMGPPGERGP SGPPGESGLP GLDGPPGPRG
     EKGLPGRHGD RGPPGDAGNP GLPGAPGARG FDGEPGRPGV PGEKGDQGIP GADGLPGFRG
     PKGKPGGKRF CAGPKGRQGE KGFPGSIGAP GFPGPMGPIG IAGEKGFPGP VGQPGLPGRY
     CTEGQPGLPG PKGEIGLPGS PGPVGGPGPE GKPGPRGPPG DTPDQVLPGP PGHPGPPGSF
     GRKGAKGNMG EPGEKGFRGT GIRGPAGERG LPGDPGSDGR PGIPGLPGLP GPKGRSNLTC
     LACPDGEPGR RGEPGIQGDP GFYGSPGLAG GDGDIGWPGD PGRDGMPGMP GRKGKPGFDG
     SFGEKGDRGE KGLVRIVQEK IIPGERGEDG RPGEKGESGD RGIPGSFGYI GDPGPPGRRG
     FPGPHGPVGA PGEDGSPGFR GDPGVDGRSV DGMPGEPGTP GYPGVKGRQG LPGAKGYCDR
     VRPQVTKGYQ GEPGFRGDPG PAGEPGSRGE MGIKGKQSQI QGEPGFRGDP GPAGEPGSRG
     EMGIKGGSGF PGLRGTDGEN GTAGMPGMKG SPGYPGSAGP PGYPGSVGMP GPPGPPGDIG
     RPGESGYIGQ KGLPGMNGTM GPKGFQGPKG VKGGRGQPSF AVLPGEPGRK GQKGLPGTWG
     LKGEPGRPGE CYRPGVPGDI GPKGLPGYPG PTGEEGLPGD RGEPGVDGQA FAPGLKGDRG
     EDGYPGSIGR IGIKGEPGRP GPAGIKGIPG PIGPPGQPGW PGRPGLDGPS GIPGEPGRSV
     ISRPGIRGEP GIPGPPGSPG EKGIAGMDGK VGKPGESGAS STGRPGSRGM PGPPGLRGEH
     GFPGSLGQPG PKGLPGSFGF PGLKGDRGLP GRNGAEGMDG RPGPRGQTGQ MGPPGDIGMR
     GDMGPRGEPG QSPLLHPGSS GPPGYRGPPG YPGPIGQQGD KGFPGSVGPP GEVGSAGQPG
     IPGYPGIKGR DGQPGFPGTP GMPGPVGPGG ARGIPGPSGP TGPKGIRGMK GQRGTTPYCP
     VGIRGLKGRP GPPGPPGAPG PQGDKGTLGM PGPKGLPGAP GLAFPGPKGD IGPVGPEGYP
     GAPGERGVTG IPGQLGFPGA KGEPGFPGRP GIPGDRGDVG PSGRDGLPGR RGPKGAAGFP
     GADGLPGIDG HPGDVGEPGI MGPSGRPGLK GFSGERGFPG APGLPGRLTS ALKGIRGDPG
     PPGEYGEKGV PGLPGQPGKQ GEPGEKGVKG RPNPSLGPRG PPGDIGPRGE PGYAGPKGYP
     GEKGYRGQPG QNGTIGLGDG GYLFTVHSQD TRPPQCPVYT IELYTGYSLV TLLGDDDSVT
     MDLGSPGSCL RKFSIMPYAT CFAQTTGSCH TNKRNGRSYW LSTLEQYMTS PTKVDRIQPY
     ISRCVVCQSR TNLVAFHSQR PDAGQNCPTG WENAWNGFSF PMTVSGPTGG AQPLESPGSC
     LQHFRALPFI ECNNQDGNCF HWQDGRSYWL SSIPQNQQFI KPIGTPFRSE TGVLQHISRC
     SVCRKSMEAL LMEY
//
DBGET integrated database retrieval system