ID A0A0V0TE84_9BILA Unreviewed; 1585 AA.
AC A0A0V0TE84;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE SubName: Full=Collagen alpha-1(IV) chain {ECO:0000313|EMBL:KRX36835.1};
GN Name=emb-9 {ECO:0000313|EMBL:KRX36835.1};
GN ORFNames=T05_8592 {ECO:0000313|EMBL:KRX36835.1};
OS Trichinella murrelli.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=144512 {ECO:0000313|EMBL:KRX36835.1, ECO:0000313|Proteomes:UP000055048};
RN [1] {ECO:0000313|EMBL:KRX36835.1, ECO:0000313|Proteomes:UP000055048}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS417 {ECO:0000313|EMBL:KRX36835.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRX36835.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDJ01000341; KRX36835.1; -; Genomic_DNA.
DR Proteomes; UP000055048; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 17.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KRX36835.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000055048};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1358..1582
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 1..239
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 289..342
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 366..573
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 651..923
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 955..977
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1061..1125
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1149..1190
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1229..1341
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1585 AA; 153355 MW; 6DDCA8959A31B54D CRC64;
MPGRDGVKGE KGDIGYVGYP GPPGPPGPKG AMGLFRAGQK GEKGAAGLPG SPGPPGQLTA
LRLGEMDILQ GPEGNPGPKG EKGDYGTVGP PGMPGAVGEP GYPGLKGMKG EPGPPGALGK
RGKDGTRGAA GEKGSTGDAG GPGRPGRPGL KGEMGDEGTP GRRGPQGDPG PPGLPGVGRG
LKGARGIEGP DGAPGPKGLP GKDGLPGLSG KRGPIGLPGP PGEVGVDGRP GYSEKGMKGN
DGTIIPLKIC LHRYPGLRGE PGYPGMPGLQ GPPGEIGFPG STIYGPPGRD GYPGLDGIPG
PPGERGDPGK PGLKGMPGTG KPLVGPPGQI GFPGLQGDSG RVGVPGRVGA PGLKGLPGDD
CGKCPDGTPG AKGTRGDAGL PGYAGIRGPE GSPGLPGPKG KTGFPGRDGE PGLQGYRGAP
GYPGRPGDKG DAGDAIGTPN AGRPGAKGEP GIPGFPGQRG PMGDAGRDGS PGLPGAPGLP
GERGLPGSRA KNGRMGKAGF PGPKGEPGDS FPGPPGIPGM KGIRGDPGIP GFAGPQGAAG
PAAPDMILKG EPGLPGRPGI PGQKGEFGYP GLPGERGDPG YAGQPGPAGM IGETGRSIPG
IPGPKGYPGM KGSNGLPGFP GIPGLPGKDG IPGVSGTKGV KGDAIVGPIG PAGPQGTRGV
PGEPGVPGLP GQDGAEGDIG QKGYPGLKGN AGSAGQPGLM GLKGAQGLDG LPGLPGSDGR
PGIVGSPGEP GRPGLRGNDG IPGAPGKRGP IGSAGVPGAA GLPGMAGTPG LKGESGEPGF
PGAQGPQGDS GVPGRPGIDG QKGESGLPGI MGPKGMVGEV GLAGLPGRSG LAGRKGEQGD
HGYAGTKGTR GDPGYVAAQG PPGDMGDPGE VGPAGNPGVA GLQGVRGEKG APGDSFPGAS
GNAGERGDPG LPGLAAKPGF PGPPGDAGVV GMKGIPGNAG TPGLPGMPGQ PGIPGSKGER
GNAGIPGVRG SDGAKGEDGL PGQKGYPGLA GNNGLEGMPG LSGMPGVKGE TGLSGIPGLP
GVGGLKGNKG ESGLPGYPGP KGDIGVPGKA GYPGVKGEVG IGGIPGKKGR DGIPGIPGRK
GDAGNPGLPG SPGFGVKGDP GFPGLPGMEG TGSPGAKGER GEAGLPGMMG SAGASGDPGY
PGQIGEKGVP GIPGKRGKKG ASGLPGPRGD AGYPGMKGQG GRPGAEGRPG DFGVQGQPGP
AGDAGFPGRK GESGVPGLVG VPGFPGAKGD RGDPGSFGLP GFPGLKGDTG DPGPVGPMHP
YSEQRHSPPG MRGRPGEPGI QGPAGLPGSR GMPGPPGVPG ARGDKGLAGT PGRPGVPGPK
GSSGNFGKAG FPGNVGPPGN PGFPGVPGIR GGIAPSRGFY FARHSQTTAV PNCPAGTTPM
WTGYSLLYIQ GDGKSSGQDL GLPGSCLRKF STMPFMPCNL NNECHIASRS DYSYWLATEE
PMTASMAPVS GFGIRPYISR CVVCELPTQV VALHSQTNDI PRCPRGWTGL WTGYSFIMHT
AAGAEGTGQN LQSPGSCLES FRTLPFIECH GRGTCNHYAT NHAFWLAVID RDMMFKKPYS
ETLKAGGLKQ RVSRCQVCMK NPPVY
//