ID A0A158QM15_HAEPC Unreviewed; 1781 AA.
AC A0A158QM15;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE SubName: Full=Collagen IV NC1 domain-containing protein {ECO:0000313|WBParaSite:HPLM_0000744301-mRNA-1};
GN ORFNames=HPLM_LOCUS7435 {ECO:0000313|EMBL:VDO31877.1};
OS Haemonchus placei (Barber's pole worm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Trichostrongylidae; Haemonchus.
OX NCBI_TaxID=6290 {ECO:0000313|Proteomes:UP000038042, ECO:0000313|WBParaSite:HPLM_0000744301-mRNA-1};
RN [1] {ECO:0000313|WBParaSite:HPLM_0000744301-mRNA-1}
RP IDENTIFICATION.
RG WormBaseParasite;
RL Submitted (APR-2016) to UniProtKB.
RN [2] {ECO:0000313|EMBL:VDO31877.1, ECO:0000313|Proteomes:UP000268014}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MHpl1 {ECO:0000313|EMBL:VDO31877.1,
RC ECO:0000313|Proteomes:UP000268014};
RG Pathogen Informatics;
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; UZAF01016659; VDO31877.1; -; Genomic_DNA.
DR STRING; 6290.A0A158QM15; -.
DR WBParaSite; HPLM_0000744301-mRNA-1; HPLM_0000744301-mRNA-1; HPLM_0000744301.
DR OMA; SNNESCG; -.
DR Proteomes; UP000038042; Unplaced.
DR Proteomes; UP000268014; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1096; COLLAGEN ALPHA-2(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 19.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000268014};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1554..1777
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 1..501
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 514..863
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 914..1548
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 87..101
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 321..375
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 544..565
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1781 AA; 170300 MW; 0B7115DF9A4A05C4 CRC64;
MGSPGPQGPP GLQGIRGFPG PEGLAGPKGQ KGAQGPPGAG GPKGDRGPIG VPGFPGNDGA
NGRPGEPGPP GAPGWDGCNG TDGAPGIPGR PGPPGMPGFP GPPGIAGAKG EPAIGYDGAP
GEKGDGGIPG MPGLPGPPGR DGYPGEKGDR GDIGPVGPRG PPGEAGIPGN PGIGSIGPKG
DPGDIGHQGP PGPPGPREFT GSGSIGDVGA QGPRGPPGPI ASTMAKGTIV GPKGDGEPGE
AGPRGYPGIA GIPGQPGLPG MKGEKGLSGP AGPRGKEGRP GNPGPPGFKG DRGLDGVPGF
PGMPGQKGEA GYSGRDGPKG NTGPPGPPGG GSFTDGPPGP PGLPGRPGNP GPPGTDGFPG
QPGPAGPPGQ PGGPGAPGLP GLEGLPGPKG DKGDSGIPGA PGVTGSPGQF GPPGPKGEPG
ARGIPGQSIP GLPGKDGRPG FDGASGRKGE QGLPGVRGPP GDSLHGLPGP PGARGPVGPK
GYDGRDGVPG LPGVTGPKGD RGGACSICAP GMKGEKGNSG YPGQPGPQGD RGLPGMPGPN
GDPGDDGIPG PPGRPGAPGP PGLDGVPGVP GQKGEPTQLT LRPGPPGYPG MKGETGYPGQ
PGQDGLPGSP GLVGAPGQNG IPGEKGEPGL PGIPGKPGKD GLPGLPGLKG EPGYGFPGQP
GIPGQKGEPG PIGPAGLPGI QGPPGLAAPK SMIKDGTPGQ PGMPGLPGLK GDAGSPGRPG
DVGSPGLPGV NGRKGESGLP GPPGQPGVPG VPGEKGFPGL NGLPGVPGPK GEPGSSGLPG
MPGQKGEHGA SVSGPPGLPG FPGLKGDAGL PGTPGFAGLE GQRGLPGVPG LKGDTGSTGQ
PGQPGYPGAK GEPGLPGSAD AGLQSNCFSS GDVSELTYTV AASEKKIILL FSCAQAQMNK
QNFHGFQKKY SGKEGLPGMS GMDGLPGLPG QKGEEGLPGI AGAEGQKGDT GLSGAPGQPG
LAGPPGYPGQ KGMNGIPGVP GSKGDSGLPG LPGPHGAKGN AGLPGVPGLP GMKGSAGSPG
APGKDGYPGS PGIKGDRGFN GIPGEKGEPG PAARDGPKGD QGLPGQPGLR GPQGSPGLPG
LPGIKGESGL PGYGQPGSIG EKGLAGVSGK MGRPGAQGPP GQDGLPGFPG LKGEPGYPGH
HGASGKDGMP GLPGIKGDIG VPGEPGKAGS PGQPGATGAP GIRGDKGQAG LPGLPGDRGL
DGVPGSKGNN GYPGQPGLQG VVGMKGNTGA PGFPGLKGNS GNPGQDGLPG LPGMKGETGF
PGQPGRDGVD GVPGEKGLSG LPGLPGPPGQ SLGGSQGPPG KPGLPGKDGL PGLPGPKGDS
GQPGYPGAPG LKGESGLAGF PGQKGEPGKS GFPGKRGSDG YPGAPGKDGL PGLPGSKGEM
GLLGPPGPPG TPGLPGVKGD AGLPGFPGQK GENGLPGLPG QPGSPGTKGD TGFPGQPGRE
GQPGMDGPQG PPGVPGPSSV TIPGQKGEPG LPGVPGMRGE KGLPGLDGPP GVDGPPGTVG
SRGSDGFPGQ PGLPGEKGVA GLPGIPGLDG APGGPGAPGY PGAPGPAGPA YKDGFLLVKH
SQTTDIPRCP EGQTKLWDGY SLLYIEGNEK SHNQDLGHAG SCLQRFSTMP FLFCDFNNVC
NYASRNDKSY WLSTTAPIPM MPVSEGEIER YISRCSVCEA PANVIAVHSQ TIQIPNCPSG
WNSLWIGYSF AMHTGAGAEG GGQSLSSPGS CLEDFRATPF IECNGARGTC HYFANKFSFW
LATIDNDQEF KIPESQTLKS GSLRTRVSRC QVCLKSTEDR P
//