ID A0A0N4XV54_NIPBR Unreviewed; 1486 AA.
AC A0A0N4XV54;
DT 09-DEC-2015, integrated into UniProtKB/TrEMBL.
DT 09-DEC-2015, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE SubName: Full=Collagen alpha-2(IV) chain (inferred by orthology to a C. elegans protein) {ECO:0000313|WBParaSite:NBR_0000665101-mRNA-1};
GN ORFNames=NBR_LOCUS6652 {ECO:0000313|EMBL:VDL70241.1};
OS Nippostrongylus brasiliensis (Rat hookworm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Heligmosomidae; Nippostrongylus.
OX NCBI_TaxID=27835 {ECO:0000313|Proteomes:UP000038043, ECO:0000313|WBParaSite:NBR_0000665101-mRNA-1};
RN [1] {ECO:0000313|WBParaSite:NBR_0000665101-mRNA-1}
RP IDENTIFICATION.
RG WormBaseParasite;
RL Submitted (FEB-2017) to UniProtKB.
RN [2] {ECO:0000313|EMBL:VDL70241.1, ECO:0000313|Proteomes:UP000271162}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Pathogen Informatics;
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; UYSL01019819; VDL70241.1; -; Genomic_DNA.
DR STRING; 27835.A0A0N4XV54; -.
DR WBParaSite; NBR_0000665101-mRNA-1; NBR_0000665101-mRNA-1; NBR_0000665101.
DR OMA; SNNESCG; -.
DR Proteomes; UP000038043; Unplaced.
DR Proteomes; UP000271162; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1104; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 14.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000271162};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1259..1482
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 60..1257
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 145..159
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 371..415
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 585..606
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1486 AA; 144172 MW; B7A2E23A054986D2 CRC64;
MRQRVPLCKR RVEIARTEDV SAWERRDPWL PIQENIILKE EQCKFILSTL YINTFSSFQG
APGPQGPPGS QGIRGFPGPE GLAGPKGQKG AQGPPGPQGP KGDRGPIGVP GFPGNDGANG
RPGEPGPPGA PGWDGCNGTD GAPGIPGRPG PPGMPGFPGP PGMDGAKGEP AIGYDGAPGE
KGDGGMPGMP GLPGPPGRDG YPGEKGDRGD IGPVGPRGPP GEAGIPGNPG IGSIGPKGDP
GDIGQQGPPG PPGPREFTGS GSIVGPRGNV GEKGFKGEPG EAGPRGYTGN AGLPGQPGLP
GMKGEKGLSG PAGPRGKEGR PGNPGPPGFK GDRGLDGVPG FPGMPGQKGE AGYSGREGAK
GNTGPPGPPG GGSFTDGPPG PPGLPGRPGN PGPPGTDGFP GQPGPPGPPG QAGGPGAPGL
PGLEGLPGPK GDKGDSGIPG APGVPGPPGQ FGAPGPKGEP GARGIPGQSI PGLPGKDGRP
GLDGAPGRKG EQGLPGVRGP PGDSLNGLPG PPGARGPVGP KGYDGRDGIP GLPGVPGTKG
DRGGTCSICS PGMKGEKGNS GYPGQPGPQG DRGLPGMPGP NGDPGDDGIP GPPGRPGAPG
PPGLDGLPGL PGQKGEPTQL VLRPGPPGYP GMKGESGFPG QPGQDGLPGP PGIVGAAGQP
GLPGEKGEPG MPGMPGKPGK DGLPGLPGLK GEAGYGQPGQ PGFPGQKGEQ GPAGAAGLPG
IQGPPGLPAP KSLIKDGLPG QPGIPGLPGL KGEAGFPGRP GDVGSPGLPG VNGRKGESGL
PGPPGQPGVP GVPGEKGFPG LNGLPGMPGP KGEAGHPGLP GIPGQKGEMG ASVTGPAGPP
GFPGLKGDAG LSGAPGMPGQ DGQRGLPGVP GLKGDAGLPG QPGQPGYPGA KGEPGLPGIP
GKEGLPGVPG VDGLPGLPGQ KGDDGFPGLP GVEGQKGDAG LPGAPGQPGL PGAPGYPGQK
GMNGIPGVPG MKGDAGLPGL PGQHGAKGNA GLPGMPGLPG MKGNAGEPGA PGQDGYPGSP
GMKGDRGFNG MPGEKGEPGP AARDGPKGDQ GLPGQPGLRG PQGPPGLPGL PGMKGDSGLP
GYGQPGLNGE KGLPGVPGKM GRAGAPGPPG QDGLPGFPGL KGEPGYPGQP GAQGKDGLPG
QPGKEGQPGM DGPQGPPGLP GPSSITIPGQ KGEPGLPGVP GIRGEKGLPG LDGPPGLDGP
PGAPGSRGSD GFPGQPGLPG EKGMAGLPGL PGLDGAPGGP GQPGYPGAPG PAGPAYRDGF
LLVKHSQTTE IPRCPEGQTK LWDGYSLLYI EGNEKSHNQD LGHAGSCLQR FSTMPFLFCD
FNNVCNYASR NDKSYWLSTT APIPMMPVSE GEIEGYISRC AVCEAPANVI AVHSQTIQIP
NCPAGWNSLW IGYSFAMHTG AGAEGGGQSL SSPGSCLEDF RATPFIECNG ARGTCHYFAN
KFSFWLTTID NDQEFKIPES QTLKSGSLRT RVSRCQVCIK STEGRN
//