ID B0WDA5_CULQU Unreviewed; 1785 AA.
AC B0WDA5;
DT 08-APR-2008, integrated into UniProtKB/TrEMBL.
DT 08-APR-2008, sequence version 1.
DT 27-MAR-2024, entry version 96.
DE SubName: Full=Collagen alpha-2(IV) chain {ECO:0000313|EMBL:EDS44340.1};
GN Name=6036647 {ECO:0000313|EnsemblMetazoa:CPIJ005295-PA};
GN ORFNames=CpipJ_CPIJ005295 {ECO:0000313|EMBL:EDS44340.1};
OS Culex quinquefasciatus (Southern house mosquito) (Culex pungens).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Culicinae; Culicini; Culex; Culex.
OX NCBI_TaxID=7176;
RN [1] {ECO:0000313|EMBL:EDS44340.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JHB {ECO:0000313|EMBL:EDS44340.1};
RG The Broad Institute Genome Sequencing Platform;
RA Atkinson P.W., Hemingway J., Christensen B.M., Higgs S., Kodira C.,
RA Hannick L., Megy K., O'Leary S., Pearson M., Haas B.J., Mauceli E.,
RA Wortman J.R., Lee N.H., Guigo R., Stanke M., Alvarado L., Amedeo P.,
RA Antoine C.H., Arensburger P., Bidwell S.L., Crawford M., Camaro F.,
RA Devon K., Engels R., Hammond M., Howarth C., Koehrsen M., Lawson D.,
RA Montgomery P., Nene V., Nusbaum C., Puiu D., Romero-Severson J.,
RA Severson D.W., Shumway M., Sisk P., Stolte C., Zeng Q., Eisenstadt E.,
RA Fraser-Liggett C., Strausberg R., Galagan J., Birren B., Collins F.H.;
RT "Annotation of Culex pipiens quinquefasciatus.";
RL Submitted (MAR-2007) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:CPIJ005295-PA}
RP IDENTIFICATION.
RC STRAIN=JHB {ECO:0000313|EnsemblMetazoa:CPIJ005295-PA};
RG EnsemblMetazoa;
RL Submitted (FEB-2021) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DS231895; EDS44340.1; -; Genomic_DNA.
DR RefSeq; XP_001846673.1; XM_001846621.1.
DR STRING; 7176.B0WDA5; -.
DR EnsemblMetazoa; CPIJ005295-RA; CPIJ005295-PA; CPIJ005295.
DR KEGG; cqu:CpipJ_CPIJ005295; -.
DR VEuPathDB; VectorBase:CPIJ005295; -.
DR VEuPathDB; VectorBase:CQUJHB016258; -.
DR eggNOG; KOG3544; Eukaryota.
DR HOGENOM; CLU_002023_1_0_1; -.
DR InParanoid; B0WDA5; -.
DR OMA; QSRDLMK; -.
DR OrthoDB; 2882192at2759; -.
DR Proteomes; UP000002320; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 16.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:EDS44340.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000002320};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1509..1732
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 79..118
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 136..184
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 241..469
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 549..590
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 630..705
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 873..958
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 990..1028
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1057..1115
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1178..1287
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1408..1439
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1753..1785
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 273..287
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1076..1092
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1757..1772
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1785 AA; 180938 MW; 24688EF55FD56C5A CRC64;
MSFAGRGKVD FTRLTDARML LDFVILAVCN RTECDCKGLK GAPGQIGPHG VPGAAGLPGD
IGFDGPPGYS GERGVRGEFG STGDKGYRGD TGPRGPVGYP GIPGLPGDDG PRGPRGIDGC
NGTDGAMGIP GMPGWPGDRG QPGLQGPIGF RGDSGEGGIN SKGTKGSIGE SGRPGITGAD
GFKGNRGLPG IDGINGIDGA VGPKGELGDE GEPSIPFCPG EPGEPGEPLY YFSSKNSTVN
KGISGPKGDN GADGLPGIPG RKGDTGGYGE RGLKGFKGEE GNVGDRGKQG KDGPPGPSGE
KGEKGAPGFA GRDGELGDKG EPGDDGRSGL PGTQGVRGPP GLYDPNLAPI VIGPVGPQGD
VGAAGNQGLG GIPGNPGRRG LMGLQGPPGD PGIDGPRGKK GTTIAGQLGD DGESGPRGQV
GPRGVSGPAG TKGPKGYPGR SILGEKGESG LPGLNGEYGD KGDQGEIGIT GDKGLPGIGY
NITGPPGPDG IPGMQGPPGE AGWNGYDGSK GDRGFRGEDC GICPPGPRGI KGEGGDIGRP
GWEGFDGNRG LPGPRGLKGG PGKPGPQGIP GLQGNPGENG EPGRPGMKGA KGEVIRVGNM
VHFPGDKGEM GMFGLQGQQG DVGLAGEPGE RGFPGEQGLF GDDGSPGLRG RDGDPGLPGR
DGVHGRNAVY NAYNMWGLKG SQGPPGDDGT PGERGDQGDR GERGATAEFD YLITGDKGVR
GDFGDEGPFG EPGYRGEIGD EGFRGAPGVQ GSVGDSIQGP IGYKGYFGII GDHGLQGEDG
FPGMPGVDGP PGLPGLKGQR GDPGRAVLFG EKGEEGEAGY FGEFGDKGFK GMEGYRGAPG
ITGPKGEKGD DGVMGRIGLP GNKGMVGDFI YGDRGAPGAD GTPGRTAPYG DKGERGEPGI
EGPSGPKGEI GDTGRDGLPG FAGDEGEPGE PGLRGIHGYM GGEGDQGERG EVGDQGNIGL
TGRRGLVGLR GEKGEIGDLG DIGFPGRFGR NGTKGERGDE GFAGVRGPKG SGSFSGHKGE
PGIFGLRGPP GLDGMPGLFG MKGEEGEAGN VIDGTPGLKG PKGAPGFNGR SGLPGLKGER
GDEGRFGSKG IQGDKGRDGY PGIPGRRGKD GLRGLPGLRG LSGSRGEQGE EGDFGYVGFV
GDKGERGDIG RIGLPGIEGM LGDPGFKGLP GELIFRSPAK GDRGDSGFPG PEGMPGIQGV
KGRIGYPGTK GEPGLRGEPG FAGRNGLDGL KGQQGDRGYK GPRGTLDIRP EQGDQGESGY
DGFPGRLGLR GSKGAPGDYG DDGPPGPRGE VGMVSGAFKG IKGEVGFEGA PGLEGLPGLP
GPQGMSGAPG EKGLSGSIGI SLRGLKGVAG DDGLIGLDGM PGPLGFVGEV GLPGRPGMRG
FPGIKGEFGD VGEDGYYGRL GLKGVKGERG DPIPLDDWRP NQPGERGVPG NKGAPGDEGD
MGLPGYPGLI GLKGERGLQG LDGEEGQMGF KGERGFQGAP GRDGLDGYPG IRGEIGDPAP
PPPPPKSRGY IFTKHSQTVH IPDCPLNTIK LWDGYSLVSV IGSSRSVGQD LGSAGSCMRK
FSTMPYLFCD INNVCNYATN NDDSIWLASP EPMPMSMAPM KSREVAQYIS RCSVCETTTR
VIAVHSQTMA IPDCPGGWEE LWIGYSYVMH TTDNSGGFGM DLTSPGSCLE EFRAQPVIEC
HGHGTCNFYD GITSFWLTII EDGEEFNQPK QQTLKADQTS KISRCIVCRR KAGFLRVLSD
GGISASALRR PDIATVQKPY YPPPPPQPPR RRIRPGSRQR NRNSG
//