GenomeNet

Database: UniProt/TrEMBL
Entry: B0WDA5_CULQU
LinkDB: B0WDA5_CULQU
Original site: B0WDA5_CULQU 
ID   B0WDA5_CULQU            Unreviewed;      1785 AA.
AC   B0WDA5;
DT   08-APR-2008, integrated into UniProtKB/TrEMBL.
DT   08-APR-2008, sequence version 1.
DT   27-MAR-2024, entry version 96.
DE   SubName: Full=Collagen alpha-2(IV) chain {ECO:0000313|EMBL:EDS44340.1};
GN   Name=6036647 {ECO:0000313|EnsemblMetazoa:CPIJ005295-PA};
GN   ORFNames=CpipJ_CPIJ005295 {ECO:0000313|EMBL:EDS44340.1};
OS   Culex quinquefasciatus (Southern house mosquito) (Culex pungens).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Culicinae; Culicini; Culex; Culex.
OX   NCBI_TaxID=7176;
RN   [1] {ECO:0000313|EMBL:EDS44340.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=JHB {ECO:0000313|EMBL:EDS44340.1};
RG   The Broad Institute Genome Sequencing Platform;
RA   Atkinson P.W., Hemingway J., Christensen B.M., Higgs S., Kodira C.,
RA   Hannick L., Megy K., O'Leary S., Pearson M., Haas B.J., Mauceli E.,
RA   Wortman J.R., Lee N.H., Guigo R., Stanke M., Alvarado L., Amedeo P.,
RA   Antoine C.H., Arensburger P., Bidwell S.L., Crawford M., Camaro F.,
RA   Devon K., Engels R., Hammond M., Howarth C., Koehrsen M., Lawson D.,
RA   Montgomery P., Nene V., Nusbaum C., Puiu D., Romero-Severson J.,
RA   Severson D.W., Shumway M., Sisk P., Stolte C., Zeng Q., Eisenstadt E.,
RA   Fraser-Liggett C., Strausberg R., Galagan J., Birren B., Collins F.H.;
RT   "Annotation of Culex pipiens quinquefasciatus.";
RL   Submitted (MAR-2007) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EnsemblMetazoa:CPIJ005295-PA}
RP   IDENTIFICATION.
RC   STRAIN=JHB {ECO:0000313|EnsemblMetazoa:CPIJ005295-PA};
RG   EnsemblMetazoa;
RL   Submitted (FEB-2021) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC       Secreted, extracellular space, extracellular matrix, basement membrane
CC       {ECO:0000256|ARBA:ARBA00004302}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; DS231895; EDS44340.1; -; Genomic_DNA.
DR   RefSeq; XP_001846673.1; XM_001846621.1.
DR   STRING; 7176.B0WDA5; -.
DR   EnsemblMetazoa; CPIJ005295-RA; CPIJ005295-PA; CPIJ005295.
DR   KEGG; cqu:CpipJ_CPIJ005295; -.
DR   VEuPathDB; VectorBase:CPIJ005295; -.
DR   VEuPathDB; VectorBase:CQUJHB016258; -.
DR   eggNOG; KOG3544; Eukaryota.
DR   HOGENOM; CLU_002023_1_0_1; -.
DR   InParanoid; B0WDA5; -.
DR   OMA; QSRDLMK; -.
DR   OrthoDB; 2882192at2759; -.
DR   Proteomes; UP000002320; Unassembled WGS sequence.
DR   GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR   Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR001442; Collagen_IV_NC.
DR   InterPro; IPR036954; Collagen_IV_NC_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF01413; C4; 2.
DR   Pfam; PF01391; Collagen; 16.
DR   SMART; SM00111; C4; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   PROSITE; PS51403; NC1_IV; 1.
PE   4: Predicted;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:EDS44340.1};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002320};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          1509..1732
FT                   /note="Collagen IV NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51403"
FT   REGION          79..118
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          136..184
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          241..469
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          549..590
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          630..705
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          873..958
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          990..1028
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1057..1115
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1178..1287
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1408..1439
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1753..1785
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        273..287
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1076..1092
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1757..1772
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1785 AA;  180938 MW;  24688EF55FD56C5A CRC64;
     MSFAGRGKVD FTRLTDARML LDFVILAVCN RTECDCKGLK GAPGQIGPHG VPGAAGLPGD
     IGFDGPPGYS GERGVRGEFG STGDKGYRGD TGPRGPVGYP GIPGLPGDDG PRGPRGIDGC
     NGTDGAMGIP GMPGWPGDRG QPGLQGPIGF RGDSGEGGIN SKGTKGSIGE SGRPGITGAD
     GFKGNRGLPG IDGINGIDGA VGPKGELGDE GEPSIPFCPG EPGEPGEPLY YFSSKNSTVN
     KGISGPKGDN GADGLPGIPG RKGDTGGYGE RGLKGFKGEE GNVGDRGKQG KDGPPGPSGE
     KGEKGAPGFA GRDGELGDKG EPGDDGRSGL PGTQGVRGPP GLYDPNLAPI VIGPVGPQGD
     VGAAGNQGLG GIPGNPGRRG LMGLQGPPGD PGIDGPRGKK GTTIAGQLGD DGESGPRGQV
     GPRGVSGPAG TKGPKGYPGR SILGEKGESG LPGLNGEYGD KGDQGEIGIT GDKGLPGIGY
     NITGPPGPDG IPGMQGPPGE AGWNGYDGSK GDRGFRGEDC GICPPGPRGI KGEGGDIGRP
     GWEGFDGNRG LPGPRGLKGG PGKPGPQGIP GLQGNPGENG EPGRPGMKGA KGEVIRVGNM
     VHFPGDKGEM GMFGLQGQQG DVGLAGEPGE RGFPGEQGLF GDDGSPGLRG RDGDPGLPGR
     DGVHGRNAVY NAYNMWGLKG SQGPPGDDGT PGERGDQGDR GERGATAEFD YLITGDKGVR
     GDFGDEGPFG EPGYRGEIGD EGFRGAPGVQ GSVGDSIQGP IGYKGYFGII GDHGLQGEDG
     FPGMPGVDGP PGLPGLKGQR GDPGRAVLFG EKGEEGEAGY FGEFGDKGFK GMEGYRGAPG
     ITGPKGEKGD DGVMGRIGLP GNKGMVGDFI YGDRGAPGAD GTPGRTAPYG DKGERGEPGI
     EGPSGPKGEI GDTGRDGLPG FAGDEGEPGE PGLRGIHGYM GGEGDQGERG EVGDQGNIGL
     TGRRGLVGLR GEKGEIGDLG DIGFPGRFGR NGTKGERGDE GFAGVRGPKG SGSFSGHKGE
     PGIFGLRGPP GLDGMPGLFG MKGEEGEAGN VIDGTPGLKG PKGAPGFNGR SGLPGLKGER
     GDEGRFGSKG IQGDKGRDGY PGIPGRRGKD GLRGLPGLRG LSGSRGEQGE EGDFGYVGFV
     GDKGERGDIG RIGLPGIEGM LGDPGFKGLP GELIFRSPAK GDRGDSGFPG PEGMPGIQGV
     KGRIGYPGTK GEPGLRGEPG FAGRNGLDGL KGQQGDRGYK GPRGTLDIRP EQGDQGESGY
     DGFPGRLGLR GSKGAPGDYG DDGPPGPRGE VGMVSGAFKG IKGEVGFEGA PGLEGLPGLP
     GPQGMSGAPG EKGLSGSIGI SLRGLKGVAG DDGLIGLDGM PGPLGFVGEV GLPGRPGMRG
     FPGIKGEFGD VGEDGYYGRL GLKGVKGERG DPIPLDDWRP NQPGERGVPG NKGAPGDEGD
     MGLPGYPGLI GLKGERGLQG LDGEEGQMGF KGERGFQGAP GRDGLDGYPG IRGEIGDPAP
     PPPPPKSRGY IFTKHSQTVH IPDCPLNTIK LWDGYSLVSV IGSSRSVGQD LGSAGSCMRK
     FSTMPYLFCD INNVCNYATN NDDSIWLASP EPMPMSMAPM KSREVAQYIS RCSVCETTTR
     VIAVHSQTMA IPDCPGGWEE LWIGYSYVMH TTDNSGGFGM DLTSPGSCLE EFRAQPVIEC
     HGHGTCNFYD GITSFWLTII EDGEEFNQPK QQTLKADQTS KISRCIVCRR KAGFLRVLSD
     GGISASALRR PDIATVQKPY YPPPPPQPPR RRIRPGSRQR NRNSG
//
DBGET integrated database retrieval system