ID A0A0L7R9S1_9HYME Unreviewed; 1978 AA.
AC A0A0L7R9S1;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE SubName: Full=Collagen alpha-1(IV) chain {ECO:0000313|EMBL:KOC67617.1};
GN ORFNames=WH47_10392 {ECO:0000313|EMBL:KOC67617.1};
OS Habropoda laboriosa.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea;
OC Anthophila; Apidae; Habropoda.
OX NCBI_TaxID=597456 {ECO:0000313|EMBL:KOC67617.1, ECO:0000313|Proteomes:UP000053825};
RN [1] {ECO:0000313|EMBL:KOC67617.1, ECO:0000313|Proteomes:UP000053825}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=0110345459 {ECO:0000313|EMBL:KOC67617.1};
RA Pan H., Kapheim K.;
RT "The genome of Habropoda laboriosa.";
RL Submitted (JUL-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ414621; KOC67617.1; -; Genomic_DNA.
DR STRING; 597456.A0A0L7R9S1; -.
DR Proteomes; UP000053825; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1104; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 24.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KOC67617.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000053825};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1741..1978
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 25..218
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 256..1263
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1314..1339
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1390..1576
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1642..1731
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 64..79
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 113..131
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 558..587
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 933..947
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1418..1432
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1978 AA; 195976 MW; 10D5DF3342463B18 CRC64;
MKHFYIKSKY LNNFLSFQQY NHTSWSSNPS PVVPLGRGDI GKPNSTDDDY NISPRWRGIF
PSSEGSEEAD RRNVDPYDVG RNYDDGYNRY PDGGEYGHRT DGTGGYQSGY DGGDRGTYSQ
RQGQGSLDPN NQGAVDPYGR SPGYEDTYGR TGGGASRGEG AGVVGRGGQD SRDPYGDGYG
ENRGSGDLYG RVDPYSRGST GQTGRSQGQG GYGADGGSEN YPSSYSVVPV PGLIPPAKCN
GSACCVPKCF AEKGSRGPPG TIGPQGPKGQ RGFPGTEGLL GPKGEKGDPG LQGPHGPKGD
RGKMGMPGYH GVNGVPGVQG PPGPSGFPGR DGCNGTDGAP GLPGYPGEIG PRGFRGVPGS
KGDKGQSAFV GPFSVGQKGE PGIDGVRGPP GPPGPQGDRG LPGVKGEPGP YGVHGTPGPK
GEKGNMGLGF EGLKGDKGKK GDPGPPGTNG PLVPFVGVPK TVTGEPGETG EQGPMGPEGE
KGAAGPMGDH GTPGNPGPKG EKGLIGPPGI RGRDGFSGPP GPPGRKGDRG YDGLDGLPGR
PGLKGDAGRD GSMGAPGLRG PPGPPGGDKG TPGPPGPKGP PGYPGPPGNR GSDGFPGNPG
PRGPIGPSGG PGSQGIPGPE GLPGEKGGKG EPGITGFPGP TGPRGFDGPP GPTGPRGFAG
ETGLSIMGAK GMDGSPGIDG EKGQKGERGY GGPRGFPGDS LDGIPGLPGE SGLPGEPGTS
GKDGTPGYPG APGEKGEVGG RCQDCVPGSQ GEKGDRGYDG TPGHPGARGP QGERGFPGES
GSDGLPGPIG PPGLPGKDGI PGAAGPQGEP GTPATITRSM IKAQKGDKGA RGEVGRPGPL
GPQGPPGEKG DTGFGGFPGP KGIAGDRGFP GNDGIPGRPG IPGTKGESGL SVKGEPGEPG
NRGQDGQKGE PGLYGEKGDD GVCPSPLELC PGRKGDRGPR GDKGEPGPPG REGLPGDRGL
QGFEGPPGEP GISGAPGPVG PRGLPGPRGD KGNMGPLGFP GEPGHQGPRG FPGVPAPKGD
RGEPGISVKG SPGLPGLPGE KGERGLPGPP GKEGLPGMSG LPGYTGDKGD EGIPGISGLP
GAPGEKGDTG IKGPPGPPGV AGVPGIDGIK GEPGLPGADG RPGLPGFPGT KGDTGEPGID
GPKGHPGRRG PPGLKGRAGA EGVPGLKGER GEKGSDGFPG MPGMEGKPGR AGLPGPKGDE
GPPGPEGSPG LIGYDGPKGD RGIPGMPGLK GDAGQVSEKG QKGEPGMPGI RGPTGPPGRD
GIDGAKGEVG LPCVGVDGLP GPKGEPGIPG RDGLPGLQGQ KGDAGVRGFL GHKGDLGPPG
APGEPGTPGI DGLPGERGDT GPLGPRGFPG LVGEMGAPGH PGFDGAKGDR GLAGAVGYPG
NPGLMGEKGD RGPPGPAIVT KGDKGEPGIP GLPGIDGEKG DRGLEGLVGY DGEKGDRGSP
GPVGNPGPAG YPGIKGDMGP NGPPGIPGAT IKGEKGLQCL PGKHGRQGTP GRPGEKGEQG
FPGLPGHKGD QGPHGPIGPH GEKGDMGLMG PPGLPGNDGI PGPKGSPGFL GERGDKGDSG
PEGLPGIPGQ KGEPGFPEYK SLFHELIGQK GLAGLTGEKG DRGWDGAPGL VGAVGEKGDR
GYPGQVGLIG PVGSFGQKGD KGDDCIDSPL GPKGDRGFPG IEGPRGVPGE KGPPGPPGFP
GLNGLKGAQG PMGPPGPAGT PGLSGLTGPP GLPGLQGPIG VPGPAGEPGK PCDAPSDYLT
GILLVKHSQS QSVPRCDAGH IKLWEGYSLL HTDGEERAHS QDLGNINSAD LINNHGTGYA
GSCVRKFSTM PFLFCDVNNV CQYASRNDRS YWLSTNSPIP MSPVQETAIE EYISRCVVCE
VPANVLAVHS QSLNIPECPN GWTGLWIGYS FIMHTAAGSQ GGGQSLSSTG SCLEDFRATP
FIECNGAKGH CHYYANEFSF WMATIEDRQQ FQRPEKQTLK PGNLRSRISR CQVCIKNT
//