ID A0A182K3K3_9DIPT Unreviewed; 2038 AA.
AC A0A182K3K3;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
OS Anopheles christyi.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=43041 {ECO:0000313|EnsemblMetazoa:ACHR005338-PA, ECO:0000313|Proteomes:UP000075881};
RN [1] {ECO:0000313|Proteomes:UP000075881}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ACHKN1017 {ECO:0000313|Proteomes:UP000075881};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Besansky N., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles christyi ACHKN1017.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ACHR005338-PA}
RP IDENTIFICATION.
RC STRAIN=ACHKN1017 {ECO:0000313|EnsemblMetazoa:ACHR005338-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004141}; Multi-
CC pass membrane protein {ECO:0000256|ARBA:ARBA00004141}. Secreted,
CC extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 43041.A0A182K3K3; -.
DR EnsemblMetazoa; ACHR005338-RA; ACHR005338-PA; ACHR005338.
DR VEuPathDB; VectorBase:ACHR005338; -.
DR OrthoDB; 2882192at2759; -.
DR Proteomes; UP000075881; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0016409; F:palmitoyltransferase activity; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR001594; Palmitoyltrfase_DHHC.
DR PANTHER; PTHR37456:SF5; -; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 15.
DR Pfam; PF01529; DHHC; 1.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS50216; DHHC; 1.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 1766..1788
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1800..1823
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1895..1920
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1951..1972
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 1533..1756
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 20..51
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 87..620
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 666..910
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 958..1094
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1149..1179
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1274..1478
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1496..1524
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 27..51
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 228..252
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1151..1172
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2038 AA; 205818 MW; 37A779D0067CA7FE CRC64;
MQQFWADNGN IGGIGLFKRQ EQEHPRQLQP NIDPSYSIID TASGPQGPPS KNCTSGGCCL
PKCFAEKGNR GLPGPMGLKG GKGVRGFPGA EGLPGEKGTK GEPGPVGLQG PKGDRGRDGL
PGYPGIPGTN GVPGVPGAPG LSGRDGCNGT DGLPGLSGLP GNPGPRGYAG IPGSKGEKGE
PARHPENYNK GQKGEPGNDG IEGPPGPQGE VGPRGFSGRP GEKGDPGTPG ARGERGDKGV
CIKGDKGQKG AKGEEVYGAT GTTTTTGPKG EKGDRGEAGE PGRSGDKGQA GDRGQVGERG
HKGEKGLPGQ PGPRGRDGNF GPVGLPGQKG DRGSEGLHGL KGQSGPKGEP GRDGIPGQPG
IAGPAGAPGG GEGRPGAPGP KGPRGYEGPQ GPKGMDGFDG EKGERGQMGP KGGQGVPGRP
GPEGMPGDKG DKGESGSVGM PGPQGPRGYP GQPGPEGLRG EPGQPGYGIP GQKGNAGMAG
FPGLKGQKGE RGFKGVMGTP GDAKEGRPGA TGLPGRDGEK GEPGRPGLPG GKGERGLKGE
LGGRCTDCRP GMKGDKGERG YAGEPGRPGA SGVPGERGYP GMPGEDGTPG LRGEPGPKGE
PGMLGPPGPS GEPGRDAEIP MDQLKPIKGD KGEAGEKGLM GIKGEKGFPG PVGPEGKMGL
RGMKGDKGRP GESGMDGTPG LPGKDGQPGR HGQTIKGEPG LKGNVGYSGD KGDKGYSGLK
GEPGKCASIP PNLEEAIRGP PGTQGEKGAP GVQGIRGDKG EMGEQGRTGM QGNAGPPGAP
GPVGPRGLTG LRGEKGNSGP LGPPGAPGRD GMAGVPGLPG SKGVKGDPGL SMVGPPGPKG
NPGLRGPKGD RGGMGDRGDP GLPGSVGYPG EKGDMGIAGQ PGFPGEVGPK GEPGPKGPAG
HPGAPGRPGM DGVKGLPGLK GDIGAPGVIG LPGQKGDIGQ AGNDGLKGFQ GRKGMMGAPG
IQGVRGPQGA KGEPGEKGDR GDIGMKGLMG QTGQPGMLGP KGDKGLGGLP GPACLPGLSG
EKGDKGYTGP EGPPGEAGAA SEKGQKGEPG VPGLRGNDGL PGLEGPSGPK GDAGVPGYGR
PGPQGEKGDV GLTGINGLPG LNGVKGDMGV PGFPGVKGDK GTTGLPGVPG APCMDGLPGA
EGPVGPRGYD GEKGFKGEPG RVGERGEQGE KGDHGLTGPV GLMGRKGDRG VPGSPGLPAT
VAAIKGDKGE SGFPGAIGRP GKVGAPGLPG DMGAKGEMGI QGLPGLPGPA GLNGLQGMKG
DMGLMGEKGD SCPVVKGEKG LPGRPGKMGR DGPSGLIGEK GDKGLSGLPG PMGPPGPPGP
LGRQGEKGDR GDSGLMGRPG NDGLPGPQGQ RGLPGPQGEK GDQGPPGFIG PKGERGERGR
DGLNGLNGVQ GLKGDRGMPG LEGVAGLPGM VGEKGDRGLP GMAGLNGVSG EKGQKGETPQ
LPPQRKGPPG PPGFNGPKGD KGMPGLAGPA GIPGAPGAPG EMGLRGFEGA RGLQGLRGDV
GPEGRVGRDG SPGLPGPKGE PGRDCESAPY YTGILLVRHS QSDEVPICEP GHLKLWDGYS
LLYVDGNDYP HNQDLGSAGS CVRKFSTLPV LACGQNNVCN YASRNDRTFW LSTSAPIPMM
PVTENEMRPY ISRCTVCEAP TNVIAVHSQT LHIPECPNGW DGLWIGYSFL MHTAVGHGGG
GQSLSAPGSC LEDFRATPFI ECNGGKGHCH YYETQTSFWL VSLEDHQQFQ RPEQQTLKAG
NLLSRLSLVS TMRIRKHVLP RTLQDAVATA FMAGIIPITF WFEVYVVIPG IHGPDSVYNW
VHFVPAVLLL FNVTAHMLAT MLCDTSCSTE LIQLPANVSS AASGLGSKSW HLCATCEFIA
PPRSWHCTSC RTCILKRDHH CVFTGCCIGH KNHRYFILFV VYLFISTLYA SVLNNYFIWF
VRGEEFRNWT SLVKIVFPLA MLLIDTSTKQ YYLVIYLINM VGVMFTGVLI IYHGRLILSG
AVVHERKAPE YDMGRAENVR MVLGNRWYVA WLSPFVSSEL PHNGINWETL QKQTVKSK
//