ID A0A182Y9U2_ANOST Unreviewed; 1994 AA.
AC A0A182Y9U2;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:ASTEI05228-PA};
OS Anopheles stephensi (Indo-Pakistan malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=30069 {ECO:0000313|EnsemblMetazoa:ASTEI05228-PA, ECO:0000313|Proteomes:UP000076408};
RN [1] {ECO:0000313|Proteomes:UP000076408}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Indian {ECO:0000313|Proteomes:UP000076408};
RX PubMed=25244985; DOI=10.1186/preaccept-1262842421127991;
RA Jiang X., Peery A., Hall A.B., Sharma A., Chen X.G., Waterhouse R.M.,
RA Komissarov A., Riehle M.M., Shouche Y., Sharakhova M.V., Lawson D.,
RA Pakpour N., Arensburger P., Davidson V.L., Eiglmeier K., Emrich S.,
RA George P., Kennedy R.C., Mane S.P., Maslen G., Oringanje C., Qi Y.,
RA Settlage R., Tojo M., Tubio J.M., Unger M.F., Wang B., Vernick K.D.,
RA Ribeiro J.M., James A.A., Michel K., Riehle M.A., Luckhart S.,
RA Sharakhov I.V., Tu Z.;
RT "Genome analysis of a major urban malaria vector mosquito, Anopheles
RT stephensi.";
RL Genome Biol. 15:459-459(2014).
RN [2] {ECO:0000313|EnsemblMetazoa:ASTEI05228-PA}
RP IDENTIFICATION.
RC STRAIN=Indian {ECO:0000313|EnsemblMetazoa:ASTEI05228-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004141}; Multi-
CC pass membrane protein {ECO:0000256|ARBA:ARBA00004141}. Secreted,
CC extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 30069.A0A182Y9U2; -.
DR EnsemblMetazoa; ASTEI05228-RA; ASTEI05228-PA; ASTEI05228.
DR VEuPathDB; VectorBase:ASTE001486; -.
DR VEuPathDB; VectorBase:ASTEI05228; -.
DR VEuPathDB; VectorBase:ASTEI20_038473; -.
DR OMA; SNNESCG; -.
DR Proteomes; UP000076408; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0016409; F:palmitoyltransferase activity; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR001594; Palmitoyltrfase_DHHC.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1104; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 17.
DR Pfam; PF01529; DHHC; 1.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS50216; DHHC; 1.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000076408};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
SQ SEQUENCE 1994 AA; 200586 MW; CD4FE1B53ED1A1D5 CRC64;
MQPQIDPSYN IIDTASGPQG PPSKNCTSGG CCLPKCFAEK GNRGFPGPQG LKGVKGVRGF
PGTEGLPGDK GTKGEPGPVG MQGPKGDRGR DGLPGYPGIP GTNGVPGLSG APGLPGRDGC
NGTDGLPGLS GLPGNPGPRG YPGIAGSKGE KGEPARHPEN YNKGQKGEPG NDGLEGLPGP
QGVEGPRGYA GRPGEKGLPG MPGARGERGD KGVCIKGLKG QKGAKGEEVY GSTGTTTTMG
PEGAKGDRGE PGEPGRPGEK GQAGDRGQFG ERGHKGEKGL PGQPGPRGRD GNFGPVGLPG
QKGDRGSEGL HGLKGQSGPK GEPGRDGTPG QPGISGPPGA PGGGEGRPGA PGPKGPRGYE
GPQGPKGMDG FDGEKGERGQ MGPKGGQGVP GRPGPEGMPG DKGDKGESGA VGLPGPQGPR
GYPGQPGPEG LRGEPGQPGY GIPGQKGNAG MAGFPGLKGQ KGERGFKGVM GTPGDAKEGR
PGAPGLPGRD GEKGEPGRPG LSGTKGERGM KGEIGGRCTD CRPGMKGDKG ERGYAGEPGR
PGASGVPGER GYPGMPGEDG TPGLRGEPGP KGEPGLLGPP GPSGEPGRDA EIPMDQLKPI
KGDKGEPGEK GLVGIKGEKG FPGLVGPEGK MGVRGMKGDK GRQGEAGLDG APGAPGKDGL
AGRDGVTIKG EPGLKGNVGY TGDKGDKGYS GLKGEPGKCA SIPPNLEEAI RGPQGLQGEK
GAPGIQGIRG DKGEMGEQGR TGAQGNAGPP GAPGPVGPRG LTGHRGEKGN SGPVGPPGAP
GRDGMPGAPG LPGSKGVKGD PGLSMVGPPG PKGNPGLRGP KGERGGMGDR GDPGLPGSLG
YPGEKGDLGT PGPPGYPGDV GPKGEPGPKG PAGHPGAPGR PGVDGVKGLP GLKGDIGAPG
VIGLPGQKGD MGQAGNDGLK GFQGRKGMMG APGIQGVRGP QGVKGEPGEK GDRGEIGVKG
LMGQSGPPGM IGLKGDKGLA GLPGPSCLPG MSGEKGDKGY TGPEGPPGEP GAASEKGQKG
EPGVPGLRGN DGIPGLEGPS GPKGDAGVPG YGRPGPQGEK GDIGLTGVNG LPGLNGVKGD
MGVPGFPGVK GDKGTTGLPG IPGPPCVDGL PGAAGPVGPR GYDGEKGFKG EPGRIGERGL
MGEKGDMGLT GPVGLSGRKG DRGVPGSPGL PATVAAIKGD KGEPGFPGAI GRPGKVGVPG
LPGEAGAKGE MGIQGLPGLP GPAGLNGLPG MKGDMGPLGE KGDACPVVKG EKGLPGRPGK
TGRDGPPGLT GEKGERGLAG LEGPPGPPGP PGPLGRQGEK GDRGDSGLMG RPGNDGLPGP
QGQRGLPGPQ GEKGDQGPPG FIGPKGDKGE RGRDGLNGLN GPQGMKGDRG MPGLEGVAGL
PGMVGEKGDR GLPGMSGLNG APGEKGQKGE TPQLPPQRKG PPGPPGFNGP KGDKGLPGLA
GPAGIPGAPG APGEMGLRGF EGARGLQGLR GDVGPEGRPG RDGAPGLPGP KGEPGRDCEA
APYYTGILLV RHSQSDEVPV CEPGHLKLWD GYSLLYVDGN DYPHNQDLGS AGSCVRKFST
LPILACGQNN VCNYASRNDR TFWLSTSAPI PMMPVTENEM RPYISRCTVC EAPTNVIAVH
SQTLHIPECP NGWDGLWIGY SFLMHTAVGH GGGGQSLSGP GSCLEDFRAT PFIECNGGKG
HCHYYETQTS FWLVSLEDHQ QFQRPEQQTL KAGNLLSRDA VATTFMAGII PITFWFEVYV
AIPGIHGPDS LYNWVHFVPA VLLLFNVTAH MLATILCDTS CSTELIQLPP ANVSNAGTSG
LGSKSWHLCA TCEFIAPPRS WHCTSCRTCI LKRDHHCVFT GCCIGHKNHR FFILFVAYLF
VSTLYASVLN NYFIWFVRGE EFRNWTSLVK IVFPLAMLMI DISTKQYYLV IYLINMVGVM
FTGVLLVYHG RLILSGAVVH ERKAPEYDMG RVENIRMVLG TRWYIAWVSP FVKSELPHNG
VNWETLQKQT IKSK
//