ID Q17A79_AEDAE Unreviewed; 1746 AA.
AC Q17A79;
DT 25-JUL-2006, integrated into UniProtKB/TrEMBL.
DT 25-JUL-2006, sequence version 1.
DT 27-MAR-2024, entry version 120.
DE SubName: Full=AAEL005386-PA {ECO:0000313|EMBL:EAT43147.1};
GN ORFNames=AAEL005386 {ECO:0000313|EMBL:EAT43147.1};
OS Aedes aegypti (Yellowfever mosquito) (Culex aegypti).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Culicinae; Aedini; Aedes; Stegomyia.
OX NCBI_TaxID=7159 {ECO:0000313|EMBL:EAT43147.1, ECO:0000313|Proteomes:UP000682892};
RN [1] {ECO:0000313|EMBL:EAT43147.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Liverpool {ECO:0000313|EMBL:EAT43147.1};
RA Loftus B.J., Nene V.M., Hannick L.I., Bidwell S., Haas B., Amedeo P.,
RA Orvis J., Wortman J.R., White O.R., Salzberg S., Shumway M., Koo H.,
RA Zhao Y., Holmes M., Miller J., Schatz M., Pop M., Pai G., Utterback T.,
RA Rogers Y.-H., Kravitz S., Fraser C.M.;
RL Submitted (OCT-2005) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EAT43147.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Liverpool {ECO:0000313|EMBL:EAT43147.1};
RX PubMed=17510324; DOI=10.1126/science.1138878;
RA Nene V., Wortman J.R., Lawson D., Haas B.J., Kodira C.D., Tu Z.J.,
RA Loftus B.J., Xi Z., Megy K., Grabherr M., Ren Q., Zdobnov E.M., Lobo N.F.,
RA Campbell K.S., Brown S.E., Bonaldo M.F., Zhu J., Sinkins S.P.,
RA Hogenkamp D.G., Amedeo P., Arensburger P., Atkinson P.W., Bidwell S.L.,
RA Biedler J., Birney E., Bruggner R.V., Costas J., Coy M.R., Crabtree J.,
RA Crawford M., DeBruyn B., DeCaprio D., Eiglmeier K., Eisenstadt E.,
RA El-Dorry H., Gelbart W.M., Gomes S.L., Hammond M., Hannick L.I.,
RA Hogan J.R., Holmes M.H., Jaffe D., Johnston S.J., Kennedy R.C., Koo H.,
RA Kravitz S., Kriventseva E.V., Kulp D., Labutti K., Lee E., Li S.,
RA Lovin D.D., Mao C., Mauceli E., Menck C.F., Miller J.R., Montgomery P.,
RA Mori A., Nascimento A.L., Naveira H.F., Nusbaum C., O'Leary S.B., Orvis J.,
RA Pertea M., Quesneville H., Reidenbach K.R., Rogers Y.-H.C., Roth C.W.,
RA Schneider J.R., Schatz M., Shumway M., Stanke M., Stinson E.O.,
RA Tubio J.M.C., Vanzee J.P., Verjovski-Almeida S., Werner D., White O.R.,
RA Wyder S., Zeng Q., Zhao Q., Zhao Y., Hill C.A., Raikhel A.S., Soares M.B.,
RA Knudson D.L., Lee N.H., Galagan J., Salzberg S.L., Paulsen I.T.,
RA Dimopoulos G., Collins F.H., Bruce B., Fraser-Liggett C.M., Severson D.W.;
RT "Genome sequence of Aedes aegypti, a major arbovirus vector.";
RL Science 316:1718-1723(2007).
RN [3] {ECO:0000313|EMBL:EAT43147.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Liverpool {ECO:0000313|EMBL:EAT43147.1};
RG VectorBase;
RL Submitted (SEP-2012) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH477339; EAT43147.1; -; Genomic_DNA.
DR RefSeq; XP_001650820.1; XM_001650770.1.
DR PaxDb; 7159-AAEL005386-PA; -.
DR VEuPathDB; VectorBase:AAEL005386; -.
DR eggNOG; KOG3544; Eukaryota.
DR HOGENOM; CLU_001074_2_0_1; -.
DR OMA; TQGFQGK; -.
DR PhylomeDB; Q17A79; -.
DR Proteomes; UP000682892; Unassembled WGS sequence.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 8.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
FT DOMAIN 1521..1746
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 197..227
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 282..411
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 443..1459
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 358..372
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 486..501
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 609..624
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 679..708
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 892..909
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 981..995
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1354..1370
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1443..1459
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1746 AA; 177634 MW; C498227DCAABC151 CRC64;
MCNETRNDYQ TYPASAYNLN QDTVLSIGTT QVFPNGFPSD FSILVVLKAT PNLVRVPLFT
VYSSDSEEVL MLMVGMEVAL YYQDTDGNPE EESLISFGVS IDDERWHRLG ISVKGDSVTL
IKDCHEQVTR RLRRQAGSVI ATDGLILTGV QLNEDEGFFT GDLQLFMFAD TPDEAFHICT
KYAPDCLGAG HGSATVTSSM HSFSSQSTSG QNGSTSSTFH QTQDFSNSRD QVNKFVISSS
GSSGNNFEGH REFVDITGEA DGLEIIGEDD EYYNNLELEG EDRKVEQHQF STSTQRSNTE
WHRVEINRTR PGIVDQTGRR PILPLPDPEY DSSASDNNRP DGVQPSVEHE QDPDDTSVSF
PAPPTGNYSI TGFRTIIGPR GPPGEPGPKG EPGRDGLPGQ GGPPGQPGHV FMIPLSQTAN
EKGPDSQAEA FRAMLSQHMM AMRGADGPMG LTGVPGPVGP PGPEGTKGEP GDNGEPGPRG
LRGAHGPPGR EGRRGRAGRD GERGVVGLQG SKGEPGPVGL PGMVGEKGER GRHGNIGEKG
AQGHEGIQGE DGPPGLPGLP GELGPRGFAG PRGFPGPSGN PGIPGTEGLP GVKGNPGPQG
QPGAPGQTGP AGPVGPPGPQ GNMGPPGIAG PHGKPGIPGL PGADGAPGRE GSPGIVGPKG
DIGSQGVQGS IGFPGNRGPK GDDGERGGPG DKGDKGERGH DGEKGDMGPP GERGLIGLQG
TAGIEGPEGP KGFEGPRGET GHMGLPGEKG KQGVQGFAGY PGPTGDKGDK GIVGIPGLAG
DKGERGNTGL QGERGEVGPR GFRGARGRRG ADGTPGPKGD TGQPGSSGPQ GEMGPQGMEG
PRGFIGLPGP QGNNGKDGPQ GTPGERGPPG ENGNPGPTGH PGVIGPQGPT GEPGPVGEPG
IPGLPGMPGE PGVPGDSGKE GQVGPPGPPG KNGPPGGQGL PGFPGERGMM GMPGLPGLKG
EIGVPGLVGP VGDKGQPGEP GKEGPPGPEG RRGPPGPTGP SGLKGERGEM GLVGPIGRDG
LPGQRGLTGP PGPQGSPGKD GDKGDIGPPG EKGYKRSQGE AGPVGSAGPQ GSRGEPGAIG
PPGEKGPPGE MGRAGSKGED GPTGLPGPPG PAGAQGMPGP SGIKGDRGDD GLVGPAGSIG
PPGEPGRRGP RGAEGAPGAT GPAGPQGLIG EPGPQGPPGP DGPEGKDGEK GSVGPKGEEG
KSGPPGPPGK RGPAGPEGPK GVVGSPGFPG NPGEPGVIGP KGDTGKDGED GKQGEPGQTG
DPGPPGIPGE VGPPGKMGPE GPAGLQGSVG PSGEKGDRGL PGPIGNPGLV GPQGMPGAQG
LPGLRGSPGA SGEIGPVGKP GESGPPGKIG ETGSKGQKGD RGRRGLKGHR GELGMVGLKG
DSGEKGDKGV RGNSGPDGPK GPPGPVGPLG LKGNDGPQGP PGNEGPLGPK GSEGVAGLKG
EAGPPGPPGP PGPPAEAPLI PPELLFRMSE FTMTKGEDRM KREAEEIPEE EDDFDEELLK
IEPKKKKKKK KIKDEMGPKF LDMYSSIYSM RQELDRIRKP VGTRENPGRT CRDLHYGHPQ
FKDGWYWIDP NLGMADDAVY VFCNMTAEGE TCVFPDLHSS QMPNIPWRKE NDKTDWYSNL
RGGFRISYET VGVVQMTFLR LLSEGAYQNF TYTCMNSVAW YSNQEENHDD AIRFLGDNEI
DIGYENSKLR PNVLVDGCKS GKSKSETIFE IRTTKLQYLP IIDFYPVDYG QPQQAFGFQV
GPVCFK
//