GenomeNet

Database: UniProt
Entry: W5J528_ANODA
LinkDB: W5J528_ANODA
Original site: W5J528_ANODA 
ID   W5J528_ANODA            Unreviewed;       921 AA.
AC   W5J528;
DT   19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT   19-MAR-2014, sequence version 1.
DT   28-JAN-2026, entry version 63.
DE   SubName: Full=Collagen alpha 1(Xviii) chain {ECO:0000313|EMBL:ETN59071.1};
GN   ORFNames=AND_009337 {ECO:0000313|EMBL:ETN59071.1};
OS   Anopheles darlingi (Mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=43151 {ECO:0000313|EMBL:ETN59071.1};
RN   [1] {ECO:0000313|EMBL:ETN59071.1, ECO:0000313|Proteomes:UP000000673}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=20920257; DOI=10.1186/1471-2164-11-529;
RA   Mendes N.D., Freitas A.T., Vasconcelos A.T., Sagot M.F.;
RT   "Combination of measures distinguishes pre-miRNAs from other stem-loops in
RT   the genome of the newly sequenced Anopheles darlingi.";
RL   BMC Genomics 11:529-529(2010).
RN   [2] {ECO:0000313|EMBL:ETN59071.1}
RP   NUCLEOTIDE SEQUENCE.
RA   Almeida L.G., Nicolas M.F., Souza R.C., Vasconcelos A.T.R.;
RL   Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:ETN59071.1}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=23761445;
RA   Marinotti O., Cerqueira G.C., de Almeida L.G., Ferro M.I., Loreto E.L.,
RA   Zaha A., Teixeira S.M., Wespiser A.R., Almeida E Silva A.,
RA   Schlindwein A.D., Pacheco A.C., Silva A.L., Graveley B.R., Walenz B.P.,
RA   Lima Bde A., Ribeiro C.A., Nunes-Silva C.G., de Carvalho C.R., Soares C.M.,
RA   de Menezes C.B., Matiolli C., Caffrey D., Araujo D.A., de Oliveira D.M.,
RA   Golenbock D., Grisard E.C., Fantinatti-Garboggini F., de Carvalho F.M.,
RA   Barcellos F.G., Prosdocimi F., May G., Azevedo Junior G.M., Guimaraes G.M.,
RA   Goldman G.H., Padilha I.Q., Batista Jda S., Ferro J.A., Ribeiro J.M.,
RA   Fietto J.L., Dabbas K.M., Cerdeira L., Agnez-Lima L.F., Brocchi M.,
RA   de Carvalho M.O., Teixeira Mde M., Diniz Maia Mde M., Goldman M.H.,
RA   Cruz Schneider M.P., Felipe M.S., Hungria M., Nicolas M.F., Pereira M.,
RA   Montes M.A., Cantao M.E., Vincentz M., Rafael M.S., Silverman N.,
RA   Stoco P.H., Souza R.C., Vicentini R., Gazzinelli R.T., Neves Rde O.,
RA   Silva R., Astolfi-Filho S., Maciel T.E., Urmenyi T.P., Tadei W.P.,
RA   Camargo E.P., de Vasconcelos A.T.;
RT   "The genome of Anopheles darlingi, the main neotropical malaria vector.";
RL   Nucleic Acids Res. 41:7387-7400(2013).
RN   [4] {ECO:0000313|EnsemblMetazoa:ADAC009337-PA}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (JUN-2015) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ADMH02002104; ETN59071.1; -; Genomic_DNA.
DR   AlphaFoldDB; W5J528; -.
DR   FunCoup; W5J528; 18.
DR   STRING; 43151.W5J528; -.
DR   EnsemblMetazoa; ADAC009337-RA; ADAC009337-PA; ADAC009337.
DR   VEuPathDB; VectorBase:ADAC009337; -.
DR   VEuPathDB; VectorBase:ADAR2_003121; -.
DR   eggNOG; KOG3544; Eukaryota.
DR   eggNOG; KOG3546; Eukaryota.
DR   HOGENOM; CLU_014222_0_0_1; -.
DR   OMA; YSHERPY; -.
DR   Proteomes; UP000000673; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:ETN59071.1};
KW   Membrane {ECO:0000256|SAM:Phobius};
KW   Reference proteome {ECO:0000313|Proteomes:UP000000673};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   TRANSMEM        45..69
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          636..684
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          722..887
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          103..125
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          142..206
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          247..590
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        143..161
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        184..193
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        274..289
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        422..431
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        447..456
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        508..520
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        545..558
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   921 AA;  97283 MW;  6BFDCC2074C35ABF CRC64;
     MSPLKCIAEV YWPLLLALEG LENEEWQCKV VVVTMAMVVS GRAKAVIAAF VCFFLLGIVL
     VTGSTKGWFV PTRYSERVAA RVQGQFMFLK IFGNPEVVNV ECSKTPRINP GPTEEEPSIE
     DISNASDISV IEGSSSNDDE LDAVLASSSG SGDDSFGISS AEDPDYSQPP MTPPPPPYNG
     YGLKGDKGEK GVKGDNIPGR AAFEGEGSGD ELLYDALPNR KHAGQCSCNA TLIIEELKMD
     SKLREYLRGP QGMPGKEGKT GSPGLTGVSG PQGERGERGD KGDRGERGEQ GATGGEGIQG
     EKGEPGLDGL PGPAGPPGLP AENYDGHQGS HGQPGPKGPS GTPGIPGLPG QTGATGPKGD
     RGAKGEIGPA GPPGPVTMAH DRNGSCECQP GPPGPRGPAG IDGAPGLPGE TGLPGHPGLP
     GDKGDRGEKG PEFIINENAA FNSSRANKGE KGDKGQRGRK GRTGSPGPIG PPGKPGTMSD
     SWPGREGPKG NPGQKGEKGD SITLMGPKGD KGDRGMDGRD GLPGPPGLPA ASGGDVGGGV
     QYIPMPGPPG PPGPPGPPGL SIIGEKGEPG MDSRSPFYSE SQHFYGRPGR SSLDELKALR
     ELKHHKDYDD STLGPPGPSD EIRNSYGPNV RIVPGAVTFQ NAETMAKMSA HTPVGTLAYI
     IDEEALLVRV KKGWQYIALG TFVPIATPAP PTTTGLPPQR SEQLQVSNLI KNHPHHEEDS
     TLRMAALNEP YSGDMQGIRG ADFACYRQAR RAGLLGTFRA FLSSRIQNLD SIVRVADREL
     PVVNNRGEVL FNSWNNIFSG HGGFFSQTPR IYSFSGKNVL TDITWPQKLV WHGSSALGER
     AIETYCDAWH SPSPDKVGLA SSLLGNKLLD QERYSCDNRF IVLCVEAVPQ DRRRKRRDTR
     SQHEFANEEE YSQYLQSIDA L
//
DBGET integrated database retrieval system