ID W5J528_ANODA Unreviewed; 921 AA.
AC W5J528;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 28-JAN-2026, entry version 63.
DE SubName: Full=Collagen alpha 1(Xviii) chain {ECO:0000313|EMBL:ETN59071.1};
GN ORFNames=AND_009337 {ECO:0000313|EMBL:ETN59071.1};
OS Anopheles darlingi (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=43151 {ECO:0000313|EMBL:ETN59071.1};
RN [1] {ECO:0000313|EMBL:ETN59071.1, ECO:0000313|Proteomes:UP000000673}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=20920257; DOI=10.1186/1471-2164-11-529;
RA Mendes N.D., Freitas A.T., Vasconcelos A.T., Sagot M.F.;
RT "Combination of measures distinguishes pre-miRNAs from other stem-loops in
RT the genome of the newly sequenced Anopheles darlingi.";
RL BMC Genomics 11:529-529(2010).
RN [2] {ECO:0000313|EMBL:ETN59071.1}
RP NUCLEOTIDE SEQUENCE.
RA Almeida L.G., Nicolas M.F., Souza R.C., Vasconcelos A.T.R.;
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:ETN59071.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=23761445;
RA Marinotti O., Cerqueira G.C., de Almeida L.G., Ferro M.I., Loreto E.L.,
RA Zaha A., Teixeira S.M., Wespiser A.R., Almeida E Silva A.,
RA Schlindwein A.D., Pacheco A.C., Silva A.L., Graveley B.R., Walenz B.P.,
RA Lima Bde A., Ribeiro C.A., Nunes-Silva C.G., de Carvalho C.R., Soares C.M.,
RA de Menezes C.B., Matiolli C., Caffrey D., Araujo D.A., de Oliveira D.M.,
RA Golenbock D., Grisard E.C., Fantinatti-Garboggini F., de Carvalho F.M.,
RA Barcellos F.G., Prosdocimi F., May G., Azevedo Junior G.M., Guimaraes G.M.,
RA Goldman G.H., Padilha I.Q., Batista Jda S., Ferro J.A., Ribeiro J.M.,
RA Fietto J.L., Dabbas K.M., Cerdeira L., Agnez-Lima L.F., Brocchi M.,
RA de Carvalho M.O., Teixeira Mde M., Diniz Maia Mde M., Goldman M.H.,
RA Cruz Schneider M.P., Felipe M.S., Hungria M., Nicolas M.F., Pereira M.,
RA Montes M.A., Cantao M.E., Vincentz M., Rafael M.S., Silverman N.,
RA Stoco P.H., Souza R.C., Vicentini R., Gazzinelli R.T., Neves Rde O.,
RA Silva R., Astolfi-Filho S., Maciel T.E., Urmenyi T.P., Tadei W.P.,
RA Camargo E.P., de Vasconcelos A.T.;
RT "The genome of Anopheles darlingi, the main neotropical malaria vector.";
RL Nucleic Acids Res. 41:7387-7400(2013).
RN [4] {ECO:0000313|EnsemblMetazoa:ADAC009337-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADMH02002104; ETN59071.1; -; Genomic_DNA.
DR AlphaFoldDB; W5J528; -.
DR FunCoup; W5J528; 18.
DR STRING; 43151.W5J528; -.
DR EnsemblMetazoa; ADAC009337-RA; ADAC009337-PA; ADAC009337.
DR VEuPathDB; VectorBase:ADAC009337; -.
DR VEuPathDB; VectorBase:ADAR2_003121; -.
DR eggNOG; KOG3544; Eukaryota.
DR eggNOG; KOG3546; Eukaryota.
DR HOGENOM; CLU_014222_0_0_1; -.
DR OMA; YSHERPY; -.
DR Proteomes; UP000000673; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR CDD; cd00247; Endostatin-like; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:ETN59071.1};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000000673};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 45..69
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 636..684
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 722..887
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 103..125
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 142..206
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 247..590
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 143..161
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 184..193
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 274..289
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 422..431
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 447..456
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 508..520
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 545..558
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 921 AA; 97283 MW; 6BFDCC2074C35ABF CRC64;
MSPLKCIAEV YWPLLLALEG LENEEWQCKV VVVTMAMVVS GRAKAVIAAF VCFFLLGIVL
VTGSTKGWFV PTRYSERVAA RVQGQFMFLK IFGNPEVVNV ECSKTPRINP GPTEEEPSIE
DISNASDISV IEGSSSNDDE LDAVLASSSG SGDDSFGISS AEDPDYSQPP MTPPPPPYNG
YGLKGDKGEK GVKGDNIPGR AAFEGEGSGD ELLYDALPNR KHAGQCSCNA TLIIEELKMD
SKLREYLRGP QGMPGKEGKT GSPGLTGVSG PQGERGERGD KGDRGERGEQ GATGGEGIQG
EKGEPGLDGL PGPAGPPGLP AENYDGHQGS HGQPGPKGPS GTPGIPGLPG QTGATGPKGD
RGAKGEIGPA GPPGPVTMAH DRNGSCECQP GPPGPRGPAG IDGAPGLPGE TGLPGHPGLP
GDKGDRGEKG PEFIINENAA FNSSRANKGE KGDKGQRGRK GRTGSPGPIG PPGKPGTMSD
SWPGREGPKG NPGQKGEKGD SITLMGPKGD KGDRGMDGRD GLPGPPGLPA ASGGDVGGGV
QYIPMPGPPG PPGPPGPPGL SIIGEKGEPG MDSRSPFYSE SQHFYGRPGR SSLDELKALR
ELKHHKDYDD STLGPPGPSD EIRNSYGPNV RIVPGAVTFQ NAETMAKMSA HTPVGTLAYI
IDEEALLVRV KKGWQYIALG TFVPIATPAP PTTTGLPPQR SEQLQVSNLI KNHPHHEEDS
TLRMAALNEP YSGDMQGIRG ADFACYRQAR RAGLLGTFRA FLSSRIQNLD SIVRVADREL
PVVNNRGEVL FNSWNNIFSG HGGFFSQTPR IYSFSGKNVL TDITWPQKLV WHGSSALGER
AIETYCDAWH SPSPDKVGLA SSLLGNKLLD QERYSCDNRF IVLCVEAVPQ DRRRKRRDTR
SQHEFANEEE YSQYLQSIDA L
//