GenomeNet

Database: UniProt
Entry: A0A212EPX4_DANPL
LinkDB: A0A212EPX4_DANPL
Original site: A0A212EPX4_DANPL 
ID   A0A212EPX4_DANPL        Unreviewed;       919 AA.
AC   A0A212EPX4;
DT   27-SEP-2017, integrated into UniProtKB/TrEMBL.
DT   27-SEP-2017, sequence version 1.
DT   28-JAN-2026, entry version 31.
DE   SubName: Full=Collagen alpha-1 {ECO:0000313|EMBL:OWR43526.1};
GN   ORFNames=KGM_207250 {ECO:0000313|EMBL:OWR43526.1};
OS   Danaus plexippus plexippus.
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Papilionoidea;
OC   Nymphalidae; Danainae; Danaini; Danaina; Danaus; Danaus.
OX   NCBI_TaxID=278856 {ECO:0000313|EMBL:OWR43526.1, ECO:0000313|Proteomes:UP000007151};
RN   [1] {ECO:0000313|EMBL:OWR43526.1, ECO:0000313|Proteomes:UP000007151}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=F-2 {ECO:0000313|EMBL:OWR43526.1};
RX   PubMed=22118469; DOI=10.1016/j.cell.2011.09.052;
RA   Zhan S., Merlin C., Boore J.L., Reppert S.M.;
RT   "The monarch butterfly genome yields insights into long-distance
RT   migration.";
RL   Cell 147:1171-1185(2011).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OWR43526.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGBW02013361; OWR43526.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A212EPX4; -.
DR   KEGG; dpl:KGM_207250; -.
DR   eggNOG; KOG3544; Eukaryota.
DR   eggNOG; KOG3546; Eukaryota.
DR   InParanoid; A0A212EPX4; -.
DR   Proteomes; UP000007151; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:OWR43526.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007151};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..919
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5013120876"
FT   DOMAIN          621..667
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          711..876
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          68..110
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          139..337
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          352..571
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        83..93
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        96..105
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        174..183
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        206..215
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        278..296
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        314..323
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        355..368
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        379..388
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        553..565
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   919 AA;  94857 MW;  E9A3C8CB9F26385A CRC64;
     MKWIWLRVVI LLVVQGAFQE LKLYGSPSQA EVQCVNTYEV DQEEGDSEGS GRYGTIPPFP
     PPPPGMDVII YSQGPPGESI RGPPGPPGPP GPPGVNTVSE TSGSGDDQIF GENYASLGHC
     GCNSSVLLSL LEIAPELQGP PGPPGITGAD GLTGAPGIPG QPGMPGERGS IGQRGEKGDR
     GDSGPRGSEG QPGPKGEPGV DGRPGSPGPP GPPGTPGSSD YNNFDESLLG SYGGAIGRPG
     APGPKGDAGQ PGPIGLQGER GFPGPKGERG QIGQTGAKGD RGHPGHKGDR GVKGDRGNPG
     LDGRSGLPGA NGRFGEKGEK GERGIPGPPG PPSLPIGVVA SEEPEFLATG LRHLGPAEKG
     EKGEKGSRGN DGTSGFPGKD GKPGERGDIG PSGLPGIAGP PGSPGLKGDR GERGPPGPVS
     LTSAGSDILT IKGEKGEPGL RGRRGRPGPP GPRGAQGLQG LVGPTGKPGE KGDIGLPGWM
     GRPGTLGPPG IPGPVGPKGE KGDPGVNILD VSMGEKGDRG LEGISGPKGE QGPIGPPGPP
     GPGSRSEAVQ YIPGPPGPPG PPGQPGTPGI SIVGPKGEPG VSYLEEYPVH GSTKYFGRPA
     SPEYRPHQDE MNANKNVPGA LVFHTTEEML RLASTSHLGA LAYVIEEQSL FVKVNSGWQY
     VLLGSLVTQS ALHTTTTSAP APPPLLPAAS LVHAPLSNMV DTPLAPMGPS LRLAALNEPL
     SGDMHGIRRA DYACYRQARR AGLKGTFRAF LTSRIQNLDS TVRYADRHLP VINTQGDVLF
     QSFSDIFDGN GGVIAGSPRI YSFSGKNIML DSNWPQKLIW HGSHASGERA LETFCEEWQS
     ADPSSRGMAA SLHSHRLLSQ ERYSCNNHFA VLCIEATSHL SVRRKREIAR YNMSSVNDEY
     HPYNAEEYQD LLNEIFGQP
//
DBGET integrated database retrieval system