ID A0A212EPX4_DANPL Unreviewed; 919 AA.
AC A0A212EPX4;
DT 27-SEP-2017, integrated into UniProtKB/TrEMBL.
DT 27-SEP-2017, sequence version 1.
DT 28-JAN-2026, entry version 31.
DE SubName: Full=Collagen alpha-1 {ECO:0000313|EMBL:OWR43526.1};
GN ORFNames=KGM_207250 {ECO:0000313|EMBL:OWR43526.1};
OS Danaus plexippus plexippus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Papilionoidea;
OC Nymphalidae; Danainae; Danaini; Danaina; Danaus; Danaus.
OX NCBI_TaxID=278856 {ECO:0000313|EMBL:OWR43526.1, ECO:0000313|Proteomes:UP000007151};
RN [1] {ECO:0000313|EMBL:OWR43526.1, ECO:0000313|Proteomes:UP000007151}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=F-2 {ECO:0000313|EMBL:OWR43526.1};
RX PubMed=22118469; DOI=10.1016/j.cell.2011.09.052;
RA Zhan S., Merlin C., Boore J.L., Reppert S.M.;
RT "The monarch butterfly genome yields insights into long-distance
RT migration.";
RL Cell 147:1171-1185(2011).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OWR43526.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGBW02013361; OWR43526.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A212EPX4; -.
DR KEGG; dpl:KGM_207250; -.
DR eggNOG; KOG3544; Eukaryota.
DR eggNOG; KOG3546; Eukaryota.
DR InParanoid; A0A212EPX4; -.
DR Proteomes; UP000007151; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:OWR43526.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000007151};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..919
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5013120876"
FT DOMAIN 621..667
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 711..876
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 68..110
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 139..337
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 352..571
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 83..93
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 96..105
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 174..183
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 206..215
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 278..296
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 314..323
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 355..368
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 379..388
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 553..565
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 919 AA; 94857 MW; E9A3C8CB9F26385A CRC64;
MKWIWLRVVI LLVVQGAFQE LKLYGSPSQA EVQCVNTYEV DQEEGDSEGS GRYGTIPPFP
PPPPGMDVII YSQGPPGESI RGPPGPPGPP GPPGVNTVSE TSGSGDDQIF GENYASLGHC
GCNSSVLLSL LEIAPELQGP PGPPGITGAD GLTGAPGIPG QPGMPGERGS IGQRGEKGDR
GDSGPRGSEG QPGPKGEPGV DGRPGSPGPP GPPGTPGSSD YNNFDESLLG SYGGAIGRPG
APGPKGDAGQ PGPIGLQGER GFPGPKGERG QIGQTGAKGD RGHPGHKGDR GVKGDRGNPG
LDGRSGLPGA NGRFGEKGEK GERGIPGPPG PPSLPIGVVA SEEPEFLATG LRHLGPAEKG
EKGEKGSRGN DGTSGFPGKD GKPGERGDIG PSGLPGIAGP PGSPGLKGDR GERGPPGPVS
LTSAGSDILT IKGEKGEPGL RGRRGRPGPP GPRGAQGLQG LVGPTGKPGE KGDIGLPGWM
GRPGTLGPPG IPGPVGPKGE KGDPGVNILD VSMGEKGDRG LEGISGPKGE QGPIGPPGPP
GPGSRSEAVQ YIPGPPGPPG PPGQPGTPGI SIVGPKGEPG VSYLEEYPVH GSTKYFGRPA
SPEYRPHQDE MNANKNVPGA LVFHTTEEML RLASTSHLGA LAYVIEEQSL FVKVNSGWQY
VLLGSLVTQS ALHTTTTSAP APPPLLPAAS LVHAPLSNMV DTPLAPMGPS LRLAALNEPL
SGDMHGIRRA DYACYRQARR AGLKGTFRAF LTSRIQNLDS TVRYADRHLP VINTQGDVLF
QSFSDIFDGN GGVIAGSPRI YSFSGKNIML DSNWPQKLIW HGSHASGERA LETFCEEWQS
ADPSSRGMAA SLHSHRLLSQ ERYSCNNHFA VLCIEATSHL SVRRKREIAR YNMSSVNDEY
HPYNAEEYQD LLNEIFGQP
//