ID A0A8J2QJ54_9NEOP Unreviewed; 898 AA.
AC A0A8J2QJ54;
DT 25-MAY-2022, integrated into UniProtKB/TrEMBL.
DT 25-MAY-2022, sequence version 1.
DT 28-JAN-2026, entry version 14.
DE SubName: Full=(African queen) hypothetical protein {ECO:0000313|EMBL:CAG9561016.1};
GN ORFNames=DCHRY22_LOCUS2596 {ECO:0000313|EMBL:CAG9561016.1};
OS Danaus chrysippus (African queen).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Papilionoidea;
OC Nymphalidae; Danainae; Danaini; Danaina; Danaus; Anosia.
OX NCBI_TaxID=151541 {ECO:0000313|EMBL:CAG9561016.1, ECO:0000313|Proteomes:UP000789524};
RN [1] {ECO:0000313|EMBL:CAG9561016.1}
RP NUCLEOTIDE SEQUENCE.
RA Martin H S.;
RL Submitted (SEP-2021) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CAG9561016.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAKASE010000046; CAG9561016.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A8J2QJ54; -.
DR OrthoDB; 5983381at2759; -.
DR Proteomes; UP000789524; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000789524}.
FT DOMAIN 699..739
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 742..896
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 260..321
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 337..435
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 462..665
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 270..296
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 349..359
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 419..430
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 503..517
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 592..616
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 619..629
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 641..655
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 898 AA; 96007 MW; 62D08893EF16CDA4 CRC64;
MLTISFLAVN NSDLATNYLA QQVLETRKRW FGVFSPTVIA EPDSYDILSL VRANVMTDFI
DIVKGTDVYG AIKLVKNELI TIKLDQFPDP IDHLATPFEI YALVKLDLDV TSCLFQIVSN
KENKLSLCFT PEGEDLIRIT LNGSDLPESG ISFHYLIEGR NAFVKIILAV NDKNVEFYSN
CEKIETQYFD SDYTIENINF DKDSILHFGK LTEESNLFEA PIQTLVIYPK PDIIGRRSIC
SDDKLPASFI DFDSSEKIPT DSLFDSSEES VVKGEKGDKG DKGEKGDRGD KGERGESVTG
ERGPIGPEGA PGTPGAMGKE GSCKCSEAVV SELLLKMPEM RGPPGEYGMK GDRGEKGLKG
DSGLPGKDGR DGSKGDPGIQ GPPGTPGIVR KEIVETKVPV VGEKGERGPI GPPGTPGRDG
FRGEKGDKGE PGLMGLPAKL SSILDEDIDP IEEKAIVEKF RGYKGASGPE GPKGEKGDTG
AMGPRGETGR DGIQGPPGKH GHKGETGKDG TKGDKGEPGT PGPPGTVPSS QISLMKGPKG
ERGAPGQIGP RGPSGHHGKV GPIGPPGKSH KGEPGKPGPM GPKGEKGATG PKGEKGEGLS
PNEIERLKGH KGDRGEIGLP GAAGKPGLPG TCGECVHKSI PGPPGPPGPP GPSGPPGVSI
IGPKGEPGGL LTKNSFFAFN DIHHESTDED DDFYTAATVI FKTTTGLLKR TTETPLGTLA
YILQEKILLM RVENGWQYVV IRLVALNQAY AGNILMANNR TGRNAADQEC YRQAYIHNFK
STFAAFLATR VEDLRFIVKR KRDRYVPVVN LYGQVLFDSW ASMFNGSGAL FAKSSIYSFN
GKNVQIDTTW PSKAVWHGSN SFGTVLSRAN CNEWTSDSPL NVGAASLLYT HRLLEEEQ
//