GenomeNet

Database: UniProt
Entry: A0A8J2QJ54_9NEOP
LinkDB: A0A8J2QJ54_9NEOP
Original site: A0A8J2QJ54_9NEOP 
ID   A0A8J2QJ54_9NEOP        Unreviewed;       898 AA.
AC   A0A8J2QJ54;
DT   25-MAY-2022, integrated into UniProtKB/TrEMBL.
DT   25-MAY-2022, sequence version 1.
DT   28-JAN-2026, entry version 14.
DE   SubName: Full=(African queen) hypothetical protein {ECO:0000313|EMBL:CAG9561016.1};
GN   ORFNames=DCHRY22_LOCUS2596 {ECO:0000313|EMBL:CAG9561016.1};
OS   Danaus chrysippus (African queen).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Papilionoidea;
OC   Nymphalidae; Danainae; Danaini; Danaina; Danaus; Anosia.
OX   NCBI_TaxID=151541 {ECO:0000313|EMBL:CAG9561016.1, ECO:0000313|Proteomes:UP000789524};
RN   [1] {ECO:0000313|EMBL:CAG9561016.1}
RP   NUCLEOTIDE SEQUENCE.
RA   Martin H S.;
RL   Submitted (SEP-2021) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:CAG9561016.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CAKASE010000046; CAG9561016.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A8J2QJ54; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000789524; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000789524}.
FT   DOMAIN          699..739
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          742..896
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          260..321
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          337..435
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          462..665
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        270..296
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        349..359
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        419..430
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        503..517
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        592..616
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        619..629
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        641..655
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   898 AA;  96007 MW;  62D08893EF16CDA4 CRC64;
     MLTISFLAVN NSDLATNYLA QQVLETRKRW FGVFSPTVIA EPDSYDILSL VRANVMTDFI
     DIVKGTDVYG AIKLVKNELI TIKLDQFPDP IDHLATPFEI YALVKLDLDV TSCLFQIVSN
     KENKLSLCFT PEGEDLIRIT LNGSDLPESG ISFHYLIEGR NAFVKIILAV NDKNVEFYSN
     CEKIETQYFD SDYTIENINF DKDSILHFGK LTEESNLFEA PIQTLVIYPK PDIIGRRSIC
     SDDKLPASFI DFDSSEKIPT DSLFDSSEES VVKGEKGDKG DKGEKGDRGD KGERGESVTG
     ERGPIGPEGA PGTPGAMGKE GSCKCSEAVV SELLLKMPEM RGPPGEYGMK GDRGEKGLKG
     DSGLPGKDGR DGSKGDPGIQ GPPGTPGIVR KEIVETKVPV VGEKGERGPI GPPGTPGRDG
     FRGEKGDKGE PGLMGLPAKL SSILDEDIDP IEEKAIVEKF RGYKGASGPE GPKGEKGDTG
     AMGPRGETGR DGIQGPPGKH GHKGETGKDG TKGDKGEPGT PGPPGTVPSS QISLMKGPKG
     ERGAPGQIGP RGPSGHHGKV GPIGPPGKSH KGEPGKPGPM GPKGEKGATG PKGEKGEGLS
     PNEIERLKGH KGDRGEIGLP GAAGKPGLPG TCGECVHKSI PGPPGPPGPP GPSGPPGVSI
     IGPKGEPGGL LTKNSFFAFN DIHHESTDED DDFYTAATVI FKTTTGLLKR TTETPLGTLA
     YILQEKILLM RVENGWQYVV IRLVALNQAY AGNILMANNR TGRNAADQEC YRQAYIHNFK
     STFAAFLATR VEDLRFIVKR KRDRYVPVVN LYGQVLFDSW ASMFNGSGAL FAKSSIYSFN
     GKNVQIDTTW PSKAVWHGSN SFGTVLSRAN CNEWTSDSPL NVGAASLLYT HRLLEEEQ
//
DBGET integrated database retrieval system