GenomeNet

Database: UniProt
Entry: A0A6A4JK61_APOLU
LinkDB: A0A6A4JK61_APOLU
Original site: A0A6A4JK61_APOLU 
ID   A0A6A4JK61_APOLU        Unreviewed;       918 AA.
AC   A0A6A4JK61;
DT   17-JUN-2020, integrated into UniProtKB/TrEMBL.
DT   17-JUN-2020, sequence version 1.
DT   28-JAN-2026, entry version 19.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KAF6202683.1};
GN   ORFNames=GE061_003084 {ECO:0000313|EMBL:KAF6202683.1};
OS   Apolygus lucorum (Small green plant bug) (Lygocoris lucorum).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Paraneoptera; Hemiptera; Heteroptera; Panheteroptera;
OC   Cimicomorpha; Miridae; Mirini; Apolygus.
OX   NCBI_TaxID=248454 {ECO:0000313|EMBL:KAF6202683.1, ECO:0000313|Proteomes:UP000466442};
RN   [1] {ECO:0000313|EMBL:KAF6202683.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=12Hb {ECO:0000313|EMBL:KAF6202683.1};
RX   PubMed=32939994;
RA   Liu Y., Liu H., Wang H., Huang T., Liu B., Yang B., Yin L., Li B.,
RA   Zhang Y., Zhang S., Jiang F., Zhang X., Ren Y., Wang B., Wang S., Lu Y.,
RA   Wu K., Fan W., Wang G.;
RT   "Apolygus lucorum genome provides insights into omnivorousness and
RT   mesophyll feeding.";
RL   Mol. Ecol. Resour. 21:287-300(2021).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAF6202683.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; WIXP02000011; KAF6202683.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A6A4JK61; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000466442; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000466442};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..18
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           19..918
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5044005908"
FT   DOMAIN          637..685
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          721..885
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          87..143
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          182..268
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          283..397
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          418..517
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        96..107
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        115..125
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        198..211
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        214..229
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        253..262
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        319..336
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        352..365
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        420..439
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   918 AA;  94719 MW;  6DDA24A1D76F0BCB CRC64;
     MSLLDLLVTF VFLKGVIQEL KVYGKAHLAD VHCRVFKEDG TLEDEEFFKL NDVFSDKSDA
     EGSGEGEWNP PTVNPLLPPI IAPDHAPASI TNGMLRGEKG EKGDRGPPGE TIRGPPGPPG
     PPGPPFSGTI TTDYGSGDGS YKPGIKNIGS FDYGPGYGVP GPAGPPGQCV CNMTRLLDGF
     MMPKMIQGPP GTPGVNGQTG APGPAGLTGP PGERGPHGAQ GDKGERGDQG AKGPEGLQGP
     KGEPGLDGAP GIPGNPGPPG PPGTSEFSNF DVSAKIKEAI MGSGFDSSMG RPGSPGPKGE
     PGEVGPQGPK GDKGIQGSKG ERGEAGLKGP KGDRGHQGPP GLQGFKGDRG EPGQNGVPGV
     PGQNGRPAVK GDKGDIGTQG PPGPPGPPGV TYHLKGAEDL PPNLTELAEF ESLASGVRGL
     KGERGDPGFK GEKGAEGERG PPGLPGANGI PGAPGEKGDV GLLGPAGPKG ATGPPGPSGG
     PKGERGKRGR RGKPGPPGPP GKLVESGPPG IGASGGASIP GGYYGSAGLP GPKGQKGEAG
     EVKTDMFGDP LSFKGEKGEP GKDAVSVQYI PVPGPPGPPG APGQPGMPGL SVVGAKGEPG
     IPGPTHFRPV TYAEPSFTSS RSARTPDFDG PRIVPGAVTF QNLDAMARMS SASPVGTLAY
     IADEEAMLVR VNKGWQYIAL GSLVPIPTDL PATSSERPKP MFESSNFLSG QPLPHMDGPV
     LHMAALNEPQ SGGMKGQRGA DYSCYRQARR AGHKGTFRAF LTSRVQNLDS LVRPSDRKTP
     VVNIKGDVLF NSWQDIFNGS FFAQQPRIYS FSGKNIVTDL NWPQKYVWHG ALANGERDVD
     RYCDNWESEA RESIGMASSL EGGKLLGQEL YSCFHRFAVL CIEVSSTTHN RRRRRIEPLL
     SPTEYAEFVD AIARGYFP
//
DBGET integrated database retrieval system