ID A0A6A4JK61_APOLU Unreviewed; 918 AA.
AC A0A6A4JK61;
DT 17-JUN-2020, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 1.
DT 28-JAN-2026, entry version 19.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KAF6202683.1};
GN ORFNames=GE061_003084 {ECO:0000313|EMBL:KAF6202683.1};
OS Apolygus lucorum (Small green plant bug) (Lygocoris lucorum).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Paraneoptera; Hemiptera; Heteroptera; Panheteroptera;
OC Cimicomorpha; Miridae; Mirini; Apolygus.
OX NCBI_TaxID=248454 {ECO:0000313|EMBL:KAF6202683.1, ECO:0000313|Proteomes:UP000466442};
RN [1] {ECO:0000313|EMBL:KAF6202683.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=12Hb {ECO:0000313|EMBL:KAF6202683.1};
RX PubMed=32939994;
RA Liu Y., Liu H., Wang H., Huang T., Liu B., Yang B., Yin L., Li B.,
RA Zhang Y., Zhang S., Jiang F., Zhang X., Ren Y., Wang B., Wang S., Lu Y.,
RA Wu K., Fan W., Wang G.;
RT "Apolygus lucorum genome provides insights into omnivorousness and
RT mesophyll feeding.";
RL Mol. Ecol. Resour. 21:287-300(2021).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAF6202683.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; WIXP02000011; KAF6202683.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A6A4JK61; -.
DR OrthoDB; 5983381at2759; -.
DR Proteomes; UP000466442; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000466442};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..918
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5044005908"
FT DOMAIN 637..685
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 721..885
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 87..143
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 182..268
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 283..397
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 418..517
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 96..107
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 115..125
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 198..211
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 214..229
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 253..262
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 319..336
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 352..365
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 420..439
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 918 AA; 94719 MW; 6DDA24A1D76F0BCB CRC64;
MSLLDLLVTF VFLKGVIQEL KVYGKAHLAD VHCRVFKEDG TLEDEEFFKL NDVFSDKSDA
EGSGEGEWNP PTVNPLLPPI IAPDHAPASI TNGMLRGEKG EKGDRGPPGE TIRGPPGPPG
PPGPPFSGTI TTDYGSGDGS YKPGIKNIGS FDYGPGYGVP GPAGPPGQCV CNMTRLLDGF
MMPKMIQGPP GTPGVNGQTG APGPAGLTGP PGERGPHGAQ GDKGERGDQG AKGPEGLQGP
KGEPGLDGAP GIPGNPGPPG PPGTSEFSNF DVSAKIKEAI MGSGFDSSMG RPGSPGPKGE
PGEVGPQGPK GDKGIQGSKG ERGEAGLKGP KGDRGHQGPP GLQGFKGDRG EPGQNGVPGV
PGQNGRPAVK GDKGDIGTQG PPGPPGPPGV TYHLKGAEDL PPNLTELAEF ESLASGVRGL
KGERGDPGFK GEKGAEGERG PPGLPGANGI PGAPGEKGDV GLLGPAGPKG ATGPPGPSGG
PKGERGKRGR RGKPGPPGPP GKLVESGPPG IGASGGASIP GGYYGSAGLP GPKGQKGEAG
EVKTDMFGDP LSFKGEKGEP GKDAVSVQYI PVPGPPGPPG APGQPGMPGL SVVGAKGEPG
IPGPTHFRPV TYAEPSFTSS RSARTPDFDG PRIVPGAVTF QNLDAMARMS SASPVGTLAY
IADEEAMLVR VNKGWQYIAL GSLVPIPTDL PATSSERPKP MFESSNFLSG QPLPHMDGPV
LHMAALNEPQ SGGMKGQRGA DYSCYRQARR AGHKGTFRAF LTSRVQNLDS LVRPSDRKTP
VVNIKGDVLF NSWQDIFNGS FFAQQPRIYS FSGKNIVTDL NWPQKYVWHG ALANGERDVD
RYCDNWESEA RESIGMASSL EGGKLLGQEL YSCFHRFAVL CIEVSSTTHN RRRRRIEPLL
SPTEYAEFVD AIARGYFP
//