ID A0A9P0IKW8_APHGO Unreviewed; 1035 AA.
AC A0A9P0IKW8;
DT 13-SEP-2023, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 1.
DT 28-JAN-2026, entry version 9.
DE RecName: Full=Thrombospondin-like N-terminal domain-containing protein {ECO:0000259|SMART:SM00210};
GN ORFNames=APHIGO_LOCUS1059 {ECO:0000313|EMBL:CAH1710098.1};
OS Aphis gossypii (Cotton aphid).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidomorpha;
OC Aphidoidea; Aphididae; Aphidini; Aphis; Aphis.
OX NCBI_TaxID=80765 {ECO:0000313|EMBL:CAH1710098.1, ECO:0000313|Proteomes:UP001154329};
RN [1] {ECO:0000313|EMBL:CAH1710098.1}
RP NUCLEOTIDE SEQUENCE.
RA King R.;
RL Submitted (FEB-2022) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:CAH1710098.1}
RP NUCLEOTIDE SEQUENCE.
RG ENA_rothamsted_submissions;
RG culmorum;
RA King R.;
RL Submitted (OCT-2022) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; OU899034; CAH1710098.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A9P0IKW8; -.
DR Proteomes; UP001154329; Chromosome 1.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24637; COLLAGEN; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP001154329};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..1035
FT /note="Thrombospondin-like N-terminal domain-containing
FT protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5040289760"
FT DOMAIN 31..222
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 234..309
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 343..717
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 234..245
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 256..265
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 266..275
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 277..289
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 293..302
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 374..389
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 555..568
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 656..675
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 688..700
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1035 AA; 109028 MW; 506875C3C33ED935 CRC64;
MTWTFIVVCA TLCRFSAAVD ELADEKFVPD EHDLQTAIKV PFEDPQLYFD SGEDGFPAFG
IKPGSDIKSP YRLFLPEKLY SEFSIVVNFK LNSMDGGFLF AVVNPLENVV QLGLQVVPSS
SNAMNVSFLY TDVNKYSSSS NVLATFSVPW KIRKYIRLSL KVTREYVRLF GRCLEPQTVM
VVRDPVELLF DLASTLYIGQ AGPLIKGPFD GAIQEMKIYS SPDFADIQCT ELLQPDDDKE
PENPEDLTNG GYSDRPPAPP PPPPSNENHG YQTPNIKGDK GDAGQKGESI RGPPGPPGPP
GSPFSGDFAT DEELVKSGVS GPRGPPGICS CNLTTLFAPG KIPELIQGPP GNPGTDGKMG
MTGLPGAVGL PGERGLEGIK GDKGERGDVG PRGNEGIQGP KGDSGIDGER GLQGPPGPPG
PPGGSDFSNN DDIGFESNVG RPGPPGPKGD PGVDGAPGLK GAKGIQGNKG VRGEMGSKGI
KGDKGHAGSI GPQGFKGERG IRGFDGTPGV PGENARPAPK GEKGDSGPPG PPGPPGLTQT
GVKIDKTDTA VVKTVKGDKG TKGDHGEKGA VGNLGPGGNP GPPGLTGPKG ERGEPGLPAP
LISTTDLGMI KGDKGEMGRR GRRGKPGPTG PPGPPGDIGL PGLREDGTPG KAGGQKGEKG
DRGESVKGDK GEPGKDGLPG SGGTYVPVPG PAGPPGPPGI PGVAIEGQKG EPGDPGFSSA
PLRSEINEHI IVPGALTFPN KKSMINVTDR TQMGTIAFII EEEALLVRVT RGWQYISLGS
LITTGIDPAP TSVPMPTRVP LESSNLVHNH PVKDNTMWHP KMLRIAALNE PYTGNMHGVQ
SVDYSCYRQS QRAGLHGAFK AFLSSRLNNL KTIVHESDRD LPVVNIKGDV LFNSWKDIFS
ENGAFISQQP RIYSFSGKNV LTDFTWPQKT IWHGSDISGD SAVDGNCDAW NSESADKRGL
GSSLMQNADR QAKLLDQDSA YDCRNFFIVL CVEITPHSGA MFSRKRRSGV GGLHDEPQLP
MSRQEYEKFI ENIRA
//