GenomeNet

Database: UniProt
Entry: A0A8R2NNF6_ACYPI
LinkDB: A0A8R2NNF6_ACYPI
Original site: A0A8R2NNF6_ACYPI 
ID   A0A8R2NNF6_ACYPI        Unreviewed;      1033 AA.
AC   A0A8R2NNF6;
DT   12-OCT-2022, integrated into UniProtKB/TrEMBL.
DT   12-OCT-2022, sequence version 1.
DT   28-JAN-2026, entry version 14.
DE   RecName: Full=Thrombospondin-like N-terminal domain-containing protein {ECO:0000259|SMART:SM00210};
OS   Acyrthosiphon pisum (Pea aphid).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidomorpha;
OC   Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon.
OX   NCBI_TaxID=7029 {ECO:0000313|EnsemblMetazoa:XP_029344257.1, ECO:0000313|Proteomes:UP000007819};
RN   [1] {ECO:0000313|Proteomes:UP000007819}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=LSR1 {ECO:0000313|Proteomes:UP000007819};
RA   Jiang H., Abraham K., Ali S., Alsbrooks S.L., Anim B.N., Anosike U.S.,
RA   Attaway T., Bandaranaike D.P., Battles P.K., Bell S.N., Bell A.V.,
RA   Beltran B., Bickham C., Bustamante Y., Caleb T., Canada A., Cardenas V.,
RA   Carter K., Chacko J., Chandrabose M.N., Chavez D., Chavez A., Chen L.,
RA   Chu H.-S., Claassen K.J., Cockrell R., Collins M., Cooper J.A., Cree A.,
RA   Curry S.M., Da Y., Dao M.D., Das B., Davila M.-L., Davy-Carroll L.,
RA   Denson S., Dinh H., Ebong V.E., Edwards J.R., Egan A., El-Daye J.,
RA   Escobedo L., Fernandez S., Fernando P.R., Flagg N., Forbes L.D.,
RA   Fowler R.G., Fu Q., Gabisi R.A., Ganer J., Garbino Pronczuk A.,
RA   Garcia R.M., Garner T., Garrett T.E., Gonzalez D.A., Hamid H.,
RA   Hawkins E.S., Hirani K., Hogues M.E., Hollins B., Hsiao C.-H., Jabil R.,
RA   James M.L., Jhangiani S.N., Johnson B., Johnson Q., Joshi V., Kalu J.B.,
RA   Kam C., Kashfia A., Keebler J., Kisamo H., Kovar C.L., Lago L.A.,
RA   Lai C.-Y., Laidlaw J., Lara F., Le T.-K., Lee S.L., Legall F.H.,
RA   Lemon S.J., Lewis L.R., Li B., Liu Y., Liu Y.-S., Lopez J., Lozado R.J.,
RA   Lu J., Madu R.C., Maheshwari M., Maheshwari R., Malloy K., Martinez E.,
RA   Mathew T., Mercado I.C., Mercado C., Meyer B., Montgomery K., Morgan M.B.,
RA   Munidasa M., Nazareth L.V., Nelson J., Ng B.M., Nguyen N.B., Nguyen P.Q.,
RA   Nguyen T., Obregon M., Okwuonu G.O., Onwere C.G., Orozco G., Parra A.,
RA   Patel S., Patil S., Perez A., Perez Y., Pham C., Primus E.L., Pu L.-L.,
RA   Puazo M., Qin X., Quiroz J.B., Reese J., Richards S., Rives C.M.,
RA   Robberts R., Ruiz S.J., Ruiz M.J., Santibanez J., Schneider B.W.,
RA   Sisson I., Smith M., Sodergren E., Song X.-Z., Song B.B., Summersgill H.,
RA   Thelus R., Thornton R.D., Trejos Z.Y., Usmani K., Vattathil S.,
RA   Villasana D., Walker D.L., Wang S., Wang K., White C.S., Williams A.C.,
RA   Williamson J., Wilson K., Woghiren I.O., Woodworth J.R., Worley K.C.,
RA   Wright R.A., Wu W., Young L., Zhang L., Zhang J., Zhu Y., Muzny D.M.,
RA   Weinstock G., Gibbs R.A.;
RL   Submitted (JUN-2010) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EnsemblMetazoa:XP_029344257.1}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (JUN-2022) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_029344257.1; XM_029488397.1.
DR   AlphaFoldDB; A0A8R2NNF6; -.
DR   EnsemblMetazoa; XM_029488397.1; XP_029344257.1; LOC100573836.
DR   GeneID; 100573836; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000007819; Chromosome A1.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF914; OTOLIN-1; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007819};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..18
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           19..1033
FT                   /note="Thrombospondin-like N-terminal domain-containing
FT                   protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5035837468"
FT   DOMAIN          31..222
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          234..309
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          347..721
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          942..965
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1001..1033
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        234..245
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        256..265
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        266..275
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        277..289
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        293..302
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        374..389
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        555..568
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        657..676
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        689..698
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1022..1033
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1033 AA;  108111 MW;  E0EAFD4EC8350A3F CRC64;
     MSWTSIVVCA TLCGFGAATD EFSGGKFLPD EHDLQTAIKV PFEDPQLYFD SGEDGFPAFG
     IKPGSDIKSP YRLFLPEKLY SEFSIVVNFK LNSMDGGFLF AVVNPLENVV QLGVQVVPSS
     SSAMNVSFLY TDVNKYSSSS NVLATFSVPW KIRKYIRLSL KVTREYVRLF GRCLEPQTVM
     VVRDPVELLF DSASTLYIGQ AGPLIKGPFD GAIQEMKIYA SPDFADIQCT ELLQPDDDKE
     PENPEDLANG GYSDRPPAPP PPPPSNENHS YQTPNIKGDK GDAGQKGESI RGPPGPPGPP
     GSPFSGDFAT DEELVKSGVS GPRGPPGICS CNLTTLFAPG NIPELIQGPP GNPGIDGKMG
     LTGLPGAVGL PGDRGLEGIK GDKGDRGDVG PRGNEGIQGP KGDSGIDGER GLQGPPGPPG
     PPGGSDFSNN DDIGFESNVG RPGPPGPKGD PGVDGAPGLK GAKGIQGNKG VRGELGSKGV
     KGDKGHAGSP GPQGFKGERG IRGFDGTPGM PGENSRPAPK GEKGDSGTPG PPGPPGPAQS
     GVKIDKTDTA VVKTVKGDKG TKGDHGEKGS VGNLGPGGNP GPPGLTGPKG ERGEPGLPAP
     SLSTTDLGMS IKGDKGEMGR RGRRGKPGPI GPPGPPGEIG LPGLRDDGSP GKPGGLKGDK
     GDPGVSVKGD KGDPGKDGIP GAGGTYVPVP GPPGPPGIPG VAIEGQKGEP GDPGFSSSPL
     RSEINEPVIV PGAMTFPNKK SMINVTDRTQ MGTIAFIIEE EALLVRVTRG WQYISLGSLV
     TTGIDPVPTS VPIPTRVPLE SSNLVHNHPV KDNTMWHPKM LRIAALNEPY TGNMHGVQSV
     DYSCYRQSQR AGLHGAFKAF LSSRINNLKT IVHESDRDLP VVNIKGDVLF NSWKDIFSEN
     GAFISQQPRI YSFSGKNVLT DFTWPQKTIW HGSDMSGDSA VDGNCDAWNS ESSDKRGLGS
     SLTPKSDRQA KLLDQDSAFD CRNFFVVLCV EITPHSGAMF SRKRRSGGGG FHDEPQPPMS
     RQEYEKLMED IGA
//
DBGET integrated database retrieval system