GenomeNet

Database: UniProt
Entry: A0A8R2NNH0_ACYPI
LinkDB: A0A8R2NNH0_ACYPI
Original site: A0A8R2NNH0_ACYPI 
ID   A0A8R2NNH0_ACYPI        Unreviewed;      1041 AA.
AC   A0A8R2NNH0;
DT   12-OCT-2022, integrated into UniProtKB/TrEMBL.
DT   12-OCT-2022, sequence version 1.
DT   28-JAN-2026, entry version 13.
DE   RecName: Full=Thrombospondin-like N-terminal domain-containing protein {ECO:0000259|SMART:SM00210};
OS   Acyrthosiphon pisum (Pea aphid).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidomorpha;
OC   Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon.
OX   NCBI_TaxID=7029 {ECO:0000313|EnsemblMetazoa:XP_029344256.1, ECO:0000313|Proteomes:UP000007819};
RN   [1] {ECO:0000313|Proteomes:UP000007819}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=LSR1 {ECO:0000313|Proteomes:UP000007819};
RA   Jiang H., Abraham K., Ali S., Alsbrooks S.L., Anim B.N., Anosike U.S.,
RA   Attaway T., Bandaranaike D.P., Battles P.K., Bell S.N., Bell A.V.,
RA   Beltran B., Bickham C., Bustamante Y., Caleb T., Canada A., Cardenas V.,
RA   Carter K., Chacko J., Chandrabose M.N., Chavez D., Chavez A., Chen L.,
RA   Chu H.-S., Claassen K.J., Cockrell R., Collins M., Cooper J.A., Cree A.,
RA   Curry S.M., Da Y., Dao M.D., Das B., Davila M.-L., Davy-Carroll L.,
RA   Denson S., Dinh H., Ebong V.E., Edwards J.R., Egan A., El-Daye J.,
RA   Escobedo L., Fernandez S., Fernando P.R., Flagg N., Forbes L.D.,
RA   Fowler R.G., Fu Q., Gabisi R.A., Ganer J., Garbino Pronczuk A.,
RA   Garcia R.M., Garner T., Garrett T.E., Gonzalez D.A., Hamid H.,
RA   Hawkins E.S., Hirani K., Hogues M.E., Hollins B., Hsiao C.-H., Jabil R.,
RA   James M.L., Jhangiani S.N., Johnson B., Johnson Q., Joshi V., Kalu J.B.,
RA   Kam C., Kashfia A., Keebler J., Kisamo H., Kovar C.L., Lago L.A.,
RA   Lai C.-Y., Laidlaw J., Lara F., Le T.-K., Lee S.L., Legall F.H.,
RA   Lemon S.J., Lewis L.R., Li B., Liu Y., Liu Y.-S., Lopez J., Lozado R.J.,
RA   Lu J., Madu R.C., Maheshwari M., Maheshwari R., Malloy K., Martinez E.,
RA   Mathew T., Mercado I.C., Mercado C., Meyer B., Montgomery K., Morgan M.B.,
RA   Munidasa M., Nazareth L.V., Nelson J., Ng B.M., Nguyen N.B., Nguyen P.Q.,
RA   Nguyen T., Obregon M., Okwuonu G.O., Onwere C.G., Orozco G., Parra A.,
RA   Patel S., Patil S., Perez A., Perez Y., Pham C., Primus E.L., Pu L.-L.,
RA   Puazo M., Qin X., Quiroz J.B., Reese J., Richards S., Rives C.M.,
RA   Robberts R., Ruiz S.J., Ruiz M.J., Santibanez J., Schneider B.W.,
RA   Sisson I., Smith M., Sodergren E., Song X.-Z., Song B.B., Summersgill H.,
RA   Thelus R., Thornton R.D., Trejos Z.Y., Usmani K., Vattathil S.,
RA   Villasana D., Walker D.L., Wang S., Wang K., White C.S., Williams A.C.,
RA   Williamson J., Wilson K., Woghiren I.O., Woodworth J.R., Worley K.C.,
RA   Wright R.A., Wu W., Young L., Zhang L., Zhang J., Zhu Y., Muzny D.M.,
RA   Weinstock G., Gibbs R.A.;
RL   Submitted (JUN-2010) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EnsemblMetazoa:XP_029344256.1}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (JUN-2022) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_029344256.1; XM_029488396.1.
DR   AlphaFoldDB; A0A8R2NNH0; -.
DR   EnsemblMetazoa; XM_029488396.1; XP_029344256.1; LOC100573836.
DR   GeneID; 100573836; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000007819; Chromosome A1.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF914; OTOLIN-1; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007819};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..18
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           19..1041
FT                   /note="Thrombospondin-like N-terminal domain-containing
FT                   protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5035934713"
FT   DOMAIN          31..222
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          234..309
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          363..729
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          950..973
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1009..1041
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        234..245
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        256..265
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        266..275
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        277..289
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        293..302
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        374..389
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        565..578
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        648..658
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        665..684
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        697..706
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1030..1041
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1041 AA;  109135 MW;  DA2CAE7E5444E3B1 CRC64;
     MSWTSIVVCA TLCGFGAATD EFSGGKFLPD EHDLQTAIKV PFEDPQLYFD SGEDGFPAFG
     IKPGSDIKSP YRLFLPEKLY SEFSIVVNFK LNSMDGGFLF AVVNPLENVV QLGVQVVPSS
     SSAMNVSFLY TDVNKYSSSS NVLATFSVPW KIRKYIRLSL KVTREYVRLF GRCLEPQTVM
     VVRDPVELLF DSASTLYIGQ AGPLIKGPFD GAIQEMKIYA SPDFADIQCT ELLQPDDDKE
     PENPEDLANG GYSDRPPAPP PPPPSNENHS YQTPNIKGDK GDAGQKGESI RGPPGPPGPP
     GSPFSGDFAT DEELVKSGVS GPRGPPGICS CNLTTLFAPG NIPELIQGPP GNPGIDGKMG
     LTGLPGAVGL PGDRGLEGIK GDKGDRGDVG PRGNEGIQGP KGDSGIDGER GLQGPPGPPG
     PPGGSDFSNN DPSWKPRPIY KDIGFESNVG RPGPPGPKGD PGVDGAPGLK GAKGIQGNKG
     VRGELGSKGV KGDKGHAGSP GPQGFKGERG IRGFDGTPGM PGENSRPAPK GEKGDSGTPG
     PPGPPGPAQS GVKIDKTDTA VVKTVKGDKG TKGDHGEKGS VGNLGPGGNP GPPGLTGPKG
     ERGEPGLPAP SLSTTDLGMS IKGDKGEMGR RGRRGKPGPI GPPGPPGEIG LPGLRGSPGK
     PGGLKGDKGD PGVSVKGDKG DPGKDGIPGA GGTYVPVPGP PGPPGIPGVA IEGQKGEPGD
     PGFSSSPLRS EINEPVIVPG AMTFPNKKSM INVTDRTQMG TIAFIIEEEA LLVRVTRGWQ
     YISLGSLVTT GIDPVPTSVP IPTRVPLESS NLVHNHPVKD NTMWHPKMLR IAALNEPYTG
     NMHGVQSVDY SCYRQSQRAG LHGAFKAFLS SRINNLKTIV HESDRDLPVV NIKGDVLFNS
     WKDIFSENGA FISQQPRIYS FSGKNVLTDF TWPQKTIWHG SDMSGDSAVD GNCDAWNSES
     SDKRGLGSSL TPKSDRQAKL LDQDSAFDCR NFFVVLCVEI TPHSGAMFSR KRRSGGGGFH
     DEPQPPMSRQ EYEKLMEDIG A
//
DBGET integrated database retrieval system