GenomeNet

Database: UniProt
Entry: A0A8R2NNM4_ACYPI
LinkDB: A0A8R2NNM4_ACYPI
Original site: A0A8R2NNM4_ACYPI 
ID   A0A8R2NNM4_ACYPI        Unreviewed;       685 AA.
AC   A0A8R2NNM4;
DT   12-OCT-2022, integrated into UniProtKB/TrEMBL.
DT   12-OCT-2022, sequence version 1.
DT   28-JAN-2026, entry version 13.
DE   RecName: Full=Collagen alpha-1(XV) chain {ECO:0008006|Google:ProtNLM};
OS   Acyrthosiphon pisum (Pea aphid).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidomorpha;
OC   Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon.
OX   NCBI_TaxID=7029 {ECO:0000313|EnsemblMetazoa:XP_029344259.1, ECO:0000313|Proteomes:UP000007819};
RN   [1] {ECO:0000313|Proteomes:UP000007819}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=LSR1 {ECO:0000313|Proteomes:UP000007819};
RA   Jiang H., Abraham K., Ali S., Alsbrooks S.L., Anim B.N., Anosike U.S.,
RA   Attaway T., Bandaranaike D.P., Battles P.K., Bell S.N., Bell A.V.,
RA   Beltran B., Bickham C., Bustamante Y., Caleb T., Canada A., Cardenas V.,
RA   Carter K., Chacko J., Chandrabose M.N., Chavez D., Chavez A., Chen L.,
RA   Chu H.-S., Claassen K.J., Cockrell R., Collins M., Cooper J.A., Cree A.,
RA   Curry S.M., Da Y., Dao M.D., Das B., Davila M.-L., Davy-Carroll L.,
RA   Denson S., Dinh H., Ebong V.E., Edwards J.R., Egan A., El-Daye J.,
RA   Escobedo L., Fernandez S., Fernando P.R., Flagg N., Forbes L.D.,
RA   Fowler R.G., Fu Q., Gabisi R.A., Ganer J., Garbino Pronczuk A.,
RA   Garcia R.M., Garner T., Garrett T.E., Gonzalez D.A., Hamid H.,
RA   Hawkins E.S., Hirani K., Hogues M.E., Hollins B., Hsiao C.-H., Jabil R.,
RA   James M.L., Jhangiani S.N., Johnson B., Johnson Q., Joshi V., Kalu J.B.,
RA   Kam C., Kashfia A., Keebler J., Kisamo H., Kovar C.L., Lago L.A.,
RA   Lai C.-Y., Laidlaw J., Lara F., Le T.-K., Lee S.L., Legall F.H.,
RA   Lemon S.J., Lewis L.R., Li B., Liu Y., Liu Y.-S., Lopez J., Lozado R.J.,
RA   Lu J., Madu R.C., Maheshwari M., Maheshwari R., Malloy K., Martinez E.,
RA   Mathew T., Mercado I.C., Mercado C., Meyer B., Montgomery K., Morgan M.B.,
RA   Munidasa M., Nazareth L.V., Nelson J., Ng B.M., Nguyen N.B., Nguyen P.Q.,
RA   Nguyen T., Obregon M., Okwuonu G.O., Onwere C.G., Orozco G., Parra A.,
RA   Patel S., Patil S., Perez A., Perez Y., Pham C., Primus E.L., Pu L.-L.,
RA   Puazo M., Qin X., Quiroz J.B., Reese J., Richards S., Rives C.M.,
RA   Robberts R., Ruiz S.J., Ruiz M.J., Santibanez J., Schneider B.W.,
RA   Sisson I., Smith M., Sodergren E., Song X.-Z., Song B.B., Summersgill H.,
RA   Thelus R., Thornton R.D., Trejos Z.Y., Usmani K., Vattathil S.,
RA   Villasana D., Walker D.L., Wang S., Wang K., White C.S., Williams A.C.,
RA   Williamson J., Wilson K., Woghiren I.O., Woodworth J.R., Worley K.C.,
RA   Wright R.A., Wu W., Young L., Zhang L., Zhang J., Zhu Y., Muzny D.M.,
RA   Weinstock G., Gibbs R.A.;
RL   Submitted (JUN-2010) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EnsemblMetazoa:XP_029344259.1}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (JUN-2022) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_029344259.1; XM_029488399.1.
DR   AlphaFoldDB; A0A8R2NNM4; -.
DR   EnsemblMetazoa; XM_029488399.1; XP_029344259.1; LOC100573836.
DR   GeneID; 100573836; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000007819; Chromosome A1.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007819}.
FT   DOMAIN          385..432
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          473..643
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          1..373
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          594..617
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          653..685
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        16..31
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        207..220
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        309..328
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        341..350
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        674..685
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   685 AA;  70896 MW;  2AFA404CFE2B3DAB CRC64;
     MGLTGLPGAV GLPGDRGLEG IKGDKGDRGD VGPRGNEGIQ GPKGDSGIDG ERGLQGPPGP
     PGPPGGSDFS NNDPSWKPRP IYKDIGFESN VGRPGPPGPK GDPGVDGAPG LKGAKGIQGN
     KGVRGELGSK GVKGDKGHAG SPGPQGFKGE RGIRGFDGTP GMPGENSRPA PKGEKGDSGT
     PGPPGPPGPA QSGVKIDKTD TAVVKTVKGD KGTKGDHGEK GSVGNLGPGG NPGPPGLTGP
     KGERGEPGLP APSLSTTDLG MSIKGDKGEM GRRGRRGKPG PIGPPGPPGE IGLPGLRDDG
     SPGKPGGLKG DKGDPGVSVK GDKGDPGKDG IPGAGGTYVP VPGPPGPPGI PGVAIEGQKG
     EPGDPGFSSS PLRSEINEPV IVPGAMTFPN KKSMINVTDR TQMGTIAFII EEEALLVRVT
     RGWQYISLGS LVTTGIDPVP TSVPIPTRVP LESSNLVHNH PVKDNTMWHP KMLRIAALNE
     PYTGNMHGVQ SVDYSCYRQS QRAGLHGAFK AFLSSRINNL KTIVHESDRD LPVVNIKGDV
     LFNSWKDIFS ENGAFISQQP RIYSFSGKNV LTDFTWPQKT IWHGSDMSGD SAVDGNCDAW
     NSESSDKRGL GSSLTPKSDR QAKLLDQDSA FDCRNFFVVL CVEITPHSGA MFSRKRRSGG
     GGFHDEPQPP MSRQEYEKLM EDIGA
//
DBGET integrated database retrieval system