ID A0A6G0TGF4_APHGL Unreviewed; 789 AA.
AC A0A6G0TGF4;
DT 12-AUG-2020, integrated into UniProtKB/TrEMBL.
DT 12-AUG-2020, sequence version 1.
DT 28-JAN-2026, entry version 16.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KAE9531463.1};
GN ORFNames=AGLY_010669 {ECO:0000313|EMBL:KAE9531463.1};
OS Aphis glycines (Soybean aphid).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidomorpha;
OC Aphidoidea; Aphididae; Aphidini; Aphis; Aphis.
OX NCBI_TaxID=307491 {ECO:0000313|EMBL:KAE9531463.1, ECO:0000313|Proteomes:UP000475862};
RN [1] {ECO:0000313|EMBL:KAE9531463.1, ECO:0000313|Proteomes:UP000475862}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Whole aphids {ECO:0000313|EMBL:KAE9531463.1};
RA Giordano R., Donthu R.K., Hernandez A.G., Wright C.L., Zimin A.V.;
RT "The genome of the soybean aphid Biotype 1, its phylome, world population
RT structure and adaptation to the North American continent.";
RL Submitted (AUG-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAE9531463.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; VYZN01000041; KAE9531463.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A6G0TGF4; -.
DR OrthoDB; 5983381at2759; -.
DR Proteomes; UP000475862; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000475862}.
FT DOMAIN 490..537
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 577..747
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 16..55
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 88..475
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 23..35
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 39..48
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 120..135
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 311..324
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 410..429
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 442..454
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 789 AA; 81759 MW; 8E3A8942B7CCDC92 CRC64;
MYTLLTDIVC ITENHGYQTP NIKGDKGDAG QKGESIRGPP GPPGPPGSPF SGDFATDEEL
VKSGVSGPRG PPGICSCNLT TLFAPGKIPE LIQGPPGNPG TDGKMGMTGL PGAVGLPGER
GLEGIKGDKG ERGDVGPRGN EGIQGPKGDS GIDGERGLQG PPGPPGPPGG SDFSNNDPSW
KPRPIYKDIG FESNVGRPGP PGPKGDPGVD GAPGLKGAKG IQGNKGVRGE MGSKGIKGDK
GHAGSIGPQG FKGERGIRGF DGTPGMPGEN ARPAPKGEKG DSGPPGPPGP PGLSQTGVKI
DKTDTAVVKT VKGDKGTKGD HGEKGAVGNL GPGGNPGPPG LTGPKGERGE PGLPAPLISA
TDLGMIKGDK GEMGRRGRRG KPGPTGPPGP PGDIGLPGLR GTPGKAGGQK GEKGDRGESV
KGDKGEPGKD GLPGSGGTYV PVPGPAGPPG PPGIPGVAIE GQKGEPGDPG FSSAPLRSEI
NEHIIVPGAL TFPNKKSMIN VTDRTQMGTI AFIIEEEALL VRVTRGWQYI SLGSLITTGI
DPAPTSVPMP TRVPLESSNL VHNHPVKDNT MWHPKMLRIA ALNEPYTGNM HGVQSVDYSC
YRQSQRAGLH GAFKAFLSSR LNNLKTIVHE SDRDLPVVNI KGDVLFNSWK DIFSENGAFI
SQQPRIYSFS GKNVLTDFTW PQKTIWHGSD ISGDSAVDGN CDAWNSESAD KRGLGSSLMQ
NADRQAKLLD QDSAYDCRNF FIVLCVEITP HSGAMFSRKR RSGVGGLHDE PQLPMSRQEY
EKFIENIRA
//