ID A0A6G0TEN4_APHGL Unreviewed; 1995 AA.
AC A0A6G0TEN4;
DT 12-AUG-2020, integrated into UniProtKB/TrEMBL.
DT 12-AUG-2020, sequence version 1.
DT 27-MAR-2024, entry version 9.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KAE9530651.1};
GN ORFNames=AGLY_011113 {ECO:0000313|EMBL:KAE9530651.1};
OS Aphis glycines (Soybean aphid).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidomorpha;
OC Aphidoidea; Aphididae; Aphidini; Aphis; Aphis.
OX NCBI_TaxID=307491 {ECO:0000313|EMBL:KAE9530651.1, ECO:0000313|Proteomes:UP000475862};
RN [1] {ECO:0000313|EMBL:KAE9530651.1, ECO:0000313|Proteomes:UP000475862}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Whole aphids {ECO:0000313|EMBL:KAE9530651.1};
RA Giordano R., Donthu R.K., Hernandez A.G., Wright C.L., Zimin A.V.;
RT "The genome of the soybean aphid Biotype 1, its phylome, world population
RT structure and adaptation to the North American continent.";
RL Submitted (AUG-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAE9530651.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; VYZN01000042; KAE9530651.1; -; Genomic_DNA.
DR Proteomes; UP000475862; Unassembled WGS sequence.
DR CDD; cd14348; UBA_p47; 1.
DR CDD; cd01770; UBX_UBXN2; 1.
DR Gene3D; 1.10.8.10; DNA helicase RuvA subunit, C-terminal domain; 1.
DR Gene3D; 3.30.420.210; SEP domain; 1.
DR InterPro; IPR036241; NSFL1C_SEP_dom_sf.
DR InterPro; IPR012989; SEP_domain.
DR InterPro; IPR009060; UBA-like_sf.
DR InterPro; IPR029071; Ubiquitin-like_domsf.
DR InterPro; IPR001012; UBX_dom.
DR PANTHER; PTHR23333:SF20; NSFL1 COFACTOR P47; 1.
DR PANTHER; PTHR23333; UBX DOMAIN CONTAINING PROTEIN; 1.
DR Pfam; PF08059; SEP; 1.
DR Pfam; PF14555; UBA_4; 1.
DR Pfam; PF00789; UBX; 1.
DR SMART; SM00553; SEP; 1.
DR SMART; SM00166; UBX; 1.
DR SUPFAM; SSF102848; NSFL1 (p97 ATPase) cofactor p47, SEP domain; 1.
DR SUPFAM; SSF46934; UBA-like; 1.
DR SUPFAM; SSF54236; Ubiquitin-like; 1.
DR PROSITE; PS51399; SEP; 1.
DR PROSITE; PS50033; UBX; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000475862}.
FT DOMAIN 1765..1834
FT /note="SEP"
FT /evidence="ECO:0000259|PROSITE:PS51399"
FT DOMAIN 1886..1963
FT /note="UBX"
FT /evidence="ECO:0000259|PROSITE:PS50033"
FT REGION 417..440
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 452..488
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 531..562
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 633..673
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 890..930
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1240..1269
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1609..1650
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1714..1737
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 643..673
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 890..909
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1616..1637
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1995 AA; 223874 MW; E35B3B21B28E6B9A CRC64;
MKKICLMFRD TRYDNFKNFN AMEHDNFMIY NFSYGLNLKD YSISRAVSCK IKFKASGEVK
YILHIFKFNL FNPSNAVLNS LDQAKIYLQT QGTCKCGLEC PFQCEHVFNF DAKVMTKPTS
GLNTMTNLCN HKRKIMFNNS ENRFSKYSLE MAENRRKRKL SGVQQQYSFP PYECEKSGKE
NFSQVSNSHQ MQWPDQSAVS SQIRFSSQGT SAMSNPQSPQ PMLNTQGSEI MPQGMQHVVP
NPQIPPILNQ SSSVMSNSQG PILPNPQSAM MSNLQGPVMP NSQETIMPNP QGTMIPNTQS
QSMSLGNPRV PHYPNHSYQQ DLGYEQSYRN GYRYNSPQCR SHETTALQPD SSCQPHHQQY
QNDMTSQDYY QKNTHSMYQP EQMQYNNYDS LRPQGNCRFN CSSHCGHYCN PSDQNPSYGN
QINSYQQQPQ SQPVRQNEST TNHEYYPEMY QNQNNYTDSY PDNRNSTTQE YPSEYEYEPQ
SQSQCHHNID SNLKNNVCTV INNQSPVEVP LIRPQTSQVK QIQQLNTQNI QQSITKPSRP
QHRSTPPWQL NKHQQSSHQQ VQQQQIQQQQ QVQQQQIQQQ QIQQQQQVQQ QQIQQQQVQQ
QQIQQQVQQH QIHERQIQQQ QIHQQQIQQQ QIQQQQISQR ESSKQIVEDR VPPIHHHIPQ
ISRAEEKNRK PLKLSTKTVN FGRKSKSPSE EDSYPSFLDD PSGYLAQQTA LLNNTISTNS
FSPLSPPARP YRQEKPRFTT NVTTMASGRT TSSNTITSVL AGRTNTSVVT VNTSEDQVLP
LSQQQSIMSK TPLEMVQSVV SNIQVPTSSE KSSIQPSHIL LTSNGQFIMA STKIQSPNTS
QVLSTVQQQP TVLVNTLQGG QSTLLLQPGN VMTVDQVQVP QLAVATGNID NNGAFSPRGS
NLLSPPDSKR KTINSKKRKS PQISPNQNNS SVLLQPQQQN YNQPVVQTLI LPNKTTQYGN
QQLITNVIQP VSLVHNLPAI QQFIVPANLG GVVMADNSIL QDTMQLNVIT PFSNTGQNIL
PTGMVLRTPQ TQQRPQQSNQ FIVNSVGQLS PILANLSPNQ SQNQNRNQQQ NDYIHVVPCS
IQQNQENTTV VQQNTTIVQQ QMTMVSGQQG NEQGNLIINE KQGQNYIIAD NKQQGFILSP
KDKQQSGGTH FILNNMNSEK QTQSFIITSP TSGDKQLCGN FILEKTNSSG NFIIATTNSD
KNSKFSKHSV STQTAAGQQM LQISSTPALI VATNRNAYVG SPPDTTTLSP VSGQSPSAKS
EMPSSSVTAD LDAALSPSST LSDIQNRQPM VHCISSSNVV DWSDNSADER KTISNASDSP
NMYSIEHSFP RCKIQYHKQM YSTWKSNYKS NKLVVNLIFM FNELSLDKMS NNKPLNTFYL
CNFEIALYIL CFYTRKSRSY CKALIKSAGS LNDTNPKPRV FIVRRSRTTR AFWNEGYRLN
VRANNSSVTS LPKSPQNNRK SSENIIILDY LSYYAAHSAA ATALPPADTT TVLDVALTAS
AISLPGNTNL RALYFPCMHL LNHVHLDIHC QNLNDDDLHD RMIEDQCECH YYSGQLTILS
NNTMSDSEQS NKVNEFAGIA NVDTERAKFY LESAAWNMDA ALASFYDEGT DDEPLSDNAE
QSSSRPAAVS NRDVPMASVS SYKPAAKPKK WQPQSRIMTF SSLKNAEVED KDSDDEEGQR
FYAGGSITSG QQVIGPPRNN TDVITDMFQT AQKYASTSAP SGSSNSTQDS GASNFFGTGY
KLGQTENDTE VIPSHNSTTK RSSNQEEVVL KVWKEGFTIN DGELHSIDRP ENREFLLLVA
RGEEIPPLLL KEANVSSEDE LHVSVEDHRY EEYVPSKPKK KIFGGSGNLL GSPAPDVVGN
EVPKENSSDN GVANETNARA EVPLIPDEPT TSLQIRLVDG TRIVATFNHS HTIGDIRRYI
IAARSSFASR PFKLQSSYPP KTLDNNDQTL SEAGLLNTCK SVEGNCQESV EKVLYKSLWP
AIELWPSWKQ TDLLF
//