ID A0A6G0SUF4_APHGL Unreviewed; 1062 AA.
AC A0A6G0SUF4;
DT 12-AUG-2020, integrated into UniProtKB/TrEMBL.
DT 12-AUG-2020, sequence version 1.
DT 27-MAR-2024, entry version 10.
DE RecName: Full=PiggyBac transposable element-derived protein domain-containing protein {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
GN ORFNames=AGLY_017673 {ECO:0000313|EMBL:KAE9521939.1};
OS Aphis glycines (Soybean aphid).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidomorpha;
OC Aphidoidea; Aphididae; Aphidini; Aphis; Aphis.
OX NCBI_TaxID=307491 {ECO:0000313|EMBL:KAE9521939.1, ECO:0000313|Proteomes:UP000475862};
RN [1] {ECO:0000313|EMBL:KAE9521939.1, ECO:0000313|Proteomes:UP000475862}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Whole aphids {ECO:0000313|EMBL:KAE9521939.1};
RA Giordano R., Donthu R.K., Hernandez A.G., Wright C.L., Zimin A.V.;
RT "The genome of the soybean aphid Biotype 1, its phylome, world population
RT structure and adaptation to the North American continent.";
RL Submitted (AUG-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAE9521939.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; VYZN01001834; KAE9521939.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A6G0SUF4; -.
DR Proteomes; UP000475862; Unassembled WGS sequence.
DR GO; GO:0090304; P:nucleic acid metabolic process; IEA:UniProt.
DR InterPro; IPR025398; DUF4371.
DR InterPro; IPR029526; PGBD.
DR InterPro; IPR012337; RNaseH-like_sf.
DR PANTHER; PTHR47272; DDE_TNP_1_7 DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR47272:SF1; PIGGYBAC TRANSPOSABLE ELEMENT-DERIVED PROTEIN 3-LIKE; 1.
DR Pfam; PF13843; DDE_Tnp_1_7; 1.
DR Pfam; PF14291; DUF4371; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000475862}.
FT DOMAIN 214..577
FT /note="PiggyBac transposable element-derived protein"
FT /evidence="ECO:0000259|Pfam:PF13843"
FT DOMAIN 812..994
FT /note="DUF4371"
FT /evidence="ECO:0000259|Pfam:PF14291"
FT REGION 617..646
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KAE9521939.1"
SQ SEQUENCE 1062 AA; 122582 MW; 0E7DE86178DBE9A0 CRC64;
ESNYIHLTKI TDSNNCLIEH LLENEFHLKN FDEKLTIVIN GRPCPEIPKL TSNHKEKNKL
KMSSKLTNEQ LLDMLDGMNS DLELSDDEDD DNVDDIDIAN EILENIYDTD DGEDDDNIDE
IDIVNETVER VYDTIDNEDD GDIFIPDQLP IQIPPTDEVS VPNIVSDTNT SPILKQFSTF
AKSSIKWLCK PMKQKNIVLR SLEQTEFPST IPPPISYFMK YFPEEAFSKM AIFTNIYAEQ
KSTNKWVQTT SAEMKVFVGI HLMMGVLNLP RVRMYWQKEF RIEIIASNMT RNRFFELRTH
FHVMNNEEIP QTNIDKFIKV RPLYNYMKHR FHQLPIERNI SIDEQMVPFK GKLAPKQYMR
GKPHPWGIKL FLMCGSSGIV YDFIMFQGSS TELDPVMQNL FGQGGAVVMQ FIERLEENRH
FVYFDNYFTS YNLLSVLADR KIYAAGTVRV NRFANPPLIT DKCLSKMGRG TSYEVSGIAQ
GQKSEIGLIK WFDNKGVNLG SNFITSGEPE TIKRWDKKHK KFVDVERPEV IGLYNKSMGG
VDVHDQLVSF YRTFIKSRKW TLRLIAHAFD MASVNSWLEY KKDVRHNTIL DRDTMDLLAF
KERLATTLIS LGRTKSFITP PRKRGRPSTS PSPTPEPEAQ IVRTKPRSVD STPYEETIKD
GFDHMPTFDN KQNSTRCKNA PCTFKTHVYC DKCNIHLCFI PGRECYLKKK FLLEIFNSGS
ERVNISYYLN TPWLTGSAKL NKLFCWPCVL FTREKNVWSH RGFVNLNNLT NGIQKHERSQ
TYISSVLKFK MFGKSRIYLQ LDEQRRSDVI RHNESVKKDR DVLKLFIDCV CFLAIHELPF
RGHNERVNSF NQGNFLGYLN LLSSYDSILN MHLETSKIFR GTSNRIQNDL ICSVSNIISM
SIESEIKNTN FVAILLDETS DITNLSQLST TLRYVQHETG EAHERFISFV DVSADRSADG
LLKHVIDIVN RYELKHKLAA QTYDGAAVMS GHVGGLQIKV KELYPKALFV HCFSHSLNLV
LSQLASNIKD CKIFFQTLTG MGSFFTKSSK RTLYLILQEK KF
//