ID A0A445F100_GLYSO Unreviewed; 1402 AA.
AC A0A445F100;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 13.
DE SubName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94 {ECO:0000313|EMBL:RZB42467.1};
DE EC=1.1.1.35 {ECO:0000313|EMBL:RZB42467.1};
DE EC=2.7.7.7 {ECO:0000313|EMBL:RZB42467.1};
DE EC=4.2.1.17 {ECO:0000313|EMBL:RZB42467.1};
GN ORFNames=D0Y65_053167 {ECO:0000313|EMBL:RZB42467.1};
OS Glycine soja (Wild soybean).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3848 {ECO:0000313|EMBL:RZB42467.1, ECO:0000313|Proteomes:UP000289340};
RN [1] {ECO:0000313|EMBL:RZB42467.1, ECO:0000313|Proteomes:UP000289340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. W05 {ECO:0000313|Proteomes:UP000289340};
RC TISSUE=Hypocotyl of etiolated seedlings {ECO:0000313|EMBL:RZB42467.1};
RA Xie M., Chung C.Y.L., Li M.-W., Wong F.-L., Chan T.-F., Lam H.-M.;
RT "A high-quality reference genome of wild soybean provides a powerful tool
RT to mine soybean genomes.";
RL Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RZB42467.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QZWG01000020; RZB42467.1; -; Genomic_DNA.
DR Proteomes; UP000289340; Chromosome 20.
DR GO; GO:0003857; F:3-hydroxyacyl-CoA dehydrogenase activity; IEA:UniProtKB-EC.
DR GO; GO:0003887; F:DNA-directed DNA polymerase activity; IEA:UniProtKB-EC.
DR GO; GO:0004300; F:enoyl-CoA hydratase activity; IEA:UniProtKB-EC.
DR GO; GO:0070403; F:NAD+ binding; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006631; P:fatty acid metabolic process; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 2.
DR Gene3D; 3.40.50.720; NAD(P)-binding Rossmann-like Domain; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR006176; 3-OHacyl-CoA_DH_NAD-bd.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR036291; NAD(P)-bd_dom_sf.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR42648:SF11; TRANSPOSON TY4-P GAG-POL POLYPROTEIN; 1.
DR Pfam; PF02737; 3HCDH_N; 1.
DR Pfam; PF07727; RVT_2; 2.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF51735; NAD(P)-binding Rossmann-fold domains; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 2.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50158; ZF_CCHC; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils}; Lyase {ECO:0000313|EMBL:RZB42467.1};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Nucleotidyltransferase {ECO:0000313|EMBL:RZB42467.1};
KW Oxidoreductase {ECO:0000313|EMBL:RZB42467.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000289340};
KW Transferase {ECO:0000313|EMBL:RZB42467.1};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 106..121
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 155..246
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 842..856
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT COILED 917..944
FT /evidence="ECO:0000256|SAM:Coils"
SQ SEQUENCE 1402 AA; 160513 MW; 0D90D1D95F475036 CRC64;
MDIDYAIRKD EPPAITDESS PADVALYEWW EQSNRLSVMF IKTKISAGIR GCVDQHEKEE
EWLVLEMGEN ALLTTAYGKN KAIKSQANQK GNGKIPPQAD IKKVAKCFFC KKKGHMKKNC
PGFQKWLEKK DTSFDLFYNS ECVGNGILSV GNGRYTENGQ APGPFAKFLQ EHGIVAQYTM
LGSPNQNGVA ERRNRTLLDM VRSMLSNSNL PKSLWVETLK TVVYILNRVP TKAVPKTHFE
LFKGWKPSLK HMRVWGCPSE VRIYNPQEKK LDPSTISGCF IGYAERSKVD QVDHQIHEND
EQQVKQHDPQ ENVDATLKRS TRIRKSTIHS DYIVYLQESD YNTGAENDPE TFDQAMSCKE
SNLWYDAMKD EMNSMQSNKV WNLVKMPNGA KAIRCKWVFK TKRDSLGNIE RYKARLVVEG
FTQKEGIDYT KTFSPVSKKD SLRIILALVA HFDLELQQMD VKTTFLNGDL EEEVYMKQLK
DFSSNSGEHL VCKLNKSIYG LKQASRQWYL KFHGIISSFG FEENPMDQCI YHKVSGSKTC
FLVLYVDDIL LAANDRGLLH EVKQFLSKNF DMKDMGDASM SSALRFIEID LEGDRFNLNQ
FPKNDFEREQ MKNIPYALVV GSLMYAQVCT RPDIAFAVGM LGKYQSNPGI DHWRAAKKVP
RYLQGTKDYM LMYRQTNNLD VIGYSDSDFA GCVDSRRSTS GYIFMMTGGA ISWRSVKQSL
TTTSTMEAEF VSCFEATSQG VWLKSFISGL KIVDTISRPL RIFCDNSAVV FMAKNNKSGS
RSKHIDIKYL AIRERVKDKK VRMGDCLHEL GMEDLKLLEE EMDKAAKVVR ERKIVFTPGQ
DKCFLCGQMG HMAANCEGKA KRKLLIGACG RKPKEAWCGR KPTVNHFRIL GCIAYADISN
KKRSKFDDKE RFWENNIDEA KQILANFDED NEDEELQTRE LEEQRIPAII VEYERPQRAR
RRHAWMSDYE VTGIEDPVTH FALFSDCDPT TFESVVKEEK WRKAMDDEID SIERNDTWEL
CDLPNGHNTI GVKWIFKTKQ KENGEVDKYK ARLVAKGYKQ QYGVDYTEVF SPVARHDTIR
NCTLMFDEFK KSMMNEFGMI DLGMMHYFLG IEIVQSDVGI FLSQKKYSAA ISWSSWKQPI
VTLSTTEAEF IAASTCACQA IWLRNILEEV HFKQQGPTLI YFDNSSTIKL SKNPIMHGRS
KHIDTQTLIS RIDELHGKQL PRLVGLTKGL EMILAPNLQH PLVCIDVIEA RIVVGPRVGL
WKEVEAFEGL TPRVTDRGLV PRQVKKVAII GGGLLGSGVA TALILSNYHV ILKEVNEKFL
DAASQTISFF YIQCDFPELK GDNYKIWKER ILLQLRWIDI DYAIRKDEPP AITDESSPAD
VALYERWKRS NRLSVMFIKT KI
//