ID A0A2K3MWZ8_TRIPR Unreviewed; 1433 AA.
AC A0A2K3MWZ8;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 28-MAR-2018, sequence version 1.
DT 27-MAR-2024, entry version 20.
DE SubName: Full=Retroelement pol polyprotein-like {ECO:0000313|EMBL:PNX95321.1};
GN ORFNames=L195_g018511 {ECO:0000313|EMBL:PNX95321.1};
OS Trifolium pratense (Red clover).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; Hologalegina; IRL clade; Trifolieae; Trifolium.
OX NCBI_TaxID=57577 {ECO:0000313|EMBL:PNX95321.1, ECO:0000313|Proteomes:UP000236291};
RN [1] {ECO:0000313|EMBL:PNX95321.1, ECO:0000313|Proteomes:UP000236291}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Tatra {ECO:0000313|Proteomes:UP000236291};
RC TISSUE=Young leaves {ECO:0000313|EMBL:PNX95321.1};
RX PubMed=24500806; DOI=10.3732/ajb.1300340;
RA Istvanek J., Jaros M., Krenek A., Repkova J.;
RT "Genome assembly and annotation for red clover (Trifolium pratense;
RT Fabaceae).";
RL Am. J. Bot. 101:327-337(2014).
RN [2] {ECO:0000313|EMBL:PNX95321.1, ECO:0000313|Proteomes:UP000236291}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Tatra {ECO:0000313|Proteomes:UP000236291};
RC TISSUE=Young leaves {ECO:0000313|EMBL:PNX95321.1};
RX PubMed=28382043; DOI=10.3389/fpls.2017.00367;
RA Istvanek J., Dluhosova J., Dluhos P., Patkova L., Nedelnik J., Repkova J.;
RT "Gene Classification and Mining of Molecular Markers Useful in Red Clover
RT (Trifolium pratense) Breeding.";
RL Front. Plant Sci. 8:367-367(2017).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PNX95321.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASHM01013347; PNX95321.1; -; Genomic_DNA.
DR Proteomes; UP000236291; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR029472; Copia-like_N.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF14244; Retrotran_gag_3; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF07727; RVT_2; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000236291}.
FT DOMAIN 546..712
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 1..53
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 255..284
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 312..341
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 312..336
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1433 AA; 163212 MW; 4837E6120B72E2EE CRC64;
MASEQVSDSD SESNSNVNSA AEKNDTSSTT KSTSAVTNTS PYYLGPSDNP GTPLVATTLK
GENHRNWARS MRTALRAKSK LGFIDGSIKK PAKTNPDFYN WEKADSMVMA WIINAVDPVL
HGSISHASTA RDIWEDLEER FAQTNAPRIH QLWRMLCLME HESDMTVTEY YTKFKSLLDE
LGELQPLPEC TCGASKEILQ REGDQQVHLF LGSLNNERFE HVKAAVLNTE PLPSLRRVFN
HVLREEARIM GEKERMATTK RESGGSAFYA SNQNRQRRKD GSNSKCDYCG KTGHIKAGCF
EIIGYPENWH TRRTQRRSRD DGGQPSAHHT HATEETVQGR ALHGSRVMKH DLCESGKNTS
CKDLEWVLDS GASHHMTPLL TLMRGVTKIE KPFYVTVPTG NTVLVEMMGT IDLSKDITLQ
NVLLVPKFDC NLISICQLTR DLNCFVTYHP SYCMIQDFAT KKKIGLGDVH GGVYVLKQQV
QGSAFAAYHE DNTALWHARM GHPSPQVMQR ISQLVNFNFC SNKLRCCDIC HRSKQCRLPF
HISYNKAENP FDLIHCDLWG KYRTSSHSGC HYFLTIVDDY SRGTWVYLLK EKTEVLRILT
NFINMVNTQF NVKIKRLRSD NGTEFTNHAF QGILQREGIL HETSCVGTPQ QNARVERKHR
HILNVARALR FQANLPISFW GECVLTATYL INRTPSMIND GLTPYEKLLG KPSSYEHIKT
FGCLCYVKNS NKQQDKFDSR AEKCVFVGYP KGQKGWTVYN LKTQEIYVSR DVVFYEDIFP
YASQEKDSNG ENHSSTFSLD FCSVGEEIVP SNDGQQDQII ISNEQREEQV VETNSDVITE
VEEPQNKGES ETMINIDMGP RNRRPPKRLD DYYCYSAEIT HTCTSPKTST SSGIIYPISN
YVNYDEFSQK YQAYLAAVES IEEPQSFKQA IHKQEWREAM SQELKALEEN KTWEMSRLPK
GKKAVGCRWV YKVKYKSTGE VEKYKARLVA KGYTQVEGDD FNETFAPDAK MTTVRCLLTV
AVAKGWELHQ MDVSNAFLHG ELDEEVYMEV PPGYRVPDKE MVCRLRKSLY GLRQASRNWY
SKLSQALVKY GFHECEADHS LFTYSHGSIF IAVLIYVDDL VVTGNDAKSC EKFKQYLNKC
FHMKDLGELK YFLGLELARG SSGLFICQRK YALDILNECG MLACKPSSIP LEPNHKLALD
SSPLYENPSQ YRRLVGRLIY LTITRPELTY SVHILSQFMQ EPHQGHWDAA MHVLRYLKSS
PGQGIIIPRD NDLRLVAYCD SDYASCPLTR RSISGYVMKL GTAPISWKTK KQTTVSRSSS
EAEYRAMAHA TSEVIWLRRL LTHLQVRCDS PTVLHCDNRA AIHLASNPVF HERTKHIEVD
CHFIREFLVK GIISTVHIPT KSQQADIFTK SLGTKQFREL SDKLGSHHPQ SPT
//