ID A0A2K3PDY6_TRIPR Unreviewed; 791 AA.
AC A0A2K3PDY6;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 28-MAR-2018, sequence version 1.
DT 27-MAR-2024, entry version 15.
DE RecName: Full=Retrotransposon gag domain-containing protein {ECO:0000259|Pfam:PF03732};
GN ORFNames=L195_g010162 {ECO:0000313|EMBL:PNY13506.1};
OS Trifolium pratense (Red clover).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; Hologalegina; IRL clade; Trifolieae; Trifolium.
OX NCBI_TaxID=57577 {ECO:0000313|EMBL:PNY13506.1, ECO:0000313|Proteomes:UP000236291};
RN [1] {ECO:0000313|EMBL:PNY13506.1, ECO:0000313|Proteomes:UP000236291}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Tatra {ECO:0000313|Proteomes:UP000236291};
RC TISSUE=Young leaves {ECO:0000313|EMBL:PNY13506.1};
RX PubMed=24500806; DOI=10.3732/ajb.1300340;
RA Istvanek J., Jaros M., Krenek A., Repkova J.;
RT "Genome assembly and annotation for red clover (Trifolium pratense;
RT Fabaceae).";
RL Am. J. Bot. 101:327-337(2014).
RN [2] {ECO:0000313|EMBL:PNY13506.1, ECO:0000313|Proteomes:UP000236291}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Tatra {ECO:0000313|Proteomes:UP000236291};
RC TISSUE=Young leaves {ECO:0000313|EMBL:PNY13506.1};
RX PubMed=28382043; DOI=10.3389/fpls.2017.00367;
RA Istvanek J., Dluhosova J., Dluhos P., Patkova L., Nedelnik J., Repkova J.;
RT "Gene Classification and Mining of Molecular Markers Useful in Red Clover
RT (Trifolium pratense) Breeding.";
RL Front. Plant Sci. 8:367-367(2017).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PNY13506.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASHM01006120; PNY13506.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2K3PDY6; -.
DR Proteomes; UP000236291; Unassembled WGS sequence.
DR CDD; cd00303; retropepsin_like; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR PANTHER; PTHR33067; ASP_PROTEASE DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR33067:SF38; TRANSCRIPTION FACTOR INTERACTOR AND REGULATOR CCHC(ZN) FAMILY; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000236291}.
FT DOMAIN 87..179
FT /note="Retrotransposon gag"
FT /evidence="ECO:0000259|Pfam:PF03732"
FT REGION 316..351
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 207..253
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 278..315
FT /evidence="ECO:0000256|SAM:Coils"
SQ SEQUENCE 791 AA; 91399 MW; C3654D9BA2CE1676 CRC64;
MAGENSTRNV MGNYFKKTDT EEVTLGFQPA NPTTLEVKTV VMNELRNNQF KGDSSQDPWE
HLVKFNEICA LQKRPEHTTD DQKKLFMFAY SLTQQAKDWL YCLPTKTIQT WKELEGKFLD
RFFTEDQFKE RKAELMNFQQ HKKECLYQSH ERFKLLKRRC PNHQICAAEL MYIFINGMKQ
KQRMFLDASA GGTIQNKTPA EVEELIEKMC ENKYNKMEDE EETLQDQLEE RKRKEHIEKI
NKLQREEDSK KQQESYTTQI TNLESVIVQL AKNMGEHIQS SNAQIQHLIN QVSDLKDEQC
KAIELRNRLV DITERPKKSK KAKDNGTDHS QQEVPAEEAV TEEEREPGTE PEVRIEINQP
VFDNNQTPSQ STPILNTRLP EIFAPYPTKD GKKEKEKVQN RQFEGYLKQM EINIPLGDML
RISPPFHKYV KNVVAGKIKL QDKENIELTA ECSAIFQKTL PRKCKDPGSF TISCTIGGVE
IGRALCDLGA SVNLMPLSIM KKLNCGEAKP TRMTLILADR TRVYPHGILE DVLVRVDDTI
FPADFVIMDI EEDDEAPILL GRPFLTTFKA LIDMETREIK FRVDGNEVTF NINNMVPQKK
EKPECYKVDI VETLVKEQLE TPAPGIQRAI LQSLKAEEEG MEEETNLSVR WLNREAHCKY
PQKYKPLEFD PNEKPKPLEL KELPPYLKYI FLGEGKTKPA IISNSLPPER EERLIKMEEE
YKPVVQPQRR LNPVMNEVVR KEVNKLLAAG MIYPISDSPW VSPVHVVPKK GGITVMKNEK
NELIPTRNVT G
//