ID Q9LJ55_ARATH Unreviewed; 1250 AA.
AC Q9LJ55;
DT 01-OCT-2000, integrated into UniProtKB/TrEMBL.
DT 01-OCT-2000, sequence version 1.
DT 22-FEB-2023, entry version 90.
DE SubName: Full=Retroelement pol polyprotein-like {ECO:0000313|EMBL:BAB02990.1};
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702 {ECO:0000313|EMBL:BAB02990.1};
RN [1] {ECO:0000313|EMBL:BAB02990.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=10907853; DOI=10.1093/dnares/7.3.217;
RA Nakamura Y.;
RT "Structural analysis of Arabidopsis thaliana chromosome 3. II. Sequence
RT features of the 4,251,695 bp regions covered by 90 P1, TAC and BAC
RT clones.";
RL DNA Res. 7:217-221(2000).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130713; DOI=10.1038/35048706;
RG European Union Chromosome 3 Arabidopsis Sequencing Consortium;
RG Institute for Genomic Research;
RG Kazusa DNA Research Institute;
RA Salanoubat M., Lemcke K., Rieger M., Ansorge W., Unseld M., Fartmann B.,
RA Valle G., Blocker H., Perez-Alonso M., Obermaier B., Delseny M., Boutry M.,
RA Grivell L.A., Mache R., Puigdomenech P., De Simone V., Choisne N.,
RA Artiguenave F., Robert C., Brottier P., Wincker P., Cattolico L.,
RA Weissenbach J., Saurin W., Quetier F., Schafer M., Muller-Auer S.,
RA Gabel C., Fuchs M., Benes V., Wurmbach E., Drzonek H., Erfle H., Jordan N.,
RA Bangert S., Wiedelmann R., Kranz H., Voss H., Holland R., Brandt P.,
RA Nyakatura G., Vezzi A., D'Angelo M., Pallavicini A., Toppo S.,
RA Simionati B., Conrad A., Hornischer K., Kauer G., Lohnert T.H.,
RA Nordsiek G., Reichelt J., Scharfe M., Schon O., Bargues M., Terol J.,
RA Climent J., Navarro P., Collado C., Perez-Perez A., Ottenwalder B.,
RA Duchemin D., Cooke R., Laudie M., Berger-Llauro C., Purnelle B., Masuy D.,
RA de Haan M., Maarse A.C., Alcaraz J.P., Cottet A., Casacuberta E.,
RA Monfort A., Argiriou A., flores M., Liguori R., Vitale D., Mannhaupt G.,
RA Haase D., Schoof H., Rudd S., Zaccaria P., Mewes H.W., Mayer K.F., Kaul S.,
RA Town C.D., Koo H.L., Tallon L.J., Jenkins J., Rooney T., Rizzo M.,
RA Walts A., Utterback T., Fujii C.Y., Shea T.P., Creasy T.H., Haas B.,
RA Maiti R., Wu D., Peterson J., Van Aken S., Pai G., Militscher J.,
RA Sellers P., Gill J.E., Feldblyum T.V., Preuss D., Lin X., Nierman W.C.,
RA Salzberg S.L., White O., Venter J.C., Fraser C.M., Kaneko T., Nakamura Y.,
RA Sato S., Kato T., Asamizu E., Sasamoto S., Kimura T., Idesawa K.,
RA Kawashima K., Kishida Y., Kiyokawa C., Kohara M., Matsumoto M., Matsuno A.,
RA Muraki A., Nakayama S., Nakazaki N., Shinpo S., Takeuchi C., Wada T.,
RA Watanabe A., Yamada M., Yasuda M., Tabata S.;
RT "Sequence and analysis of chromosome 3 of the plant Arabidopsis thaliana.";
RL Nature 408:820-822(2000).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AP000736; BAB02990.1; -; Genomic_DNA.
DR EMBL; AP002064; BAB02990.1; JOINED; Genomic_DNA.
DR AlphaFoldDB; Q9LJ55; -.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR11439; GAG-POL-RELATED RETROTRANSPOSON; 1.
DR PANTHER; PTHR11439:SF436; RETROTRANSPOSON PROTEIN-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF07727; RVT_2; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 189..204
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT REGION 146..169
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 625..680
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 625..656
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 663..680
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1250 AA; 142186 MW; 6F70344C7BBE45E3 CRC64;
MGDIVLVTTG SKLKETSSAS ISCPLLSATN YNVWTMRMRL LLKVHKVWDT VEPGSEDVDK
NDIARALIFQ SVPESLTLQI GELETAKEVW ESIKTKNVGA ERVKEARLQT LMAEFERAKM
KETDTIEDFA GRLSEITTKL QHMYANHDSQ SQQGRGRGQG DRFYQRGRGR GRFGYQGYKQ
ERDTSKVVCY RCDKLGHYAS SCPDRLLKLQ KAKEEEDNDT QEAEELMVHE VVYLNEKNVK
PSEFETSSDA SNVWYLDNGA SNHMKVRFGD DSRIDIKGKG SVLFISKNKE KKILADVYFI
PDLKSNIISL GQATESGCEV RMKDDLLIMH DKDGKLLVKA NRSKNRLYKV LMEIEPPKCL
QAMVLSNSAK WHSRLGHIGV ETLKTMVKKD LVIGMPQMEV DKETCASCLL GKQVSKSFPQ
ASSYRATQNL ELIHGDLCGP ITPPTSARNR YIFVLIDDHS RYMWSILLKE KNEPFEKSKR
FKTRVEQESG VTIKTCRTDR AVRHATYVIN RVATRVLTNQ TPYEAYKGRK PNVEHIRVFG
CVGYARIESP HLKKLDDISR SLVHLGTEPG SKAYCLLDLT THKIVVSRDV VFDETKSWKW
NDLRSESTED SGNFVLGFEQ FGNNGLRRER EEHGSETSEK NTEDEDTSRV TEATETEEPI
QEEGQPQENT QPTLRRSQRQ VSMPKYLEDY VLLAEEESEY LLSVINEEPW DYAEAKETQE
WREACEDEIA SIEKNKTWDL VELPQGAKPI GLKWVFKLKK NAEGNINKYK ARLVAKGYVQ
RHGIDFDEVF APVARIETVR FIIALAASNG WEVHHLDVKT AFLHGELKEI VFVSQPEGFT
EKGSEGKVYK LNKALYGLRQ APRAWNNKLN KILGELKFVK CSKEPSLYRK QEKDDLLLVE
VYVDDLLISG SSLKLINDFK KGMASKFEMS DLGLLTYYLG IEVIQYNGGI MLKQGRYAEK
ILDETKMSDC NAVHIPMNSG LKLSKAGTEK GSHDSETEKN IEPKEYRRNI GCLRYLLHTR
PDLSYCVGVL SRYMQEPKEG HGVAMKQILR YLRGTTSYGL SFKRGDKSGL IGFSDSSHNV
DEDDGRSTTG HIFYLDGSLI TWCTQKQETV ALSSCEAEFM AATEGAKQAI WLQELLGEVT
GEACKKVRLL IDNKSAIALA KNPVFHGRSK HIHKRYHFIR ECVENEQIEV EHVPGEEQKA
NLLTKALGRI KFKEMRELVG VQELSKCEFK LKGVNVDKLE VSLRNKLTKS
//