ID W9S674_9ROSA Unreviewed; 890 AA.
AC W9S674;
DT 14-MAY-2014, integrated into UniProtKB/TrEMBL.
DT 14-MAY-2014, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE SubName: Full=Putative AC transposase {ECO:0000313|EMBL:EXC28050.1};
GN ORFNames=L484_022285 {ECO:0000313|EMBL:EXC28050.1};
OS Morus notabilis.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Rosales; Moraceae; Moreae; Morus.
OX NCBI_TaxID=981085 {ECO:0000313|EMBL:EXC28050.1, ECO:0000313|Proteomes:UP000030645};
RN [1] {ECO:0000313|Proteomes:UP000030645}
RP NUCLEOTIDE SEQUENCE.
RA He N., Zhao S.;
RT "Draft Genome Sequence of a Mulberry Tree, Morus notabilis C.K. Schneid.";
RL Submitted (JAN-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBUNIT: Homodimer. {ECO:0000256|ARBA:ARBA00011738}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KE346160; EXC28050.1; -; Genomic_DNA.
DR RefSeq; XP_010110785.1; XM_010112483.1.
DR AlphaFoldDB; W9S674; -.
DR STRING; 981085.W9S674; -.
DR eggNOG; KOG1121; Eukaryota.
DR Proteomes; UP000030645; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0009791; P:post-embryonic development; IEA:UniProt.
DR CDD; cd00293; USP_Like; 1.
DR Gene3D; 3.40.50.620; HUPs; 1.
DR InterPro; IPR025525; hAT-like_transposase_RNase-H.
DR InterPro; IPR008906; HATC_C_dom.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR014729; Rossmann-like_a/b/a_fold.
DR InterPro; IPR006016; UspA.
DR InterPro; IPR003656; Znf_BED.
DR PANTHER; PTHR46481:SF4; ZINC FINGER BED DOMAIN-CONTAINING PROTEIN 39; 1.
DR PANTHER; PTHR46481; ZINC FINGER BED DOMAIN-CONTAINING PROTEIN 4; 1.
DR Pfam; PF05699; Dimer_Tnp_hAT; 1.
DR Pfam; PF14372; hAT-like_RNase-H; 1.
DR Pfam; PF00582; Usp; 1.
DR Pfam; PF02892; zf-BED; 1.
DR SMART; SM00614; ZnF_BED; 1.
DR SUPFAM; SSF52402; Adenine nucleotide alpha hydrolases-like; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000030645};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT DOMAIN 4..138
FT /note="UspA"
FT /evidence="ECO:0000259|Pfam:PF00582"
FT DOMAIN 263..304
FT /note="BED-type"
FT /evidence="ECO:0000259|Pfam:PF02892"
FT DOMAIN 633..726
FT /note="hAT-like transposase RNase-H fold"
FT /evidence="ECO:0000259|Pfam:PF14372"
FT DOMAIN 770..850
FT /note="HAT C-terminal dimerisation"
FT /evidence="ECO:0000259|Pfam:PF05699"
SQ SEQUENCE 890 AA; 99847 MW; EE7AE86265206ABD CRC64;
MDERKIVVVV EEVEVARTAL QWALHNLLRY GDTITLLHVF SSSSSSRSKS KKKNRLLRLK
GFQLALSFQD ICNTFSNTKV EIVVSEGDQE GRKISAIVRE IGASVLVVGL HDHSFLYKLA
MVHNDVASSL SCRVLAIKQS PSSPSGTKTK TSAAGLALNS STNMDFSQIE IAGLQWGSDM
FCLLCSSSLD TFGMAAIVGG KTTTQMEWGV NNNTFKTFKD MEPKSMMDMA VIPIDQVDIG
LGSSEKPNVV SSVKPRKKTM TSVYLKFFET APDGKSRRCK FCGQSYSIAT ATGNLGRHLS
NRHPGYDKSG DTVTNSTPQP VAVTVAKKPQ SQAKTSQVDY DHLNWLLVKW LIVAALPPST
LEERWLANSY KFLNPLIQLW PGDKYKAVFH EVFRSMQEDI RASLVHVSSR ISITLDFWTS
YEQIYYMSVT CQWIDENWSF QKVLLDICYV PYPCGGAEIY HSLVKILKMY NIENRVLSCT
HDNSQSAIHA CHSLKEDLDT QKLGSFCYIP CAARSLNLII EDGLRTMKPI ISKIREFVLG
LNASPEISED FIQLAAACQE GSWKFPLDAS ARWSGNYQML DIVKKASKSM DAVIRKYEET
LGSRMLLSSA EKNAISVVHE YLEPFYKTTN NICTNKVPTI GLVLFFMDHI SEMIAACREA
RHYPDWLKNA AEDMAKKARS YNNQVCNIFT YMTAILDPRI KGELIPENLS NENFLEEARS
HFIRNYSTSH FPSMTSGYGT QDIEDGGSVS FAEEIARKKR RASMSSATDE LTQYLSESPA
PIPTDVLDWW KVNSTRYPRL SMMARDFLAM QPTSLVPEEI FCGKGDEIDK QRLCVPHDST
QALLCVRSWI LAGMKLKFKS TEIDYERLME LATAAATDNS RLAQIGNKSK
//