ID K7F172_PELSI Unreviewed; 552 AA.
AC K7F172;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 55.
DE SubName: Full=Tigger transposable element-derived protein 1-like {ECO:0000313|Ensembl:ENSPSIP00000001782.1};
OS Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC Trionychidae; Pelodiscus.
OX NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000001782.1, ECO:0000313|Proteomes:UP000007267};
RN [1] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG Soft-shell Turtle Genome Consortium;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
RN [3] {ECO:0000313|Ensembl:ENSPSIP00000001782.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00320}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCU01200537; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_006138987.1; XM_006138925.2.
DR AlphaFoldDB; K7F172; -.
DR Ensembl; ENSPSIT00000001789.1; ENSPSIP00000001782.1; ENSPSIG00000001789.1.
DR GeneID; 102458430; -.
DR KEGG; pss:102458430; -.
DR eggNOG; KOG3105; Eukaryota.
DR GeneTree; ENSGT00940000163154; -.
DR HOGENOM; CLU_018294_1_4_1; -.
DR OMA; NIARHTK; -.
DR OrthoDB; 2967227at2759; -.
DR TreeFam; TF101131; -.
DR Proteomes; UP000007267; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR InterPro; IPR004875; DDE_SF_endonuclease_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR006600; HTH_CenpB_DNA-bd_dom.
DR InterPro; IPR007889; HTH_Psq.
DR PANTHER; PTHR19303:SF61; HTH CENPB-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR19303; TRANSPOSON; 1.
DR Pfam; PF04218; CENP-B_N; 1.
DR Pfam; PF03184; DDE_1; 1.
DR Pfam; PF03221; HTH_Tnp_Tc5; 1.
DR SMART; SM00674; CENPB; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS51253; HTH_CENPB; 1.
DR PROSITE; PS50960; HTH_PSQ; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00320};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00320};
KW Reference proteome {ECO:0000313|Proteomes:UP000007267}.
FT DOMAIN 12..63
FT /note="HTH psq-type"
FT /evidence="ECO:0000259|PROSITE:PS50960"
FT DOMAIN 77..157
FT /note="HTH CENPB-type"
FT /evidence="ECO:0000259|PROSITE:PS51253"
FT DNA_BIND 39..59
FT /note="H-T-H motif"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00320"
SQ SEQUENCE 552 AA; 63038 MW; A17A5962D4EFCC7C CRC64;
MADKQTSKGS DTPERRKRKA INFEMKLDIL KRAEKGETQS QIGYAFGLNR STIATIIKDK
ARILEHVKGS ALMHSTIITK KRSGIISDME KLLILWLEDQ HQRKVPVSVM LIQEKALSLY
KDLKKNLGEN ARDVEPFVAS RGWFNRFKAR ANLHNIKVSG EAASADEKAA SAFLETFAAT
IKEGNYSAHQ VFNVEETELF WKKMPERTYI SKEEKTMPGF KAAKDHLILL LGGNAAGDFK
IKPLLVYHSE TPRAFKGVTK AALPVVWKSS HKAWVTLVIF EDWFFHHFVP EVKLYCAKNN
IPFRILLVLN HAPGHPVSLD DFHPDIKVVF IPPNTSSLLQ PMDQGATRLF KAYYTRRIFA
QAIKTIKGEE ASTFKDFWRG YNMYAAVKNI GESWHEVMQT SINRIWKKLY PQFVNDFKGF
EETLKNVIEN IIEMGKELDF DLEVYDVEEL LDSHAEDLSN DDLVHLEAQK VAEEEAAALW
ADTPPSKRFT VKKMSEAFQM IEGAMALFEE QDPNSFRFAS VSRAVRNALT CYREIYLEKK
VPDFKGTSAI SL
//