ID G3WYE9_SARHA Unreviewed; 339 AA.
AC G3WYE9;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 54.
DE SubName: Full=Transcription factor 7 like 2 {ECO:0000313|Ensembl:ENSSHAP00000020454.2};
GN Name=TCF7L2 {ECO:0000313|Ensembl:ENSSHAP00000020454.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000020454.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000020454.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000020454.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the TCF/LEF family.
CC {ECO:0000256|ARBA:ARBA00006569}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3WYE9; -.
DR Ensembl; ENSSHAT00000020617.2; ENSSHAP00000020454.2; ENSSHAG00000022847.1.
DR GeneTree; ENSGT00940000155535; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0016055; P:Wnt signaling pathway; IEA:UniProtKB-KW.
DR CDD; cd21996; HMG-box_TCF7-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR013558; CTNNB1-bd_N.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR024940; TCF/LEF.
DR PANTHER; PTHR10373; TRANSCRIPTION FACTOR 7 FAMILY MEMBER; 1.
DR PANTHER; PTHR10373:SF32; TRANSCRIPTION FACTOR 7-LIKE 2; 1.
DR Pfam; PF08347; CTNNB1_binding; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM01366; c-clamp; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Wnt signaling pathway {ECO:0000256|ARBA:ARBA00022687}.
FT DOMAIN 207..275
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 207..275
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 71..104
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 175..207
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 277..298
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 71..100
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 175..189
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 190..207
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 339 AA; 38091 MW; 20D5FE32E506DE8A CRC64;
MYPRDAILVG MEDLSGKGAA VASGFSLFQQ LGCWHLNLNC WSNKVPVVQH PHHVHPLTPL
ITYSNEHFTP GNPPPHLPAD VDPKTGIPRP PHPPDISPYY PLSPGTVGQI PHPLGWLVPQ
QGQPVYPITT GGFRHPYPTA LTVNASMSRF PPHMVPPHHT LHTTGIPHPA IVTPTVKQES
SQSDVASLHS SKHQDSKKEE EKKKPHIKKP LNAFMLYMKE MRAKVVAECT LKESAAINQI
LGRRWHALSR EEQAKYYELA RKERQLHMQL YPGWSARDNY GKKKKRKRDK QPGETNEHSE
CFLNPCLSLP PITDLSAPKK CRARFGLDQQ NNWCGPCSL
//