GenomeNet

Database: UniProt
Entry: A0A445G5T3_GLYSO
LinkDB: A0A445G5T3_GLYSO
Original site: A0A445G5T3_GLYSO 
ID   A0A445G5T3_GLYSO        Unreviewed;      1344 AA.
AC   A0A445G5T3;
DT   08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT   08-MAY-2019, sequence version 1.
DT   24-JAN-2024, entry version 17.
DE   RecName: Full=Transcription initiation factor TFIID subunit 2 {ECO:0000256|ARBA:ARBA00017363};
GN   ORFNames=D0Y65_045610 {ECO:0000313|EMBL:RZB56541.1};
OS   Glycine soja (Wild soybean).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC   NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC   Glycine subgen. Soja.
OX   NCBI_TaxID=3848 {ECO:0000313|EMBL:RZB56541.1, ECO:0000313|Proteomes:UP000289340};
RN   [1] {ECO:0000313|EMBL:RZB56541.1, ECO:0000313|Proteomes:UP000289340}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. W05 {ECO:0000313|Proteomes:UP000289340};
RC   TISSUE=Hypocotyl of etiolated seedlings {ECO:0000313|EMBL:RZB56541.1};
RA   Xie M., Chung C.Y.L., Li M.-W., Wong F.-L., Chan T.-F., Lam H.-M.;
RT   "A high-quality reference genome of wild soybean provides a powerful tool
RT   to mine soybean genomes.";
RL   Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- SIMILARITY: Belongs to the TAF2 family.
CC       {ECO:0000256|ARBA:ARBA00010937}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:RZB56541.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; QZWG01000017; RZB56541.1; -; Genomic_DNA.
DR   Proteomes; UP000289340; Chromosome 17.
DR   GO; GO:0005669; C:transcription factor TFIID complex; IEA:InterPro.
DR   GO; GO:0008237; F:metallopeptidase activity; IEA:InterPro.
DR   GO; GO:0003743; F:translation initiation factor activity; IEA:UniProtKB-KW.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   Gene3D; 1.10.390.10; Neutral Protease Domain 2; 1.
DR   Gene3D; 2.60.40.1730; tricorn interacting facor f3 domain; 1.
DR   InterPro; IPR042097; Aminopeptidase_N-like_N_sf.
DR   InterPro; IPR016024; ARM-type_fold.
DR   InterPro; IPR014782; Peptidase_M1_dom.
DR   InterPro; IPR027268; Peptidase_M4/M1_CTD_sf.
DR   InterPro; IPR037813; TAF2.
DR   PANTHER; PTHR15137; TRANSCRIPTION INITIATION FACTOR TFIID; 1.
DR   PANTHER; PTHR15137:SF9; TRANSCRIPTION INITIATION FACTOR TFIID SUBUNIT 2; 1.
DR   Pfam; PF01433; Peptidase_M1; 1.
DR   SUPFAM; SSF48371; ARM repeat; 1.
DR   SUPFAM; SSF63737; Leukotriene A4 hydrolase N-terminal domain; 1.
DR   SUPFAM; SSF55486; Metalloproteases ('zincins'), catalytic domain; 1.
PE   3: Inferred from homology;
KW   Initiation factor {ECO:0000313|EMBL:RZB56541.1};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Protein biosynthesis {ECO:0000313|EMBL:RZB56541.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000289340};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT   DOMAIN          248..437
FT                   /note="Peptidase M1 membrane alanine aminopeptidase"
FT                   /evidence="ECO:0000259|Pfam:PF01433"
FT   REGION          132..151
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1044..1114
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1170..1191
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1206..1291
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1312..1344
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1076..1094
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1213..1227
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1240..1255
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1263..1291
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1344 AA;  151505 MW;  49D91706654BC3BC CRC64;
     MAKPRKPKNN EDPKPENSGA VVHHQKLCLS IDIDKRQVHG YTELEIAVPE IGIVGLHAEN
     LGIESVCVDG EPTEFEYYPH HQQQGEDEKR FSSVCSPSSA ADAAVSVYMS ALEKELVPNL
     LINCCKPSKA ESEQQQEQQP TSENGFHSSA EPKQNVRTVR IDYWIEKAET GIHFRNNLLH
     TDNQIRRARC WFPCIDDNSQ RCCYDLEFTV AYNLVAVSTG SLLYQVLSKD NPPQKTYVYK
     LDVPVAASCY KDYLSVDFPF DSYTQVFIEP EMAVSSLSLG ASMSVFSSQV LFDEKVIDQT
     IDTRVKLAYA LARQWFGVYI TPEAPNDEWL LDGLAGFLTD FFIKKHLGNN EARYRRYKAN
     CAVCKVDNGG ATALSCSASC KDLYGTQCIG LYGKIRSWKS VAALQMLEKQ MGPESFRRIL
     QTIVSRAQDK TRSIKTLSTK EFRHFANKVG NLERPFLKDF FPRWVGSCGC PVLRMGFSYN
     KRKNMVELAV LRGCTALQTS NTSILDINPD TETRDGDTGW PGMMSIRVYE LDGMYDHPIL
     PMAGDAWQLL EIQCHSKLAA RRFQKPKKGL KLDGSDDNGD VPSMDMRSNT ESPLLWIRAD
     PDMEYLAEVH FNQPVQMWIN QLEKDKDVIA QAQAIAALEA SPQLSFSIVN ALNNFLSDSK
     AFWRVRIEAA FALANSASEE TDFSGLLHLM KFYKSRRFDT DIGLPKPNDF HDFAEYFVLE
     AIPHAVAMVR AADKKSPREA IEFVLQLLKY NDNNGNPYSD VFWLAALVQS VGELEFGQQS
     ILLLSSLLKR IDRLLQFDSL MPSYNGILTI SCIRTLTQIA LKLSGFIPLD RVYGLVKPFR
     DIKALWQVRI EASRALLDLE FHCKGMDSAL LLFIKYIEEE HSLRGQLKLA THVMRLCQMR
     DGLNSNDEIT SQTLVSMLNL LEGRIAFNNA FLRHYLFCIL QILARRHPTL HGIPRENRML
     HMSLTEASNY QKNMLALDSE SKPLDLPSSI DDLTQNLGPT MEGLRDALDE APKDQPCEAP
     TQVHLEALKE ASLEKPKEVF TEFPQEAPIE APNEISKEAD TVSNSHERKR PIKIKVKQSS
     ATSRADTDNQ VVERSLGGRN EMDHGASSSV SVDAPQRNFA ETVSISNHNI DEVNSWHDRG
     SRMTASIGSA KFLSDGDELV KELQCTADSS IVYSQPQPED PSSSSIIQDN NIDADARRYA
     SLQTLSVARF DPDGEPLGKE ISARGKEKHK SKEKKRKRES NKGHHDDPEY LERKRLKKEK
     KRREKELAKL QSDEAKRSSI DMSSKKEEPV VDVVARQVTS VEPTGYDSKL EIKKIDTTKP
     EPSEGTSGAP KIRIKIKNRM LSKP
//
DBGET integrated database retrieval system