ID G3VB09_SARHA Unreviewed; 3854 AA.
AC G3VB09;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 69.
DE RecName: Full=Transformation/transcription domain associated protein {ECO:0008006|Google:ProtNLM};
GN Name=TRRAP {ECO:0000313|Ensembl:ENSSHAP00000000363.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000000363.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000000363.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000000363.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the PI3/PI4-kinase family. TRA1 subfamily.
CC {ECO:0000256|ARBA:ARBA00007234}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9305.ENSSHAP00000000363; -.
DR Ensembl; ENSSHAT00000000368.2; ENSSHAP00000000363.2; ENSSHAG00000000321.2.
DR eggNOG; KOG0889; Eukaryota.
DR GeneTree; ENSGT00390000017961; -.
DR HOGENOM; CLU_000129_1_1_1; -.
DR InParanoid; G3VB09; -.
DR TreeFam; TF106414; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005794; C:Golgi apparatus; IEA:Ensembl.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR CDD; cd05163; PIKK_TRRAP; 1.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 1.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003152; FATC_dom.
DR InterPro; IPR011009; Kinase-like_dom_sf.
DR InterPro; IPR000403; PI3/4_kinase_cat_dom.
DR InterPro; IPR003151; PIK-rel_kinase_FAT.
DR InterPro; IPR014009; PIK_FAT.
DR InterPro; IPR046807; Tra1_central.
DR InterPro; IPR046805; Tra1_ring.
DR PANTHER; PTHR11139; ATAXIA TELANGIECTASIA MUTATED ATM -RELATED; 1.
DR PANTHER; PTHR11139:SF1; TRANSFORMATION_TRANSCRIPTION DOMAIN-ASSOCIATED PROTEIN; 1.
DR Pfam; PF02259; FAT; 1.
DR Pfam; PF00454; PI3_PI4_kinase; 1.
DR Pfam; PF20175; Tra1_central; 1.
DR Pfam; PF20206; Tra1_ring; 1.
DR SMART; SM01343; FATC; 1.
DR SMART; SM00146; PI3Kc; 1.
DR SUPFAM; SSF48371; ARM repeat; 3.
DR SUPFAM; SSF56112; Protein kinase-like (PK-like); 1.
DR PROSITE; PS51189; FAT; 1.
DR PROSITE; PS51190; FATC; 1.
DR PROSITE; PS50290; PI3_4_KINASE_3; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT DOMAIN 2669..3252
FT /note="FAT"
FT /evidence="ECO:0000259|PROSITE:PS51189"
FT DOMAIN 3495..3818
FT /note="PI3K/PI4K catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50290"
FT DOMAIN 3822..3854
FT /note="FATC"
FT /evidence="ECO:0000259|PROSITE:PS51190"
FT REGION 483..506
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2519..2555
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3262..3284
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 489..506
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2519..2533
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3264..3283
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3854 AA; 436984 MW; 0FFD86C62F9D564E CRC64;
MAFVATPGAT VVDQTTLMKK YLQFVAALTD VNTPDETKLK MMQEVSENFE NVTSSPQYST
FLEHIIPRFL TFLQDGEVQF LQEKPAQQLR KLVLEIIHRI PTNEHLRPHT KNVLSVMFRF
LETENEENVL ICLRIIIELH KQFRPTITQE IHHFLDFVKQ IYKELPKVVN RYFENPQVIP
DNTVPSPEMV GMITTLAVKV NPERDDSETR THSIIPRGSL SLKVLAELPI IVVLMYQLYK
LNIHNVVAEF VPLIMNTIII QVSAQARQHK LYNKELYADF IAAQIKTLSF LAYIIRIYQD
LVAKYSQQMV KGMLQLLSNC PAETAHLRKE LLIAAKHILT TDLRSQFIPC MDKLFDESIL
IGSGYTARET LRPLAYSTLA DLVHHVRQHL PLNDLSLAVQ LFAKNIDDES LPSSIQTMSC
KLLLNLVDCI RSKSEQENGN GRDILMRMLE VFVLKFHTIA RYQLSAIFKK CKPQSELGAA
EAALPGVPTG PTAPAPAPSP APTTPVAPAP VPVFEKQGEK DKEDKQTFQV TDCRSLVKTL
VCGVKTITWG ITSCKAPGEA QFIPNKQLQP KETQIYIKLV KYAMQALDIY QVQIAGNGQT
YIRVANCQTV RMKEEKEVLE HFAGVFTMMN PLTFKEIFQT TVPYMVERIS KNYALQIVAN
SFLANPTTSA LFATILVEYL LDRLPEMGSN VELSNLYLKL FKLVFGSVSL FAAENEQMLK
PHLHKIVNSS MELAQTAKEP YNYFLLLRAL FRSIGGGSHD LLYQEFLPLL PNLLQGLNML
QSGLHKQHMK DLFVELCLTV PVRLSSLLPY LPMLMDPLVS ALNGSQTLVS QGLRTLELCV
DNLQPDFLYD HIQPVRAELM QALWRTLRNP ADSISHVAYR VLGKFGGSNR KMLKESQKLQ
YVVTEIQGPS ITVEFSDCKA SIQLPMEKAI ETALDCLKSA NTEPYYRRQA WEVIKCFLVA
MMNLDDNKHA LYQLLAHPNF TEKSIPSVII SHRYKAQDTP ARKTFEQALT GAFMSAVIKD
LRPSALPFVA SLIRHYTMVA VAQQCGPFLL QCYQVGSQPS TAMFHSEENG SKGMDPLVLI
DAIAICMAYE EKELCKIGEV ALAVIFDVAS IILGSKERAC QLPLFSYIVE RLCACCYEQA
WYAKLGGVVS IKFLMERLPL IWVLQNQQTF LKALLFVMMD LTGEVSNGAV AMAKTTLEQL
LIRCATPLKD EEKSEEILSA QEKSFHHVTH DLVREVTSPN STVRKQAMHS LQVLAQVTGK
SVTVIMEPHK EVLQDMVPPK KHLLRHQPAN AQIGLMEGNT FCTTLQPRLF TMDLNVVEHK
VFYTELLNLC EAEDVALMKL PCYKSLPSLV PLRIAALNAL AACNYLPQSR EKIIAALFKA
LNSTNNELQE AGEACMRKFL EGATIEVDQI HTHMRPLLMM LGDYRSLTLN VVNRLTSVTR
LFPNSFNDKF CDQMMQHLRK WMEVVVITHK GGQRSDGNPA MEGVEEMKIC SAIINLFHLI
PAAPQTLVKP LLEVVMKTER AMLIEAGSPF REPLIKFLTR HPSQTVELFM MEATLNDPQW
SRMFMSFLKH KDAKPLRDVL AANPNRFIAL LLPGGTQAAV RPGSPSTSTL KLDLQFQAIK
IISIIVKNDE SWLANQHSLV SQLRRVWISE TFQERHRKEN MAATNWKEPK LLAYCLLNYC
KRNYGDIELL FQLLRAFTGR FLCNMTFLKE YMEEEIPKNY SISQKRALFF RFVDFNDPNF
GDELKAKVLQ HILNPAFLYS FEKAEGEQLL GPPNPEGDNP ESITSVFITK VLDPEKQTDM
LDSLRIYLLQ FATLLVEHAP HHIHDNNKNR NSKLRRLMTF AWPCLLSKAC VDPACKYSGH
LLLAHIIAKF AIHKKIVLQV FHSLLKAHAM EARAIVRQAM AILTPAVPAR MEDGHQMLTH
WTRKIIVEEG HTVPQLVHIL HLIVQHFKVY YPVRHHLVQH MVSAMQRLGF TPSVTIEQRK
LAVDLAEVVI KWELQRIKDQ QPDSDMDPNS SGEGASCASS AIKRGLSVDS GQEVKRFRTT
TGAMSAVFGR SQSLPGADAL LAKPIDKQHT DTVVNFLIRI ACQVNDNSNT AGSPGELLSR
RCVNLLKTAL RPDMWPKSEL KLQWFDKLLM TVEQPNQANF ANICTGLEVL SFLLTVLQSP
AILSSFKPLQ RGVAACMTCG NTKVLRAVHS LLSRLMSIFP TEPSTSSVAS KYEELECLYA
AVGKVIYEGL TNYEKATNAN PSQLFGTLMI LKSACSNNPS YIDRLISVFM RSLQKMVREH
LNPQAASGTA EANTAGTSEL VMLSLDLVKT RLAVMSMEMR KNFIQAILTS LIEKSTDAKI
LRAVVKTVEE WVKNNSPMAA NQTPTLREKS ILLVKMMTYI EKRFPEDLEL NAQFLDLVNY
VYRDENLSGS ELTAKLEPAF LSGLRCAQPL IRAKFFEVFD NSMKRRVYER LLYVTCSQNW
EAMGNHFWIK QCIELLLAVC ERNTTIGTSC QGAMLPSITN VINLADSHDR AAFAMVTHVK
QEPRERENSE SKEEDVEIDI ELAPGDQTST PKTKELSEKD IGNQLHMLTN RHDKFLDSLR
EVKTGALLSA FVQLCHISTT LAEKTWIQLF ARLWKILSDR QQHALAGEIS PFLCSGSHQV
QRDCQPSALN CFVEAMSQCV PPIPIRPCVL KYLGKTHNLW FRSTLMLEHQ AFEKGLSLQI
KPKQTTEFYE QESITPPQQE ILDSLAELYS LLQEEDMWAG LWQKRCKFPE TATAIAYEQH
GFFEQAQESY EKAMEKAKKE HERNNASPAI FPEYQLWEDH WIRCSKELNQ WEALTEYGQS
KGHINPYLVL ECAWRVSNWT AMKEALVQVE LSCPKEMAWK VNMYRGYLAI CHPEEQQLNF
IERLVEMASS LAIREWRRLP HVVSHVHTPL LQAAQQIIEL QEAAQINAGL QPTNLGRNNS
LHDMKTVVKT WRNRLPIVSD DLSHWSSIFM WRQHHYQGKP TWSGMHSSSI VTAYENSSQH
DPSSNNAMLG VHASASAIIQ YGKIARKQGL VNVALDILSR IHTIPTVPIV DCFQKIRQQV
KCYLQLAGVM GKNECMQGLE VIESTNLKYF TKEMTAEFYA LKGMFLAQIN KSEEANKAFS
AAVQMHDVLV KAWAMWGDYL ENIFVKERQL HLGVSAITCY LHACRHQNES KSRKYLAKVL
WLLSFDDDKN TLADAVDKYC IGVPPIQWLA WIPQLLTCLV GSEGKLLLNL ISQVGRVYPQ
AVYFPIRTLY LTLKIEQRER YKSDSGQQQP SSVGNQSHSA SDPGPIRATA PMWRCSRIMH
MQRELHPTLL SSLEGIVDQM VWFRENWHEE VLRQLQQGLA KCYSVAFEKS GAVSDAKITP
HTLNFVKKLV STFGVGLENV SNVSTMFSSA ASESLARRAQ ATAQDPVFQK LKGQFTTDFD
FSVPGSMKLH NLISKLKKWI KILEAKTKQL PKFFLIEEKC RFLSNFSAQT AEVEIPGEFL
MPKPTHYYIK IARFMPRVEI VQKHNTAARR LYIRGHNGKI YPYLVMNDAC LTESRREERV
LQLLRLLNPC LEKRKETTKR HLFFTVPRVV AVSPQMRLVE DNPSSLSLVE IYKQRCAKKG
IEHDNPISRY YDRLATVQAR GTQASHQVLR DILKEVQSNM VPRSMLREWA LHTFPNATDY
WTFRKMFTIQ LALIGFAEFV FHLNRLNPEM LQIAQDTGKL NVAYFRFDIN DATGDLDANR
PVPFRLTPNI SEFLTTIGVS GPLTASMIAV ARCFAQPNFK VDGILKTVLR DEIIAWHKKT
QEDTSSPLSA AGQPENMDSQ QLVSLVQKAV TAIMTRLHNL AQFEGGESKV NTLVAAANSL
DNLCRMDPAW HPWL
//