ID A0A1W4WH12_AGRPL Unreviewed; 937 AA.
AC A0A1W4WH12;
DT 07-JUN-2017, integrated into UniProtKB/TrEMBL.
DT 07-JUN-2017, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Activating transcription factor 7-interacting protein 1 isoform X4 {ECO:0000313|RefSeq:XP_018319305.1};
GN Name=LOC108732823 {ECO:0000313|RefSeq:XP_018319305.1};
OS Agrilus planipennis (Emerald ash borer) (Agrilus marcopoli).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Elateriformia;
OC Buprestoidea; Buprestidae; Agrilinae; Agrilus.
OX NCBI_TaxID=224129 {ECO:0000313|Proteomes:UP000192223, ECO:0000313|RefSeq:XP_018319305.1};
RN [1] {ECO:0000313|RefSeq:XP_018319305.1}
RP IDENTIFICATION.
RC TISSUE=Entire body {ECO:0000313|RefSeq:XP_018319305.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_018319305.1; XM_018463803.1.
DR AlphaFoldDB; A0A1W4WH12; -.
DR EnsemblMetazoa; XM_018463803.1; XP_018319305.1; LOC108732823.
DR GeneID; 108732823; -.
DR OrthoDB; 316761at2759; -.
DR Proteomes; UP000192223; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0006396; P:RNA processing; IEA:InterPro.
DR Gene3D; 1.10.10.790; Surp module; 1.
DR InterPro; IPR000467; G_patch_dom.
DR InterPro; IPR040169; SUGP1/2.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR035967; SWAP/Surp_sf.
DR PANTHER; PTHR23340; ARGININE/SERINE RICH SPLICING FACTOR SF4/14; 1.
DR PANTHER; PTHR23340:SF0; SURP AND G-PATCH DOMAIN-CONTAINING PROTEIN 1 ISOFORM X1; 1.
DR Pfam; PF01585; G-patch; 1.
DR Pfam; PF01805; Surp; 1.
DR SMART; SM00443; G_patch; 1.
DR SUPFAM; SSF109905; Surp module (SWAP domain); 1.
DR PROSITE; PS50174; G_PATCH; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000192223}.
FT DOMAIN 904..937
FT /note="G-patch"
FT /evidence="ECO:0000259|PROSITE:PS50174"
FT REGION 1..74
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 93..135
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 359..434
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 668..719
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 20..61
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 93..111
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 377..392
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 393..433
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 686..714
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 937 AA; 103594 MW; 2EAF15CB9E982EE1 CRC64;
MSRKGIVDVF QTKTTRNDRY AQMSHQEKLI EQKKREIQAK VEQKRKSGNS EKSEKNKDTK
SYTKSYNQFS NDGSFMDQFK QMKDKKIDQK LKSFKFSQEK SHSEDRNRNN RWSHRRRSPS
PISSRHKFSD KFQNNEPKIS ISTSYTNFIS QANFNVQTSQ SIITPENAVG QPLLKNDQSS
SNVTFPPPTQ TSTTVISALP LMLNAPPPPI IQQNAVLMQQ PALPSTAVIT SEIRTPTIIT
SLSLPPPTSA IGPQNAPDLT GATLPVLGPE NALNIPTLRS MTSVELASIP SPNPLQLQSI
PQPEPINTMN IPPPAPLQVQ NIPPPSSIQL NEIPKPKPID IINIPTPTEV ATNLENSMSD
PDFIKNIPPP NKSIPPPSIQ DTTISANISV PPPDISNPGP TCMPPPNVSS QNIPPPPSLA
LLPPSPQSVP PPLPQSVSSL IISENVGQSV MTPQTIIVHS VPPPSQNIQS LQLLSPSSTI
QNVSSQLPLQ TVSQTLPPTP VIPVTLTLQS GTEVSNATET NVIISQQLLQ GNPGQISVAT
PNLNIHYPPP IVNNNIQGLQ QTFITQPPPI TNHTMPPMNI PPPASIPNSA PPQEIKSLQA
VYSSGTAEYE AMVSLGRMVA QCGPGIEDIV RQRKQQDPHL WFLFHKESAP YRQYQQLVEQ
FSVETEREKK EFRNAQQNEN NLMKGENSDL NEKDNDQDSG AGKRRRRSRW GDKDQKMTLP
TMMMVGNSNA SVTPTSIPIN IAINPVLNVL PPVPVVCPQK SQGPVLLTSL TRNDPAFIQY
VKQTYGSVDL TEEDWKKAED NFKVSLLYQE MLKKKQEAER LAKTGKHKYD YDSDEETDGG
TWEHKLREKE MQKTQTWAVE LTKNAEGKHH IGDFLPPEEL QKFLDKKGKG PNDGSDYKDF
KIKEDNIGFK MLQKLGWSEG QGLGATASGI VEPVRLS
//