ID A0A2A4K248_HELVI Unreviewed; 1161 AA.
AC A0A2A4K248;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 24-JAN-2024, entry version 17.
DE RecName: Full=Pre-mRNA-processing factor 6 {ECO:0000256|ARBA:ARBA00020235};
DE AltName: Full=PRP6 homolog {ECO:0000256|ARBA:ARBA00032140};
DE AltName: Full=U5 snRNP-associated 102 kDa protein {ECO:0000256|ARBA:ARBA00031070};
GN ORFNames=B5V51_6498 {ECO:0000313|EMBL:PCG77712.1};
OS Heliothis virescens (Tobacco budworm moth).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Heliothinae; Heliothis.
OX NCBI_TaxID=7102 {ECO:0000313|EMBL:PCG77712.1, ECO:0000313|Proteomes:UP000218220};
RN [1] {ECO:0000313|EMBL:PCG77712.1, ECO:0000313|Proteomes:UP000218220}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HvINT- {ECO:0000313|EMBL:PCG77712.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:PCG77712.1};
RA Fritz M.L., Deyonke A.M., Papanicolaou A., Micinski S., Westbrook J.,
RA Gould F.;
RT "Contemporary evolution of a Lepidopteran species, Heliothis virescens, in
RT response to modern agricultural practices.";
RL Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PCG77712.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NWSH01000290; PCG77712.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2A4K248; -.
DR STRING; 7102.A0A2A4K248; -.
DR Proteomes; UP000218220; Unassembled WGS sequence.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 6.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR010491; PRP1_N.
DR InterPro; IPR045075; Syf1-like.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR PANTHER; PTHR11246; PRE-MRNA SPLICING FACTOR; 1.
DR PANTHER; PTHR11246:SF1; PRE-MRNA-PROCESSING FACTOR 6; 1.
DR Pfam; PF06424; PRP1_N; 1.
DR Pfam; PF13432; TPR_16; 4.
DR Pfam; PF14559; TPR_19; 2.
DR SMART; SM00386; HAT; 20.
DR SMART; SM00028; TPR; 8.
DR SUPFAM; SSF81901; HCP-like; 1.
DR SUPFAM; SSF48452; TPR-like; 5.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000218220};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 21..171
FT /note="PRP1 splicing factor N-terminal"
FT /evidence="ECO:0000259|Pfam:PF06424"
FT REGION 34..98
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 164..207
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 107..150
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 46..69
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 164..179
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 191..205
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1161 AA; 130865 MW; 26DDDDB7FE4B521F CRC64;
MSVPPQAFVN KNKKHFLGIP APLGYVAGVG RGATGFTTRS DIGPARDAND VSDDRHAPPA
AKRKKTEEED DDEDLNDSNY DEFSGYSGSL FSKDPYDKDD AEADAIYESI DKRMDEKRKE
YREKRLKEDL ERYRQERPKI QQQFSDLKRE LKSVSEDEWS AIPEVGDARN RKQRNPRAEK
FTPLPDSVLS RNLGGESSSS IDPGSGLASM MPGVMTPGML TPSGDLDLRK IGQARNTLMT
VKLSQVSDSV TGQTVVDPKG YLTDLQSMIP TYGGDINDIK KARLLLKSVR ETNPNHPPAW
IASARLEEVT GKVQSARNLI MKGCEVNPSS EELWLEAARL QPRDTARAVI AHAARNLPHS
VRIWVKAADL EQETKAKRRV YRKALEHIPN SVRLWKAAVE LENPEDARIL LSRAVECCPT
SVELWLALAR LETYENARKV LNKARENIPT DRQIWVTAAK LEEAHGNTHM VEKIIDRAIT
SLSANGVEIN REHWFKEAME AEKSGAVHTC QAIIRAVIGH GIEPEDQKHT WMEDAEACAN
EGAYECARAV YGYALSVFPS KKSIWLRAAY LEKQHGTRAS LEALLQRAVA HCPKSEVLWL
MGAKSKWLAG DVPAARGILS LAFQANPNSE EIWLAAVKLE SENKEYDRAR RLLSKARASA
PTPRVMIKSA KLEWALNNLD VALKLLEEAI KVFADYAKLH MMKGQIEEQM EKDEEARNTY
SQGLKKCATS VPMWILLARL EEKLKNVTKA RSVLEKARLR NPKTAELWLE SVRLERRNGS
AEIANAVMAK ALQECPSAGR LWAEAIFMES RPQRKTKSRA VAHCPKSEVL WLMGAKSKWL
AGDVPAARGI LSLAFQANPN SEEIWLAAVK LESENKEYDR ARRLLSKARA SAPTPRVMIK
SAKLEWALNN LDVALKLLEE AIKVFADYAK LHMMKGQIEE QMEKDEEARN TYSQGLKKCA
TSVPMWILLA RLEEKLKNVT KARSVLEKAR LRNPKTAELW LESVRLERRN GSAEIANAVM
AKALQECPSA GRLWAEAIFM ESRPQRKTKS VDALKKCEHD AFVLLAVSQL FWTERKLNKC
REWFNRTVKI DPDLGDAWAY FYKFELLHGN EQQQEDVKTR CKTAEPHHGE AWCKVSKDIA
NWCFSTEQIL LLVAKNLPVP T
//