ID H0ZPG4_TAEGU Unreviewed; 850 AA.
AC H0ZPG4;
DT 22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 2.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=Pre-mRNA processing factor 40 homolog A {ECO:0000313|Ensembl:ENSTGUP00000012496.2};
GN Name=PRPF40A {ECO:0000313|Ensembl:ENSTGUP00000012496.2};
OS Taeniopygia guttata (Zebra finch) (Poephila guttata).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Passeroidea; Estrildidae;
OC Estrildinae; Taeniopygia.
OX NCBI_TaxID=59729 {ECO:0000313|Ensembl:ENSTGUP00000012496.2, ECO:0000313|Proteomes:UP000007754};
RN [1] {ECO:0000313|Ensembl:ENSTGUP00000012496.2, ECO:0000313|Proteomes:UP000007754}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20360741; DOI=10.1038/nature08819;
RA Warren W.C., Clayton D.F., Ellegren H., Arnold A.P., Hillier L.W.,
RA Kunstner A., Searle S., White S., Vilella A.J., Fairley S., Heger A.,
RA Kong L., Ponting C.P., Jarvis E.D., Mello C.V., Minx P., Lovell P.,
RA Velho T.A., Ferris M., Balakrishnan C.N., Sinha S., Blatti C., London S.E.,
RA Li Y., Lin Y.C., George J., Sweedler J., Southey B., Gunaratne P.,
RA Watson M., Nam K., Backstrom N., Smeds L., Nabholz B., Itoh Y., Whitney O.,
RA Pfenning A.R., Howard J., Volker M., Skinner B.M., Griffin D.K., Ye L.,
RA McLaren W.M., Flicek P., Quesada V., Velasco G., Lopez-Otin C.,
RA Puente X.S., Olender T., Lancet D., Smit A.F., Hubley R., Konkel M.K.,
RA Walker J.A., Batzer M.A., Gu W., Pollock D.D., Chen L., Cheng Z.,
RA Eichler E.E., Stapley J., Slate J., Ekblom R., Birkhead T., Burke T.,
RA Burt D., Scharff C., Adam I., Richard H., Sultan M., Soldatov A.,
RA Lehrach H., Edwards S.V., Yang S.P., Li X., Graves T., Fulton L.,
RA Nelson J., Chinwalla A., Hou S., Mardis E.R., Wilson R.K.;
RT "The genome of a songbird.";
RL Nature 464:757-762(2010).
RN [2] {ECO:0000313|Ensembl:ENSTGUP00000012496.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; H0ZPG4; -.
DR STRING; 59729.ENSTGUP00000028288; -.
DR Ensembl; ENSTGUT00000012635.2; ENSTGUP00000012496.2; ENSTGUG00000012118.2.
DR GeneTree; ENSGT00930000150980; -.
DR HOGENOM; CLU_005825_0_0_1; -.
DR TreeFam; TF318732; -.
DR Proteomes; UP000007754; Chromosome 7.
DR GO; GO:0045292; P:mRNA cis splicing, via spliceosome; IEA:InterPro.
DR CDD; cd00201; WW; 2.
DR Gene3D; 2.20.70.10; -; 2.
DR Gene3D; 1.10.10.440; FF domain; 5.
DR InterPro; IPR002713; FF_domain.
DR InterPro; IPR036517; FF_domain_sf.
DR InterPro; IPR039726; Prp40-like.
DR InterPro; IPR001202; WW_dom.
DR InterPro; IPR036020; WW_dom_sf.
DR PANTHER; PTHR11864:SF20; PRE-MRNA-PROCESSING FACTOR 40 HOMOLOG A; 1.
DR PANTHER; PTHR11864; PRE-MRNA-PROCESSING PROTEIN PRP40; 1.
DR Pfam; PF01846; FF; 4.
DR Pfam; PF00397; WW; 2.
DR SMART; SM00441; FF; 5.
DR SMART; SM00456; WW; 2.
DR SUPFAM; SSF81698; FF domain; 5.
DR SUPFAM; SSF51045; WW domain; 2.
DR PROSITE; PS51676; FF; 6.
DR PROSITE; PS01159; WW_DOMAIN_1; 2.
DR PROSITE; PS50020; WW_DOMAIN_2; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000007754}.
FT DOMAIN 76..109
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 122..150
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 285..339
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 352..406
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 419..479
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 499..559
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 564..619
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 634..691
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT REGION 36..94
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 181..282
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 696..850
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 338..365
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 470..502
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 44..76
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 238..252
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 260..282
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 699..719
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 720..758
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 759..773
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 774..805
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 830..850
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 850 AA; 96736 MW; D7EEE117ABCDED8F CRC64;
MESKKLKMPG MMQSVMPGMM MSHMSQAAMQ PTVPPGVNSM DAQVGVTPPG TQTTHPVVST
VQQSSTSSSS ASEEHSKQKS TWTEHKSPDG RTYYYNTETK QSTWEKPDDL KTPAEQLLSK
CPWKEYKSDS GKPYYYNSQT KESRWAKPKE LEDLEAMIKA EENSTKPEDV AAASAVAVAG
DAPSGAGAAP AADGAAATTA APAGAAPEGD PAPAATTGDT DGAGTATAEE QGQGGAAPGT
QDGGTDTAAA TTDEAAKQEG AGDAAAKKED EDAQPVKKTY TWNTKEEAKQ AFKELLKEKR
VPSNASWEQA MKMIINDPRY SALAKLSEKK QAFNAYKVQT EKEEKEEARS KYKEAKESFQ
RFLENHEKMT STTRYKKAEQ MFGEMEVWNA ISERDRLEIY EDVLFFLSKK EKEQAKQLRK
RNWEALKNIL DNMANVTYCT TWSEAQQYLM DNPTFAEDEE LQNMDKEDAL ICFEEHIRAL
EKEEEEEKQK SLLRERRRQR KNRESFQLFL DELHEHGQLH SMSSWMELYP AISSDIRFTS
MLGQPGSTAL DLFKFYVEDL KARYHDEKKI IKDILKDKGF VVEVNTSFED FVTVISSTKR
ATTLDAGNIK LAFNSLLEKA EAREREREKE EARKMKRKES AFKSMLKQAT PPIELDAVWE
DIRDRFVKEP AFEDITLESE RKRIFKDFLH VLEHECQHHH SKTKKHSKKS KKHHRKRSRS
RSGSESEDDD SHSKKKRQRS ESRSVSERSS SAESERSYKK SKKHKKKSKK RRHKSDSPES
DVEREKDKKE RERDSEKERA RQRSESKHKS PTKKRPGKDS GNWDTSGSEL SEGELEKQRR
TLLEQLDEDQ
//