ID F6R8H3_HORSE Unreviewed; 1885 AA.
AC F6R8H3;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 27-MAR-2024, entry version 74.
DE SubName: Full=Zinc finger protein 638 {ECO:0000313|Ensembl:ENSECAP00000022839.3};
GN Name=ZNF638 {ECO:0000313|Ensembl:ENSECAP00000022839.3,
GN ECO:0000313|VGNC:VGNC:25365};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000022839.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000022839.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000022839.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000022839.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000022839.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSECAT00000028898.4; ENSECAP00000022839.3; ENSECAG00000007715.5.
DR VGNC; VGNC:25365; ZNF638.
DR GeneTree; ENSGT00940000153322; -.
DR HOGENOM; CLU_002180_0_0_1; -.
DR TreeFam; TF333921; -.
DR Proteomes; UP000002281; Chromosome 15.
DR Bgee; ENSECAG00000007715; Expressed in brainstem and 23 other cell types or tissues.
DR ExpressionAtlas; F6R8H3; baseline.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0008380; P:RNA splicing; IEA:InterPro.
DR CDD; cd12716; RRM1_2_NP220; 1.
DR Gene3D; 3.30.70.330; -; 3.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR033096; ZNF638_RRM1/2.
DR PANTHER; PTHR15592; MATRIN 3/NUCLEAR PROTEIN 220-RELATED; 1.
DR PANTHER; PTHR15592:SF1; ZINC FINGER PROTEIN 638; 1.
DR SMART; SM00360; RRM; 2.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 3.
DR PROSITE; PS50102; RRM; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00176}.
FT DOMAIN 673..748
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT REGION 1..48
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 91..137
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 464..516
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 533..651
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 746..769
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1082..1126
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1291..1328
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1353..1416
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1431..1531
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1552..1587
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1780..1816
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 91..107
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 113..137
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 471..488
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 489..516
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 557..578
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 606..635
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 636..651
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 748..769
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1106..1120
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1294..1328
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1361..1396
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1446..1460
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1476..1502
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1503..1521
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1559..1587
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1789..1816
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1885 AA; 209690 MW; 9995B61A5F543897 CRC64;
MSRPRFNPRG NFPLQRPRAP NPSGMRPPGP FMRPGSMGLP RFYPAGRARG IPHRFAGHES
YQNMGPQRMN VQVTQHRTDP RLTKEKLDFH EAQQKKGKPH GSRWDDEPHI SASVGVKQSS
VTQVTDQSPK VQSRYTKESA SSILASFGLS NEDLEELSRY PDEQLTPENM PLILRDIRMR
KMGRRLPNLP SQSRNKETLD SEAVSSNVID YGHASKYGYT EDPLEVRIYD PEIPTDEVKN
EFRSQQNISA SVPTPNVICN SMFPIEDVFR QMDFPNESSN NQSFFPVESG TKMSGLHISG
GQSVLEPVKS ASQSISQTVN QTMSQSLIPP SMNQQPFSSE LISAVNQQER IPREPVINSS
NVHRSGGNKK NYQSEADMPI QSPFGIVKAS WLPKFSHADA QKMKRLPTPS MMNDYYAASP
RIFPHLCSLC NVECSHLKDW IQHQNTSTHI ESCRQLRQQY PDWNPEILPS RRNEGNRKEN
ETPRRRSHSP SPRRSRRSSS SHRFRRSRSP IRYIYRPRSR SPRICHRFVS RYRSRSRSPY
RVRNPFRGSP KYYRSVSPER TSRRSVRSSD RKKALEDVVQ RSGLASEFNK QKHLEAVDKG
HSPAQKLKTG SGSKPSVKPT SSTKSDSNLG GHSTRYKSKN PEDDTLPECK QVSDKAVSLQ
RKLRKDQSSH YDSVLLISEL PEDGCTEEDI RKIFQPFGKV NDVLIVPYRK EAYLEMKFKE
AITAVMKYIE TTPLLIKGKN VKVSVPGKKK SQNKEVKKKT TDSKKTSAST LKKDTDICKT
VETVTSASAT KTGQAKTSTA KVNKSSGKSA GSVKSVVTVA AKGNKASIKT AKSSGKKSLE
VKKAGIVKNK DTSKPVIVSE NSEIKTSVEV KTTENSAKET IPEAALEATE DESVNKETEE
MCVVLISNLP NKGYSIEEVY NLVKPFGGLK DILILSSHNK AFIEINRKSA DSMVKFYTCF
PISMDGNQLS ISMAPENMNI KDEEAIFTTL TKENDPELQA DIDKIYNRFV HLDNLPEDGL
QCVLCIGLQF GKVDHHIFIS NKNKAILQLD SPESAQSMYS FLKQNPQNIG DHVLTCTLSP
KMDSSEAQAE TDPGRGNESP DLKNSPVDES EVQTAAGSPS VKPSDIEEEF IPSIQTETSV
LQEEPCEEEP EKVPCDSDFA METLEVETQG EEVKVEIPLV ASTPTSIELF TENLEESVLN
QQMYTSDFEK EEAEIINPET ELSTSDSTFI EERNIKGILE ESPSETEDFF SGITESLIEA
VAEEDKHESV SEIVPSACIV ALVPGISTGD EKAVSKKGIS EKSNMDEKEE NEFNTKESKM
DLQIGTEKAE KNDARMVAEK LEKIVAAMKE KPAENAVTKV YPNKGVSQAN KPDETSKTSV
LVASNAPSSK SSIKAGIVSS PKAKATASKS ENQKSFLKSV LRDQINAEKK LSAKELGLLK
PTSARPGLAE SSSKFKPTQS GIPRGGSGRI SALQGKDSKL DYRDITKQSQ EAEARPSFMK
RDDSNNKTFA GQNTKNPKST TGRSSKSKEE PLFPFNLDEF VTVDEVIEEV NPFQAKQAPP
KGKRKEALKN TPSSELNLKK KRGKTSAPRV VEGELSFVTL DEIGEEEDAA THLAQALVTV
DEVIDEEEIN MEEMVKNSNS LLTLDELIDQ DDCISHSEPK DVTVLSVAEE QDLLKQERLV
TVDEIGEVEE LPLNESTDIS FATLNTKGDE GNTGRDSIGF ISSQMPEDPS TLVTVDEIQD
DSSDLHLVTL DEVTEEDEDS LADFNNLKEE LNFVTVDEVG EEEDGDNDLK VELAQSKNDH
PTDKRGDRKK RAVDTKKTKL EALSQVGLVN ETVLEEDLKT MIERHLAAKV PTKRVRIGKT
PPSEKAVVTE PVKGEEAFQI NEAKC
//