ID F6YBS7_HORSE Unreviewed; 1259 AA.
AC F6YBS7;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 24-JAN-2024, entry version 76.
DE SubName: Full=Diaphanous related formin 1 {ECO:0000313|Ensembl:ENSECAP00000019206.3};
GN Name=DIAPH1 {ECO:0000313|Ensembl:ENSECAP00000019206.3,
GN ECO:0000313|VGNC:VGNC:17183};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000019206.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000019206.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000019206.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000019206.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000019206.3};
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the formin homology family. Diaphanous
CC subfamily. {ECO:0000256|ARBA:ARBA00008214}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSECAT00000023203.3; ENSECAP00000019206.3; ENSECAG00000020822.4.
DR VGNC; VGNC:17183; DIAPH1.
DR GeneTree; ENSGT00940000157822; -.
DR TreeFam; TF315383; -.
DR Proteomes; UP000002281; Chromosome 14.
DR Bgee; ENSECAG00000020822; Expressed in blood and 23 other cell types or tissues.
DR GO; GO:0003779; F:actin binding; IEA:InterPro.
DR GO; GO:0031267; F:small GTPase binding; IEA:InterPro.
DR GO; GO:0030036; P:actin cytoskeleton organization; IEA:InterPro.
DR Gene3D; 1.20.58.630; -; 1.
DR Gene3D; 6.10.30.30; -; 1.
DR Gene3D; 1.10.20.40; Formin, diaphanous GTPase-binding domain; 1.
DR Gene3D; 1.20.58.2220; Formin, FH2 domain; 1.
DR Gene3D; 1.10.238.150; Formin, FH3 diaphanous domain; 1.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 1.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR014767; DAD_dom.
DR InterPro; IPR044933; DIA_GBD_sf.
DR InterPro; IPR015425; FH2_Formin.
DR InterPro; IPR042201; FH2_Formin_sf.
DR InterPro; IPR010472; FH3_dom.
DR InterPro; IPR014768; GBD/FH3_dom.
DR InterPro; IPR010473; GTPase-bd.
DR PANTHER; PTHR45691; PROTEIN DIAPHANOUS; 1.
DR PANTHER; PTHR45691:SF4; PROTEIN DIAPHANOUS HOMOLOG 1; 1.
DR Pfam; PF06346; Drf_FH1; 2.
DR Pfam; PF06367; Drf_FH3; 1.
DR Pfam; PF06371; Drf_GBD; 1.
DR Pfam; PF02181; FH2; 1.
DR SMART; SM01139; Drf_FH3; 1.
DR SMART; SM01140; Drf_GBD; 1.
DR SMART; SM00498; FH2; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR SUPFAM; SSF101447; Formin homology 2 domain (FH2 domain); 1.
DR PROSITE; PS51231; DAD; 1.
DR PROSITE; PS51444; FH2; 1.
DR PROSITE; PS51232; GBD_FH3; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281}.
FT DOMAIN 84..449
FT /note="GBD/FH3"
FT /evidence="ECO:0000259|PROSITE:PS51232"
FT DOMAIN 806..1208
FT /note="FH2"
FT /evidence="ECO:0000259|PROSITE:PS51444"
FT DOMAIN 1231..1259
FT /note="DAD"
FT /evidence="ECO:0000259|PROSITE:PS51231"
FT REGION 1..84
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 572..789
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1190..1221
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 481..568
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1089..1151
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 36..67
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 68..84
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 599..626
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 640..789
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1259 AA; 139349 MW; EFD363131B363334 CRC64;
MEPPGGGLGP GRGTRDKKKG RSPDELPSAG GDGGKSKKFT LKRLMADELE RFTSMRIKKE
KEKPNSAHRN SSASYGDDPT AQSLQDVSDE QVLVLFEQML LDMNLNEEKQ QPLREKDIII
KREMVSQYLH TSKAGMSQKE SSRSAMMYIQ ELRSGLRDMP LLSCLESLRV SLNNNPVSWV
QTFGAEGLAS LLDILKRLHD EKEEIAGSYD SRNKHEIIRC LKAFMNNKFG IKTMLETEEG
ILLLVRAMDP AVPNMMIDAA KLLSALCILP QPEDMNERVL EAMTERAEMD EVERFQPLLD
GLKSGTSIAL KVGCLQLINA LITPAEELDF RVHIRSELMR LGLHQVLQEL REIENDDMRV
QLNVFDEQGE EDSYDLKGRL DDIRMEMDDF SEVFQILLNT VKDSKAEPHF LSILQHLLLV
RNDYEARPQY YKLIEECISQ IVLHKNGADP DFKCRHLQID IEGLIDQMID KTKVEISEAK
ATELEKKLDS ELTARHELQV EMKKMESDFE QKLQDLQGEK DALDSEKQQI ATEKQGLEAE
VSQLTGEVAK LSKELEDAKK EMASISAAVT AVAPPSSASV ASAPPLPGDS GTAKVGVSIS
PPPPLPGSDT VPSPPPPPPP PPLPGGSYTS SSGSPLPGDV CISTPPPLPE GTIPPPPPLP
EGTSIPPPPP LPVGTSIPPP PPLPGSASIP PPPPLPGSAS IPPPPPLPGS ASIPPPPPLP
GGACIPLPPP LPGGACIPPP PPPLPGGPGM PPPPPPLPGG AGIPPPPPFP GGPGIPPPPP
GMGMPPPPPF GFGVPAAPVL PFGLTPKKLY KPEVQLRRPN WSKFVAEDLS QDCFWTKVKE
DRFENNELFA KLTLTFSAQT KTSKAKKDQE GGEEKKSVQK KKVKELKVLD SKTAQNLSIF
LGSFRMPYQE IKNVILEVNE AVLTESMIQN LIKQMPEPEQ LKMLSELKDE YDDLAESEQF
GVVMGTVPRL RPRLNAILFK LQFSEQVENI KPEIVSVTAA CEELRKSENF SSLLEITLLV
GNYMNAGSRN AGAFGFSISF LCKLRDTKST DQKMTLLHFL AELCETDHPD VLKFPDELAH
VEKASRVSAE NMQKNLDQMK KQISDVERDV QNFPAATDEK DKFVEKMTSF VKDAQEQYNK
LRMMHSNMET LYKELGEYFL FDPKKVPVEE FFMDLHNFRN MFVQAVKENQ KRRETEEKMR
RAKLAKEKAE KERLEKQQKR EQLIDMNAEG DETGVMDSLL EALQSGAAFR RKRGPRQGK
//