ID W5K3F8_ASTMX Unreviewed; 468 AA.
AC W5K3F8;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 52.
DE SubName: Full=Myosin-binding protein H-like {ECO:0000313|Ensembl:ENSAMXP00000002119.2};
OS Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC Characoidei; Characidae; Astyanax.
OX NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000002119.2, ECO:0000313|Proteomes:UP000018467};
RN [1] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RA Jeffery W., Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RX PubMed=25329095; DOI=10.1038/ncomms6307;
RA McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D., O'Quin K.E.,
RA Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C., Volff J.N.,
RA Yoshizawa M., Warren W.C.;
RT "The cavefish genome reveals candidate genes for eye loss.";
RL Nat. Commun. 5:5307-5307(2014).
RN [3] {ECO:0000313|Ensembl:ENSAMXP00000002119.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; W5K3F8; -.
DR Ensembl; ENSAMXT00000002119.2; ENSAMXP00000002119.2; ENSAMXG00000002079.2.
DR eggNOG; ENOG502QVIQ; Eukaryota.
DR GeneTree; ENSGT00940000158040; -.
DR HOGENOM; CLU_037185_0_0_1; -.
DR OrthoDB; 4232090at2759; -.
DR Proteomes; UP000018467; Unassembled WGS sequence.
DR Bgee; ENSAMXG00000002079; Expressed in muscle tissue and 7 other cell types or tissues.
DR CDD; cd00063; FN3; 2.
DR CDD; cd00096; Ig; 1.
DR CDD; cd05748; Ig_Titin_like; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 4.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR013098; Ig_I-set.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR PANTHER; PTHR14340:SF11; IG-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR14340; MICROFIBRIL-ASSOCIATED GLYCOPROTEIN 3; 1.
DR Pfam; PF00041; fn3; 2.
DR Pfam; PF07679; I-set; 2.
DR PRINTS; PR00014; FNTYPEIII.
DR SMART; SM00060; FN3; 2.
DR SMART; SM00409; IG; 2.
DR SMART; SM00408; IGc2; 2.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF48726; Immunoglobulin; 2.
DR PROSITE; PS50853; FN3; 2.
DR PROSITE; PS50835; IG_LIKE; 2.
PE 4: Predicted;
KW Immunoglobulin domain {ECO:0000256|ARBA:ARBA00023319};
KW Reference proteome {ECO:0000313|Proteomes:UP000018467};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 58..153
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 157..245
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 254..349
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 367..451
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT REGION 1..64
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 41..55
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 468 AA; 51033 MW; C09ACE41A9AC42D8 CRC64;
MPVNPATPPE AETALPPADT VPSTPEAPAA ETTAPEVAPA PAAPAASPEA SPKAPLSPPQ
NLTVHDVTNT SLTLKWSTPE SAVDSGLDGY CVEYCKEGAS EYVAANTELI TSNLFVIKNQ
TTGDSLNVRV VAVRAGERSE PASLPNPITI REVPDHPRIR LPHLLRSRYI KQAGDQINLV
IPFSGQPKPM ITWTKNGQPL DVKKVNVRNS DKDSILFIRK AERDDSGTYQ MTVKIDSLED
KATIVIQIVE LPGPPTSVKL VDSWGFNAAL EWTPPKDNGN TDITGYTVQK ADKKTGEWFT
VLEHFHRLNA TITDLVMGNS YSFRVFSENV VGRSETAAVT KEVAKIQKTG TVYKPPHYKE
HDFSEAPKFT ASLSDRAATV GYTTRLLCAV RGSPKPKIEW MKNQMIIGDD PKFRQNSNLG
VCSLEIRKPS SFDGGVYTCR AKNPYGEATV ACKLEVKQVV VPTEEAKK
//