ID A0A2I2USC5_FELCA Unreviewed; 447 AA.
AC A0A2I2USC5;
DT 28-FEB-2018, integrated into UniProtKB/TrEMBL.
DT 10-OCT-2018, sequence version 2.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=EMI domain containing 1 {ECO:0000313|Ensembl:ENSFCAP00000035442.2};
GN Name=EMID1 {ECO:0000313|Ensembl:ENSFCAP00000035442.2,
GN ECO:0000313|VGNC:VGNC:61844};
OS Felis catus (Cat) (Felis silvestris catus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Carnivora; Feliformia; Felidae; Felinae; Felis.
OX NCBI_TaxID=9685 {ECO:0000313|Ensembl:ENSFCAP00000035442.2, ECO:0000313|Proteomes:UP000011712};
RN [1] {ECO:0000313|Ensembl:ENSFCAP00000035442.2, ECO:0000313|Proteomes:UP000011712}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Abyssinian {ECO:0000313|Ensembl:ENSFCAP00000035442.2,
RC ECO:0000313|Proteomes:UP000011712};
RX PubMed=17975172; DOI=10.1101/gr.6380007;
RA Pontius J.U., Mullikin J.C., Smith D.R., Lindblad-Toh K., Gnerre S.,
RA Clamp M., Chang J., Stephens R., Neelam B., Volfovsky N., Schaffer A.A.,
RA Agarwala R., Narfstrom K., Murphy W.J., Giger U., Roca A.L., Antunes A.,
RA Menotti-Raymond M., Yuhki N., Pecon-Slattery J., Johnson W.E., Bourque G.,
RA Tesler G., O'Brien S.J.;
RT "Initial sequence and comparative analysis of the cat genome.";
RL Genome Res. 17:1675-1689(2007).
RN [2] {ECO:0000313|Ensembl:ENSFCAP00000035442.2, ECO:0000313|Proteomes:UP000011712}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Abyssinian {ECO:0000313|Ensembl:ENSFCAP00000035442.2,
RC ECO:0000313|Proteomes:UP000011712};
RA Hillier L.W., Warren W., Obrien S., Wilson R.K.;
RT "Sequence assembly of the Felis catus genome version 6.2.";
RL Submitted (SEP-2011) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Ensembl:ENSFCAP00000035442.2}
RP IDENTIFICATION.
RC STRAIN=breed Abyssinian {ECO:0000313|Ensembl:ENSFCAP00000035442.2};
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AANG04000189; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A2I2USC5; -.
DR Ensembl; ENSFCAT00000047308.3; ENSFCAP00000035442.2; ENSFCAG00000004241.6.
DR VGNC; VGNC:61844; EMID1.
DR GeneTree; ENSGT00940000161542; -.
DR Proteomes; UP000011712; Chromosome D3.
DR Bgee; ENSFCAG00000004241; Expressed in uterus and 10 other cell types or tissues.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR011489; EMI_domain.
DR PANTHER; PTHR15427:SF23; EMI DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR15427; EMILIN ELASTIN MICROFIBRIL INTERFACE-LOCATED PROTEIN ELASTIN MICROFIBRIL INTERFACER; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF07546; EMI; 1.
DR PROSITE; PS51041; EMI; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000011712};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..447
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5016273778"
FT DOMAIN 33..106
FT /note="EMI"
FT /evidence="ECO:0000259|PROSITE:PS51041"
FT REGION 168..376
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 409..447
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 245..273
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 278..294
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 297..329
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 447 AA; 45936 MW; B6BB71F17F145C0F CRC64;
MGGPRAWALL CLGLLLPGGS AAWNVGGAQF SGRRNWCSYV VTRTISCHVQ NGTYLQRVLQ
NCPWPMSCPG NSYRTVVRPT YKVMYKTVTT REWRCCPGHS GVNCEEVSGS AGFLEPGWSG
NTMRRMALRP TAFTGCLNCS KVSELTERLK VLEAKVAVLT VTERAVLPTP VAPGDPVPLW
GSPAARGSPG DGSLQDRVGG QGLPGPTGPK GDTGSRGPTG MRGPPGPQGP PGSPGQAGAV
GTPGERGPPG PPGPPGPPGP PGPPAPVGPP HAWNPQYGDP LLSNTFTETS SHWPQGPVGL
PGPPGPMGPP GLPGPMGIPG SPGHMGPPGP TGPKGISGHP GEKGERGLRG EPGPQGSMGQ
RGEPGPKGDP GEKSHWAPSL QSFLQQQAQL ELLARRVTLL EAIIWPEPEG SGAGPAGTGT
PSLLRGKRGG HAANYRIVAP RSRNERG
//