ID A0A452RAV3_URSAM Unreviewed; 495 AA.
AC A0A452RAV3;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=EMI domain containing 1 {ECO:0000313|Ensembl:ENSUAMP00000015764.1};
GN Name=EMID1 {ECO:0000313|Ensembl:ENSUAMP00000015764.1};
OS Ursus americanus (American black bear) (Euarctos americanus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; Ursus.
OX NCBI_TaxID=9643 {ECO:0000313|Ensembl:ENSUAMP00000015764.1, ECO:0000313|Proteomes:UP000291022};
RN [1] {ECO:0000313|Proteomes:UP000291022}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Korstanje R., Srivastava A., Sarsani V.K., Sheehan S.M., Seger R.L.,
RA Barter M.E., Lindqvist C., Brody L.C., Mullikin J.C.;
RT "De novo assembly and RNA-Seq shows season-dependent expression and editing
RT in black bear kidneys.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSUAMP00000015764.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A452RAV3; -.
DR STRING; 9643.ENSUAMP00000015764; -.
DR Ensembl; ENSUAMT00000017675.1; ENSUAMP00000015764.1; ENSUAMG00000012608.1.
DR GeneTree; ENSGT00940000161542; -.
DR OMA; CEEXVAV; -.
DR Proteomes; UP000291022; Unassembled WGS sequence.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR011489; EMI_domain.
DR PANTHER; PTHR15427:SF23; EMI DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR15427; EMILIN ELASTIN MICROFIBRIL INTERFACE-LOCATED PROTEIN ELASTIN MICROFIBRIL INTERFACER; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF07546; EMI; 1.
DR PROSITE; PS51041; EMI; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000291022};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..495
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019061549"
FT DOMAIN 33..106
FT /note="EMI"
FT /evidence="ECO:0000259|PROSITE:PS51041"
FT REGION 174..448
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 262..287
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 294..311
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 314..346
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 495 AA; 50557 MW; B82295488CA57209 CRC64;
MGGPRAWALL CLGLLLPGGS AAWSVGGAPF SGRRNWCAYM VTRTISCHVQ NGTYLQRVLQ
NCPWPMSCPV SSYRTVVRPS YKVVYKTVTA REWRCCPGHS GVSCEEVAGP SGFVDPRWSG
NAMRRMALRP TAFSGCLNCS KVSELTERLK ALEAKVAVLT VTERVVLPTP AAPGDPFPLW
GSPAAQGSPG DGGLRGLPGA RENVRTPLLP RDDRVGGQGL PGPTGPKGDT GSRGPTGMRG
PPGPQGPPGS PGQAGAVGTP GERGPPGPPG PPGPPGPPGP PAPVGPPHTR NLQYGDPLLS
NTFTETSSHW PQGPAGLPGP PGPMGPPGLP GPMGIPGSPG HMGPPGPTGP KGISGHPGEK
GERGLRGEPG PQGSTGQRGE PGPKGDPGEK SHWNQRGRGQ ALPAQAPPAS CGARGEDTRP
TTGSWPPEAA TREAEGGGGP RGRPGQASLP SLDLANCLQG PPIHIYSCPQ GPFCYLGLWD
GHVLVLTPRA SLIGS
//