ID A0A1A6HY23_NEOLE Unreviewed; 492 AA.
AC A0A1A6HY23;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE RecName: Full=Mucin-1 {ECO:0000256|ARBA:ARBA00014269};
GN ORFNames=A6R68_22902 {ECO:0000313|EMBL:OBS83124.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS83124.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS83124.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS83124.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS83124.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Apical cell membrane
CC {ECO:0000256|ARBA:ARBA00004247}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004247}. Cell membrane
CC {ECO:0000256|ARBA:ARBA00004251}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004251}. Cytoplasm
CC {ECO:0000256|ARBA:ARBA00004496}. Membrane
CC {ECO:0000256|ARBA:ARBA00004479}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004479}. Nucleus
CC {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS83124.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01007959; OBS83124.1; -; Genomic_DNA.
DR STRING; 56216.A0A1A6HY23; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0016324; C:apical plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR Gene3D; 6.10.140.600; -; 1.
DR InterPro; IPR000082; SEA_dom.
DR InterPro; IPR036364; SEA_dom_sf.
DR PANTHER; PTHR10006:SF19; MUCIN-1; 1.
DR PANTHER; PTHR10006; MUCIN-1-RELATED; 1.
DR Pfam; PF01390; SEA; 1.
DR SMART; SM00200; SEA; 1.
DR SUPFAM; SSF82671; SEA domain; 1.
DR PROSITE; PS50024; SEA; 1.
PE 4: Predicted;
KW Autocatalytic cleavage {ECO:0000256|ARBA:ARBA00022813};
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Lipoprotein {ECO:0000256|ARBA:ARBA00023288};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Palmitate {ECO:0000256|ARBA:ARBA00023139};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW Signal {ECO:0000256|SAM:SignalP}; Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..492
FT /note="Mucin-1"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008346729"
FT TRANSMEM 397..422
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 279..384
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT REGION 44..244
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 492 AA; 51059 MW; EFE9DE60A3576A9E CRC64;
MTPGIRAPFF LMLLLATDKH SVTLSQDTNS SSTLTTISAS APATSSTVDS ATTPVHTGSS
APATSSTVDL ATTPVHSGSS APATSSTVDS ATTPVHTGSS APATSSTVDS TTTPVHTGSS
IQTTEAMSGS ATTPIHNGSL VPTTSSTLGS TTSPAHSGAS SATNSSDSDL ATTPVYGGTS
VSTTKATSGS AITPXHNGSL VPTTSSVLGS STTPIHNNIS TATTTPVGNG TQSSVPSQHP
VTPTTLAISR NSTVAYSTYY STVLSSTFSS DSAPQVSVGV SFFFISFHIW NHQFNSSLED
PSSNYYQELK RNISGLFLQI FNQDFLGIST IQFRSGSVVV ESTVIFREGA VSASEVESQL
LQHEKEAEDY NLAISEVNVN EMQFPPSAQS WPGVPGWGIA LLVLVCILVA LAIVYLIALG
VCQCRRKNYG HLDIFPTQDT YHPMSEYPTY HTHGRYVPPS STKRNPYEVS AGNGGSSLSY
TNSAVATTSA NL
//