ID A0A218U781_9PASE Unreviewed; 2986 AA.
AC A0A218U781;
DT 27-SEP-2017, integrated into UniProtKB/TrEMBL.
DT 27-SEP-2017, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE SubName: Full=Mucin-17 {ECO:0000313|EMBL:OWK49583.1};
GN Name=MUC17 {ECO:0000313|EMBL:OWK49583.1};
GN ORFNames=RLOC_00004029 {ECO:0000313|EMBL:OWK49583.1};
OS Lonchura striata domestica (Bengalese finch).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Passeroidea; Estrildidae;
OC Estrildinae; Lonchura.
OX NCBI_TaxID=299123 {ECO:0000313|EMBL:OWK49583.1, ECO:0000313|Proteomes:UP000197619};
RN [1] {ECO:0000313|EMBL:OWK49583.1, ECO:0000313|Proteomes:UP000197619}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=White83orange57 {ECO:0000313|EMBL:OWK49583.1};
RA Colquitt B.M., Brainard M.S.;
RT "Genome of assembly of the Bengalese finch, Lonchura striata domestica.";
RL Submitted (MAY-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OWK49583.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MUZQ01000823; OWK49583.1; -; Genomic_DNA.
DR STRING; 299123.ENSLSDP00000003493; -.
DR Proteomes; UP000197619; Unassembled WGS sequence.
DR GO; GO:0071944; C:cell periphery; IEA:UniProt.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR CDD; cd00054; EGF_CA; 4.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR Gene3D; 3.30.70.960; SEA domain; 2.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000082; SEA_dom.
DR InterPro; IPR036364; SEA_dom_sf.
DR PANTHER; PTHR37999; MUCIN-17; 1.
DR PANTHER; PTHR37999:SF2; MUCIN-17; 1.
DR Pfam; PF01390; SEA; 3.
DR SMART; SM00181; EGF; 9.
DR SMART; SM00200; SEA; 3.
DR SUPFAM; SSF82671; SEA domain; 3.
DR PROSITE; PS00022; EGF_1; 5.
DR PROSITE; PS01186; EGF_2; 4.
DR PROSITE; PS50026; EGF_3; 3.
DR PROSITE; PS50024; SEA; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000197619};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 203..227
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1355..1379
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2844..2867
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 1..115
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT DOMAIN 285..321
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 347..383
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 972..1013
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1152..1264
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT DOMAIN 2643..2763
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT REGION 237..266
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 725..770
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 834..862
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1073..1108
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1385..1416
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1432..1558
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1583..1654
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1908..1975
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1996..2016
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2103..2135
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2329..2429
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2466..2590
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 752..770
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 845..862
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1075..1090
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1439..1558
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 311..320
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 373..382
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1003..1012
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 2986 AA; 315467 MW; A7A97648CC1D7FA2 CRC64;
MATLDMKVTV TNFNYSEDLE DPTSKTFLFF QNHFRQEIKK IYGTIPGYEG VEITSLKSGS
IVVGHQVFFT MAQSSNTTEK FQETTDRLKE NLQEAVDRQG NCQYNTSVLC LRDDPVVGDM
REVLDLDELC HQRAPPGYGA SFSADISSGV LHCVTSCTPN RPQSLDCHHG RCQVTREGPR
CFCPDEELYW YAGAQCSGRV SKVATGLGVV VALLFLVCLV LLVVLLCRRH QRHWRRPQGG
DIDLDPETPT WTRRPQTQPG DPNPTPCASG GTAVGPGCVC PPGRSGRRCE TPDPGGACRN
GGTAVGTSCY CPPGFGGPRC QRRDPGGACR NGATAFGTGC VCPPGFRGDT CQEPEEIGSC
LNGGTLEKGT CRCPPAAWGP RCECARMGPA GKGPGVGDTA AAATAGGGTT DAPWVRWARN
VTVTPGEGNA TTVGRGEVVT ATNSGGVGRN GTVSAVVSAT MVTHGVAGTD HDVVTITKRD
TEVTDTMVTH GVVSTDHDVV TITKKDTEVT DTMVTHGVAG TDHDVVTITE KDTEVSATTV
THGVAGTDHD VVTITKKDTE VSATTVTHGV AGTDHDMVTI TEKDTEVTDT TVTHGVAGTD
HDVVTITKKD TEVSATTVSH GVAGTDHDVV TITKRDTEVT DTTVTHGVVS TDHDVVTITK
RDTEVTDTMV THRVAGTDHD VVTIAEVSAG TDHDVVTITE KDTEVTDTTV SHGVAGTDHD
VVTTTERDTR VRGSVPMATT TPMPPARRVR NVTASPTRSH TRATTMEGET TAVEGDTTVT
RSYLLEGDTT AMEGDTTAMD GETTVMEGDT EATPMEGDTE ATPVATTVES VTMTVPPGHL
GSHSRATTRA RNTTAVGNSS RATTLKGDTR VTTMEGTTRA VENQTWATIT ESDTRATTLK
SDTRATTLEN HTLSTFSVAP ADVTSTPAPP NATNTILDVT ESPPNTTEAT GALAHVSAGT
TGDFISSTAP TMATICLYLP HGTSPPAIVC RNGGVANRTR CLCPPGYSGP TCETPDPTDR
CAGGGTAVGD RCVCPAGRTG PRCASPDPAT ACRHGGTAVG TECYCPPGFS GPRCEDPSPT
TTTTPATRPP RRSRATPRST RRTTTTTTTV LTTRDPCLNG GFWMGTACLC PPNMDGPRCE
FGATTINLTA ELGPSVTMMA RVTNRDFSED MRDAASPGHR RFAEEFGRTM DGIYRNVSGY
RGINVLSLSR GSVVVNYRVR LRPLPGTASL EHRALELLAV ANAAPQPHNC STSADGLCFT
ATSARATRAS TLALNATELC RKHAPANFSR FYFPYRTANG LLCITNCTLN VPGSFDCHRG
LCRLTLDGPQ CFCPDLPWYL SAGDRCQTHI SKLGLGLGVG LGLGLTILIL LVLCIVLTVR
LARGRKKSPG PSAAAEDTWP DGGRNSRVTG IYHVNGRGGG AGKGPYGYNT YKPSSEVADP
PASGSSIYSA EATTSGTTRA TTTSGETTRP GTTTSETIRP ETTTPGTTTS GTTTPESTSD
PWVTTSFETI STSPQATNDP GTSSSFETIP TTPETTKTSD QWATTSPETV STIPDTTTTI
PETTSLEATT DPWMATSLET ISTTPDTTTT LPGTNTTPER TTFPETTTIP GITTTFPETT
NPETDNSSGT TTTFAETTVT PGTTTTFPGT TSPEITTTIP EVITIPEITT SIPEVTATIP
EVTTTIPEVT TSPEIMTTIP ETTTTMPETT TSPETTITIP ETTTTIPEIT TTIPETTTTI
PDVPTTIPET TTTIPDVPTT IPEVTTSPEI MTTIPETTTT MPETTTSPET TITIPETTTT
IPEVTTTIPE TTTNPDTTTT FPGTTIPEIT TSIPEVTTIP ETTATIPEVT TSIPEVTTTI
AETTATIPEI TTSIPEVTTT IPEVTTTIPE ITTSIPEVTA TILETTTNPD TTMTFPETTN
PEVTIPSGTT TTIPGTTTRT PGASTTPETS TFPETTTTPD TTTTFPGTTI PEVTTTTPEV
TTSPEITITI PEVPTTTPEV TTTIPETTTN PDTTTTFPGT TIPEITTSIP EVTTIPETTA
TIPEVTTSIP EVPTTIPEVI TIPEITTTIP EVPTTIPEVT TTIPEVTTSP EIMTTIPETT
TTMPEITTTS PETTATIPET TTNPDTTTTF PGTTIPEITT SPKIMTTIPE TTITIPEVPT
TIPETTTNPD TTMIFPGTTI PEITTTIPDV PTISETITIP ETTTTIPDVP TTIPETTATI
PDVPTTTPET TTTIPDVPTT IPETTATIPD VPTAIPEITT TTPETTATIP EITTTTPEVT
TTIPETTATI PEVTTTIPET TTNPDTTTTF LGTTIPEITT SIPEVTTTIP ETNSTAPETT
FTTPVTTTTP ETTSTPGTNP TTPVATTMPV TNTTTLETST TTPATSTTIP ETNPTTLGFT
TTTTPVSTAT PWTTTTTSGT TTTTSRTTTI TPETNTTTAV TNITIPGTTT TTTTPVTIIS
IPETNTTTPV SMATPGTNST TTPRTTTTTS GTTTTTPRTT TTTPGTTTTP GTTTTTPGTT
TITPRTTTTP RTTTTTPGTT TTTPRTTTTT PRTTTTTPGT TTTTRMTTTS NSTTTTTTTP
TPTTTSTTRG TTTITTTTIT TTTTPGICSN GGTWVNGHCQ CPLGFTGDRC DDISSTIETK
AETNGTVQLM LRVTNRDFTE DLKNSSSPAY IEFIQEFTKQ MDVVYASIEG YKGIRVNHLS
SGSVVVDHSI IVSLLVTSQS QEKLQNITAK VQEKIKVAAT QFNCTNGDLC FKSSEVNITS
TMLDFDGDAY CWSQAPEGYR EFFFPNLTIS GLSCMSNCTP DTTSTIDCNR GTCHITRRGP
QCFCDETHLY WYQDDRCTSR VSKLAMGLGL AVAILVTVVI LLSIFLFRAR RSGSLNTAEQ
KMMKNWYQNI SEEWSPQGNF TFRNQGAQLK DDHPVHLEAV DTSVPNLTII IVVIDIDINI
IIINMNNVKI FIIITTIIPI IPLTNPMEGG PPQHCRTNWT FSWGSP
//