ID G3TMX8_LOXAF Unreviewed; 2544 AA.
AC G3TMX8;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 60.
DE RecName: Full=Mucin 6, oligomeric mucus/gel-forming {ECO:0008006|Google:ProtNLM};
OS Loxodonta africana (African elephant).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta.
OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000016519.2, ECO:0000313|Proteomes:UP000007646};
RN [1] {ECO:0000313|Ensembl:ENSLAFP00000016519.2, ECO:0000313|Proteomes:UP000007646}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000016519.2,
RC ECO:0000313|Proteomes:UP000007646};
RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Loxodonta africana (African elephant).";
RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLAFP00000016519.2}
RP IDENTIFICATION.
RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000016519.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9785.ENSLAFP00000016519; -.
DR Ensembl; ENSLAFT00000021621.2; ENSLAFP00000016519.2; ENSLAFG00000022740.2.
DR eggNOG; KOG1216; Eukaryota.
DR GeneTree; ENSGT00940000161708; -.
DR HOGENOM; CLU_000076_1_0_1; -.
DR InParanoid; G3TMX8; -.
DR OMA; GSNIEGC; -.
DR TreeFam; TF300299; -.
DR Proteomes; UP000007646; Unassembled WGS sequence.
DR CDD; cd19941; TIL; 2.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF264; MUCIN-6; 1.
DR Pfam; PF08742; C8; 2.
DR Pfam; PF01826; TIL; 1.
DR Pfam; PF00094; VWD; 3.
DR SMART; SM00832; C8; 2.
DR SMART; SM00215; VWC_out; 2.
DR SMART; SM00216; VWD; 3.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 2.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS50853; FN3; 1.
DR PROSITE; PS51233; VWFD; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000007646};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 10..190
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 370..548
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 846..1028
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1579..1685
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2454..2543
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT REGION 218..240
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1116..1135
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1154..1388
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1437..1993
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2005..2095
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2115..2183
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2261..2372
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1154..1233
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1240..1319
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1437..1454
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1455..1479
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1493..1688
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1697..1993
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2544 AA; 265703 MW; 68667E4921FE69E2 CRC64;
NTLSTDEQRG WCSTWGAGHF STFDHHVYNF SGTCNYVFAA ICKDPSPTFS VQLRRGSDGT
TARIIMELGS SVVTVQSGVV SVKDVGVVSL PYTSNGLQIT PFGQNVRLVA KQLELELEVL
WGPDGYVMVR TAGVLSGGGG RGPMWGRCQH CRCPQVLVER KFMGKMCGLC CPANQGVQGS
AAQPTSDPVP TKAQLPRLLH LWLLLPRRVR ASAARSLGLH SPNPQMNQEE GTAGNPTSSS
PCPAMRACGC VHGQKRRWGL SAPSVQSGHR HLLGSATHQG VVSWVWASWG RVPLRCPWQL
RMQHAAPGSP SPAGMVLDDL SKNQTCVPIT QCPCMFNGAV YAPGEVTSSS CRTCQCSEGL
WKCTEQPCPG RCSLEGGSFV TTFDARPYRF HGTCTYILLQ SHQLPEEGSL MAVYDKSGYS
HSETSLVAII YLSGQDKIMI SQDEVITNNR EVKWLPYKIG NITIFRQTST HLQMATDFGL
ELMIQLQPVF QAYVTVEPQF RGQTRAHPGL CGNYNGDTTD DFMTSMGITE GTASLFVDSW
RAGNCPTALE RETDPCSMSQ LNKVCAETHC SVLVKKGSVF EKCHSVVNPQ PFYKRCVYQA
CNYEETFPHI CAALGAYAHL CASHGVLLWD WRSSVDNCTI PCTGNTTFSY NSEACNRTCM
SLSDHTLECH PSAVPVDGCN CPEGTYLNHK AECVRKGPVS LPSWDSHEFI FGRDSPPLST
APPCYCINGR LTCSRKAQTL LATCTAPKTF QSCSQSSESK FGAACAPTCQ MLATGIACVP
TKCEPGCVCA EGLYEDANGQ CVPPSNCSCE FGGASYPTGA ELNTDCQTCT CKQGKWVCQQ
STSCASTCTL YGEGHMITFD GQRFVFDGNC EYTLTTDGCS TNDSQPTFKI VTENVICGKS
GVTCSRAIKL FLGALSIVLA DKTYTVSGHD PQVSFQVQPG SMHLVLEVYI AGKYNLTLTW
NKHMMVLIKV SRTSAQVTPV RAGGLDGNPK ADSGSRMGLT GWWTLVSKGK RSPPVADSGN
GLVAKVCMAH RVLGAGPAHT RLSNLDAHPQ VYHMPYYEAC VRDTCGCDTG GDCECLCDAV
AAYAKACLDK GVCVDWRSPD FCRECLGVDG IKRHQAVPSL PPQGPHPSHP RQASCPASAE
VSFPCSPAYH ASTTHHSDDA NLVPTAWPST EAQPTTPATP TPRTSGLLSS ARPSTSPGAT
PVGPTATASL PATSMASPQA PSSAQTDTVP TAPTKPAVSP GESPRSTTAI TPRVTSAPTP
TSTRRVTATR PTVTQATSHP TASHPTTATQ TTAESHRAHH SSYGTPTALE ETSSILPATH
QRRRAVPHHS RRSVPGAQRR TGPPGDAQAW PRPRHSPTNK LPPCLHPTAT GHGPHTSAVP
ASHGEYSVGS SSIVSCASAE ELDTGRASGG LRLETHELKI KGVENMGGFG CKFRKQSTKE
IREGDPKAVE TKQTRVHTTK TTGTERGILA NSEERTNRSD LGSGGPRRRQ TRVTPLLSPT
AASTSTAATG TTTGRQTRTR SPEITPPTRI PPLATSSVTP TSHRVTTPAA EATTSSPSSP
PPTGTSLSTT TGTKTKLPTP VPPDATSSVT PTSQQVTTLT PAAIKSSSSP LTTATSRHTT
AAPSTGTRVR ATGTPVPETT LPVSHSQPHT TFTTLSQPTV SASSSSRSTG PPTGTSFKTT
TTLPIPSSPK TTLPTPAPPV ATSSVTPTSQ QVTTLTPAAI KSSSSPLTTA TSRPTTAAPS
TGTQATGTPV PETTSPVSHS RPHTNFTTPS QPTVSASSSS RSTGPPTGTS FKTTTTLPTP
SSPETTLPTP VQPLATSSVT ATSHPVTTPT TETISSSPSS SPHTGTSRTT TAAPSTGTRS
ATGTPVPETT SPVSHSQPHT TSTTLSQPTV SASSSSRSTG PNGTSFKTTT TLPTPSSTKT
KLPTSVPPVA TSSVTPTSHQ VTTPTRSSSS PLTTATSRPT TAAPSTGTRS TPSTSPFTKT
TSATGSSHTS VSTVKTTISQ VSFAPSTAPA SSTSPPHTTI TPKPTSRATG TLVPETSLSA
THSQPHTSFI TLSPPTLSPS SSSHSTRPPL GTSFKTTTSF PIPSGPRTTV STRFPPFATS
SVTLTSHRVT TLTAEAITSS PSSAWPTGTS RPTTAAPSTG PRPTASTSPL TKTTSATGPP
HSSLSTAKTS PHVSHSSSLL PPLPTISHPS SSFTSLPSVS QIPFTTSVSL PHLPFSSTHF
PSTSSIPSVS TLATSTSSFS SGIPASFSSS TIASVSTYPS QSAVPTTQHS KQPSSAYHTP
SSPGQVVVSN STSSFTHSPH STTSLSSSVS PPFTSSPHSS TLPPTSTLHL SSPTSSVPVS
SSAIGPSTSS TTPSAVPSSP LSSQSTSLST PLTSLTATPG IVSSTLGATA SPSSPATMLT
TNQTSTTGLL TTLVLTSSIT AHGSSSIPVP ESFSTQGSSI SSVVTSLSSP TGICSVREYE
KEITYLGCTA NVTMTRCEGS CASSASFNIH TKQVDIQCGC CHPLRSYEKQ LVLPCPDPSA
PGDQLVLTVL VFSSCACSSQ ACRD
//