GenomeNet

Database: UniProt
Entry: G3TMX8_LOXAF
LinkDB: G3TMX8_LOXAF
Original site: G3TMX8_LOXAF 
ID   G3TMX8_LOXAF            Unreviewed;      2544 AA.
AC   G3TMX8;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   16-NOV-2011, sequence version 1.
DT   27-MAR-2024, entry version 60.
DE   RecName: Full=Mucin 6, oligomeric mucus/gel-forming {ECO:0008006|Google:ProtNLM};
OS   Loxodonta africana (African elephant).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta.
OX   NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000016519.2, ECO:0000313|Proteomes:UP000007646};
RN   [1] {ECO:0000313|Ensembl:ENSLAFP00000016519.2, ECO:0000313|Proteomes:UP000007646}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000016519.2,
RC   ECO:0000313|Proteomes:UP000007646};
RA   Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., Lindblad-Toh K.;
RT   "The Genome Sequence of Loxodonta africana (African elephant).";
RL   Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSLAFP00000016519.2}
RP   IDENTIFICATION.
RC   STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000016519.2};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 9785.ENSLAFP00000016519; -.
DR   Ensembl; ENSLAFT00000021621.2; ENSLAFP00000016519.2; ENSLAFG00000022740.2.
DR   eggNOG; KOG1216; Eukaryota.
DR   GeneTree; ENSGT00940000161708; -.
DR   HOGENOM; CLU_000076_1_0_1; -.
DR   InParanoid; G3TMX8; -.
DR   OMA; GSNIEGC; -.
DR   TreeFam; TF300299; -.
DR   Proteomes; UP000007646; Unassembled WGS sequence.
DR   CDD; cd19941; TIL; 2.
DR   Gene3D; 2.10.25.10; Laminin; 2.
DR   InterPro; IPR006207; Cys_knot_C.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036084; Ser_inhib-like_sf.
DR   InterPro; IPR002919; TIL_dom.
DR   InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR   InterPro; IPR001007; VWF_dom.
DR   InterPro; IPR001846; VWF_type-D.
DR   PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR   PANTHER; PTHR11339:SF264; MUCIN-6; 1.
DR   Pfam; PF08742; C8; 2.
DR   Pfam; PF01826; TIL; 1.
DR   Pfam; PF00094; VWD; 3.
DR   SMART; SM00832; C8; 2.
DR   SMART; SM00215; VWC_out; 2.
DR   SMART; SM00216; VWD; 3.
DR   SUPFAM; SSF57603; FnI-like domain; 1.
DR   SUPFAM; SSF57567; Serine protease inhibitors; 2.
DR   PROSITE; PS01225; CTCK_2; 1.
DR   PROSITE; PS50853; FN3; 1.
DR   PROSITE; PS51233; VWFD; 3.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007646};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT   DOMAIN          10..190
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          370..548
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          846..1028
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          1579..1685
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2454..2543
FT                   /note="CTCK"
FT                   /evidence="ECO:0000259|PROSITE:PS01225"
FT   REGION          218..240
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1116..1135
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1154..1388
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1437..1993
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2005..2095
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2115..2183
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2261..2372
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1154..1233
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1240..1319
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1437..1454
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1455..1479
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1493..1688
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1697..1993
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2544 AA;  265703 MW;  68667E4921FE69E2 CRC64;
     NTLSTDEQRG WCSTWGAGHF STFDHHVYNF SGTCNYVFAA ICKDPSPTFS VQLRRGSDGT
     TARIIMELGS SVVTVQSGVV SVKDVGVVSL PYTSNGLQIT PFGQNVRLVA KQLELELEVL
     WGPDGYVMVR TAGVLSGGGG RGPMWGRCQH CRCPQVLVER KFMGKMCGLC CPANQGVQGS
     AAQPTSDPVP TKAQLPRLLH LWLLLPRRVR ASAARSLGLH SPNPQMNQEE GTAGNPTSSS
     PCPAMRACGC VHGQKRRWGL SAPSVQSGHR HLLGSATHQG VVSWVWASWG RVPLRCPWQL
     RMQHAAPGSP SPAGMVLDDL SKNQTCVPIT QCPCMFNGAV YAPGEVTSSS CRTCQCSEGL
     WKCTEQPCPG RCSLEGGSFV TTFDARPYRF HGTCTYILLQ SHQLPEEGSL MAVYDKSGYS
     HSETSLVAII YLSGQDKIMI SQDEVITNNR EVKWLPYKIG NITIFRQTST HLQMATDFGL
     ELMIQLQPVF QAYVTVEPQF RGQTRAHPGL CGNYNGDTTD DFMTSMGITE GTASLFVDSW
     RAGNCPTALE RETDPCSMSQ LNKVCAETHC SVLVKKGSVF EKCHSVVNPQ PFYKRCVYQA
     CNYEETFPHI CAALGAYAHL CASHGVLLWD WRSSVDNCTI PCTGNTTFSY NSEACNRTCM
     SLSDHTLECH PSAVPVDGCN CPEGTYLNHK AECVRKGPVS LPSWDSHEFI FGRDSPPLST
     APPCYCINGR LTCSRKAQTL LATCTAPKTF QSCSQSSESK FGAACAPTCQ MLATGIACVP
     TKCEPGCVCA EGLYEDANGQ CVPPSNCSCE FGGASYPTGA ELNTDCQTCT CKQGKWVCQQ
     STSCASTCTL YGEGHMITFD GQRFVFDGNC EYTLTTDGCS TNDSQPTFKI VTENVICGKS
     GVTCSRAIKL FLGALSIVLA DKTYTVSGHD PQVSFQVQPG SMHLVLEVYI AGKYNLTLTW
     NKHMMVLIKV SRTSAQVTPV RAGGLDGNPK ADSGSRMGLT GWWTLVSKGK RSPPVADSGN
     GLVAKVCMAH RVLGAGPAHT RLSNLDAHPQ VYHMPYYEAC VRDTCGCDTG GDCECLCDAV
     AAYAKACLDK GVCVDWRSPD FCRECLGVDG IKRHQAVPSL PPQGPHPSHP RQASCPASAE
     VSFPCSPAYH ASTTHHSDDA NLVPTAWPST EAQPTTPATP TPRTSGLLSS ARPSTSPGAT
     PVGPTATASL PATSMASPQA PSSAQTDTVP TAPTKPAVSP GESPRSTTAI TPRVTSAPTP
     TSTRRVTATR PTVTQATSHP TASHPTTATQ TTAESHRAHH SSYGTPTALE ETSSILPATH
     QRRRAVPHHS RRSVPGAQRR TGPPGDAQAW PRPRHSPTNK LPPCLHPTAT GHGPHTSAVP
     ASHGEYSVGS SSIVSCASAE ELDTGRASGG LRLETHELKI KGVENMGGFG CKFRKQSTKE
     IREGDPKAVE TKQTRVHTTK TTGTERGILA NSEERTNRSD LGSGGPRRRQ TRVTPLLSPT
     AASTSTAATG TTTGRQTRTR SPEITPPTRI PPLATSSVTP TSHRVTTPAA EATTSSPSSP
     PPTGTSLSTT TGTKTKLPTP VPPDATSSVT PTSQQVTTLT PAAIKSSSSP LTTATSRHTT
     AAPSTGTRVR ATGTPVPETT LPVSHSQPHT TFTTLSQPTV SASSSSRSTG PPTGTSFKTT
     TTLPIPSSPK TTLPTPAPPV ATSSVTPTSQ QVTTLTPAAI KSSSSPLTTA TSRPTTAAPS
     TGTQATGTPV PETTSPVSHS RPHTNFTTPS QPTVSASSSS RSTGPPTGTS FKTTTTLPTP
     SSPETTLPTP VQPLATSSVT ATSHPVTTPT TETISSSPSS SPHTGTSRTT TAAPSTGTRS
     ATGTPVPETT SPVSHSQPHT TSTTLSQPTV SASSSSRSTG PNGTSFKTTT TLPTPSSTKT
     KLPTSVPPVA TSSVTPTSHQ VTTPTRSSSS PLTTATSRPT TAAPSTGTRS TPSTSPFTKT
     TSATGSSHTS VSTVKTTISQ VSFAPSTAPA SSTSPPHTTI TPKPTSRATG TLVPETSLSA
     THSQPHTSFI TLSPPTLSPS SSSHSTRPPL GTSFKTTTSF PIPSGPRTTV STRFPPFATS
     SVTLTSHRVT TLTAEAITSS PSSAWPTGTS RPTTAAPSTG PRPTASTSPL TKTTSATGPP
     HSSLSTAKTS PHVSHSSSLL PPLPTISHPS SSFTSLPSVS QIPFTTSVSL PHLPFSSTHF
     PSTSSIPSVS TLATSTSSFS SGIPASFSSS TIASVSTYPS QSAVPTTQHS KQPSSAYHTP
     SSPGQVVVSN STSSFTHSPH STTSLSSSVS PPFTSSPHSS TLPPTSTLHL SSPTSSVPVS
     SSAIGPSTSS TTPSAVPSSP LSSQSTSLST PLTSLTATPG IVSSTLGATA SPSSPATMLT
     TNQTSTTGLL TTLVLTSSIT AHGSSSIPVP ESFSTQGSSI SSVVTSLSSP TGICSVREYE
     KEITYLGCTA NVTMTRCEGS CASSASFNIH TKQVDIQCGC CHPLRSYEKQ LVLPCPDPSA
     PGDQLVLTVL VFSSCACSSQ ACRD
//
DBGET integrated database retrieval system