ID A0A1A9VHI2_GLOAU Unreviewed; 2319 AA.
AC A0A1A9VHI2;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
OS Glossina austeni (Savannah tsetse fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Hippoboscoidea;
OC Glossinidae; Glossina.
OX NCBI_TaxID=7395 {ECO:0000313|EnsemblMetazoa:GAUT037571-PA, ECO:0000313|Proteomes:UP000078200};
RN [1] {ECO:0000313|Proteomes:UP000078200}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=TTRI {ECO:0000313|Proteomes:UP000078200};
RA Aksoy S., Warren W., Wilson R.K.;
RL Submitted (MAY-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:GAUT037571-PA}
RP IDENTIFICATION.
RC STRAIN=TTRI {ECO:0000313|EnsemblMetazoa:GAUT037571-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004167}; Single-
CC pass membrane protein {ECO:0000256|ARBA:ARBA00004167}.
CC -!- SIMILARITY: Belongs to the tenascin family. Teneurin subfamily.
CC {ECO:0000256|ARBA:ARBA00009385}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 7395.A0A1A9VHI2; -.
DR EnsemblMetazoa; GAUT037571-RA; GAUT037571-PA; GAUT037571.
DR VEuPathDB; VectorBase:GAUT037571; -.
DR OrthoDB; 5491728at2759; -.
DR Proteomes; UP000078200; Unassembled WGS sequence.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-KW.
DR CDD; cd00054; EGF_CA; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR Gene3D; 2.180.10.10; RHS repeat-associated core; 2.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 2.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR031325; RHS_repeat.
DR InterPro; IPR028916; Tox-GHH_dom.
DR InterPro; IPR006530; YD.
DR NCBIfam; TIGR01643; YD_repeat_2x; 1.
DR PANTHER; PTHR11219; TENEURIN AND N-ACETYLGLUCOSAMINE-1-PHOSPHODIESTER ALPHA-N-ACETYLGLUCOSAMINIDASE; 1.
DR PANTHER; PTHR11219:SF69; TENEURIN-M; 1.
DR Pfam; PF05593; RHS_repeat; 1.
DR Pfam; PF15636; Tox-GHH; 1.
DR SMART; SM00181; EGF; 8.
DR SUPFAM; SSF101898; NHL repeat; 1.
DR PROSITE; PS00022; EGF_1; 4.
DR PROSITE; PS01186; EGF_2; 3.
DR PROSITE; PS50026; EGF_3; 3.
PE 3: Inferred from homology;
KW Cell membrane {ECO:0000256|ARBA:ARBA00022475};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Membrane {ECO:0000256|ARBA:ARBA00022475}.
FT DOMAIN 80..116
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 118..150
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 282..318
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 2242..2319
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2242..2261
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2277..2294
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 106..115
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 140..149
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 286..296
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 308..317
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 2319 AA; 261853 MW; E1B57F4BE9326886 CRC64;
MIIIIIIIIG DNVQLSITRE VTRYMEPGHW FVSLYNDDGD VQEVTFYAAI AEDMTQNCPN
GCSGNGQCLL GHCQCNPGFG GDDCSESVCP VLCSQHGEYN NGECICNPGW KGKECSLRHD
ECEVADCKGH GHCVSGKCQC MRGYKGKFCE EVDCPHPTCS SHGFCADGTC ICKKGWKGPD
CAIMDQDALQ CLPDCSGHGT FDLDSQTCTC ESKWSGDDCS KELCDLDCGQ HGRCIGDACT
CDDSWGGEYC NTKLCDLRCN EHGQCKNGTC LCVTGWNGKH CTIEGCPNSC SNHGQCRVSG
EGQWECRCYE GWDGPDCGIA LELNCGDSKD NDKDGLVDCE DPECCASHVC KTSQLCVSAP
KPIDVLLRKQ PPAITASFFE RMKFLIDESS LQNYAKLETF NESRSAVIRG RVVTSLGMGL
VGVRVSTTTL LEGFTLTRDD GWFDLMVNGG GAVTLQFGRS PFRPQSRIVQ VPWNEVIIID
MVVMSMSEEK SLPPSTHTCF SHDYDLMKPV VLASWKHGFQ GACPDRSAIL AESQVIQESL
QIPGTGLNLV YHSSRAAGYL STIKLQLTPE IIPSSLHLIH LRITIEGILF ERIFEADPGI
KFTYAWNRLN IYRQRVYGVT TAVVKVGYQY TDCKDIIWDI QTTKLSGHDM SISEVGGWNL
DIHHRYNFHE GILQKGDGSN IYLKNKPRVI LTTMGDGHQR PLECMDCEGL ALKQRLLAPV
ALAASPDGSL YVGDFNYIRR IMTDGTVRTV VKLNATRVSY RYHMALSPLD GTLYISDPES
HQIIRVRDTN DFTQPERNWE PTVGSGERCL PGDEAHCGDG ALAKDAKLAY PKGIAISSDN
ILYFADGTNI RMVDRDGIVS TLIGNHMHKS HWKPIPCEGT LKLEEMHLRW PTELAVSPLD
NTLHIIDDHM ILRMTPDGRV RVISGRPLHC ATTSSVYDTD LATHATLVMP QSIAFGPLGE
LYVAESDSQR INRVRVIGTN GRISPFAGAE SKCNCLERGC DCFEADHYLA TSAKFNTIAA
LAVTPDGHVH IADQANYRIR SVMSSIPEAS SSREYEIYAP DMQEIYIFNR FGQHVMTKNI
LTGETTYVFT YNVNTSNGKL STVTDAAGNK VFLLRDYTSQ VNSIENTKGQ KCRLRMTRMK
MLHELNTPDN YNVTFEYHGP TGLLKTKLDS TGRSYVYNYD EFGRLTSAVT PTGRVIDLAF
DLSVKGAQVK VSENAQKEIS MLIQGSSVVV RNGEAESKTM VEMDGSTTSI TAWGHMLQME
VVPYPVLAEI SPIIGESYPV PAKQRTEIAG DLANRFEWRY FVRRLQQGKQ SKGPRPFTQV
GRKLRVNGDN VLTLEYDRDT QSIVVMVDDK QELLNVTYDR TSRPVSFRPQ SGDYADVDLE
YDRFGRLVTW KWGNLQEAYT FDRNGRLNEI KYGDGSSMVY AFKDMFGSLP LKVTTPRRSD
YLLQYDDAGA LQSLTTPRGH IHSFSVQTSL GFFKYQYFSP INRHPFEILY NDEGQILAKI
HPHQSGKVAF VHDNAGRLET ILAGLSSTHY MYQDTTSLVK SVDVQEPGFE LRREFKYHAG
ILKDEKLRFG SKNSLASAHY KYAYDGNARL TGVEMTIDDK EMPTTRYKYS QNLGQLEVVQ
DLKITRNAFN RTVIQDSSKQ FFTIIDYDQH GRVKSVLMNI KSFDVFRLEL DYDLRNRIKS
QKTTFGRSTA FDKINYNADG HVIEVLGTNN WKYLYDENGN AVSVVDQGEK INLGYDIGDR
VIKVGDIEFN NYDARGFVVR RGEQKYRYNN RGQLIHAFER ERFQTWYYYD DRSRLMAWHD
SKGNVTQYYY ANPRTPALLT HMHYPKTGKT VRFFYDDRDM LIAIETTEQR YYVATDQNGS
PLAFFDLNGS IIKEIKRTPF GRIIKDTNPD FFVPVDFHGG MLDPLTKLVY IDGRQYDSTV
GQWMTPMWET LATEMSNPTD VFIYRYNNND PVNPSKQPNY MIELESWLQL FGYDLNNMQS
SKYTKTIQYN PQASIKSHTL APDFGVISGL ECIVEKTNQK FSDFDFVPKP LLKMEPKMRN
LLPRVSYRRG VFGEGVLLSR IGGRALVSVV DGSNSVVQDV VSSVFNNSYF LDVHFSIHDQ
DVFYFVKDNV LKLRDDNEEL RRLGGMFNIS THEITDHGGS AAKELRLHGP DAVVTIKYGV
DPEQERHRIL KHAHKRAVER AWELEKQLVA AGFQGRGDWT EEEKEELVSH GDVDGWVGVD
IHSIYKYPQL ADDPGNVAFQ RDAKRKRRKT GNQHRQQGKR RQQRLKDMST ANEPESESLY
DAEDEEENEN ENENFDSELL FQNKAKDFDS NSTEDDLYY
//