ID A0A2A4JA68_HELVI Unreviewed; 2619 AA.
AC A0A2A4JA68;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
GN ORFNames=B5V51_5361 {ECO:0000313|EMBL:PCG68322.1};
OS Heliothis virescens (Tobacco budworm moth).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Heliothinae; Heliothis.
OX NCBI_TaxID=7102 {ECO:0000313|EMBL:PCG68322.1, ECO:0000313|Proteomes:UP000218220};
RN [1] {ECO:0000313|EMBL:PCG68322.1, ECO:0000313|Proteomes:UP000218220}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HvINT- {ECO:0000313|EMBL:PCG68322.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:PCG68322.1};
RA Fritz M.L., Deyonke A.M., Papanicolaou A., Micinski S., Westbrook J.,
RA Gould F.;
RT "Contemporary evolution of a Lepidopteran species, Heliothis virescens, in
RT response to modern agricultural practices.";
RL Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004167}; Single-
CC pass membrane protein {ECO:0000256|ARBA:ARBA00004167}.
CC -!- SIMILARITY: Belongs to the tenascin family. Teneurin subfamily.
CC {ECO:0000256|ARBA:ARBA00009385}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PCG68322.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NWSH01002427; PCG68322.1; -; Genomic_DNA.
DR STRING; 7102.A0A2A4JA68; -.
DR Proteomes; UP000218220; Unassembled WGS sequence.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-KW.
DR CDD; cd00053; EGF; 1.
DR CDD; cd00054; EGF_CA; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.10.25.10; Laminin; 7.
DR Gene3D; 2.180.10.10; RHS repeat-associated core; 2.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 2.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR028916; Tox-GHH_dom.
DR PANTHER; PTHR11219; TENEURIN AND N-ACETYLGLUCOSAMINE-1-PHOSPHODIESTER ALPHA-N-ACETYLGLUCOSAMINIDASE; 1.
DR PANTHER; PTHR11219:SF73; TENEURIN-A; 1.
DR Pfam; PF15636; Tox-GHH; 1.
DR SMART; SM00181; EGF; 8.
DR SUPFAM; SSF101898; NHL repeat; 1.
DR PROSITE; PS00022; EGF_1; 4.
DR PROSITE; PS01186; EGF_2; 3.
DR PROSITE; PS50026; EGF_3; 1.
PE 3: Inferred from homology;
KW Cell membrane {ECO:0000256|ARBA:ARBA00022475};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000218220};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 59..80
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 542..577
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 107..128
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 546..556
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 567..576
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 2619 AA; 291156 MW; BD9886A314E8FCD8 CRC64;
MSSGMGGGTL GSGHCQLANQ PLVMPVFPLR ASQPSPVPVY SPSRFHIDKR CQHRCTWKCL
AIAMICLCVI LAAMLAYFIA LSSIKTNIDN SNCILVQDVK AVTHAQGNRD ALSTSSPSDE
SISTSSLGGW STTAEAALES VVIPQQAQQA APPPWSPALE VREFDELHSA TIPAYQFWSS
EFRNKQPAYV SLNFTVPWGA NFAVYGRRNV APSVTQYDFV EFIRGGRVDH RLRRRRSPYR
DNPFQSSLND WDALLSTINP LHRWNHTISK RSTDMRVNVS ILQYLDTGRW FISVYNDELQ
PHNVEMIVSE AEGVTNSCPD DCSGHGSCYL GKCECMDGYE GHDCSKSVCP VLCSGHGAYA
GGVCHCSEGW KGLECDVPAH DCEPADCSGR GQCIAGHCHC KAGWKGQKCD EEDCLDPTCG
GHGSCVRGRC VCRAGWRGAS CSERDARVQR CLPACSQRGV YDLDAGRCVC DPLYTGDDCS
QVVCSLDCGP HGVCAEGVCR CDDGWTGSMC DQRPCDTRCH DHGQCKNGTC VCTQGWNGRH
CTLPGCPNGC TRHGQCLLED GVYRCSCADG WAGTDCSIAL ELSCNDNEDN DEDGMTDCSD
SECCSRPECA EHIMCLASND PVEVLLRKQP PSVTASFYQR VKFLIEENSV QSYAHMDEYS
ESRVSVMRGQ VVSPQGLGII GIRVSVDREA RFGFTLTRQG GWFDVLVNGG GAVTLQFQRS
PFKPLRKTVF VPWNQIVVLP PVQMELSDDN VKVPTPRAPP RIGWSPTPQW WESETPPCAA
HDHERLRPEV VAVPPLAAPA ASSATTTTVI YPEAQIVSES IGIPGSLVKL TYRSSQSAGY
LSCVHVQLTR RVVPASLIRV HVRVEIEGSL HTHTYEADPD LTHIFAWNKR NVYKQKVYGQ
AQAKISIGYE YTSCASIVWE TQTATLAGFD VDISDIGGWG LDIHHHYNFH EGILQKGDGA
LLHLKQYPRT VQVVMGTGLQ RSLECPDHCN GKAADARLLT PTALTSGPDG SLYVGDFNLV
RRITPEGVVT TVLQLQTTQV AYQYYICISP ADGYLYISDS ERHQVRRVIA LEKVRDPATN
SEAVVGNGDR CVPGDDSNCG DEGPAIKAKL AHPKGLAIAA DRTMYIADGT NIRAVDPNGI
IHTLVGHHGH HNHWSPVPCR GAIPPYEAQL QWPTGVALSP LDGSLYFIDD RIILKLTVDM
KIKVVAGQPS HCRIGNDGKP IAKPTNRTNT DIREDSSLGT ILAIAFAPSG ILYVAEADSK
KTNAIKSIDP SGKIMHFAGK VQENLKELSC ECNSSASATV VPTNSRDDAG CPCRLSVAAG
DEPPTSTETL LSSNAKFQTI SALAVTPDGV LNVVDQGSLH ILALKHYLPT HDENGEFRIP
YPPTSEIYVF NRYGQHITTK DLTSGKTRYS FLYSKNTSFG KLSTVTDASG NKIQLLRDYS
NIVSSIENTQ DHKLELKISG IGYLTRIAEK GSTEMEFDYD ASTGLLNSRS GAGDTVIYNY
DELGRVTKII MPSGEQVHIT SGLAKNYGLA VTVSNPSSTI PVGVAKKCEY VLHGQSFKQI
TINNGKQITE GRIYTNNTLL LETPWAGKFE SIAAAKHPLL EAALPIEAEM LHMWSHQTTT
FGDALVNNMY SLYTLVGDVR NPQQTLNREI WVNDSRVLII EFDQFKSRET FFNADRNPLF
TISYDVAGLP LSFNPHGAGA PLNISYDRFY RINGWKWGDS EESYSYDPHG MLSEITSPQD
GTKFIYYNEG NLVSKISLAS QRSFKYKYDN DGGLTHVILP SGSNHSFSVQ PSIGFLRVTY
TPPGSSKKYL QHYAHTGELL QTVFPGDGAR VVYRYFTTNK VSEVIHGDGQ TQIHYAETSG
LPSEILHVDR DVDYRWESTY VGGLLTEERL DYGAKTGLSN AKIIYEYDNN YRITSVQGRI
GGQTLIPHHI VYNSKTGAPE ILGQFTVSKQ KWNETSVYDG IAMFSRLLND QFLEKEVTVN
IHRMEVFRME FSYDRHGRIS QTRTHTRNVG VNTYTNVKNY TWDCDGQLTG VEAQEPWGFR
YDDNGNMLSL TYRGNTIPME YNDMDRIVKF GEGQYRYDSR GLVSQNAREE RFQYNSKGLL
IRATKRGRFD VRYFYDHLDR LSTRKDNFGN VTQFFYTNKD RPHEVSHIYS PRENRFMTLV
YDDRGHLIYT QVARHKYYIA TDQCGTPVMI FNQYGEGIRE IMRSPYGHIV YDSNPYLYLP
VDFCGGLLDQ VTSLVHMANG KVYDPLIGQW MSPLWENLIE RIHNPTQLHL YRFNGNDPIN
VRPQNKKPTD HLSWLKLLGY DTKSLAPQLY PDELPGGSVL PSVPRGRPVW GTAPADTSPG
LPSAMSLLPT VTIESGFLSH MSNKRIADFR SLSIPAMSAL KTDALNLAPK RIGSDSEPPF
GRGILVSRNS RGRAVVTTVP SANAIYRDVY TSVFNRSHLL PFSFVVHGDQ QDVFYFVKED
TWRAADDKQQ LKRMQGKLNV TFHEVAEGGR SYADVKIHGK TSIVNLRYGT SAERERQRLL
HHARTAAVRK AWHREREALR SGLGGSLDWS AAELDEIQKA GSAAGYEGEY VHDVARYPEL
AEDPYNIRFV KKRSDRAERR RRKREDCRGA WWLAWDELC
//