ID A0A194Q7A1_PAPXU Unreviewed; 2399 AA.
AC A0A194Q7A1;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Hemicentin-1 {ECO:0000313|EMBL:KPJ01408.1};
GN ORFNames=RR46_03180 {ECO:0000313|EMBL:KPJ01408.1};
OS Papilio xuthus (Asian swallowtail butterfly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Papilionoidea;
OC Papilionidae; Papilioninae; Papilio.
OX NCBI_TaxID=66420 {ECO:0000313|EMBL:KPJ01408.1, ECO:0000313|Proteomes:UP000053268};
RN [1] {ECO:0000313|EMBL:KPJ01408.1, ECO:0000313|Proteomes:UP000053268}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Ya'a_city_454_Px {ECO:0000313|EMBL:KPJ01408.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:KPJ01408.1};
RX PubMed=26354079; DOI=10.1038/ncomms9212;
RA Li X., Fan D., Zhang W., Liu G., Zhang L., Zhao L., Fang X., Chen L.,
RA Dong Y., Chen Y., Ding Y., Zhao R., Feng M., Zhu Y., Feng Y., Jiang X.,
RA Zhu D., Xiang H., Feng X., Li S., Wang J., Zhang G., Kronforst M.R.,
RA Wang W.;
RT "Outbred genome sequencing and CRISPR/Cas9 gene editing in butterflies.";
RL Nat. Commun. 6:8212-8212(2015).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ459337; KPJ01408.1; -; Genomic_DNA.
DR STRING; 66420.A0A194Q7A1; -.
DR Proteomes; UP000053268; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00096; Ig; 4.
DR Gene3D; 2.60.40.10; Immunoglobulins; 16.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 2.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR013098; Ig_I-set.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR InterPro; IPR013151; Immunoglobulin.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR45080; CONTACTIN 5; 1.
DR PANTHER; PTHR45080:SF8; WRAPPER_REGA-1_KLINGON HOMOLOG; 1.
DR Pfam; PF07645; EGF_CA; 2.
DR Pfam; PF07679; I-set; 9.
DR Pfam; PF00047; ig; 1.
DR Pfam; PF13927; Ig_3; 5.
DR Pfam; PF00090; TSP_1; 2.
DR SMART; SM00181; EGF; 2.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00409; IG; 16.
DR SMART; SM00408; IGc2; 16.
DR SMART; SM00209; TSP1; 2.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR SUPFAM; SSF48726; Immunoglobulin; 16.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 2.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS01187; EGF_CA; 2.
DR PROSITE; PS50835; IG_LIKE; 16.
DR PROSITE; PS50092; TSP1; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000053268};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 387..471
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 515..607
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 774..861
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 881..952
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 959..1050
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1055..1144
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1149..1233
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1242..1325
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1330..1418
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1423..1502
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1507..1586
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1591..1670
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1701..1767
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1774..1858
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1876..1964
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1968..2060
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2279..2321
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
SQ SEQUENCE 2399 AA; 265645 MW; E9BC325F14B2008C CRC64;
MQKIEKTPQD KDEGKSSLAF VFDTTGSMSN DLKQLREGAE MILKTALEES NIIADFVFVP
FHDPNIGPAT VTKSKKVFKA ALNIIHVKGG GDCPEKSLGG VLLALSVSRP RSYVYVFTDA
SASDHKIVGK VLDAVQRKQS QVVFVLTGHC NDVKRQTYKV YQQVATASSG QVFNLNKTNV
HNVLDFVRSS IRGRSVNLAS AVHPAGYNYT QAIPVDSSLG EVTVSVSGAK PTIKVVNPSG
HEVVGPPQLV TTLDLSEIMI VKVLEPEPGN WTITVGSEKD YSVKVSGLSN ITFDHGFSVE
KTNSMEEASY RPLQGTYNHM LISISEVPVQ IDFAEILTTD GKTLFEVPVK QKDDMFLADA
FIPPDDFFYI AINGRDQNGQ KFRRVGPTAV QAKPPEARSH DRVVLKCNVE SLVPVTVQWT
RDSPRMQRKT SSLQSTSIEY VIEDMSEAHV GTYRCSASNA AGQSKTSTQL SLLDVARRPE
REAGGSSSLP ALDDRSLSVQ AEGVGRCRCL SVEPPQVSVL PVNATLSAGE DLTISCAVFS
EAVLLKSQII FNNGKTVNVT NIKVGRNIDG FYSYNKTISK VTEKHRGKYT CLAANRGGSI
NQSTYIIIES EPTAKILGHH NVAGQINTNL KLLCQVENAN LAQWMSPNGT VAKEVAINGT
YNITLDVTLT EEGYWTCIGL RNSHRDRQVP STPSRWLAIH NRAVCAKLPA SHSSVVEWTV
LGGTATIKGP ARSEHIINAS AIPLVLLMSM GVVEHLGVPA APSDIVQVNI EIKPEVNIED
RNITVVVGSM QQLVCTVRAK PPPTVVWRRG TEDFNNTVVQ VKETVYRSVV MLDSDRYSVN
GTYFCIAENS VGKAQDTVTV NVRKNMRILE NFTDQQVELY SQIDLHCNVD THPPATITWQ
HNNTDIQTDD NIDISEDGTT LYIQKIDFND LGLYTCLMDN GYERIQINGS LEVVGLESPV
LSKEPSSIRA RRGDTAFISC RLLKGNPEPK ITWQYKINGT DEFKNLRNNT KVNEEGFIVT
INNVSLQDEG FYQCLAENPI GKDIYQVFLN ILYPPELKTL RKSEENNLEI KSGDKVLLNC
NVVGNPPPFV VWTKDEKTLP YSKNVYLTNT NELVISNVTV YHSGTYSCNA SSTLGSLTNN
YTLQVYLSPS ITTHSLDQVI QVLEGQLVEL PCAVSGSPTP AVTWSHNDVH VNEHRKYIDE
YGMRFVANLT DFGEYSCIAT NAYGYAMLNY TVYVWVAPFL EPPLLETKNV LLGSNVTLQC
SAIGFPVPII LWQFNNTLLA ENTTDLSFNE IGQMNITNVN YKHEGHYECV AENLAGFAAK
TIILNINEPP KIMDDNYTGP YVATVLDTFL ALACKATGKP TPYVVWIKDG YYLDKDSRYD
IDMEGTLTIK SPSEDLSGDY TCLAKNILGS ATRTVPVQIY AVPSVMVSEE SQSVMHVVEG
SNATIECPLR RGLNDAIKWY KDALLISNTS LHLTPVRRAH DARYACVVSN MAGGAHATLI
LDVQWPPHTN TYTQLLEISK GNNIELNCDV DAKPTAKTKW LFNSKLLLGE DKSHLKLTNI
QLRQTGVYKC VAANEHGTVV KEFIVDVLVP PFISDFDVLD VQLKEGTNAS LECNARGSPK
PDIKWSFNNT SWLIKNSTII NTNITTKSEG TYKCEARNKA GVTYLVYRVN VVKIAKIEDI
IVFNDAIGTN VIDILEVVVD SRIRIACKGT GKPTPYIQWI RHGNTLANNT PNINYADFII
HKVATSDAGL YVCVASNEGG VDERKIKLDV LEPPKIFQNL FQETSSTNII NLEVLSGQAF
YMHCHPYGNP LPEIYWFKDN IPLKLFDNTM IYTDYSEIIQ SPNALYEQSG NYTCVAKNKV
GVTSLIYLVD VLVPPPQPKE STKEVRTRIG KALNLTCPAE GAPKPYVTWI KHPYSEITQL
TPRVHLTEDN VTLIINVTLV ADSGIYSCIM TNKVGAIEVN FNVIIEKPSS IVGNVGNDTM
ENHVVSLRRS IVLKCDVDGH PPPKITWLKD IQRVSEEVHI QRVLGSSLLA VWSAAVRDAG
QYICVAENSA GIASRRYNLA VKVPGKWSAW SQWSYCNVTC GLGYQKRSRF CHYIDDNNTT
IDKTSMSDKI ILDESACKGS TIDRRKCRMP SCEEEESGWS SWSRWGACSA TCGAGTQARA
RRCRAHAQYQ CTGDNVQIRK CPNLSKCSPQ SRYNTNEVYS SQETNETDMS SYLPETVIEM
QPDDVDFRRS LDTEEIYVTT GKKGQMYFDV NVTENLDRSE RGPCNAGFTY ITDDDTCQDV
DECTIESNQC HATQLCTNTA GGYRCSCPAG YVALAAGLRC LDINECEQEV HGCEFACVNV
AGGYVCACPR HLRLHVDRHH CVLPPLYQKP LSFNENSVSD DFLNSDVDFP ASYTKYNRY
//