ID A0A212F6F9_DANPL Unreviewed; 1362 AA.
AC A0A212F6F9;
DT 27-SEP-2017, integrated into UniProtKB/TrEMBL.
DT 27-SEP-2017, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE SubName: Full=Hemicentin-2 {ECO:0000313|EMBL:OWR49308.1};
GN ORFNames=KGM_213249 {ECO:0000313|EMBL:OWR49308.1};
OS Danaus plexippus plexippus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Papilionoidea;
OC Nymphalidae; Danainae; Danaini; Danaina; Danaus; Danaus.
OX NCBI_TaxID=278856 {ECO:0000313|EMBL:OWR49308.1, ECO:0000313|Proteomes:UP000007151};
RN [1] {ECO:0000313|EMBL:OWR49308.1, ECO:0000313|Proteomes:UP000007151}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=F-2 {ECO:0000313|EMBL:OWR49308.1};
RX PubMed=22118469; DOI=10.1016/j.cell.2011.09.052;
RA Zhan S., Merlin C., Boore J.L., Reppert S.M.;
RT "The monarch butterfly genome yields insights into long-distance
RT migration.";
RL Cell 147:1171-1185(2011).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OWR49308.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGBW02010045; OWR49308.1; -; Genomic_DNA.
DR STRING; 278856.A0A212F6F9; -.
DR KEGG; dpl:KGM_213249; -.
DR eggNOG; KOG4475; Eukaryota.
DR InParanoid; A0A212F6F9; -.
DR Proteomes; UP000007151; Unassembled WGS sequence.
DR GO; GO:0016787; F:hydrolase activity; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR CDD; cd00096; Ig; 1.
DR CDD; cd00198; vWFA; 1.
DR Gene3D; 3.40.570.10; Extracellular Endonuclease, subunit A; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 7.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR001604; DNA/RNA_non-sp_Endonuclease.
DR InterPro; IPR044929; DNA/RNA_non-sp_Endonuclease_sf.
DR InterPro; IPR044925; His-Me_finger_sf.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR013098; Ig_I-set.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR13817:SF73; PROTEIN SIDEKICK HOMOLOG; 1.
DR PANTHER; PTHR13817; TITIN; 1.
DR Pfam; PF01223; Endonuclease_NS; 1.
DR Pfam; PF07679; I-set; 4.
DR Pfam; PF13927; Ig_3; 2.
DR Pfam; PF13519; VWA_2; 1.
DR SMART; SM00409; IG; 6.
DR SMART; SM00408; IGc2; 5.
DR SUPFAM; SSF54060; His-Me finger endonucleases; 1.
DR SUPFAM; SSF48726; Immunoglobulin; 7.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50835; IG_LIKE; 6.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007151};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..1362
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5011967681"
FT DOMAIN 22..197
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 411..499
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 598..680
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 687..770
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 774..864
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 872..965
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 970..1048
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
SQ SEQUENCE 1362 AA; 153567 MW; BF7651D2CCCBC463 CRC64;
MLLLFISMVL FGFTNGHNAK SSLTFVIDDT GSMWNDIDQV KEKTNEVFDA VLNSNASKID
DFVLVTFNDP DAKVCTVTRD RKEFKKALSD ITVDGGGDCP EYSMKGIQLA LEHSKPNSLF
YVFTDAASKD YEEYEKIKSL GLKKSIQVTF LLTGECTNTP EEAFTVYDKL AETTSGQVFH
LDKQDVSKII DYIIATIKNK KTTVAQKTFY NGYGNEFKFS IDSKLWDVMI SVSADDPRFH
LNGPDGESVD VKEFISTKKS SISKLDVKPG IYTMVLDNIG QTSVVITGST YVCFQHGFST
VMPSTLNETS TKPIEDTPSY LAIELDNVNR DVILDTVEIR DINDNILSAY PLDLLNKDSQ
FYVTKPILTP DSTFKIAING HTSTEEKITR IAPTAIEHQK PDLEGPKRKA PMVTILEGSI
TTVEYDSNLS LKCKVHAFPK PDIVWKDDSG MIWPSKVVPV DLPYDYMGIL DKDKINKNIT
LYCTAKNEIG EDKKSILVET KRNYFLEILE SPKDLVIEYG SSSVLNFKVN AYPAATIGWY
KKRKELFNDD DYEISADGST LKIKYMHQSL RGFYSVKAMN EEEKKIIYFK IDMSGEKPEI
DKTVSSYRIE KGSSANLTCR ILKGKPEPEI SWTFQNESPG SLKRLDVVGD LYIDKVGPEN
MGIYTCKARN EFGKDRHDID LFVGYVPTIK NVQTEVLVPE GQQVILTCIV DGSPYPFVRW
LLNDVEVTRT GKYSFNDNRL SFTGSIDDSG IYTCEASNSL GRTQKDYDVD IYIPVKMQVP
KDTTLKLDVG SSTTLPCVAE GYPKPNIRWT YYSKNPSIRP KTLKFDDTGS YNLEHIQIED
EGFYRCSASN VGGLSSVTYE VFVRAPVSIT NPDGVVFNAV KGDLALRIPC NAIGSPKPKV
TWMANGEHIA SGTDWYDIED DGTLNGELIA SDELYLSHVK FEDAGIYSCR VSTFLSAHTA
HKKVTVGYKP RFLSDEETVI EYSEGDFSYM DCNADGYPKP STQWIRNGDP VPINGSYLII
EMKLEDIGYY QCTVSNDLGS IRRTFKINSG ECLLRTKHDF NDQQPLLLTL SRDWPEFRTS
NEYVHIPIYK YFLLSCPGSS VYDVCLDHEQ KIPLFAKQTS NKGIALNAPP GDYTFVESKY
LPFHFGDMYD CDSQLRFISS SIGKSIKPVK DVECCFTKRQ LINPRDVLPG LSQVAVYSYL
NVIPHWSTCG TKNWDELELR VRYLGKYSSN ELTIFTGASD PMMLPGQTED AYVSLRDRLN
RRQPVPMYLW KIIQNPADNS SLAVIQLNIP NVTSAEAYSY MPCNDICPEV EWLRNNDWQD
VNKGFTFCCS ISDFNSRFGK LFDGCEKVFK TLPPLLPDFS LI
//