ID A0A3P9P691_POERE Unreviewed; 4080 AA.
AC A0A3P9P691;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Hemicentin 2 {ECO:0000313|Ensembl:ENSPREP00000017309.1};
OS Poecilia reticulata (Guppy) (Acanthophacelus reticulatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=8081 {ECO:0000313|Ensembl:ENSPREP00000017309.1, ECO:0000313|Proteomes:UP000242638};
RN [1] {ECO:0000313|Proteomes:UP000242638}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Guanapo {ECO:0000313|Proteomes:UP000242638};
RA Kuenstner A., Dreyer C.;
RT "The genomic landscape of the Guanapo guppy.";
RL Submitted (NOV-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPREP00000017309.1}
RP IDENTIFICATION.
RC STRAIN=Guanapo {ECO:0000313|Ensembl:ENSPREP00000017309.1};
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 8081.ENSPREP00000017309; -.
DR Ensembl; ENSPRET00000017493.1; ENSPREP00000017309.1; ENSPREG00000011532.1.
DR GeneTree; ENSGT00940000164697; -.
DR OMA; RIYRVQP; -.
DR Proteomes; UP000242638; Unassembled WGS sequence.
DR Bgee; ENSPREG00000011532; Expressed in caudal fin.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0035122; P:embryonic medial fin morphogenesis; IEA:Ensembl.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:Ensembl.
DR GO; GO:0090497; P:mesenchymal cell migration; IEA:Ensembl.
DR GO; GO:0043589; P:skin morphogenesis; IEA:Ensembl.
DR CDD; cd00054; EGF_CA; 8.
DR CDD; cd00096; Ig; 7.
DR CDD; cd00198; vWFA; 1.
DR Gene3D; 2.40.155.10; Green fluorescent protein; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 32.
DR Gene3D; 2.10.25.10; Laminin; 7.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR026823; cEGF.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR006605; G2_nidogen/fibulin_G2F.
DR InterPro; IPR009017; GFP.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR013098; Ig_I-set.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR InterPro; IPR013106; Ig_V-set.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR45080; CONTACTIN 5; 1.
DR PANTHER; PTHR45080:SF8; WRAPPER_REGA-1_KLINGON HOMOLOG; 1.
DR Pfam; PF12662; cEGF; 1.
DR Pfam; PF07645; EGF_CA; 6.
DR Pfam; PF07474; G2F; 1.
DR Pfam; PF07679; I-set; 22.
DR Pfam; PF13927; Ig_3; 8.
DR Pfam; PF00090; TSP_1; 1.
DR PRINTS; PR00907; THRMBOMODULN.
DR SMART; SM00181; EGF; 8.
DR SMART; SM00179; EGF_CA; 8.
DR SMART; SM00682; G2F; 1.
DR SMART; SM00409; IG; 32.
DR SMART; SM00408; IGc2; 32.
DR SMART; SM00406; IGv; 11.
DR SMART; SM00209; TSP1; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF54511; GFP-like; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 2.
DR SUPFAM; SSF48726; Immunoglobulin; 31.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 5.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 5.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS50835; IG_LIKE; 31.
DR PROSITE; PS50993; NIDOGEN_G2; 1.
DR PROSITE; PS50092; TSP1; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Membrane {ECO:0000256|ARBA:ARBA00022989};
KW Reference proteome {ECO:0000313|Proteomes:UP000242638};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022989};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..4080
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018313914"
FT DOMAIN 424..510
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 512..594
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 604..682
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 686..769
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 775..932
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 934..1020
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1024..1108
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1119..1198
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1220..1279
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1283..1371
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1375..1464
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1467..1556
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1560..1653
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1656..1760
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1763..1854
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1858..1944
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1951..2035
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2038..2126
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2130..2217
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2223..2311
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2316..2404
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2408..2496
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2498..2586
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2589..2663
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2671..2756
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2762..2847
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2851..2930
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2938..3031
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 3034..3107
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 3112..3192
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 3198..3283
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 3345..3566
FT /note="Nidogen G2 beta-barrel"
FT /evidence="ECO:0000259|PROSITE:PS50993"
FT DOMAIN 3574..3613
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3657..3696
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3738..3773
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3780..3820
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3877..3913
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DISULFID 3578..3588
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 3881..3891
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 4080 AA; 440777 MW; BFD30EE40E0FAB16 CRC64;
MFGSLLKCLV LALLVSVHRL LADSLVLPRF EEVDEPSDSA STLAFVFDVT GSMYDDLKQV
IEGASRILEK TLSRRTRPIK NFVLVPFHDP IGPVSITTDP KKFQQDLQEL FVQGGGDCPE
MSIGAIKKAL EVSLPGSFIY VFTDARAKDF RLKRDVLRLV QLRQSQVVFV LTGDCGDRSQ
PGYRAYEEIA ATSSGQIFHL DKQQVNEVLK WVEETVQAMK VHLLSSNHDS AQENKWEVPF
DPSLREVTVS LSGPAPQIEL SDPYQVVGAE QGLKELLNIP NSARVINLKF PRPGFWKLKV
SCSGRHTLRV TGVSSLDFRA GFSTLPVSEF NHTRERPIKH VMLKCTGLKP PGYLSQVELM
SASGRSLRTI PLSLPSDLGQ QGLWTLPEFR TPPQSFFIKA TGNDESGFRF QRLSSVSYTH
IIPPPAVRMP NAVRGFYMQP ATIGCSVQSD IPYRLRFTRS GITLGEEKHF NSAVASWEIA
HASGEDEGMY ECVAQSSAGQ GRALTQLTIQ PPPVLRPAVN VTSSVGGVAV LSCQVDGHMR
HNLTWHRAGR AVHARAGRAK LLADSTLQIS GVRPQDAGAY RCEATNAHGR SEITVWLLVP
AEAPSVEVRP QTQTFSRGDE IRFVCSASGS PPPQVFWSHG NVPLANQPGV LTIRGALPED
AGDYTCQATN EAGSVSRSAS LTYAVPRLTI TQQVVLVSVG SDATLACQAT GVPPPLVRWF
KELEVGSAPF VEQDVHSGTL QIRGVQEVDA GQYNCVASSS AGTSTGTVSL EVGGPLFSEA
PADVSANVGE NITLPCIARG FPQPTVSWRR KDVWLHDEGV YICEAKNQFG TVKAEARVSV
TGLPPVLAQA PPIITTGAGQ SLSIPCMLLD GIPLPERHWS SNGKVQANGR MFIRSDGSLY
IERTLPEDTG TYVCTAVNVA GSTNITVSLE VQPPEIRAGP YQYIANEGVA ITLACEASGV
PKPVVVWSKG RQPLPRDGSS LQPDSDGYLH IPRPASDHAG IYICTATSPV GYASREIQLS
VNLPKIMGLS GHDNIVKMAA EVGTEVVLPC EAQGSPSPLV TWSRNGHPIP PVTAFTVLPS
GSLRITDVRL IDSKLYTCTA ENPAGNVSLI YNLHIQAKPR IQPAPSILKA MLGQTVALPC
VVQGEPRPEI SWFHNGRPVG NKNSAPLRIH PVALADQGTY QCVAKNGAGQ ETLEITLEIL
PGDAILEKVV NSKVVISCSP HPRVRWFKNG LEIHPEQSEY SVARDGALVI STASASHSGD
FKCVATNEAG SVERKTRLKV NPPEIQDDSQ PLNLTVTLRQ PLTLGCDAFG IPSPLITWTK
DGHAQVDNPG VYLQNGNRLL RIYRVQQEHA GQFACIAQNS AGEARRQYNI VVQPPVISGT
SRLQELTVVQ GQEVEFQCRV SGRPAPRVEW SHDGVLSPDG DPHVEFLEDG QVLKVKSVRL
RDQGLYQCLA RNSAGTQMRQ FRLTVQPPVI KSTNETSEVS VVLGFPTVLS CNTAGSPTPS
ITWLKDNQPI VSTPQLTYTL GGQALRLGST QGDSSGIYTC RATNPAGTAT KHYSLSVLPP
QIEGDSSTST FSGQEEKVRI NGSLTLSCLA KGFPEPKVHW FKDGQLLTGS QQAGIQERGH
LLHIENAVLS HEGLYTCVVS NSAGEDKRDF HITVQPPIFQ RVTNTEAAWS LGHEGDNEDR
VQKRDIVLGH SVSLSCESNA IPPPKLSWYK DGRKLTSADG VMLQPGQVLQ IPRVQQDDGG
KYTCQAVNEA GEDHMHFELE VLPPVIMGAS EEFMEEMGAV VNSTVVLHCD ATGHPTPVVS
WLRDGQPVSA DSQHHISKDG TQLQLLSVQV SDMADYLCVA ENKVGTVEKH FNLAVQPPRI
TGPKEEEVSV IEGLMVSLLC DVQAYPPPEI TWTRDGQVLH FSTTVHILPG QMLQLPRARL
EDGGQYVCTA SNSAGQDQKS FLLSVYPSLK PRLDAETDSV TPQVGSSVVL LCDAHGVPEP
EVTWYKNGRQ LAGGNGLKIN GHQLEIIGVQ TTDGGMYTCK VSNVAGQVDR TFRLTVHPPV
LDGPQRESVS YTLGSHVALL CEATGVPVPS ITWLKDGTPI ESSLQWQWSI RGNRLELGPL
TLSHAGTYTC LVKNSEGQTK KDYTLTVLSP TILNSDQASD VSAPTGEELT LDCRANGIPT
PRLSWLKDGE PLEGSDTHHI LTSGGSTLTV RRLSPEDSGT YTCLAVSTAG QESKIYTLVV
LPPSISGETT VPREVQVTQD SVVTLECHAA GNPPPQISWL KNGRPLLLSP RTRLLSADSL
LIAPVQQSDS GVYTCVARSQ AGLAQLSYDV QVQVPPGVDR VEPVEPVTVV QGSLVTLTCE
ARGFPLPTLT WMKDGQPLSL HRNLLLDGQE TRLQLPDVTR SDEGLYSCVA SNQAGSSTKS
FNLTVLPPKI SRSSSPEELT VPVNSPLELE CSASGVPPPT ISWLKDGRPL EGAGIIQQDG
HAVRISKVQV EDAGLYACLA SSLAGEDGKS HWVRVQPPTL LGSSDVKLVT VPFNGHLTLE
CLADSDPPPD IEWYKDEAKL QLGGRIQRLA GGQYLEIQEV KPEDSGQYSC VVTNMAGSTS
LFFTVEIMPP VIKESSSLVT VHLSQDVVLP CEVEGDSLPA VIWRKDGFPV PRDNNFSLLT
EGSLRVRGAQ LNDAGRYYCT VSNPAGSDHG PSISPGPFNV TVTTGTRAVL SCETTGIPPP
KISWKRNGTP LDIHQQPGAY LLSSGSLVLL SPSDEDEGYF ECTAVNEVGE ERRVIEVILQ
PPSIEDDVTA VKAVKMSPVA LPCHVQGQPP PTVTWTKGGA KLGTRGGNYR VLPTVLEIPA
ALPSHAGRYT CSARNPAGVA HKHVSLTIQP PEIRPMAEEV QVLLNHGTVL PCEVQGFPRP
TIIWQREGVP IAAHRLAVLS NGALKFSRVT LGDAGTYQCL AKNEAGVTVA RTKLVLQPPV
LSVPRIDYTA VLGQPVSLEC AADGQPQPEV TWHKERRPVV DGPHMRIFAN GTLAIASTQR
SDAGIYTCTA KNLAGRASHD MRLVVQQTEL SVIQGFQALL PCAAQGLPEP RVLWEKDSVV
VPNLLGKFSI LRSGELIIER VEPGDAGIFT CVATNPAGSA RRDVRLSINS RPVFKELPGD
VTLNKGQSLA LSCHAQGTPP PSISWTINNK PYKAITDEAG RSSVLSENVT LNDAGTYVCL
AENSVGSIRA LSFVRIRPPV LKGEAHTSQT VVQGGIAMLD CPVHGDPSPV LRWLRNGKPL
HRLLRMQALH NGSLVIYSIT AADEGEYQCV AESEAGSAER TVTLKVQHGG YSSWGEWGPC
SSTCGQGFQE RNRLCNNPAP ANGGRTCGGP STDSRKCQTG LCPEVPRKTR GSLIGMVNDR
EFGVSFLEAN ITDDKEQGSS TLEARLDNVP PAVPLLQVLV SVFTPIYWTT VLQNGATRNG
FSFTQGQFRQ ESQLEFETEI LRLTHVARGL DSEGVLLIDI VINGFVPPSL SSSSHLGLQD
FDESYVQTGQ GQLYSWSSQT HQRAGSPMAL RCNHTIVYEG QDERQGPVLQ LLKVSGINSV
YNMFTLTLDF HITTSLLVPA CPKGFSLDTA SYCAEDECAL QSPCSHSCNN ILGGFTCACP
SGFTISAETN TCHIDECSQG SHTCHYNQQC VNTVGTYRCQ AKCGPGFKPS ATGTSCEVDE
CQESAVSPCH HKCLNTLGSY RCICHPGYQS SGHRCIDINE CMRNVCPAHQ QCRNTDGGYQ
CFDSCPAGMT TAENGACAID ECQDGSHMCR YSQICQNTVG GYGCVCPRGY RSQGVGLPCL
IDECSQTPNP CAHQCRNVPG SFRCLCPPGT LLLGDGRSCA GLERGQIFTN GTRVRARLRP
QLVSTLGRPI ISRSNGASRI TRQSCPVGYT SRNGACVVDE CLLKKPCQHE CRNTVGSFHC
LCPPGYKLLP NGRSIDECME QRIQCGHSQM CFNTRGGYQC LDTPCPTSYQ SGGRPTCFRP
CSLDCAAGGS PLLLQYKLLT LPSGIPPNHN VVRLSAFSES GVLQERTSFT ILEQESVTGV
TGRVFDIRDE AGRGIIFTRR VLDRPGLMRL KVQATTISEH GRITYQSIFI IYISISAYPY
//