ID A0A2R2MLY5_LINUN Unreviewed; 4177 AA.
AC A0A2R2MLY5;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Uncharacterized protein LOC106157441 isoform X16 {ECO:0000313|RefSeq:XP_023931243.1};
GN Name=LOC106157441 {ECO:0000313|RefSeq:XP_023931243.1};
OS Lingula unguis.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Brachiopoda; Linguliformea;
OC Lingulata; Lingulida; Linguloidea; Lingulidae; Lingula.
OX NCBI_TaxID=7574 {ECO:0000313|Proteomes:UP000085678, ECO:0000313|RefSeq:XP_023931243.1};
RN [1] {ECO:0000313|RefSeq:XP_023931243.1}
RP IDENTIFICATION.
RC TISSUE=Gonads {ECO:0000313|RefSeq:XP_023931243.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_023931243.1; XM_024075475.1.
DR EnsemblMetazoa; XM_024075475.1; XP_023931243.1; LOC106157441.
DR Proteomes; UP000085678; Unplaced.
DR CDD; cd00063; FN3; 8.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 9.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 48.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR PANTHER; PTHR22906:SF43; COMPLEMENT FACTOR PROPERDIN-RELATED; 1.
DR PANTHER; PTHR22906; PROPERDIN; 1.
DR Pfam; PF00041; fn3; 2.
DR Pfam; PF13385; Laminin_G_3; 1.
DR Pfam; PF00090; TSP_1; 37.
DR SMART; SM00060; FN3; 12.
DR SMART; SM00209; TSP1; 48.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 6.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 47.
DR PROSITE; PS50853; FN3; 9.
DR PROSITE; PS50092; TSP1; 48.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000085678};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..4177
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5015121893"
FT DOMAIN 162..251
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 252..341
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 342..431
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 521..609
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 610..698
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 699..793
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 796..890
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2639..2731
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2736..2828
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 3538..3573
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3711..3745
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3866..3896
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3538..3571
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3714..3733
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 4177 AA; 458122 MW; C65B16E897AC8AB0 CRC64;
MHFSALRQTL ALAVVAVVVQ QTRGEAFHEC KFWNNYIRDT TNKCQYYQCV PSTYTNGNWQ
FKATLMKCAY PTSVPEDYDD PRESAFDAVY TNPCSGTNFG SCEISGTGPE PRGWTQWGTW
SACSVTCGSG SRSRVRSCPP GSNCAGASIE EDACNSGPCV VAPGKPTTFV NVLGTSMNVT
WIPADSRATA YYVQYRQPGQ FAWINSREEM FKNWILIPNL FLGAYEVRVV AKGVGGQETP
SDTKTVQILG SVPGKPMATI NVVGSSMNVS WIPADNLATS YYVEYRRTGD FNWQRSREER
FNTWLWLQNL PQGIYEVRVV AKNAQGQETR SDIYPVEIKG AAPGKPTVYL QAVNNGFNVS
WTAADNLASG FYAQYRRAGA FQWMSTREER RNSWIWVQNL GYGTYEVRVV ARNNGGQETI
SDVSRVELKQ AGPSAPVVYF NQQGNGVNVS WTADPMGTIY YVQYRRPGSF VWQRSEEEMF
NNWIFLRNLQ PGTYEMRVMV RGVNGQEVPS QVYTYEIQQQ GPSAPMVYFT QQGNGVNVSW
TADPMGTVYY VQYRRPGSFV WQKSQDVMTN NWIFLSNLQP GTYEMRVMVR GANGQEVPSQ
VYTYEIRQQG PSAPMVYFTQ QGNGVNVSWT ADPMGTVYYV QYRRPGSFVW QKSQEVMNNN
YIFLSNLQPG TYEMRVMVRG ANGQEVPSQV YTYEIQRQAP AKPTQVFVTK YGQSGMNISW
FQPQGDGVLT YYIQYRRSGD VQWMSSPEER GRNWIIVQNL PMGVRYDTRV VVRNAAGQET
YSDIYYMDMT SGGGDAPIKP TNVYITYTMS TMNVTWSYSP QQTNLVYYVQ YRPVGTTNWV
NSPQVMGDQR FIYISGLQPG TSYETRLVAM NMVTGAETAS DPQRVDIMIG GVVDLREWNA
WTACSAQCNG GTQFRYRDCF GVGCMGIQLR EQRPCNTQAC TSQTIGQWEA WTSCTAECGG
GIRYRIRTCT GPCTAVQRFE IQTCNTQHCA GDVTYGQWNA WEACSVTCGV GRRERHRSCT
GTGCALAETG QIQSCYFREC ASMDLHEWGV WTTCTKPCDG GVQYRYRECF GMGCDKVPQR
EDKPCNTAPC VNVQAWGEWG VCSKECDGGF QTRARTCTGL GCDMVQKTEQ RPCNTQSCMD
LHDWGAWGMC SAQCNGGRQF RFRECYGTGC NSAKLQEEKP CNTQPCLDIS QWGQWGTCSV
TCGGGVKTRA RQCFGVGCGG VLLTEQMPCN PNPCSTTSQW TQWGPCSVTC GAGEQARTRT
CTGAACAGQP LRETRICQLQ PCTDLTQWTA WGPCSVTCGD GRQFRTRSCT GPGCVGSSLT
ESRFCSMPQC RALSQWTQWG TCSVTCGDGT QKRMRTCTGA GCQGVSLEET RSCSQQACTT
LSQWTQWGSC SVTCGDGLQF RTRTCTGNGC QGYNLEERRV CRQQACTTLS QWTQWGSCTV
TCGDGLQFRT RTCTGTGCQS YNLEERRVCQ QQACTTLSQW TQWGSCSVTC GDGLQFRTRT
CTGNGCQGYN LEESRVCRQQ ACTTLSQWTQ WGSCTVTCGD GLQFRTRTCT GNGCQGYNLE
ESRVCRQQAC TTLSQWTQWG SCTVTCGDGL QFRSRTCTGN GCQSYNLEES RVCQRQACTT
LSQWTQWGSC SVTCGDGLQF RTRTCTGNGC QGYNLEESRV CRQQACTTLS QWTQWGSCTV
TCGNGLQFRS RTCTGDGCRS YNLEESRVCQ RQACTTLSQW TQWGSCTVTC GDGLQFRTRT
CTGNGCQGYN LEESRVCRQQ ACTTLSQWTQ WGSCSVTCGD GLQFRTRTCT GNGCQGYNLE
ESRVCQRQAC TTLSQWTQWG SCTVTCGDGL QFRTRTCTGN GCQGYNLEES RVCQRQACTT
LSQWTQWGSC TVTCGDGLQF RSRTCTGNGC QGYNLEESRV CRQQACTTLS QWTQWGSCSV
TCGDGLQFRT RTCTGNGCQG YNLEESRVCR QQACITLSQW TQWGSCTVTC GDGLQFRSRT
CTGNGCQSYN LEESRVCQRQ ACINLSQWSA WGTCSVTCGE GMEMRTRTCV GSNCANQQTQ
ETRPCTRPAC QVLGDWGQWT TCSATCGGGM QFRNRPCSGN ACQFLRTRES RSCNTDQCVV
QAQVSQWTQW GQCSAQCGTG QQERTRQCMA GDCTGMMLRE SQNCNTQPCV QYGQWTQWGT
CSVTCGSGVQ RRTRTCSGLQ CNLNPGALEE TRACTLQDCV SVSLSEWGQW GACNAECDGG
VQIRNRACFG NGCQGQRLQE TRSCNQQPCV SLSQWGQWSV CPVTCNGGTQ VRSRACFGSG
CADVQLQESR PCGQEPCVSE PQVTEWGQWT QCTATCNGGI QFRSRACFGA ACVGVQLQES
QACNQQPCVS VTEWSNWGQC SAECDGGLQV RQRTCTGGAV NCQNVQLQES RACNSQPCVT
LTQWSEWGTC SVTCGQGLQF RNRQCTGSRC AAVIKQESRP CTLQQCPSTN TGPVKPSVFV
SPLSTGQGVN VTWVPQTPNM GYTYVLQYRT PGAFSWTSGG METFRGWMVV PNLPLGTYEM
RVIVTAPNGQ ETVSDIIRAE VANQGPVLPN LIFSQSPTGL NVSWVPTDSG HNHIVQYKRS
TEFTWTTAGE ERFRNYLIIP NLTPGSYEFR VIVRNNLGVE TVTDVYRVEV RETSGGQTGP
ALPNFNIQQT TTGTGLNISW VPVPGLPRNT YIVQYRRPGE LTWTSGGQES FNNFIIIPSL
PTGTYEFRII VRNNLGQETI SQIFRVEVRQ ASGTTGPQMP NFNVASTGVG TGLNVSWVPI
PGAFGNTYIA QFRQPGAPTW TVGGQDTTNN YIIIPNLPVG TYEFRMIVRN NLGQETVSQI
FRVQVQAAGD TLGEWSPWGV FEMASGRFIP DQCTRPCGGG KQFRGRQCMG TEAFCRSQQL
EEEQDCNTQP CTNSTSGDSL GEWGPWGVVE MATGRFIEGE CTRPCDGGVQ FRGRPCTGSQ
SFCMGQQLEE VRDCNTQPCG PGNGTNTGGD SLGEWTPWGV VERTSGRFIE GECTRPCDGG
VQFRGRQCRG SQSFCMSQQL EEVRDCNTQP CGPGNGTNTG GDSLGEWTPW GVVEMASGRF
IEGECTRPCD GGVQFRGRQC TGSQSFCMSQ QLEEVRDCNT QPCGPGNGTN TGGDSLGEWT
PWGVVERTSG RFIEGECTRP CDGGVQFRGR QCRGSQSFCM SQQLEEVRDC NTQPCGPGNG
TNTGGDNLGE WTPWGVVEMA SGRFIEGECT RPCDGGVQFR GRQCTGSQSF CTSQQLEEVR
DCNTQPCGPG NGTSTGGDSL GEWTPWGVME MASGRFIEGE CTRPCDGGVQ FRGRQCTGSQ
SFCMSQQLEE VRDCNTQPCG PGNGTNTGGD SLGEWTPWGV VERTSGRFIE GECTRPCDGG
VQFRGRQCTG SQSFCMSQQL EEVRDCNTQP CGPGNGTNTG GDSLGEWTPW GVVEMASGRF
IEGECTRPCD GGVQFRGRQC RGSQSFCMSQ QLEEVRDCNT QPCGPGNGTN TGGDSLGEWT
PWGVVEMSSG RFIEGECTRP CDGGVQFRGR QCRGSQSFCM SQQLEEVRDC NTQPCGSSSG
TAGVTGPTGA GPLPTTNTGA AGGSGTSSPS IPGLGEWGPW GMVDTSTGQF VEDMCSRPCD
GGLQLRSRRC DQGNCQGVEL EEIRDCNTQP CSGSIGATGP VGQAGGSGTG STGGDNFQLG
EWSPWGVVEL ATGNFIPGQC TRTCGGGAQF RGRQCTGAGC NTVDPRQLEE SRECNTEPCQ
GTGGSTGPSG GAGGSTGPTG GDVTLGQWGE WGVVELATNN FIPNQCTTPC GGGIQFRRRE
CTGAPLACQA QQVEEVRTCN QQPCTDGSGT GGGAGGNYEW GPWGPWGVVE LSSGEFIVGE
CTLPCGGGFQ FRTRNCSRGE CPPPFPPDAS QSQRCNEDPC EGGQGGPAPG PLPGPQVPPQ
ISGFSQWTDW SPCPVQCGGG NHTRSRTCAS GDPSTCQGDL VQGKPCNTIK CNSIVSANMV
DAHIALNGQD SGFSFFCWIN QEFSNQCTFN SPGGTANAYY MDGNDDKSEV YIREDFTKND
FTLSIWARPV QVDDRILLAY WDWQFAWRQK FLFGIKDSQV EMKVRNEMEE DLYDLRGGQV
RAGMWNHIVV TYQKATNTMN IYLNGVRVGS QTITDPVKNK EIMPMYMDRF APDVFQLGWM
PNPQNERAYN GHMKDLLVLN RSLSDNEVRA LFQENLW
//