ID A0A3P8W2G2_CYNSE Unreviewed; 1651 AA.
AC A0A3P8W2G2;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Vitellogenin-2-like {ECO:0000313|Ensembl:ENSCSEP00000019771.1};
OS Cynoglossus semilaevis (Tongue sole).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Carangaria; Pleuronectiformes; Pleuronectoidei; Cynoglossidae;
OC Cynoglossinae; Cynoglossus.
OX NCBI_TaxID=244447 {ECO:0000313|Ensembl:ENSCSEP00000019771.1, ECO:0000313|Proteomes:UP000265120};
RN [1] {ECO:0000313|Ensembl:ENSCSEP00000019771.1, ECO:0000313|Proteomes:UP000265120}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=24487278;
RA Chen S., Zhang G., Shao C., Huang Q., Liu G., Zhang P., Song W., An N.,
RA Chalopin D., Volff J.N., Hong Y., Li Q., Sha Z., Zhou H., Xie M., Yu Q.,
RA Liu Y., Xiang H., Wang N., Wu K., Yang C., Zhou Q., Liao X., Yang L.,
RA Hu Q., Zhang J., Meng L., Jin L., Tian Y., Lian J., Yang J., Miao G.,
RA Liu S., Liang Z., Yan F., Li Y., Sun B., Zhang H., Zhang J., Zhu Y., Du M.,
RA Zhao Y., Schartl M., Tang Q., Wang J.;
RT "Whole-genome sequence of a flatfish provides insights into ZW sex
RT chromosome evolution and adaptation to a benthic lifestyle.";
RL Nat. Genet. 46:253-260(2014).
RN [2] {ECO:0000313|Ensembl:ENSCSEP00000019771.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00557}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 244447.ENSCSEP00000019771; -.
DR Ensembl; ENSCSET00000020009.1; ENSCSEP00000019771.1; ENSCSEG00000009586.1.
DR GeneTree; ENSGT00530000064273; -.
DR InParanoid; A0A3P8W2G2; -.
DR Proteomes; UP000265120; Chromosome 2.
DR GO; GO:0005319; F:lipid transporter activity; IEA:InterPro.
DR GO; GO:0045735; F:nutrient reservoir activity; IEA:UniProtKB-KW.
DR Gene3D; 2.20.80.10; Lipovitellin-phosvitin complex, chain A, domain 4; 1.
DR Gene3D; 2.20.50.20; Lipovitellin. Chain A, domain 3; 2.
DR Gene3D; 2.20.90.10; Vitellinogen, beta-sheet shell domain; 1.
DR Gene3D; 1.25.10.20; Vitellinogen, superhelical; 1.
DR InterPro; IPR015819; Lipid_transp_b-sht_shell.
DR InterPro; IPR011030; Lipovitellin_superhlx_dom.
DR InterPro; IPR015816; Vitellinogen_b-sht_N.
DR InterPro; IPR015258; Vitellinogen_b-sht_shell.
DR InterPro; IPR037088; Vitellinogen_b-sht_shell_sf.
DR InterPro; IPR015255; Vitellinogen_open_b-sht.
DR InterPro; IPR015817; Vitellinogen_open_b-sht_sub1.
DR InterPro; IPR001747; Vitellogenin_N.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR23345; VITELLOGENIN-RELATED; 1.
DR PANTHER; PTHR23345:SF9; VTG7 PROTEIN-RELATED; 1.
DR Pfam; PF09175; Vit_b-sht_shell; 1.
DR Pfam; PF09172; Vit_open_b-sht; 1.
DR Pfam; PF01347; Vitellogenin_N; 1.
DR Pfam; PF00094; VWD; 1.
DR SMART; SM01169; DUF1943; 1.
DR SMART; SM01170; DUF1944; 1.
DR SMART; SM00638; LPD_N; 1.
DR SMART; SM00216; VWD; 1.
DR SUPFAM; SSF48431; Lipovitellin-phosvitin complex, superhelical domain; 1.
DR PROSITE; PS51211; VITELLOGENIN; 1.
DR PROSITE; PS51233; VWFD; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00557}; Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000265120};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Storage protein {ECO:0000256|ARBA:ARBA00022761}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..1651
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018204375"
FT DOMAIN 20..646
FT /note="Vitellogenin"
FT /evidence="ECO:0000259|PROSITE:PS51211"
FT DOMAIN 1385..1559
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT REGION 1067..1164
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1067..1103
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1115..1164
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 158..184
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00557"
FT DISULFID 200..203
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00557"
SQ SEQUENCE 1651 AA; 181153 MW; 906DDBF5F8B4EC12 CRC64;
MKVLVLALAV ALAGKTSPEF AVGKTYVYKY EAVLMGGLPE EGLARAGLKI LSEVMIHPVT
ADTFVLKLEK PELFEYSGIW PKDAFIPATK LTSALAAQLL TPIKFEYANG VVGKVFAPAG
ISTTVLNIYR GLINMFQLNI KKTQNVYELQ EPGTQGVCKT HYVISEDTKA ERIHLSKTKD
LNHCQERIIK DIGLAYVERC PECERGKILK GAAAFNYVMK PMPTGALILE ATTTELIQFS
PLNIMNGAAQ MEAKQVLKFV EILKTPVEPI GADYLDRGSL QYEFGSELLQ NPIQLLRISN
AEAQIVEILN HLVTNNEAKV HEDAPLKFIE LIQLLRVARF ETIEALWNQV KAKPDHRHWI
LNAVPAIGTH EALRFIKEKF LAGEVTIAEA AQALLAAVHM VTADMEAIKL AEGLVLNHKI
QENPVLLEIV WLANHACPAK LVRPIHELAF EALNKGNVEE ITIALKVLGN AGHPASLKPI
MKFLPGFGSA AANLPHRVHN DAILAMRNIA KKEPKMIQEI AVQLFMDKAL PAEHRMVAAI
VLFETKVPMA LVTTLANALL KESNMQIASF VYSYMKALTR TTAPDLASVA AACNVAVKIL
SPKFDRLSYR FSRALYIDAY HSPSLIGAAA TAFYINDAAT VLPKAMLAKV RTYLAGAYAD
VLEVGVRTEG IQEALLKMEE APANTDRLAK MRQVIRVLSE WRAHPNRPPL ASVNVKLFGQ
EIAFANVDKA IVDQLIEMVS GPSVHRFGRR ALETMLSGFA LHYAKPMLLA EVRRIIPTAV
GLPMELSFYT AAVAAASVEF HASVNPPLPA DFHATQLLKS DINMRAAIAP CVSMHTYAVM
GVNTALAQAS LLSRARVHTI VPAKIEARID MIKGNFKLQL LPVKGIDKIA SALIETVAVA
RNVEDLAAAR ITPVIPAHGV LHLSHHTSSH EASVSTEISA SSEILSALPR HVVNKMSIPK
AYEKKMCAAI ATFGIKACTE IESQNAAFIR DCPLYAIIGR HAVMVEVAPA EGPVIEKIEI
EIQVGDKAAE KIVRVINMSE EEEVVEDKNV LMRLRKILIP GLKNATAVSS SSSSSSRASK
SSSVSSASSS SSRSSSHSSQ RHSKVVDAAA PISKRSKRVS SSSSSSHSSR LSKISSSSRS
SRSSRSSRVA MSKNSLTSSS PRTTSISSAY SFEAIYKKAK YLTNTVAPAV TVLIRAVRAD
HKVQGYQIAA YLDRANARLQ VIFANLAEND HFRICADGVM LSTHKVMGKI AWGLECKQYE
TEITAETGMV DQLPALRLKL TWDKLPRSMK PYARDISDYI SRIAHEFGVK MAKNKNIPKQ
IKLTVAVASE HSVNVVLKTP TRTAYKLALG LPVSLPIGNT EAELQIYQDN WVDKIMHLIS
KANAAECSLV KDTIITFNKR KIRTQLPHSC YQVLAQDCTP ELKFIVILKR DQMQEKNQIN
LKIANMYDVV FLYLNLLMVN GVEVPLSNLP YHHPTAKIQI LQKEEGIALH ASGHGLQEVF
FSPTELKVKV ADWMRGQTCG ICGNADGEIR QEFVTPNQRV TKNAASFAYS WVLSGKSCRD
ATECHMKLES VKLEKQIMVH GQESKCYSVE PVLRCLPGCM PVRTTTVSVG YHCLPASMNR
PEKLSIISQK SIDVHETAEA HLACRCTPQC A
//