ID A0A4D9E704_9SAUR Unreviewed; 1303 AA.
AC A0A4D9E704;
DT 03-JUL-2019, integrated into UniProtKB/TrEMBL.
DT 03-JUL-2019, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Transcription factor SOX-18 {ECO:0000313|EMBL:TFK06136.1};
GN ORFNames=DR999_PMT11156 {ECO:0000313|EMBL:TFK06136.1};
OS Platysternon megacephalum (big-headed turtle).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Durocryptodira;
OC Testudinoidea; Platysternidae; Platysternon.
OX NCBI_TaxID=55544 {ECO:0000313|EMBL:TFK06136.1, ECO:0000313|Proteomes:UP000297703};
RN [1] {ECO:0000313|EMBL:TFK06136.1, ECO:0000313|Proteomes:UP000297703}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DO16091913 {ECO:0000313|EMBL:TFK06136.1};
RC TISSUE=Muscle {ECO:0000313|EMBL:TFK06136.1};
RA Gong S.;
RT "Draft genome of the big-headed turtle Platysternon megacephalum.";
RL Submitted (APR-2019) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:TFK06136.1, ECO:0000313|Proteomes:UP000297703}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DO16091913 {ECO:0000313|EMBL:TFK06136.1};
RC TISSUE=Muscle {ECO:0000313|EMBL:TFK06136.1};
RA Gong S.;
RT "The genome sequence of big-headed turtle.";
RL Submitted (APR-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004141}; Multi-
CC pass membrane protein {ECO:0000256|ARBA:ARBA00004141}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:TFK06136.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QXTE01000101; TFK06136.1; -; Genomic_DNA.
DR STRING; 55544.A0A4D9E704; -.
DR Proteomes; UP000297703; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0004930; F:G protein-coupled receptor activity; IEA:UniProtKB-KW.
DR GO; GO:0007166; P:cell surface receptor signaling pathway; IEA:InterPro.
DR CDD; cd16000; 7tmB2_GPR123; 1.
DR Gene3D; 2.60.220.50; -; 1.
DR Gene3D; 4.10.1240.10; GPCR, family 2, extracellular hormone receptor domain; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 1.20.1070.10; Rhodopsin 7-helix transmembrane proteins; 1.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 2.
DR InterPro; IPR000483; Cys-rich_flank_reg_C.
DR InterPro; IPR046338; GAIN_dom_sf.
DR InterPro; IPR017981; GPCR_2-like_7TM.
DR InterPro; IPR036445; GPCR_2_extracell_dom_sf.
DR InterPro; IPR001879; GPCR_2_extracellular_dom.
DR InterPro; IPR000832; GPCR_2_secretin-like.
DR InterPro; IPR017983; GPCR_2_secretin-like_CS.
DR InterPro; IPR000203; GPS.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR013098; Ig_I-set.
DR InterPro; IPR001611; Leu-rich_rpt.
DR InterPro; IPR003591; Leu-rich_rpt_typical-subtyp.
DR InterPro; IPR032675; LRR_dom_sf.
DR PANTHER; PTHR45930:SF3; ADHESION G PROTEIN-COUPLED RECEPTOR A1; 1.
DR PANTHER; PTHR45930; G-PROTEIN COUPLED RECEPTOR 124-LIKE PROTEIN; 1.
DR Pfam; PF00002; 7tm_2; 1.
DR Pfam; PF01825; GPS; 1.
DR Pfam; PF07679; I-set; 1.
DR Pfam; PF13855; LRR_8; 1.
DR SMART; SM00303; GPS; 1.
DR SMART; SM00369; LRR_TYP; 4.
DR SMART; SM00082; LRRCT; 1.
DR SUPFAM; SSF111418; Hormone receptor domain; 1.
DR SUPFAM; SSF48726; Immunoglobulin; 1.
DR SUPFAM; SSF52058; L domain-like; 1.
DR PROSITE; PS00650; G_PROTEIN_RECEP_F2_2; 1.
DR PROSITE; PS50227; G_PROTEIN_RECEP_F2_3; 1.
DR PROSITE; PS50261; G_PROTEIN_RECEP_F2_4; 1.
DR PROSITE; PS50221; GPS; 1.
DR PROSITE; PS50835; IG_LIKE; 1.
DR PROSITE; PS51450; LRR; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW G-protein coupled receptor {ECO:0000256|ARBA:ARBA00023040};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Immunoglobulin domain {ECO:0000256|ARBA:ARBA00023319};
KW Leucine-rich repeat {ECO:0000256|ARBA:ARBA00022614};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Receptor {ECO:0000256|ARBA:ARBA00023170};
KW Reference proteome {ECO:0000313|Proteomes:UP000297703};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Transducer {ECO:0000256|ARBA:ARBA00023224};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..1303
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5020034349"
FT TRANSMEM 751..772
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 784..804
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 810..832
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 866..886
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 906..929
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 992..1013
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1019..1038
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 231..329
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 312..406
FT /note="G-protein coupled receptors family 2 profile 1"
FT /evidence="ECO:0000259|PROSITE:PS50227"
FT DOMAIN 748..1044
FT /note="G-protein coupled receptors family 2 profile 2"
FT /evidence="ECO:0000259|PROSITE:PS50261"
FT REGION 1187..1210
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1303 AA; 145016 MW; A588604EF8344ADE CRC64;
MHGLCRCLCL AAALAGLSWA SRNCPDLIVD SCLCTAERSK GPGRQTVRIK VVCSGGELVE
TLQPSLLPSR TVSLILSNNK ILWLKNGSFF GLRSLERLDL KNNLISTIEP GAFHGLSELK
RLDLSNNRIG CLTPEIFVGL NNLHKLNLSG NIFSSLMNGL FSELLALKAL HFNTDSLICD
CNLKWVLQWA RNASVRIAEE TVCAYPSTLR GLAFHNLKEN QLICAGPLEL PLFELIPSQR
QVVFHGDRLP FQCTATYVDN TTQVHWYHDG HLIETDEESG IFVEDSIIHD CCLITRELIL
SSIDTGATGD WECLINNSYG TSTKHVEIVV LETAAPYCPE ERIINNKGDF RWPKTLAGIT
AYHPCLQYSF SSISLHNGAE ESKAWRKCNQ TGRWADENYS ECPYSQEVTQ VLHAFSQMHV
NATNALEFSR QLTAYTRGAS NFADKMDIIY LAYIMEKLIV IVDEVKDLGD AIVEIASNIM
LVDDHVLWMA QNENKACTRI VQCVAHITNQ TLTSHTQVIS KISLNIALEA FMIKPSSFIG
MTCTAFQKIS ENSDRSLIHD VGSWEADYRP DQHLNFKCND GSLSGSFMNF STKNAVAVAS
VHLPQSVFSQ SSALQSVDNS TCKLQFIVFR NGKLFPSTGN SSNLADDGKR RTVTTPAVFA
KIDGCSFGNL TTPLTIGLRH FARGIDPIAA FWDFDLLDGH GGWCGEGCHI ISSAGNITII
QSTHFSNFAV LMDLKTVLSF PQYPGEFLHP VVYACTAVML LCLFASIITY IVHHSTIRIS
RKGWHMLLNF CFHTALTFAV FAGGINRIKY PIICQAVGIV LHYSTLSTML WIGVTARNIY
KQVTKKAQPC QNSDQPSYPK QPLLRFYLIS GGVPFIICGI TAATNINNYG IEGNAPYCWM
AWEPSLGAFY GPVAFIVLVT CIYFLCTYVQ LKRHPERKYE LKERTEDQQR LASTEVGHSH
ITDSGSVSQT TCSMISSSLL ENEHSFKAQL RAAAFTLFLF TATWTFGALA VSQGHFLDMI
FSCLYGAFCV TLGLFILIHH CAKRDDVWHC WWSCCPSRRS TYSVQVNVRP KVNVNGDTQV
HAPCLQESPC PNKSAVFNHP AANHCKLTNL QAVQNHVNCL SPVTPCCAKM HCDQLLDDEA
HIHVHNEGTF RPNMHIHRCL KSRTKPRYFS RHRSAGEREY AYHIPSSIDG SIHSSHTDSP
HSTHDSQSGH RRACCVKSDA YPTINQPESS DASTVIYSCA KIPESDTVHH TAHFEMHPRT
QSLPFNTTNH NGILKGNVHE AMIYSSDSTG NIKTGPWKNE TTV
//