ID A0A226NE15_CALSU Unreviewed; 1364 AA.
AC A0A226NE15;
DT 25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT 25-OCT-2017, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE RecName: Full=Protein turtle homolog B {ECO:0008006|Google:ProtNLM};
GN ORFNames=ASZ78_004452 {ECO:0000313|EMBL:OXB65776.1};
OS Callipepla squamata (Scaled quail).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Odontophoridae;
OC Callipepla.
OX NCBI_TaxID=9009 {ECO:0000313|EMBL:OXB65776.1, ECO:0000313|Proteomes:UP000198323};
RN [1] {ECO:0000313|EMBL:OXB65776.1, ECO:0000313|Proteomes:UP000198323}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texas {ECO:0000313|EMBL:OXB65776.1,
RC ECO:0000313|Proteomes:UP000198323};
RC TISSUE=Leg muscle {ECO:0000313|EMBL:OXB65776.1};
RA Oldeschulte D.L., Halley Y.A., Bhattarai E.K., Brashear W.A., Hill J.,
RA Metz R.P., Johnson C.D., Rollins D., Peterson M.J., Bickhart D.M.,
RA Decker J.E., Seabury C.M.;
RT "Disparate Historic Effective Population Sizes Predicted by Modern Levels
RT of Genome Diversity for the Scaled Quail (Callipepla squamata) and the
RT Northern Bobwhite (Colinus virginianus): Inferences from First and Second
RT Generation Draft Genome Assemblies for Sympatric New World Quail.";
RL Submitted (JUL-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OXB65776.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCFN01000082; OXB65776.1; -; Genomic_DNA.
DR STRING; 9009.A0A226NE15; -.
DR Proteomes; UP000198323; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 2.
DR Gene3D; 2.60.40.10; Immunoglobulins; 7.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR InterPro; IPR013106; Ig_V-set.
DR PANTHER; PTHR44170; PROTEIN SIDEKICK; 1.
DR PANTHER; PTHR44170:SF50; PROTEIN TURTLE HOMOLOG B ISOFORM X1; 1.
DR Pfam; PF13927; Ig_3; 2.
DR Pfam; PF07686; V-set; 1.
DR SMART; SM00060; FN3; 3.
DR SMART; SM00409; IG; 4.
DR SMART; SM00408; IGc2; 4.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF48726; Immunoglobulin; 4.
DR PROSITE; PS50853; FN3; 2.
DR PROSITE; PS50835; IG_LIKE; 4.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Immunoglobulin domain {ECO:0000256|ARBA:ARBA00023319};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000198323};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..1364
FT /note="Protein turtle homolog B"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012556402"
FT TRANSMEM 684..709
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 38..115
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 139..226
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 228..320
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 324..465
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 470..565
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 575..669
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 719..775
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 944..1081
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1129..1234
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1269..1301
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 732..765
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 968..984
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1006..1020
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1058..1079
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1141..1202
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1285..1301
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1364 AA; 148889 MW; B2102046953A2407 CRC64;
MIWYVAALVA SVLGARGLSV QGALGSREEP EFVTARAGES VILGCDVIHP LTGQPPPYVV
EWFKFGVPIP IFIKFGFYPP HVDPGYAGRA SLHDKASLRI EQVRSEDQGW YECKVLMLDQ
QYDTFHNGSW VHLTVNAPPT FTETPPQYVE AKEGSSITLT CMAFGNPKPI VTWLREGELL
AAGPKYQVSD GSLTVLSVSR EDRGAYTCRA YSIQGEAVHT TRLLVQGPPF IVSPPENITV
NISQDALFTC QAEAYPGNLT YLWYWEEENV YFKNDLKLRV RILIDGTLII FRVKPEDAGK
YTCIPSNSLG RSPSASAYLT VQYPARVVNM PPVIYVPIGI HGYIRCPVEA EPPVTLVKWN
KDGRPLRIEK YSGWNLLEDG SIRIEEATED ALGTYTCVPY NALGTMGQSP PARLVLKVGK
PSRSKHNTLP SGTLQIRSLG KDDHGEWECI ATNVVASITA STHLTVVGTS PHAPTSVHVV
VAMTSANVSW EPGYDGGYEQ TFSVWMKRAQ FGPHDWLSLP VPAGSSWLLV DTLEPETAYQ
FSVLAQNKLG TSSFSEVVTV NTLAFPVTTP EPLVLVTPPR CLTANRTQQG VLLSWLPPAN
HSFPIDRYIM EFRVGERWEI LDDAILGTES EFFAKDLSQD TWYEFRVLAV MQDLISEPSN
VAGVSSTDVF PQPDLTDEGL ARPVLAGIVA TICFLAAAIL FSTLAACFVN KQRKRKLKRK
KDPPLSITHC RKSLESPLSS GKVSPESIRT LRPPSESSDD QGPQAKRMLS PTKEKELSLY
KKTKRAISSK KYSVSKAEAE AEATTPIELI SRGPDGRFVM DPAEMEPSLK TRRIEGFPFV
EETDMYPEFR QSDEENDDPV VPASVTALKA QLTPLSSSQE SYLQPPAYSP RFHRALEGPS
ALQATGQARP PAPRAFHHQF YGYLSSSSPG EVDPPPFYMP EVSPLSSVMS SPPLHPEGPF
GHPTIPEENG ENASNSTLPL AQTPTGGRSP EPWGLPFGSL EASPAATFPP QLQQCEAAAE
GSQPTGCLPR GPPPSSLQVV PASYPGILPL EAPKSWTGKS PSRGQTSVPT TTTKWQDKPM
QPMGCQGQLR HTSQGMGIPV LPYHEPSEPV GHSGTSTFSL DTRCSPELTA RARPRPGLIQ
QTEVSEITLQ PPAAVSFSRK STPSTGSPAP SSRGESPSYR PATAFTSLTT GFSGSQGSSM
PADTMDVFGD IPSPRRASEE ILRPEPTSTT VATSGLANAW AASLALGWRR KRPSPSWDAA
APERLEALKY QRIKKPKKSS KGSSKSRKQS NGSASQVQHL PSSQVLWPDE AVCLRKKKRH
PRQDPFARLS ALKDDLCHRQ LPEDQTAILN SVDHDDPSGH ATLL
//