ID G3VV90_SARHA Unreviewed; 1101 AA.
AC G3VV90;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 65.
DE SubName: Full=BOC cell adhesion associated, oncogene regulated {ECO:0000313|Ensembl:ENSSHAP00000007095.2};
GN Name=BOC {ECO:0000313|Ensembl:ENSSHAP00000007095.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000007095.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000007095.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000007095.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_003763635.1; XM_003763587.1.
DR AlphaFoldDB; G3VV90; -.
DR STRING; 9305.ENSSHAP00000007095; -.
DR Ensembl; ENSSHAT00000007157.2; ENSSHAP00000007095.2; ENSSHAG00000006166.2.
DR GeneID; 100933034; -.
DR KEGG; shr:100933034; -.
DR CTD; 91653; -.
DR eggNOG; ENOG502QUNT; Eukaryota.
DR GeneTree; ENSGT00940000158810; -.
DR HOGENOM; CLU_008503_0_0_1; -.
DR InParanoid; G3VV90; -.
DR TreeFam; TF332268; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 3.
DR Gene3D; 2.60.40.10; Immunoglobulins; 7.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR013098; Ig_I-set.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR PANTHER; PTHR44170:SF3; BROTHER OF CDO; 1.
DR PANTHER; PTHR44170; PROTEIN SIDEKICK; 1.
DR Pfam; PF00041; fn3; 3.
DR Pfam; PF07679; I-set; 2.
DR Pfam; PF13927; Ig_3; 2.
DR Pfam; PF16625; ISET-FN3_linker; 1.
DR SMART; SM00060; FN3; 3.
DR SMART; SM00409; IG; 4.
DR SMART; SM00408; IGc2; 4.
DR SUPFAM; SSF49265; Fibronectin type III; 2.
DR SUPFAM; SSF48726; Immunoglobulin; 4.
DR PROSITE; PS50853; FN3; 3.
DR PROSITE; PS50835; IG_LIKE; 4.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Immunoglobulin domain {ECO:0000256|ARBA:ARBA00023319};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..1101
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5029493851"
FT TRANSMEM 845..868
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 32..119
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 135..211
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 231..300
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 319..403
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 464..561
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 598..693
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 702..802
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 418..441
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 555..604
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 803..834
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 963..984
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1070..1101
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 574..593
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 811..830
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1101 AA; 120642 MW; 530927207B68FFBE CRC64;
MTMMSRRKRP AVIITCLILA AASCFGNLGE VPQVTVQPTS TVQKHGGPVI LGCVVEPPWV
NITWRLNGKE LNSSDDALGI LITRGTLVIT ALNNHTVGRY QCVARMPAGA VASVPATVTL
ANLQDFKFDG QHIIEVDEGN TAVIACDLPE SHPKAQVRYS VKQEWLEASR DNYLIMPSGN
LQIVNASQED EGMYKCAAYN PVTQEVKTSV SSDRLRVRRS TAEAARIIYP PEAQTIIVTK
GQSLILECVA SGIPPPRVTW AKDGSNIVGY NKTRFLLSNL LIDTTSEEDS GSYRCMADNG
VGEPGAAVIL YNVQVFEPPE VTMELSQQII PWGQSAKFTC EVRGNPQPSV LWLRNAVPLA
SSQRLRLSRK ALRVVSVGPE DDGVYQCMAE NEVGSAQAMA QLKTARPGTT LKPWREAKFG
SAQSPTPPAR PSSPDRTLLR PRPTVLPASL QCPTAKGQVS PAEAPIILSS PRTSKTDYYE
LVWRPRHESG NRAPILYYLV KHRKVTNSSD EWTISGIPAN QHRLTLTRLD PGSLYEVEMA
AYNCAGEGQT AMVTFRTGRR PKPEIVASKE QQIQRDDPGA STQSNNQLDN SRLSPPEAPD
RPTISMASET SVYVTWIPRG NGGFPIQSFR VEYKKLKKVG DWILATSAIP PSRLSVEITG
LEKGTSYKFR VRALNILGES EPSAASRPYV VSGYSNRVYE RPVTGPYITF TDAVNETTIM
LKWVYKSASN NNTPIHGFYI YYRPTDSDND SDYKKDVVEG DRYWHSISHL QPETSYDIKM
QCFNEGGESE FSNVMICETK ARKPSGQPGR LLPPTVAPPP ILPPDPGERP GGPGAMVARS
SDLPYLIVGV VLGSIVLIIV AFIPFCLWRA WSKQKQTADL AFPSSALLPS SCQYTMVPLR
GISAPRANGQ PYINGQPYVN GLHVKGACPS AGLGCPGVKT QEYSPGELQQ QDHNSSLLQG
KILGNGHEKS HQPMREPDSS PDENTFLYTL PDDSTHQLLQ PHDECYHLQE QPAAICQSGV
RNTSESPRRE GLWDSPFHPG PPCCLGLVPV EEVDSPHSLQ VRGGEWCPHH PPGTYLGQDP
GRRLSSSPPI HVSFETPPPT I
//