ID A0A1A6H116_NEOLE Unreviewed; 730 AA.
AC A0A1A6H116;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE RecName: Full=Sema domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=A6R68_00140 {ECO:0000313|EMBL:OBS71307.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS71307.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS71307.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS71307.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS71307.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the semaphorin family.
CC {ECO:0000256|ARBA:ARBA00009492}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00352}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS71307.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01058615; OBS71307.1; -; Genomic_DNA.
DR STRING; 56216.A0A1A6H116; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0030215; F:semaphorin receptor binding; IEA:InterPro.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 3.30.1680.10; ligand-binding face of the semaphorins, domain 2; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 1.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR016201; PSI.
DR InterPro; IPR001627; Semap_dom.
DR InterPro; IPR036352; Semap_dom_sf.
DR InterPro; IPR027231; Semaphorin.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR PANTHER; PTHR11036; SEMAPHORIN; 1.
DR PANTHER; PTHR11036:SF37; SEMAPHORIN-3B; 1.
DR Pfam; PF01403; Sema; 1.
DR SMART; SM00423; PSI; 1.
DR SMART; SM00630; Sema; 1.
DR SUPFAM; SSF48726; Immunoglobulin; 1.
DR SUPFAM; SSF103575; Plexin repeat; 1.
DR SUPFAM; SSF101912; Sema domain; 1.
DR PROSITE; PS50835; IG_LIKE; 1.
DR PROSITE; PS51004; SEMA; 1.
PE 3: Inferred from homology;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..730
FT /note="Sema domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008346033"
FT DOMAIN 22..503
FT /note="Sema"
FT /evidence="ECO:0000259|PROSITE:PS51004"
FT DOMAIN 554..627
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT REGION 684..730
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 714..730
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 730 AA; 81242 MW; 05A2C66B707B69C3 CRC64;
MIPGLALLWA AGLGDAAPNL PRLRLSFQEL QAXHGVRTFR LERTXCYEAL LVDEERGRLF
VGAENHVASL SLDNISKRAK KLAWPAPVEW REECNWAGKD IGTECMNFVK LLHAYNHTHL
LACGTGAFHP TCAFVEVGHR LEEPMLRLDL KKLEDGKGKS PYDPRHRAAS VLVGEELYSG
VAADLMGRDF TIFRSLGQNP SLRTEPHDSR WLNEPKFVKV FWIPESENPD DDKIYFFFRE
SAVEAAPAMG RMTVSRVGQI CRNDLGGQRS LVNKWTTFLK ARLVCSVPGA EGDTHFDQLQ
DVFLLSSRDR QMPLLYAVFS TSSDIFQGSA VCVYSMNDVR RAFLGPFAHK EGPTHQWVSY
QGRVPYPRPG MCPSKTFGTF SSTKDFPDDV IQFARNHPLM YNSVLPMGGR PLFLQVGAGY
TFTQIAADRV AAADGHYDVL FIGTDVGTVL KVISVPKGGR PNSEGLLLEE LQVFEDSATI
TSMQISSKRL YIASPSAVAQ IALHRCTALG RACAECCLAR DPYCAWDGSA CTRFQPTAKR
RFRRQDIRND SSHPALLERK VLGVESGSAF LECEPRSLQA HVEWTFHRAG EVAQTQMLAE
ERVERTARGL LLRGLRRQXS GVYLCVAVEQ GFSQPLRRLV LHVLSAVQAE RLARAEEAAA
PAPPGPKLWY RDFLQLVEPG GGGGANSLRM CRPQPGPHSV AAESRRKGRN RRMHASELRA
ERGPRSAAHW
//