ID A0A1A6FWU0_NEOLE Unreviewed; 701 AA.
AC A0A1A6FWU0;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 13-SEP-2023, entry version 22.
DE RecName: Full=SEA domain-containing protein {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
GN ORFNames=A6R68_11213 {ECO:0000313|EMBL:OBS57657.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS57657.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS57657.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS57657.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS57657.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix, interphotoreceptor matrix {ECO:0000256|ARBA:ARBA00004593}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS57657.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01117030; OBS57657.1; -; Genomic_DNA.
DR STRING; 56216.A0A1A6FWU0; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0042995; C:cell projection; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0033165; C:interphotoreceptor matrix; IEA:UniProtKB-SubCell.
DR GO; GO:0008201; F:heparin binding; IEA:UniProtKB-KW.
DR GO; GO:0007601; P:visual perception; IEA:InterPro.
DR Gene3D; 3.30.70.960; SEA domain; 1.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR039861; IMPG.
DR InterPro; IPR000082; SEA_dom.
DR InterPro; IPR036364; SEA_dom_sf.
DR PANTHER; PTHR12199; INTERPHOTORECEPTOR MATRIX PROTEOGLYCAN; 1.
DR PANTHER; PTHR12199:SF4; INTERPHOTORECEPTOR MATRIX PROTEOGLYCAN 2; 1.
DR Pfam; PF01390; SEA; 1.
DR SMART; SM00200; SEA; 1.
DR SUPFAM; SSF82671; SEA domain; 1.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS50024; SEA; 1.
PE 4: Predicted;
KW Cell projection {ECO:0000256|ARBA:ARBA00023273};
KW Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Heparin-binding {ECO:0000256|ARBA:ARBA00022674};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 521..634
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT DOMAIN 634..675
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 45..69
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 152..171
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 357..395
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 369..384
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 643..660
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:OBS57657.1"
FT NON_TER 701
FT /evidence="ECO:0000313|EMBL:OBS57657.1"
SQ SEQUENCE 701 AA; 76089 MW; 91685CF3D6CD2F22 CRC64;
KGVLLPQTDD IIWSTQSSSL QVITPSILDT TLQAEWLSAD ESTTTNISPL DFSSGPPSTA
GRDLRSESAL SDLVSTPKLA SPSKVVLSSS XEVLEGSSLT LHSVTPAVLQ SGLPVASEGR
TSGSSVLDEG LVNTEEPEDA SVDVLPSSSL IQPVPKETVP PMEDSGMTLL TSSPHLTSSV
IVDLAKDITA TSGLDSLASK VTDQLAMSPW FPDTSEENEF SFESGLGSGS GKNVGLNIWP
WSETSVEKTT ETLSKSWPED AYALLPTEGI EKLHVDGKVD ATEQIIEPSE HSYSDRSIHF
TEEESLDEST IPVYAESATQ FTSLIFSKHT PDVPDTDSYS VTKAPFLLAA MATSTSTEKT
DEVSTPLKED TIQTESSSHK GLPSESSVVK PDMQPVGTIL PESDIVWART SSLGKLSRDT
LASTPASSDR LWLKAPMTQS TELPPTTSST QLEDEVIMGV QDISLELDRV GTDYYQPELT
QEQNGKVGSY VEMSTNVHYT EMPFVAQPTK GSDLSHTQTS GTLVVFFSLR VTNMMFSEDL
FNKNSLEYKA LEQRFLELLV PYLQSNLSGF QNLEILNFRN GSIVVNSRVK FTETVPPNVN
NAMYLILEDF CTTAYQTMNL DIDKYSLDVE SGDDANPCKF QACNEFSECL VNPWSGEAKC
KCYPGYLSVD ELPCQSLCDL QPDFCLNDGK CDIMPGHGAI C
//