ID A0A1A6HTX3_NEOLE Unreviewed; 1172 AA.
AC A0A1A6HTX3;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE RecName: Full=Sushi, nidogen and EGF-like domain-containing protein 1 {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
GN ORFNames=A6R68_20583 {ECO:0000313|EMBL:OBS81182.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS81182.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS81182.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS81182.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS81182.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS81182.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01017287; OBS81182.1; -; Genomic_DNA.
DR STRING; 56216.A0A1A6HTX3; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0031012; C:extracellular matrix; IEA:UniProt.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0007160; P:cell-matrix adhesion; IEA:InterPro.
DR CDD; cd00033; CCP; 1.
DR CDD; cd00054; EGF_CA; 8.
DR CDD; cd00063; FN3; 3.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 3.
DR Gene3D; 2.10.25.10; Laminin; 9.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR003886; NIDO_dom.
DR InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR PANTHER; PTHR24049; CRUMBS FAMILY MEMBER; 1.
DR PANTHER; PTHR24049:SF22; DROSOPHILA CRUMBS HOMOLOG; 1.
DR Pfam; PF00008; EGF; 8.
DR Pfam; PF00041; fn3; 3.
DR Pfam; PF12661; hEGF; 1.
DR Pfam; PF06119; NIDO; 1.
DR PRINTS; PR00010; EGFBLOOD.
DR SMART; SM00032; CCP; 1.
DR SMART; SM00181; EGF; 9.
DR SMART; SM00179; EGF_CA; 8.
DR SMART; SM00060; FN3; 3.
DR SMART; SM00539; NIDO; 1.
DR SUPFAM; SSF57535; Complement control module/SCR domain; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 9.
DR SUPFAM; SSF49265; Fibronectin type III; 2.
DR PROSITE; PS00010; ASX_HYDROXYL; 4.
DR PROSITE; PS00022; EGF_1; 9.
DR PROSITE; PS01186; EGF_2; 7.
DR PROSITE; PS50026; EGF_3; 9.
DR PROSITE; PS50853; FN3; 3.
DR PROSITE; PS51220; NIDO; 1.
DR PROSITE; PS50923; SUSHI; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW Sushi {ECO:0000256|PROSITE-ProRule:PRU00302}.
FT DOMAIN 32..177
FT /note="NIDO"
FT /evidence="ECO:0000259|PROSITE:PS51220"
FT DOMAIN 190..231
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 260..296
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 299..331
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 372..408
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 433..469
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 472..508
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 510..546
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 549..606
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 629..665
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 684..720
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 719..817
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 818..916
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 917..1010
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DISULFID 221..230
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 286..295
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 321..330
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 398..407
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 459..468
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 498..507
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 536..545
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 577..604
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 655..664
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 710..719
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:OBS81182.1"
SQ SEQUENCE 1172 AA; 127090 MW; 8B8D13EE80EE8916 CRC64;
VNNNGIISFL KEVSQFTPVA FPIAKDRCVV AAFWADVDNR RAGDVYYREA TDPTTLQRAT
EDIRQYFPEL PDFSASWVFI ATWYRVTFFG GSSSSPVSPG ERVFDMGQXV QPVIANPLSW
QVNTFQTVLI TDGRFSFTIF NYESILWTTG THASSGGDAD GLGGIAAQVG QHRIGTQCSG
MRLGLLPVST ASVCLVLRPC LNGGKCIDDC VTGNPSYTCS CLAGFTGRRC HLGESLAPSI
FSSLKWLLLT TVFLAGSYPV PSPCLSNPCQ NGGTCVDADP GYVCECPEGF MGLDCRERIP
NDCECRNGGR CLGANTTLCQ CPPGFFGLLC EFEVTATPCN MNTQCPDGGY CMEYGGSYLC
VCHTDHNISH SLPSPCDSDP CFNGGSCDAH DDSYTCECPR GFHGRHCEKG DVKRTGKRVA
HVHHISLSLH TARPHLCSSG PCRNGGTCKE TGNEYHCTCP YRFTGRHCEI GKPDSCASGP
CHNGGTCFHY IGKYKCDCPP GFSGRHCEIA PSPCFRSPCM NGGTCEDLGA DFFCHCQPGY
TGHRCQAEVD CGRPEEVKHA TMRFNGTHMG SVALYTCDHG FSLSTLSHMR ICQPQGVWSQ
PPQCIGDSVG PQGWINMATS LCSWDLFLEV DECQSQPCLH GGSCQDLTAG YQCLCSPGXE
GVHCELGKRA PPCSSGQPFG CSPEVDACAS SPCQHGGRCE DGGGAYLCVC PEGFFGYHCE
TALRVERVEE SGVSISWSPP EGTTARQVLD GYAVTYASSD GSFRRTDFVD RSRSSHQLRA
LAAGRAYNIS VFSVKRNTNN KNDISRPAAL LTRTRPRPIE DVEVANVSAN AISVLWALHR
IQHATVSRVR VSILNPEASV AQSTEVDRSV DRLTFGDLLP GRRYIVRLTS LSGPEGAEYL
TESLASAPLN VWTRPLPPAN LTASRVTATS AHMVWDTPTP GISLEAYVIN VTTSQSTKSR
YIPNGKLVSY TVRDLLPGRR YQLSVTAVQS TEQGQLHSEP AHLYIITWMG LGMKQGPCGY
PQLLLEPTTL VPLKNTEEAP EQVNLALQLS KNSSKDTESK MEAPVCQVPI PTAVTAGLGS
KADTVSSPVK RCLAPAHGSS LRRSRFLSGK EMSATMWSSF TEFSCDNRYK KVYKVHQDIC
FKERCESTSL KKPQTGTSWG HAVLHAALWV DG
//