GenomeNet

Database: UniProt
Entry: A0A1A6HTX3_NEOLE
LinkDB: A0A1A6HTX3_NEOLE
Original site: A0A1A6HTX3_NEOLE 
ID   A0A1A6HTX3_NEOLE        Unreviewed;      1172 AA.
AC   A0A1A6HTX3;
DT   05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT   05-OCT-2016, sequence version 1.
DT   27-MAR-2024, entry version 27.
DE   RecName: Full=Sushi, nidogen and EGF-like domain-containing protein 1 {ECO:0008006|Google:ProtNLM};
DE   Flags: Fragment;
GN   ORFNames=A6R68_20583 {ECO:0000313|EMBL:OBS81182.1};
OS   Neotoma lepida (Desert woodrat).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC   Cricetidae; Neotominae; Neotoma.
OX   NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS81182.1, ECO:0000313|Proteomes:UP000092124};
RN   [1] {ECO:0000313|EMBL:OBS81182.1, ECO:0000313|Proteomes:UP000092124}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=417 {ECO:0000313|EMBL:OBS81182.1};
RC   TISSUE=Liver {ECO:0000313|EMBL:OBS81182.1};
RA   Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT   "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT   lepida.";
RL   Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OBS81182.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LZPO01017287; OBS81182.1; -; Genomic_DNA.
DR   STRING; 56216.A0A1A6HTX3; -.
DR   Proteomes; UP000092124; Unassembled WGS sequence.
DR   GO; GO:0031012; C:extracellular matrix; IEA:UniProt.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   GO; GO:0007160; P:cell-matrix adhesion; IEA:InterPro.
DR   CDD; cd00033; CCP; 1.
DR   CDD; cd00054; EGF_CA; 8.
DR   CDD; cd00063; FN3; 3.
DR   Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 3.
DR   Gene3D; 2.10.25.10; Laminin; 9.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR013032; EGF-like_CS.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR003886; NIDO_dom.
DR   InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR   InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR   PANTHER; PTHR24049; CRUMBS FAMILY MEMBER; 1.
DR   PANTHER; PTHR24049:SF22; DROSOPHILA CRUMBS HOMOLOG; 1.
DR   Pfam; PF00008; EGF; 8.
DR   Pfam; PF00041; fn3; 3.
DR   Pfam; PF12661; hEGF; 1.
DR   Pfam; PF06119; NIDO; 1.
DR   PRINTS; PR00010; EGFBLOOD.
DR   SMART; SM00032; CCP; 1.
DR   SMART; SM00181; EGF; 9.
DR   SMART; SM00179; EGF_CA; 8.
DR   SMART; SM00060; FN3; 3.
DR   SMART; SM00539; NIDO; 1.
DR   SUPFAM; SSF57535; Complement control module/SCR domain; 1.
DR   SUPFAM; SSF57196; EGF/Laminin; 9.
DR   SUPFAM; SSF49265; Fibronectin type III; 2.
DR   PROSITE; PS00010; ASX_HYDROXYL; 4.
DR   PROSITE; PS00022; EGF_1; 9.
DR   PROSITE; PS01186; EGF_2; 7.
DR   PROSITE; PS50026; EGF_3; 9.
DR   PROSITE; PS50853; FN3; 3.
DR   PROSITE; PS51220; NIDO; 1.
DR   PROSITE; PS50923; SUSHI; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW   Sushi {ECO:0000256|PROSITE-ProRule:PRU00302}.
FT   DOMAIN          32..177
FT                   /note="NIDO"
FT                   /evidence="ECO:0000259|PROSITE:PS51220"
FT   DOMAIN          190..231
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          260..296
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          299..331
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          372..408
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          433..469
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          472..508
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          510..546
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          549..606
FT                   /note="Sushi"
FT                   /evidence="ECO:0000259|PROSITE:PS50923"
FT   DOMAIN          629..665
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          684..720
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          719..817
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          818..916
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          917..1010
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DISULFID        221..230
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        286..295
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        321..330
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        398..407
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        459..468
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        498..507
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        536..545
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        577..604
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT   DISULFID        655..664
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        710..719
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:OBS81182.1"
SQ   SEQUENCE   1172 AA;  127090 MW;  8B8D13EE80EE8916 CRC64;
     VNNNGIISFL KEVSQFTPVA FPIAKDRCVV AAFWADVDNR RAGDVYYREA TDPTTLQRAT
     EDIRQYFPEL PDFSASWVFI ATWYRVTFFG GSSSSPVSPG ERVFDMGQXV QPVIANPLSW
     QVNTFQTVLI TDGRFSFTIF NYESILWTTG THASSGGDAD GLGGIAAQVG QHRIGTQCSG
     MRLGLLPVST ASVCLVLRPC LNGGKCIDDC VTGNPSYTCS CLAGFTGRRC HLGESLAPSI
     FSSLKWLLLT TVFLAGSYPV PSPCLSNPCQ NGGTCVDADP GYVCECPEGF MGLDCRERIP
     NDCECRNGGR CLGANTTLCQ CPPGFFGLLC EFEVTATPCN MNTQCPDGGY CMEYGGSYLC
     VCHTDHNISH SLPSPCDSDP CFNGGSCDAH DDSYTCECPR GFHGRHCEKG DVKRTGKRVA
     HVHHISLSLH TARPHLCSSG PCRNGGTCKE TGNEYHCTCP YRFTGRHCEI GKPDSCASGP
     CHNGGTCFHY IGKYKCDCPP GFSGRHCEIA PSPCFRSPCM NGGTCEDLGA DFFCHCQPGY
     TGHRCQAEVD CGRPEEVKHA TMRFNGTHMG SVALYTCDHG FSLSTLSHMR ICQPQGVWSQ
     PPQCIGDSVG PQGWINMATS LCSWDLFLEV DECQSQPCLH GGSCQDLTAG YQCLCSPGXE
     GVHCELGKRA PPCSSGQPFG CSPEVDACAS SPCQHGGRCE DGGGAYLCVC PEGFFGYHCE
     TALRVERVEE SGVSISWSPP EGTTARQVLD GYAVTYASSD GSFRRTDFVD RSRSSHQLRA
     LAAGRAYNIS VFSVKRNTNN KNDISRPAAL LTRTRPRPIE DVEVANVSAN AISVLWALHR
     IQHATVSRVR VSILNPEASV AQSTEVDRSV DRLTFGDLLP GRRYIVRLTS LSGPEGAEYL
     TESLASAPLN VWTRPLPPAN LTASRVTATS AHMVWDTPTP GISLEAYVIN VTTSQSTKSR
     YIPNGKLVSY TVRDLLPGRR YQLSVTAVQS TEQGQLHSEP AHLYIITWMG LGMKQGPCGY
     PQLLLEPTTL VPLKNTEEAP EQVNLALQLS KNSSKDTESK MEAPVCQVPI PTAVTAGLGS
     KADTVSSPVK RCLAPAHGSS LRRSRFLSGK EMSATMWSSF TEFSCDNRYK KVYKVHQDIC
     FKERCESTSL KKPQTGTSWG HAVLHAALWV DG
//
DBGET integrated database retrieval system