GenomeNet

Database: UniProt
Entry: W5NX96_SHEEP
LinkDB: W5NX96_SHEEP
Original site: W5NX96_SHEEP 
ID   W5NX96_SHEEP            Unreviewed;      1293 AA.
AC   W5NX96;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   16-APR-2014, sequence version 1.
DT   27-MAR-2024, entry version 63.
DE   RecName: Full=Attractin {ECO:0008006|Google:ProtNLM};
GN   Name=ATRN {ECO:0000313|Ensembl:ENSOARP00000002793.1};
OS   Ovis aries (Sheep).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC   Caprinae; Ovis.
OX   NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000002793.1, ECO:0000313|Proteomes:UP000002356};
RN   [1] {ECO:0000313|Ensembl:ENSOARP00000002793.1, ECO:0000313|Proteomes:UP000002356}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000002793.1,
RC   ECO:0000313|Proteomes:UP000002356};
RX   PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA   Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA   Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA   Wang W., Xun X.;
RT   "The sheep genome reference sequence: a work in progress.";
RL   Anim. Genet. 41:449-453(2010).
RN   [2] {ECO:0000313|Ensembl:ENSOARP00000002793.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00460}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AMGL01025444; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AMGL01025445; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AMGL01025446; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AMGL01025447; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   SMR; W5NX96; -.
DR   STRING; 9940.ENSOARP00000002793; -.
DR   PaxDb; 9940-ENSOARP00000002793; -.
DR   Ensembl; ENSOART00000002848.1; ENSOARP00000002793.1; ENSOARG00000002627.1.
DR   eggNOG; KOG1388; Eukaryota.
DR   HOGENOM; CLU_003930_0_0_1; -.
DR   OMA; MNGCPSD; -.
DR   Proteomes; UP000002356; Chromosome 13.
DR   Bgee; ENSOARG00000002627; Expressed in liver and 54 other cell types or tissues.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR   GO; GO:0030246; F:carbohydrate binding; IEA:UniProtKB-KW.
DR   CDD; cd00041; CUB; 1.
DR   CDD; cd00055; EGF_Lam; 2.
DR   Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR   Gene3D; 2.10.25.10; Laminin; 2.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 1.
DR   InterPro; IPR001304; C-type_lectin-like.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR000859; CUB_dom.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR011043; Gal_Oxase/kelch_b-propeller.
DR   InterPro; IPR015915; Kelch-typ_b-propeller.
DR   InterPro; IPR006652; Kelch_1.
DR   InterPro; IPR002049; LE_dom.
DR   InterPro; IPR002165; Plexin_repeat.
DR   InterPro; IPR016201; PSI.
DR   InterPro; IPR035914; Sperma_CUB_dom_sf.
DR   PANTHER; PTHR46376:SF3; ATTRACTIN; 1.
DR   PANTHER; PTHR46376; LEUCINE-ZIPPER-LIKE TRANSCRIPTIONAL REGULATOR 1; 1.
DR   Pfam; PF00431; CUB; 1.
DR   Pfam; PF01344; Kelch_1; 2.
DR   Pfam; PF13964; Kelch_6; 1.
DR   Pfam; PF00059; Lectin_C; 1.
DR   Pfam; PF01437; PSI; 2.
DR   SMART; SM00034; CLECT; 1.
DR   SMART; SM00042; CUB; 1.
DR   SMART; SM00180; EGF_Lam; 2.
DR   SMART; SM00423; PSI; 5.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF57196; EGF/Laminin; 1.
DR   SUPFAM; SSF50965; Galactose oxidase, central domain; 1.
DR   SUPFAM; SSF49854; Spermadhesin, CUB domain; 1.
DR   PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR   PROSITE; PS01180; CUB; 1.
DR   PROSITE; PS00022; EGF_1; 1.
DR   PROSITE; PS50026; EGF_3; 1.
DR   PROSITE; PS01248; EGF_LAM_1; 1.
DR   PROSITE; PS50027; EGF_LAM_2; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW   Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW   ECO:0000256|PROSITE-ProRule:PRU00460};
KW   Lectin {ECO:0000256|ARBA:ARBA00022734};
KW   Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW   ECO:0000256|SAM:Phobius}.
FT   TRANSMEM        1143..1167
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          1..113
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          111..148
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          659..783
FT                   /note="C-type lectin"
FT                   /evidence="ECO:0000259|PROSITE:PS50041"
FT   DOMAIN          927..972
FT                   /note="Laminin EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50027"
FT   DISULFID        115..125
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        119..136
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        138..147
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        944..953
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT   DISULFID        956..970
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
SQ   SEQUENCE   1293 AA;  144328 MW;  A9A4BC339F2E72E4 CRC64;
     YRLTGSSGFI TDGPGNYKYK TKCTWLIEGQ PNRIMRLRFN HFATECSWDH LYVYDGDSIY
     APLVAAFSGL IVPERDGNET VPEVVATSGY ALLHFFSDAA YNLTGFNITY NFDMCPNNCS
     GRGECKISNS SNTVQCECSE NWKGEACDIP HCVNNCGFPH RGICNSSDVR GCSCFSEWQG
     PGCSVPVPAN QSFWTREEYS DLKLPRASHK AVVNGNTMWV VGGYMFNHSD YNMVLAYDLA
     SREWLALNRS VNSVVVRYGH SLALYQDKIY MYGGKIDSTG NVTNELRVFH IHNESWVLLS
     PKAKEQYAVV GHSAHIVTLK NGRVVMLVIF GHCPLYGYIS SVQEYDLDKN TWSILHTQGA
     LVQGGYGHSS VYDDRTKALY IHGGYKAFSA NKYRLADDLY RYDVDTQMWT ILKDSRFFRY
     LHTAVIVSGT MLVFGGNTHN DTSMSHGAKC FSSDFMAYDI DCDRWSVLPR PDLHHDVNRF
     GHSAVLYNRT MYVFGGFNSL LLSDILVFTS EQCEAHQSEA ACLAAGPGVR CVWDTGSSQC
     VSWELAPEAQ EKIKSECFSK RIFDHDRCDQ LTDCYSCTAN TNGCQWCGDH CVPMNHSCAE
     GQISIFKYDQ CPKDNPMYYC NKKTSCRSCA LDQNCQWEPR NQECIALPEN ICGIGWHLVG
     NSCLKITTAK ETYDNAKLSC RNHNAFLASL TTQKKVEFVL KQLRIMQSSQ SMSKLTLTPW
     VGLRKINVSY WCWEDMSPFT NSSLQWMPSE PSDAGFCGIL SEPSTRGLKA ATCINPLNGS
     VCERPANHSA KQCRTPCALR TACGECTSGS SECMWCSNMK QCVDSNAYVA SFPFGQCMEW
     YTMSSCPPEN CSGYCTCSHC LEQPGCGWCT DPSNTGKGKC IEGSYKGPVK MPSQGPTGNS
     YPQPLLNSSM CLEDSRYNWS FIHCPACQCN GHSKCINQSI CEKCENLTTG KHCETCISGF
     YGDPTNGGKC QPCRCNGHAS LCNTNTGKCF CTTKGVKGDE CQLCEVENRY QGNPLKGTCY
     YTLLIDYQFT FSLSQEDDRY YTAINFVATP DEQNRDLDMF INASKNFNLN ITWAASFSAG
     TQAGEEMPVV SKTNIKEYKD SFSNEKFDFR NHPNITFFVY VSNFTWPIKI QIAFSQHSNF
     MDLVQFFVTF FSCFLSLLLV AAVVWKIKQS CWASRRREQL LREMQQMASR PFASVNVALE
     TDEEPPDLIG GSIKTVPKPI ALEPCFGNKA AVLSVFVRLP RGLGGIPPPG QSGLAVASAL
     VDISQQMPVV YKEKSGAVRN RKQQPPAQPG TCI
//
DBGET integrated database retrieval system