ID W5NX96_SHEEP Unreviewed; 1293 AA.
AC W5NX96;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 63.
DE RecName: Full=Attractin {ECO:0008006|Google:ProtNLM};
GN Name=ATRN {ECO:0000313|Ensembl:ENSOARP00000002793.1};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000002793.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000002793.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000002793.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000002793.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00460}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01025444; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01025445; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01025446; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01025447; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR SMR; W5NX96; -.
DR STRING; 9940.ENSOARP00000002793; -.
DR PaxDb; 9940-ENSOARP00000002793; -.
DR Ensembl; ENSOART00000002848.1; ENSOARP00000002793.1; ENSOARG00000002627.1.
DR eggNOG; KOG1388; Eukaryota.
DR HOGENOM; CLU_003930_0_0_1; -.
DR OMA; MNGCPSD; -.
DR Proteomes; UP000002356; Chromosome 13.
DR Bgee; ENSOARG00000002627; Expressed in liver and 54 other cell types or tissues.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0030246; F:carbohydrate binding; IEA:UniProtKB-KW.
DR CDD; cd00041; CUB; 1.
DR CDD; cd00055; EGF_Lam; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 1.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR011043; Gal_Oxase/kelch_b-propeller.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR InterPro; IPR002049; LE_dom.
DR InterPro; IPR002165; Plexin_repeat.
DR InterPro; IPR016201; PSI.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR PANTHER; PTHR46376:SF3; ATTRACTIN; 1.
DR PANTHER; PTHR46376; LEUCINE-ZIPPER-LIKE TRANSCRIPTIONAL REGULATOR 1; 1.
DR Pfam; PF00431; CUB; 1.
DR Pfam; PF01344; Kelch_1; 2.
DR Pfam; PF13964; Kelch_6; 1.
DR Pfam; PF00059; Lectin_C; 1.
DR Pfam; PF01437; PSI; 2.
DR SMART; SM00034; CLECT; 1.
DR SMART; SM00042; CUB; 1.
DR SMART; SM00180; EGF_Lam; 2.
DR SMART; SM00423; PSI; 5.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF50965; Galactose oxidase, central domain; 1.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 1.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR PROSITE; PS01180; CUB; 1.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS01248; EGF_LAM_1; 1.
DR PROSITE; PS50027; EGF_LAM_2; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW ECO:0000256|PROSITE-ProRule:PRU00460};
KW Lectin {ECO:0000256|ARBA:ARBA00022734};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT TRANSMEM 1143..1167
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 1..113
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 111..148
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 659..783
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT DOMAIN 927..972
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DISULFID 115..125
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 119..136
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 138..147
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 944..953
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 956..970
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
SQ SEQUENCE 1293 AA; 144328 MW; A9A4BC339F2E72E4 CRC64;
YRLTGSSGFI TDGPGNYKYK TKCTWLIEGQ PNRIMRLRFN HFATECSWDH LYVYDGDSIY
APLVAAFSGL IVPERDGNET VPEVVATSGY ALLHFFSDAA YNLTGFNITY NFDMCPNNCS
GRGECKISNS SNTVQCECSE NWKGEACDIP HCVNNCGFPH RGICNSSDVR GCSCFSEWQG
PGCSVPVPAN QSFWTREEYS DLKLPRASHK AVVNGNTMWV VGGYMFNHSD YNMVLAYDLA
SREWLALNRS VNSVVVRYGH SLALYQDKIY MYGGKIDSTG NVTNELRVFH IHNESWVLLS
PKAKEQYAVV GHSAHIVTLK NGRVVMLVIF GHCPLYGYIS SVQEYDLDKN TWSILHTQGA
LVQGGYGHSS VYDDRTKALY IHGGYKAFSA NKYRLADDLY RYDVDTQMWT ILKDSRFFRY
LHTAVIVSGT MLVFGGNTHN DTSMSHGAKC FSSDFMAYDI DCDRWSVLPR PDLHHDVNRF
GHSAVLYNRT MYVFGGFNSL LLSDILVFTS EQCEAHQSEA ACLAAGPGVR CVWDTGSSQC
VSWELAPEAQ EKIKSECFSK RIFDHDRCDQ LTDCYSCTAN TNGCQWCGDH CVPMNHSCAE
GQISIFKYDQ CPKDNPMYYC NKKTSCRSCA LDQNCQWEPR NQECIALPEN ICGIGWHLVG
NSCLKITTAK ETYDNAKLSC RNHNAFLASL TTQKKVEFVL KQLRIMQSSQ SMSKLTLTPW
VGLRKINVSY WCWEDMSPFT NSSLQWMPSE PSDAGFCGIL SEPSTRGLKA ATCINPLNGS
VCERPANHSA KQCRTPCALR TACGECTSGS SECMWCSNMK QCVDSNAYVA SFPFGQCMEW
YTMSSCPPEN CSGYCTCSHC LEQPGCGWCT DPSNTGKGKC IEGSYKGPVK MPSQGPTGNS
YPQPLLNSSM CLEDSRYNWS FIHCPACQCN GHSKCINQSI CEKCENLTTG KHCETCISGF
YGDPTNGGKC QPCRCNGHAS LCNTNTGKCF CTTKGVKGDE CQLCEVENRY QGNPLKGTCY
YTLLIDYQFT FSLSQEDDRY YTAINFVATP DEQNRDLDMF INASKNFNLN ITWAASFSAG
TQAGEEMPVV SKTNIKEYKD SFSNEKFDFR NHPNITFFVY VSNFTWPIKI QIAFSQHSNF
MDLVQFFVTF FSCFLSLLLV AAVVWKIKQS CWASRRREQL LREMQQMASR PFASVNVALE
TDEEPPDLIG GSIKTVPKPI ALEPCFGNKA AVLSVFVRLP RGLGGIPPPG QSGLAVASAL
VDISQQMPVV YKEKSGAVRN RKQQPPAQPG TCI
//