ID A0A1A6GQU5_NEOLE Unreviewed; 652 AA.
AC A0A1A6GQU5;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 22-FEB-2023, entry version 16.
DE RecName: Full=C-type lectin domain-containing protein {ECO:0000259|PROSITE:PS50041};
GN ORFNames=A6R68_02760 {ECO:0000313|EMBL:OBS68688.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS68688.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS68688.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS68688.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS68688.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS68688.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01075879; OBS68688.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1A6GQU5; -.
DR STRING; 56216.A0A1A6GQU5; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd03600; CLECT_thrombomodulin_like; 1.
DR CDD; cd00054; EGF_CA; 1.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR PANTHER; PTHR14789; CHONDROLECTIN VARIANT CHODLFDELTAE; 1.
DR PANTHER; PTHR14789:SF4; ENDOSIALIN; 1.
DR Pfam; PF14670; FXa_inhibition; 1.
DR Pfam; PF00059; Lectin_C; 1.
DR SMART; SM00034; CLECT; 1.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00179; EGF_CA; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR PROSITE; PS01187; EGF_CA; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW Signal {ECO:0000256|SAM:SignalP}; Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..652
FT /note="C-type lectin domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008345793"
FT TRANSMEM 581..606
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 30..147
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
SQ SEQUENCE 652 AA; 70021 MW; 88599DFB3A5A2849 CRC64;
MLLRLLLAWA AAVPTLGQAP WTLEPRAVCG PGSCYALFPR RRTFLEAWRA CRELGGNLAT
PRTPEEARRV DSLVGVGPAS GLLWIGLQRQ ARQCQPQRPL RGFLWTTGDQ DTAFTNWAQP
ATEGPCPAQR CAALEASGEH RWLEGSCTLA VDGYVCQFGF EGACPALPVE VGQAGPTVYT
TPFNLVSSEF EWLPFGSVAA VQCQAGRGVS LLCVKQPSGD VGWSQAGPLC PGTGCGPDNG
GCEHECVEEV DGGVSCRCSE GFRLAADGHS CEDPCAQAPC EQQCEPGGPQ GYSCHCRLGF
RPAEDEPHRC VDTDECQIAG VCQQMCVNYV GGFECYCSEG HELEADGISC SPAGAMGAQA
SQDLRDELLD DEEEGEDEEE AWEVFDGTWT EEQSVLWMAP TQPPDFGLAY RPNFPRDGEP
QRLHLEPTWP PPLSAPRGPY HSSVVSATRP MVISAMKPTL PSAHKTFVIP ATHPPLSPVH
PPAMAPATPP AVFPDHQIPQ IKASYPDLPF GYKPGIISAT HPARPPAYQP PTISTKAPLV
PREGVPSPKL VPWLPSVPPT AAPTALAEAG LADQSQRDDR WLLVALLVPT CVFLVVLLAL
GIVYCTRCGP HAPNKRITDC YRWVTQAGNK SPTEPMPPRG SLTGVQTCRT SV
//