ID A0A1A6GFG7_NEOLE Unreviewed; 355 AA.
AC A0A1A6GFG7;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE RecName: Full=Hyaluronan and proteoglycan link protein 2 {ECO:0008006|Google:ProtNLM};
GN ORFNames=A6R68_06486 {ECO:0000313|EMBL:OBS65003.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS65003.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS65003.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS65003.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS65003.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00323}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS65003.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01097113; OBS65003.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1A6GFG7; -.
DR STRING; 56216.A0A1A6GFG7; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0005540; F:hyaluronic acid binding; IEA:InterPro.
DR GO; GO:0007155; P:cell adhesion; IEA:InterPro.
DR CDD; cd03518; Link_domain_HAPLN_module_1; 1.
DR CDD; cd03519; Link_domain_HAPLN_module_2; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 2.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR InterPro; IPR013106; Ig_V-set.
DR InterPro; IPR000538; Link_dom.
DR PANTHER; PTHR22804; AGGRECAN/VERSICAN PROTEOGLYCAN; 1.
DR PANTHER; PTHR22804:SF8; HYALURONAN AND PROTEOGLYCAN LINK PROTEIN 2; 1.
DR Pfam; PF07686; V-set; 1.
DR Pfam; PF00193; Xlink; 2.
DR PRINTS; PR01265; LINKMODULE.
DR SMART; SM00409; IG; 1.
DR SMART; SM00408; IGc2; 1.
DR SMART; SM00406; IGv; 1.
DR SMART; SM00445; LINK; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR SUPFAM; SSF48726; Immunoglobulin; 1.
DR PROSITE; PS50835; IG_LIKE; 1.
DR PROSITE; PS01241; LINK_1; 2.
DR PROSITE; PS50963; LINK_2; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00323};
KW Immunoglobulin domain {ECO:0000256|ARBA:ARBA00023319};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..355
FT /note="Hyaluronan and proteoglycan link protein 2"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008345559"
FT DOMAIN 35..143
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 149..243
FT /note="Link"
FT /evidence="ECO:0000259|PROSITE:PS50963"
FT DOMAIN 258..353
FT /note="Link"
FT /evidence="ECO:0000259|PROSITE:PS50963"
FT DISULFID 195..216
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00323"
FT DISULFID 305..326
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00323"
SQ SEQUENCE 355 AA; 39458 MW; E311D11A4500A4DB CRC64;
MPRRTPLPAL CCFLLPWAFT TFHQALGNPA PHPGPHYLLP PIHEVIHSHR GATATLPCVL
GTSPPSYKVR WSKVDPGELR ETLILITNGL HARGYGHLGG RASMRRGHRL DASLVIKNVR
LEDEGRYRCE LINGIEDESV ALTLRLEGVV FPYQPSRGRY QFNYFEAKQA CEEQDGRLAT
YGQLYQAWTE GLDWCNAGWL LEGSVRYPVL NARAPCGGHG RPGIRSYGPR DRTRDRYDAF
CFTSALADRL LGAIGHRLTF PGQVFFVPGR LTLSEAHAAC RRRGAVVAKV GHLYAAWKFS
GLDRCDGGWL ADGSVRFPIT TPRPRCGGLP DPGVRSFGFP RPQQAAYGTY CYAEK
//