ID H0UUF5_CAVPO Unreviewed; 947 AA.
AC H0UUF5;
DT 22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 2.
DT 27-MAR-2024, entry version 69.
DE SubName: Full=MAM domain containing glycosylphosphatidylinositol anchor 1 {ECO:0000313|Ensembl:ENSCPOP00000000593.3};
GN Name=MDGA1 {ECO:0000313|Ensembl:ENSCPOP00000000593.3};
OS Cavia porcellus (Guinea pig).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Caviidae;
OC Cavia.
OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000000593.3, ECO:0000313|Proteomes:UP000005447};
RN [1] {ECO:0000313|Proteomes:UP000005447}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2N {ECO:0000313|Proteomes:UP000005447};
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSCPOP00000000593.3}
RP IDENTIFICATION.
RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000000593.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAKN02021441; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; H0UUF5; -.
DR Ensembl; ENSCPOT00000000676.3; ENSCPOP00000000593.3; ENSCPOG00000000671.4.
DR VEuPathDB; HostDB:ENSCPOG00000000671; -.
DR GeneTree; ENSGT00940000159201; -.
DR HOGENOM; CLU_014908_0_0_1; -.
DR TreeFam; TF330345; -.
DR Proteomes; UP000005447; Unassembled WGS sequence.
DR Bgee; ENSCPOG00000000671; Expressed in heart and 12 other cell types or tissues.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0098552; C:side of membrane; IEA:UniProtKB-KW.
DR CDD; cd00096; Ig; 2.
DR CDD; cd06263; MAM; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 7.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR InterPro; IPR000998; MAM_dom.
DR PANTHER; PTHR45080; CONTACTIN 5; 1.
DR PANTHER; PTHR45080:SF23; MAM DOMAIN-CONTAINING GLYCOSYLPHOSPHATIDYLINOSITOL ANCHOR PROTEIN 1; 1.
DR Pfam; PF13927; Ig_3; 6.
DR Pfam; PF00629; MAM; 1.
DR SMART; SM00409; IG; 6.
DR SMART; SM00408; IGc2; 6.
DR SMART; SM00137; MAM; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF48726; Immunoglobulin; 6.
DR PROSITE; PS50853; FN3; 1.
DR PROSITE; PS50835; IG_LIKE; 6.
DR PROSITE; PS50060; MAM_2; 1.
PE 4: Predicted;
KW Immunoglobulin domain {ECO:0000256|ARBA:ARBA00023319};
KW Reference proteome {ECO:0000313|Proteomes:UP000005447};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..947
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5011534013"
FT DOMAIN 24..123
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 132..230
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 240..323
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 338..432
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 440..532
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 539..631
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 643..743
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 743..910
FT /note="MAM"
FT /evidence="ECO:0000259|PROSITE:PS50060"
FT REGION 771..790
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 771..787
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 947 AA; 105159 MW; FB20675D93908F45 CRC64;
MEVTCLLLLA LIPFHCRGQG VYAPAQAQIV HAGQACVVKE DNISERVYTI REGDTLMLQC
LVTGHPRPQV RWTKTAGSAS DKFQETSVFN ETLRIERVAR TQGGRYYCKA ENGVGVPAIK
SIRVDVQYLD EPVLTVHQTV SDVRGNFYQE KTVFLRCTVN SNPPARFIWK RGSDTLSHSQ
DNGVDIYEPL YTQGETKVLK LKNLRPQDYA SYTCQVSVRN VCGIPDKAIT FRLTNTTAPP
ALKLSVNETL VVNPGDNVTV QCLLTGGDPL PQLQWSHGPG PLPLGALAQG GTLSIPSVQA
RDSGYYNCTA TNNVGNPAKK TVNLLVRSMK NATFQITPDV IKESENIQLG QDLKLSCHVD
AVPQEKVTYQ WFKNGKPARM SKRLLVTRND PELPAVTSSL ELIDLHFSDY GTYLCVASFP
GAPVSDLSVE VNISSETVPP TISVPKGRAV VTVREGSPAE LQCEVRGKPR PPVLWSRVDK
EAALLPSGLA LEETQDGKLR LERVTRDMSG TYRCQTARYN GFNVRPREAQ VQLNVQFPPE
VEPESQDVRQ ALGRPVLLRC SLLRGNPQRI ATAVWRFKGQ LLPPPPVVPA ATEAADHADL
RLDAVTRDSS GSYECSVSND VGSAACLFQV SAKAYSPEFY FDTPNPTRSH KLSKNYSYVL
QWTQREPDAV DPVLNYRLSV RQLNQHNAMV KAIPVRRVEK GQLLEYLLTD LRVPHSYEIR
LTPYTTFGAG DMASRIIHYT EHNTCHFEDE QICGYTQDLT DNFDWTRQNA LTQNPKRSPN
TGPPTDISGT PEGYYMFIET SRPRELGDRA RLVSPLYNAS AKFYCVSFFY HMYGKHIGSL
NLLVRSRNKG TLDTHAWSLS GNKGNVWQQA HVPINPSGPF QIIFEGVRGS GYLGDIAIDD
VTLKKGECPR KQMDPNKVVV MPGSGAPRQS SLQLWGPMAI FLLALER
//