ID H9GM03_ANOCA Unreviewed; 1103 AA.
AC H9GM03;
DT 16-MAY-2012, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 3.
DT 27-MAR-2024, entry version 72.
DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSACAP00000014918.4};
OS Anolis carolinensis (Green anole) (American chameleon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC Iguania; Dactyloidae; Anolis.
OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000014918.4, ECO:0000313|Proteomes:UP000001646};
RN [1] {ECO:0000313|Ensembl:ENSACAP00000014918.4}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JBL SC #1 {ECO:0000313|Ensembl:ENSACAP00000014918.4};
RG The Genome Sequencing Platform;
RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J.,
RA Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard).";
RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACAP00000014918.4}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; H9GM03; -.
DR Ensembl; ENSACAT00000015221.4; ENSACAP00000014918.4; ENSACAG00000015074.4.
DR GeneTree; ENSGT00940000155978; -.
DR HOGENOM; CLU_014484_0_0_1; -.
DR TreeFam; TF321302; -.
DR Proteomes; UP000001646; Unplaced.
DR Bgee; ENSACAG00000015074; Expressed in brain and 10 other cell types or tissues.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00110; LamG; 4.
DR Gene3D; 2.60.120.200; -; 6.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR15036:SF49; AXOTACTIN; 1.
DR PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR Pfam; PF02210; Laminin_G_2; 5.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00282; LamG; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 6.
DR PROSITE; PS50026; EGF_3; 3.
DR PROSITE; PS50025; LAM_G_DOMAIN; 4.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000001646};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..1103
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5032462493"
FT DOMAIN 27..203
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 199..237
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 377..569
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 573..610
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 615..787
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 801..976
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 979..1016
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
SQ SEQUENCE 1103 AA; 121032 MW; 1349AE602AD86580 CRC64;
MAKPVRLLVP LALLVASMLS PGAGGAALEF SGTTGQWARY ARWDASSLGE LSFSLKTNVS
RALVLYLDDG GNCDFLELLI LEGRFRLRFT ISCAEPASLH LETPINDDRW HMLLLTRNYR
ETMLVVDGEA RVAEVKSKRR DMTVESDLFV GGIPPDVRLS ALTLSTVKYE MPFRGIVANL
KVGDMPPALL GSQGIRSDME YLCTKQNPCV NGGICTVINS EVQCDCSLTG FQGKFCSEVC
RPFLAKPPPS PCQFPTWLTL CCLLVSLHPP NPPPPNPCAH DGQVTISVDG ILTTTGYTQE
DYTMLGSDDF FYIGGSPNTA DLPGSPVSNN FMGCLKDVVY KNNDFKLELS RLAKEGDPKM
KINGDLVFRC ENVAALDPVT FESPEAYISL PKWNTKKTGS ISFDFRTTEP NGLLLFSHGR
LQPPKEGRSE RLHKADYFAM ELLDGYLYLL LDMGSGGIKM RASSKKVNDG EWCHVDFQRD
GRKGSISVNS RSTPFLASGE SEILDLDSEM YLGGLPENRL DLILPPEVWT AFLNYGYVGC
VRDLFIDGKS RDVRRLAEVQ SVTGVSSFCS RETLKQCSSS PCRNGGLCRE GWNRFICDCV
GTGYLGRLCE REATVLSYDG SMYMKIMLPT VMHTEAEDVS LRFMSQRAYG FMMATTSKES
ADTLRLELDG GQMKLTVNLD CVRIGCNPSK GPETLFAGQK LNDNEWHTVR VVRRGKNLQL
SVDNVTVEGQ MAGAHTRLEF HNIETGIMTE RRFISVVPSN FIGHLSSLVF NGLPYMDLCK
NGDISYCELN ARFGLRSIIA DPVTFKSKAS YLALATLQAY ASMHLFFQFK TTATDGLILF
NSGNGNDFIV VELVKGYIHY VFDLGNGPSL MKGNSDKPVN DNQWHNVIVS RDTNNVHTLK
IDSRTVTQHS NGARNLDLKG ELYIGGLSRN MFSNLPKLVA SRDGFQGCLA SVDLNGRLPD
LIADALHRIG HVERGCDGPS TTCTEDSCAN QGVCLQQWDG YTCDCTMTSY GGPVCNDPGT
TYIFGKGGAL ITYTWPPNDR PSTRADRLAV GFSTHQKNAI LVRVDSASGL GDYLQLHIVR
TKGKAREVGV GAGGASQPGA GLL
//