ID G1KIG8_ANOCA Unreviewed; 717 AA.
AC G1KIG8;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 26-JUN-2013, sequence version 2.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=Extracellular matrix protein 2 {ECO:0000313|Ensembl:ENSACAP00000008912.3};
GN Name=ECM2 {ECO:0000313|Ensembl:ENSACAP00000008912.3};
OS Anolis carolinensis (Green anole) (American chameleon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC Iguania; Dactyloidae; Anolis.
OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000008912.3, ECO:0000313|Proteomes:UP000001646};
RN [1] {ECO:0000313|Ensembl:ENSACAP00000008912.3, ECO:0000313|Proteomes:UP000001646}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JBL SC #1 {ECO:0000313|Ensembl:ENSACAP00000008912.3,
RC ECO:0000313|Proteomes:UP000001646};
RG The Genome Sequencing Platform;
RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J.,
RA Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard).";
RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACAP00000008912.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_003217706.1; XM_003217658.2.
DR RefSeq; XP_008103390.1; XM_008105183.2.
DR RefSeq; XP_008103392.1; XM_008105185.2.
DR RefSeq; XP_016847093.1; XM_016991604.1.
DR AlphaFoldDB; G1KIG8; -.
DR STRING; 28377.ENSACAP00000008912; -.
DR Ensembl; ENSACAT00000009103.4; ENSACAP00000008912.3; ENSACAG00000009093.4.
DR GeneID; 100554283; -.
DR KEGG; acs:100554283; -.
DR CTD; 1842; -.
DR eggNOG; KOG0619; Eukaryota.
DR GeneTree; ENSGT00940000159941; -.
DR HOGENOM; CLU_000288_186_2_1; -.
DR InParanoid; G1KIG8; -.
DR OrthoDB; 5361311at2759; -.
DR TreeFam; TF330031; -.
DR Proteomes; UP000001646; Chromosome 2.
DR Bgee; ENSACAG00000009093; Expressed in lung and 8 other cell types or tissues.
DR GO; GO:0031012; C:extracellular matrix; IBA:GO_Central.
DR GO; GO:0005614; C:interstitial matrix; IEA:Ensembl.
DR GO; GO:0070052; F:collagen V binding; IBA:GO_Central.
DR GO; GO:0008201; F:heparin binding; IBA:GO_Central.
DR GO; GO:0030198; P:extracellular matrix organization; IBA:GO_Central.
DR GO; GO:0010811; P:positive regulation of cell-substrate adhesion; IBA:GO_Central.
DR Gene3D; 6.20.200.20; -; 1.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 3.
DR InterPro; IPR043184; ECM2.
DR InterPro; IPR001611; Leu-rich_rpt.
DR InterPro; IPR003591; Leu-rich_rpt_typical-subtyp.
DR InterPro; IPR032675; LRR_dom_sf.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR46544:SF1; EXTRACELLULAR MATRIX PROTEIN 2; 1.
DR PANTHER; PTHR46544; EXTRACELLULAR MATRIX PROTEIN 2-RELATED; 1.
DR Pfam; PF13855; LRR_8; 4.
DR Pfam; PF00093; VWC; 1.
DR SMART; SM00369; LRR_TYP; 12.
DR SMART; SM00214; VWC; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF52047; RNI-like; 1.
DR PROSITE; PS51450; LRR; 2.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
PE 4: Predicted;
KW Leucine-rich repeat {ECO:0000256|ARBA:ARBA00022614};
KW Reference proteome {ECO:0000313|Proteomes:UP000001646};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..717
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003412796"
FT DOMAIN 102..159
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT REGION 169..254
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 266..294
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 658..677
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 169..183
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 184..214
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 215..230
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 231..254
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 277..293
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 661..675
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 717 AA; 82098 MW; D7E0F4FC3B10F325 CRC64;
MQKAVLLYIL LLISLGIALA QNENNGMQRR ARRRMRHRGL RKGSMIYRRI NKKPQVIRPI
PTTPVPIIPL IDNYGDITGI IDSLVGLDGQ ESSYNILPEK QGRCFANGMI MYENAVWSPK
PCITCLCSKG KVLCDETLCH PLTCDNPKIP EGECCPVCTN IGSSTISLGD ASEYSGDSPK
PNDHSNPVQH DRQKQVQRNE KEGEVLVKTE MHIKKQGSIK RKRIKIHKQR RKQGDTNVQM
EKEENLRAEE EKRKAYEEEE LRIEKQLKKE EEQNEIQESE RSEEEEEEIE EDDILRGDVF
RMPPRFPIPV PSTEMPPLPT GCSVLDNTVS CINAKLTQIP PIMEEEITSI ELVGNAITTI
PGAAFNGIPN LERIDLRKNN ITSSGIEPDA FKNLKNLQRL YMDGNVLVHV PVGLPSTLEE
LKINENQLHA IDENSFQGLK KLVTLELEGN KLSEANVSPL AFQPLKSLSY LRLARNRFRI
IPQGLPHSIE ELYLENNEIE EITDICFNHT KNLNTIVLRH NKLEEGRIAP LAWINHENLE
SIDLSYNKLY HVPSYLPKSL VHLVLIGNQI DRIPGFVFGH MRPGLEYLYL SFNKLDDDGI
DPVSFYGAYH SLREVFLDHN QLKSVPLGIA EMRSLNFLRL NNNKIRAVPT ERICRIRRDN
EDDEEEDDSE EDHDYEDSQL EHLHLEYNYI NTRDLSPYAF SCVRSYSSVI LKPQKIK
//