ID H9GE16_ANOCA Unreviewed; 943 AA.
AC H9GE16;
DT 16-MAY-2012, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 2.
DT 27-MAR-2024, entry version 64.
DE SubName: Full=Thrombospondin type 1 domain containing 4 {ECO:0000313|Ensembl:ENSACAP00000008388.3};
GN Name=THSD4 {ECO:0000313|Ensembl:ENSACAP00000008388.3};
OS Anolis carolinensis (Green anole) (American chameleon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC Iguania; Dactyloidae; Anolis.
OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000008388.3, ECO:0000313|Proteomes:UP000001646};
RN [1] {ECO:0000313|Ensembl:ENSACAP00000008388.3}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JBL SC #1 {ECO:0000313|Ensembl:ENSACAP00000008388.3};
RG The Genome Sequencing Platform;
RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J.,
RA Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard).";
RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACAP00000008388.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; H9GE16; -.
DR STRING; 28377.ENSACAP00000008388; -.
DR Ensembl; ENSACAT00000008569.4; ENSACAP00000008388.3; ENSACAG00000008557.4.
DR eggNOG; KOG3538; Eukaryota.
DR eggNOG; KOG4597; Eukaryota.
DR GeneTree; ENSGT00940000156594; -.
DR HOGENOM; CLU_000660_6_0_1; -.
DR InParanoid; H9GE16; -.
DR TreeFam; TF316874; -.
DR Proteomes; UP000001646; Unplaced.
DR Bgee; ENSACAG00000008557; Expressed in hindlimb bud and 12 other cell types or tissues.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0001527; C:microfibril; IEA:Ensembl.
DR GO; GO:0048251; P:elastic fiber assembly; IEA:Ensembl.
DR GO; GO:0030198; P:extracellular matrix organization; IBA:GO_Central.
DR GO; GO:0160054; P:microfibril assembly; IEA:Ensembl.
DR Gene3D; 2.60.120.830; -; 1.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 6.
DR InterPro; IPR045371; ADAMTS_CR_3.
DR InterPro; IPR010294; ADAMTS_spacer1.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR PANTHER; PTHR13723; ADAMTS A DISINTEGRIN AND METALLOPROTEASE WITH THROMBOSPONDIN MOTIFS PROTEASE; 1.
DR PANTHER; PTHR13723:SF281; NO LONG NERVE CORD, ISOFORM C; 1.
DR Pfam; PF19236; ADAMTS_CR_3; 1.
DR Pfam; PF05986; ADAMTS_spacer1; 1.
DR Pfam; PF19030; TSP1_ADAMTS; 5.
DR Pfam; PF00090; TSP_1; 1.
DR SMART; SM00209; TSP1; 6.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 6.
DR PROSITE; PS50092; TSP1; 5.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001646};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..943
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5032563821"
FT DOMAIN 344..431
FT /note="ADAMTS/ADAMTS-like cysteine-rich"
FT /evidence="ECO:0000259|Pfam:PF19236"
FT DOMAIN 434..548
FT /note="ADAMTS/ADAMTS-like Spacer 1"
FT /evidence="ECO:0000259|Pfam:PF05986"
FT REGION 132..268
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 586..628
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 148..170
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 943 AA; 105605 MW; B4E6008FB9307496 CRC64;
MMLDHYKGSL RILSSIIVLG FQLVIPQPSI EHRKVPQRIE EQNTAAEEDN TGVPGVWGSW
GPWSACSRSC SGGVMEQTRP CLPGYYYERN YRRPGQYAAP ERTLAPHQQA PHQEQQLSPY
SGHVISAIRT SVPLHRNEEA SRASFRTGVP SGGRNDSHLS KGTSRGARPS QSRRRSPKLE
RRGRNKNPIG PGKYGYGKVP YILPLQTDTG QAPHKSRRKR QANQPKSVGV PLAQPANPSH
RQRSFYQDDR RPLPSRPQPG SGSFYQPAAP HFQTFPVAQS LFHDSDLNAH LPGSRQSVQS
QASPQRAAVI VCTGAYKQYK LCNTNPCLEN RNIREIQCTS YNNKPFMGRF YEWEPFAEVK
GSQKCELNCR AMGYRFYVRQ AEKVIDGTPC DQNGTSICVS GQCKTIGCDD YLGSDKVVDK
CGICGGDNTA CKVVSGVFKH TLTNLGYHKI VEIPEGATKI NITEMSKSNN YLALRSRTGR
SIINGNWAID RPGRYEGGGT MFTYKRPNEI SSTAGESFLA DGPTNEVLDV YMIHQQPNPG
IHYEYIIPGS YVISPQVPLH RRPGEPFNGQ LDVTESVNYE EETLRRETGT HMGQPAGTFP
IIQPGRFPSH PPDNQVPAGQ PPRRNRDYNW KQIGTTECTV TCGKGSQYPV FHCVNRNTHE
EISESYCDTS TKPSPEEEPC NIFPCPAFWD IGEWSECSKT CGLGMQHRQV LCRQIYANRS
LTVQPYRCQH LEKPETTSTC QLKICSEWQI RTEWTSCSVP CGVGQRTRDV KCVSNLGDVV
DDEECNMKLR PNDIENCDMG PCAKSWFLTE WSDRCSAECG TGVRTRSVIC MTNHISSLPL
EGCGHNRPPD TTPCDNGPCL GKVEWFAGSW SQCSMECGSG TQQREVLCVR KTENSFELLD
PYECSFLERP PGQQTCYLKP CGAKWFSTEW STVSRLPFLF LYQ
//