ID G1KDU5_ANOCA Unreviewed; 3023 AA.
AC G1KDU5;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 3.
DT 24-JAN-2024, entry version 77.
DE SubName: Full=Collagen type XII alpha 1 chain {ECO:0000313|Ensembl:ENSACAP00000005138.4};
GN Name=COL12A1 {ECO:0000313|Ensembl:ENSACAP00000005138.4};
OS Anolis carolinensis (Green anole) (American chameleon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC Iguania; Dactyloidae; Anolis.
OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000005138.4, ECO:0000313|Proteomes:UP000001646};
RN [1] {ECO:0000313|Ensembl:ENSACAP00000005138.4, ECO:0000313|Proteomes:UP000001646}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JBL SC #1 {ECO:0000313|Ensembl:ENSACAP00000005138.4,
RC ECO:0000313|Proteomes:UP000001646};
RG The Genome Sequencing Platform;
RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J.,
RA Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard).";
RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACAP00000005138.4}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 28377.ENSACAP00000005138; -.
DR Ensembl; ENSACAT00000005255.4; ENSACAP00000005138.4; ENSACAG00000005078.4.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000154923; -.
DR HOGENOM; CLU_000467_0_0_1; -.
DR OrthoDB; 5353225at2759; -.
DR TreeFam; TF329914; -.
DR Proteomes; UP000001646; Chromosome 1.
DR Bgee; ENSACAG00000005078; Expressed in lung and 13 other cell types or tissues.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0062023; C:collagen-containing extracellular matrix; IBA:GO_Central.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0035987; P:endodermal cell differentiation; IBA:GO_Central.
DR CDD; cd00063; FN3; 18.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 4.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 18.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020:SF74; -; 1.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00041; fn3; 18.
DR Pfam; PF00092; VWA; 4.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 17.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 12.
DR SUPFAM; SSF53300; vWA-like; 4.
DR PROSITE; PS50853; FN3; 17.
DR PROSITE; PS50234; VWFA; 4.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000001646};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Toxin {ECO:0000256|ARBA:ARBA00022656}.
FT DOMAIN 1..75
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 98..270
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 294..383
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 398..574
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 592..681
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 683..775
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 776..863
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 865..955
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 956..1044
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1046..1136
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1156..1328
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1345..1434
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1435..1526
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1527..1616
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1617..1714
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1717..1811
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1898..1988
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1989..2078
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2080..2168
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2169..2257
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2285..2458
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 941..963
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1030..1057
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2239..2274
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2710..2861
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2892..3010
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2259..2274
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2740..2756
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2904..2918
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2980..2998
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3023 AA; 327949 MW; 89F2E01A7C980C19 CRC64;
MTWTRPSDAI EGYRITVTPT TDGPSREFTL APSSTETLLR DLTADIEYLV TISSYDNIEE
SISVSGQLTI QTGGLITTGE KKIEEIQLQR CSLSAVTDLV FLVDGSWSVG RNNFKYILDF
IVALVSAFDI GEDKTRVGIV QYSSDTRTEF NLNQYYRQRD LIEAIKNIPY KGGNTMTGEA
IEYLMRNTFV ESAGSRKDFP KVAIIITDGK SQDEVEIPAR ELRAAGVEVF SLGIKAADAK
ELKLIASQPS LTHVFNVANF DGIVDIQNEI VSQVCSGVDE QLGELVSGEE AIEPPANLVA
TQISSRSIRI TWSPSPSEIT GYKIDMIPML AGVKQQSLSV GPQTTAFNVK DLSADTEYQI
NVYAMKGLTP SEPITIMEKT QPVKVQVECS KGVDVKADIV LLVDGSYSIG IANFVKVRAF
LEVLVKSFEI SPEKVQISLV QYSRDPHTEF TLNRYNRIDD IIQAINTFPY RGGSTNTGKA
MTYVREKIFV TGRGARPNVP RVMILITDGK SSDAFKDPAI KLRNSDVEIF AVGVKDAVRT
ELEAIATPPA ETHVYTVEDF DAFQRISFEL TQSVCLRIEQ ELREIKKKSY LPARNLQISE
VGSYSFRVTW SPAGPEVLSY LVKYKVAVDG EEFMVSVPAP VTNTVLTNLL PKTTYAVSVI
AEYEDGDGPP LDGEETTLEV RGSPRNLRIT DETTDSFKVG WSPAPGNVLR YRLAYRPVAG
GERRQVTVSA NERATTLQNL KPDTRYEVSV TAEYQSGVGN PLTGHGKTEE VLGRPRDLKV
SDPTTSSLKL SWNSAPGKVQ QYLVTYTPAT GGDTKEVTLR GDTTTTVLRD LEPGTRYGLS
VTALYASGAG DALSGQGDTL EERGSPRDLV TRDITDTTVG VSWTAAPGSV NRYRIVWKSL
YGDDSGETTV PGNIINTVLE NLQPETRYKI SVLASYRSGE GAPLEGEATT EVSPSARTLR
VDDEKETSMR VSWQAAPGRV ISYRVVYRPR RGGRQMVTKV PPSSTTTVLK RLQPLTTYDI
SVIPMYKTGE GKHRQGEGTT ASPFKPPRNL KTSDSTMSSF RVTWEPAPGE VKGYKVTFHP
MGEDRRLGEL VVGPYDNTVV LEELRAGTSY KVNVFGVFEG GESMPLIGEE MTTLSDATVV
PILSTGLECK TRAEADIVLL VDGSWSIGRP NFKTVRSFIA RIVEVFDIGP DKVQIGLAQY
SGDPRTEWQL NSHKTKQSLM DAVANLPYKG GNTLTGMALN FILRNNFKPE AGMRPGARKI
GVLITDGKSQ DDIVAPSQRL KDLGVELYAV GIKNADENEL KQIASDPDET HAYNVGDFTL
LVNIVDDLTV NLCNSVKGPG DLALPPTNLV TSEVTPRSFR VSWTAPAESV DRYRVEYYPA
AGGTPQEVFV SRSETTTVLT GLKPETEYVV NVFSVVEGTN SEPLKGTETT LAIPTVRNLN
AYDITSTTMR VRWEPVSGAS GYVLLYEPVN ATVPATEKEM RLGASVNDIQ LVDLIPNTEY
TLTVHASFGD LIGDPLTTQE VTLPLSGAKS LNIRDITHSS MRVNWERAPG KVRKYKLRYK
ATEDVELKEL EVDPSRTSTV LPDLFSKTLY NVELVAVYDE GDSVPIAARA TTLPVPAPQN
LRTDQVTKTS FRGTWDHGAA DVALYRVMWG PYGGTEKQET ILNGDENTLV FENLTPDTLY
DISVTAIYPD ESESDELIGS ERTLPIIPIT TPVPKSGPRN LQVYNATSNS LTVKWDPAIG
RVQRYRITYR PTSSDGPEQS TTVGGRQNNV VLQKLQPDTP YSVTVSSIYA DGEGGQMTGR
GKTKPLNTVR NLRVYDPTTS TLNVRWDHAE GNPRQYKVFY APVSGGADEL ATVPGNTNYA
ILRNLQPDTQ YKVTVVPSYP EGDGGRVSDN GRTLIRGTPR SIQVYNPTTN SLNVQWEPAP
GPVQQYRVVY APLVGQRPSE SVVVPANMRN VLLERLVPDT PYSVNVVALY ADGEGDPSPG
QGKTLPRSGP RNIRVFDPTT NSLSVQWDHA DGPVQQYRII YSPTVGDPID EYTTVPGRRN
NVLLQPLQSD TPYKITVVAV YEDGDGGQIV GNGKTVGLLA PRNIHISDEW YTRFRVSWDP
APSPVLGYKI VYKPVGSDEP MEVFVGEVTA YTLHNLSPST SYDVNVYAQY DAGLSAPLVD
RGTTLYLNVT DLTSYNVGWD TFCVRWAAHR SASSYRLKLN PADGSRGQEI TVRGTETSHC
FTGLSPDTEY DATIFVQTPN LEGPPVSTRE RTLIKPTEPP TEAPTPPPPP TIPPARDVCK
GAKADIVFLT DASWSIGDDN FNKVVKFIFN TVGGFDLINP AGIQVSFVQY SDDPKPEFNL
NRYDDKALAL GALQNIRYKG GNTKTGKALT FIKNKVLTWE SGMRKGVPKV LVVVTDGRSQ
DEVMKAASVI QHSGFSVFVV GVADVDYHEL AKIASKPSER HVFIVDDFDA FEKIEDNLIT
FVCETATSSC PLMYLDGYTS PGFRMLEAYN LTEKYFASVP GVSLQSGSFP SYMAYRLQKN
AFVSQPTREI HPDGLPKAYT IIFLFRLLPE TPTEPFAIWQ ITDRDYKPQV GVVLDPSNKV
LSFFNKDTRG EIQTVTFEGD EVKKLFYGSF HKVHIVVTPT SAKIYIDCVE ILEKPINEGG
NITNDGYEIL GKLQKGDRKT AALELQNFDI VCNAVWTSRD RCCDIPSRRD EAKCPSLPNA
CTCAQDSVGP PGPPGPAGGP GSKGPRGERG LTGPPGDLGP RGDVGPPGPQ GPPGPQGPNG
LSITGEPGRQ GLKGDAGEPG LPGRSGSPGL TGPPGPMGPP GDRGFTGKDG PAGPRGPPGP
MGAPGVPGVA GPTGKSGKPG DRGSPGPVGL KGEKGDRGDI ASQNMMRAVA RQVCEQMING
QMTRFNQMLN QIPNDYSSNR NQPGPSGPPG PPGPPGSRGE PGAGGRPGFP GSPGMQGPPG
ERGLAGEKGE RGIGSQGPRG LPGAPGPQGE SRMGPPGATG SRGPPGPPGR PGNSGIRGPP
GPPGYCDSSQ CASIAYNGQG YPG
//