ID G3NUQ8_GASAC Unreviewed; 3069 AA.
AC G3NUQ8;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 61.
DE SubName: Full=Collagen, type XII, alpha 1a {ECO:0000313|Ensembl:ENSGACP00000009077.1};
GN Name=COL12A1 {ECO:0000313|Ensembl:ENSGACP00000009077.1};
OS Gasterosteus aculeatus (Three-spined stickleback).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC Gasterosteus.
OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000009077.1, ECO:0000313|Proteomes:UP000007635};
RN [1] {ECO:0000313|Ensembl:ENSGACP00000009077.1, ECO:0000313|Proteomes:UP000007635}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.;
RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGACP00000009077.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSGACT00000009097.1; ENSGACP00000009077.1; ENSGACG00000006800.1.
DR GeneTree; ENSGT00940000154923; -.
DR Proteomes; UP000007635; Unassembled WGS sequence.
DR Bgee; ENSGACG00000006800; Expressed in zone of skin and 4 other cell types or tissues.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 18.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 4.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 18.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00041; fn3; 18.
DR Pfam; PF00092; VWA; 4.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 18.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 13.
DR SUPFAM; SSF53300; vWA-like; 4.
DR PROSITE; PS50853; FN3; 18.
DR PROSITE; PS50234; VWFA; 4.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..3069
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003448829"
FT DOMAIN 26..116
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 137..309
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 333..422
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 437..613
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 631..722
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 724..815
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 816..905
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 907..996
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 997..1087
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1088..1178
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1199..1371
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1388..1476
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1477..1568
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1569..1658
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1659..1748
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1756..1844
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1846..1935
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1936..2026
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2027..2117
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2118..2206
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2207..2297
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2325..2498
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 1074..1094
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2748..2901
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2935..3069
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2853..2868
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3054..3069
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3069 AA; 332194 MW; 70BDDD48D1DA2EF9 CRC64;
MKIGLSLATA AFLAALLCST DAQVEPPSDM KFKILNENSV EMSWTRPSSQ IDGFRLQVVS
EADEPVRDFT LDAYATKTKI TDLTPDLDYS VSINSYYGSD ESIPISGQLT IQSSNTSGQV
KKPKSDAIKC SVSAIADLVF LVDGSWSVGR ENFKHIRSFI ASLAGAFDIG EEKTRVAVVQ
YSTDTRTEFP LTRYSRRGDL LQAINNLPYK GGNTMTGDAI DYLLQNIFTE AAGSRKGFPK
VAMIITDGKS QDPVEEYAKR LRNIGVEIFV LGIKGADEDE LREMASTPHN RHMYNVPDFE
GIQEVQKKII EEVCFGVDEQ LGSLISGEEM VDPASNLQVT EMASKSMRVT WDASPGDITG
YKLTLNPMLP GMKRQELYIG PTQTSINVRD LSPETEYEIS LYALKGLTPS EPTIEMGKTQ
PIKVSTECSL GVDVQADVVL LVDGSYSIGK ANFAKVQGFL EVLVTAFDIG PNKVQISLVQ
YSRDSHTEFF LDTHHDIGAV VKAVRTFPYR GGSTNTGRAM TYVKDKIFIT SRGARQNVPR
VMVLITDGKS SDSFKDAATA LRNIDVEIFA VGVKDAVRSE LEAIANQPSD SHVYEVEDFD
AFKRISKELT QSICLRIEQE LLNIQKKSLL PPSDLVFSEV TSRSFRATWK APTTMVMSYM
VRFRKAEDVT ADYISIAVPG DTTTAVLPHL IPLTAYEVNV FAQYEKGDSF ALSGEETTLD
EKGSVQNLKV KDETTNSFRV SWQAAPGAVI RYRLSYIPLS GAGETLEAQT IGSETNIVLE
ELFSSTTYRV SVSAEYATGL GDEMQVDGTT KEDRGSPRNL RVSDETISTM KLEWLAARGN
VLQYRIAFKP ANGGERKEIS VKGGTTQAVL KNLQPGTEYE LFVSARYSSG LGDPLLGTGT
TLEELGSPRD LVTRDVTDTS FAAFWVAAPG NVRQYRITWR SLFTEEAGEK TIPGDVTATV
LEGLTPETRY QVSVLAGYGR GEGQPLVGEE TTDISAAGRT IAVSEETEKS MKVTWQPAPG
NVLNYRVTYK PVAGGRKLAA KVPAGTTHTV LRNLTPLTTY EIIVLPVYRS GEGKARQGEG
TTLTPYKGPR NLQTSEPAKT SFRVTWDHAP GDVKGYKVIF HPSGKDIDLE ELLVGPYDNT
VVLEELRAGT KYTVNVVGMF EGGESMPLAG EEKTTLVDAP EPPPYVASDV TCKTAAQADI
VLLVDGSWSI GRLNFKTIRS FIARMVQVFE IGPERVQIGL AQYSGDPKTE WHLDAHRTRK
SLLDAVANLP YKGGNTMTGL ALNYLLQNNF KENVGMRPKA RKIGVLITDG KSQDDVILNS
QNLRDQGIEL YAVGVKNADE NELRSIATDP DSIHMFNVVD FGFLLEIVDT LTDNLCNSVK
GPGGAPDAPT ELRTSEVTHH SFRATWLAPE DPVDKYRVEY ITLAGQQQQV FVDGTETTVV
LQSLSPLTQY MINVYSVVGE ESSGPLEGTE TTLPLSAVSR MYIFDERTTT MRVRWEQAAG
ASGYMLLYSA INATESTVEQ EMRVGRDTTE VQLVKLLPNT AYTLSLFALH GESASEPLTN
QGVTRPLPPA GKLNVRDVTH STLRLIWDAA PGPVRKYLIT YKPEEGEAKE LEVDGSVTTR
QLDSLISQTE YSLAVTPIYD EGPSQPMLGE AITDVVPAPK NLQLTEVTET SFRATWEHGA
PDVAMYRLAW VKKGDSNIES FILNNDEITY VLENLDPDTP YDVSVTAIYP DESESEDLLG
SERTLPIGSS TVPNGPPTNL VVFNETTTTL NARWNPAPGR VQNYKITYVP TAGGRSLTTQ
VGGKKTTVLL PKLTPDTEYS ITVVAVYAKG LSPELKGPGK TRPLGGVRNL QVTDPTTSTL
NVVWEAAEGN VRQYKVFYVP AAGGEDQMVQ VPGSTLNTVL KNLQSDTVYT ATVVPVYSAG
EGQRMSERGK TLMRSPVRNI QVFNPTTNTL NVRWEAATGP VVKYRVVYSP LNGARPSESV
LIPGTTTEAF LEQLLPDTGY NVGVVAMYSD GEGPAISDAG KTLPRSGPRN MRVYDPTTST
LSVSWEHADG PVTQYRITYT QTTGDPIEEY TVVPGNRNNV VLKNLDADTP YDITVTAMYA
DGAGGQLEGD GRTVGMLGPR NLLVSDEWYT RFRVSWDPAP SRVTGYKIIY QAEGSDESLE
VFAGDVTSHQ LHNLKPGTTY DLKVLAQYEA GLSAPLIGTG TTLYLNVTDL STYNVGYDTF
CIRWTPHRAA TSYRLKVNPI DPSKSGAKEI TVRGSESNYC FDGLTPDSLY EATVFTQTPN
LEGPGVKVKE RTLVKPTEVP TEPPSPPPPA TVPPALDVCK GAKADVVFLI DGSWSIGDDS
FQKVLQFVKS MTGAFDVINP RGMQVSFVQY SDDAKTEFKL NTYQDKGVVI SALQNVRYRG
GNTKTGIALK HVYEKVFTSD SGMRRNVPKV LVVVTDGRSQ DEVKKSAEKL QHSGYSVFVV
GVADVDKSEL RIIGSKPTER HVFVVDDYDA FAKIQDNLIT FICETATSTC PLIYIDGFTT
PGFRMLEAFN ITDRTFAGIN GVSMEAGSFN SYIAYRLHKD SFLNQPTKEL HPEGLPPSYT
IILLFRLLPD TTSEPFDIWQ ISDKNNNPEV GITVNPSSKT ITFYNKDTRG EIQRATFNDQ
QVKRVFHGSF HKLHISVSAE KVKLNLDCQE VAEKPIKEAN NITLDGYEVL GKLAKSAGGK
RQSATFQLQM FDIVCSLSWI SRDKCCDLPS TRDEAKCPSL PHSCTCTQDS IGPEGPPGPS
GSPGSKGPRG DRGQNGNPGP VGPRGDLGLP GAMGPPGPQG PNGLSLPGEP GRQGPKGDAG
DPGLPGLQGS PGQRGPLGPV GPSGVRGPPG KEGPSGPRGP PGPMGNPGNP GVPGITGKPG
KSGDTGNPGP VGLKGEKGER GDFASQNMMR SIARQVCEQL VSGQMSRIDT LLNQIPSGYR
SNTPGPAGPP GPPGNEGSRG EPGQPGRSGF PGNPGLPGSP GERGLAGEKG ERGSPGTGIR
GQRGPLGPPG PPGESRTGSP GATGSTGPRG PPGRQGTPGV RGPPGPSGYC DSSQCVGIPY
NGQGYTGTL
//