ID G3NVJ6_GASAC Unreviewed; 1753 AA.
AC G3NVJ6;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 68.
DE SubName: Full=Collagen, type XIV, alpha 1a {ECO:0000313|Ensembl:ENSGACP00000009365.1};
GN Name=COL14A1 {ECO:0000313|Ensembl:ENSGACP00000009365.1};
OS Gasterosteus aculeatus (Three-spined stickleback).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC Gasterosteus.
OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000009365.1, ECO:0000313|Proteomes:UP000007635};
RN [1] {ECO:0000313|Ensembl:ENSGACP00000009365.1, ECO:0000313|Proteomes:UP000007635}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.;
RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGACP00000009365.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 69293.ENSGACP00000009365; -.
DR Ensembl; ENSGACT00000009385.1; ENSGACP00000009365.1; ENSGACG00000007065.1.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000153769; -.
DR InParanoid; G3NVJ6; -.
DR OMA; VVYHPAQ; -.
DR TreeFam; TF329914; -.
DR Proteomes; UP000007635; Unassembled WGS sequence.
DR Bgee; ENSGACG00000007065; Expressed in telencephalon and 2 other cell types or tissues.
DR GO; GO:0005604; C:basement membrane; IEA:Ensembl.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0070831; P:basement membrane assembly; IEA:Ensembl.
DR CDD; cd00063; FN3; 7.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 8.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF15; COLLAGEN ALPHA-1(XIV) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00041; fn3; 7.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 8.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 7.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50853; FN3; 8.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 3..99
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 131..304
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 330..419
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 420..511
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 512..600
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 601..691
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 719..810
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 811..901
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 902..989
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1018..1191
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 78..117
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1436..1599
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1633..1753
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 93..114
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1482..1496
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1722..1744
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1753 AA; 188157 MW; 36441710FC5CCEDE CRC64;
VPAPRRLRFK VLSPSKLLIS WKEPKGDFDS YLFLYNTTPA SGGQQREIII SKSDTKVLIT
DYSPMKDYTV SVLSVSGGEQ SRPLQGRHKG ESVYQTQRLK EGKQNDKSKE TNSRPKQQVD
QFVCHTEAIA DIVILVDGSW SIGRLNFRLV RMFLENLVDA FDVGINKTRI VGLAQYSGDP
RIEWHLNAFS TKDAVIDAVK NLPYKGGNTL TGLALTYVLE NCFKPESGSR PGVPKIGILV
TDGKSQDDVI PPAESLRNAG IELFAIGVKN ADENELVSIA SEPDHTHVYN VADFNVMSSI
VEGLTKTVCE QVEQQENDIK QKYIPEKTGA PLELVTSEVT ARGFRVSWSH APGNVEKYRV
LYYPDGEGEP QEAVVDGGET SAVLQHLHSL TEYQLAVFAV YANESSEALR GSETTLGLPA
VTTLQLSDVT HSTMRARWNS VEGVSGYMLL YAPLTDDVDS VEKEVKVLDA VTEMELHGLT
PRTEYTVTVY AMYGEEASDP MTNQKSTLPL SPPSNLQFSD ITHNSAHISW APVPRGVKGY
RIVWIKTDGA VTEEVEVGPD TSYDLSGLTS LMEYSVAIFA LYKEGQSEAL TDRFTTTPVP
GPLDLRSSNV GTDGFQVSWD HSADDIVLYR LSWAPFTGGD TKEVVLSGID NQYTLTGLSP
STEYEAMLTA VFKDESESDT VSVTETTLAE TTTIATTTQG EKTALSVEKI CGVTAARRAV
RNLRLRDETT QSMEASWELQ DPHVQSYRVS YAGLRGNRRE ESVSRSSVQM RAVLQPLLPD
TQYKVTVTPV YTNGRDGISV SALGFTLPLL SPANLRVSEE WYNRFRVTWD PPPSPTAGYR
IVYQPINVPG PFLETTVGDD VNSMLLLNLL SGTEYNVQVT ASYPTGQSEP LLVNAKTLFL
GVSGMSTYQV RPNSLCVQWQ PLLRATLYRV SIQSTLNGQR QEVSLGGGAS RQCFYDLTPS
SQYQISIHTQ MQQMEGPPVS ITDMTLPPPT QTPTEPPTTE PPPTIPPAKE VCREAKADLA
FLVDGSWSIG DDNFMKITRF LYSTMGSLDL IGPDGTQVAI AQFSDDARTE FQLSSHGNKE
ALLDAIQRIR YKGGNTKTGR AIKHVKESIF TPEAGARRGV PKVLVVLTDG RSQDDVNKVS
KEMQVDGYII FAIGFADADY GELVNIASKP SDRHVFFVDD LDAVKKIEEQ LITFVCEAAT
ATCPSVLMSG NTMAGFHMME KFGLVEKEYS TIAGVSLEPG SFNSFPCYRL HRDALVSQPT
KYLHPEGLPS DYTISMMLRL LPETPEEPFA LWEILNGNDE PLVGLILDNS GKTLTFFNHD
YKGHFQTVTF EGTEIKKLFH GSFHKLHVTI SKTSVKVVLD CSVVGEKSVS AAGNITTDGV
EILGRMIQSR GRRDNSAAFQ LQMFDIICST SWASRDKCCE LPALRVEEQC PSMPHACTCS
QESKGPPGPS GPPGGPGIRG ARGDRGESGV TGPQGPVGKI GPSGPSGPPG PQGPSGLSIQ
GPPGAAGGKG ARGEIGPAGQ MGVPGSSGSP GRDGPPGARG LPGNSGPQGR QGPPGPLGSP
GAPGAHGPVG SAGTQGDQGL PGPSGTKGDK GERGDVQSQA AVHAIARQVC EQLIQSHLSR
YNSILNQIPA QAATSVRTVP GPPGEPGRRG SPGPQGEPGP AGRPGFPGAS GQNGLPGERG
LLGDKGERGS PGIGSQGPRG QSGPPGTLPS STPCPPSSGL PGEGRTGNTG TPGRTGNPGY
CDQNSCLGYN VGG
//