ID G3TVS2_LOXAF Unreviewed; 1871 AA.
AC G3TVS2;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 58.
DE SubName: Full=Collagen type XII alpha 1 chain {ECO:0000313|Ensembl:ENSLAFP00000019680.1};
GN Name=COL12A1 {ECO:0000313|Ensembl:ENSLAFP00000019680.1};
OS Loxodonta africana (African elephant).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta.
OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000019680.1, ECO:0000313|Proteomes:UP000007646};
RN [1] {ECO:0000313|Ensembl:ENSLAFP00000019680.1, ECO:0000313|Proteomes:UP000007646}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000019680.1,
RC ECO:0000313|Proteomes:UP000007646};
RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Loxodonta africana (African elephant).";
RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLAFP00000019680.1}
RP IDENTIFICATION.
RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000019680.1};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSLAFT00000027596.1; ENSLAFP00000019680.1; ENSLAFG00000010089.3.
DR GeneTree; ENSGT00940000154923; -.
DR HOGENOM; CLU_002527_2_0_1; -.
DR Proteomes; UP000007646; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 10.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 10.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF20; COLLAGEN ALPHA-1(XXI) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00041; fn3; 10.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 10.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 7.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50853; FN3; 10.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000007646};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 9..181
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 197..286
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 287..377
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 378..468
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 469..558
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 565..659
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 660..745
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 746..836
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 837..927
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 928..1016
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1017..1104
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1133..1306
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 1089..1123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1560..1706
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1740..1871
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1107..1123
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1588..1607
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1634..1648
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1750..1764
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1828..1846
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1871 AA; 202529 MW; C8E3C9E805908A25 CRC64;
ECSVSAWTDL VFLVDGSWSV GRNNFKYILD FIAALVSAFD IGEAKTRVAL AQYSGDPRTE
WQLNAHRDKQ SLLQAVANLP YKGGNTLTGM ALNFIRQQSF KTQAGMRPRA RKIGVLITDG
KSQDDVEGPS KKLKDEGVEL FAVGIKNADE VELKMIATDP DDTHAYNVAD FESLSRIVDD
LTINLCNSVK GPGDLEAPSN LVISERTHRS FRVSWTPPSD SVDRYKVEYY PVSGGKRQEF
YVSRMETSTV LKDLKPETEY VVNVYSVVED EYSEPLKGTE KTLPVPVVSL NIYDVGPTNM
RVQWQPVGGA TGYSLSYEPI NATEPTKAKE MRVGPTVNEV QLTDLIPNTE YAVTVQAALH
DLTSEPVTAR QVTSPLPGPR DLKLRDVTHS TMNVLWEPAP GKVRKYIIRY KTPEEDAKEV
EADRSRTNTP LRDLFSQTLY TISVSAVYDE GESPPVTAQD TTKHVPAPTN LRITEVTPET
FRGTWDHGAS DVSLYRITWA PFGSSDKMET ILNGDENTLV FQNLNPNTLY EVSVTAIYPD
ESESDDLVGS ERTGFFDNLS APKSGPRNLQ VYNATSNSLT VKWDPASGRV QKYRITYQPS
TGEGNEQTTT IGGRQNSVVL QKLKPDTPYT ITVSSLYPDG EGGRMTGRGK TKPLNTVRNL
RVYDPSTSTL NVRWDHAEGN PRQYKLFYAP TAGGPEELVP IPGNTNYAIL RNLQPDTPYT
VTVVPVYSEG DGGRTSDTGR TLVRGLARNV QVYNPTPNSL DVRWDPAPGP VQQYRIVYSP
VAGTRPSESI VVPGNTGMVH LERLIPDTPY SVNIVALYSD GEGNPSPGQG RTLPRSGPRN
LRVFGETTNS LSVAWDHADG PVQQYRIIYS PTVGDPIDEY TTVPGRRNNV ILQPLQPDTP
YKITVIPVYE DGDGGHLTGN GRTVGLLPPQ NIHISDEWYT RFRVSWDPSP SPVLGYKIVY
KPVDSSEPME AFVGEVTSYT LHNLNPSTTY DVNVYAQYDS GLSVPLTDKG TTLYLNVTDL
KTYQVGWDTF CVRWSPHRAA TSYRLKLNPA DGTRGQEITV RGSETSHCFT GLSPDTEYGV
TVFVQTPNLE GPGVPVKEHT TVKPTEAPTE PPTPPPPPTI PPARDVCKGA KADIVFLTDA
SWSIGDDNFN KVVKFIFNTV GAFDEISPAG IQVSFVQYSD EVKSEFKLNT YNDKAQALGA
LQNIRYRGGN TRTGKALTFI KEKVLTWESG MRKNVPKVLV VVTDGRSQDE VKKAALIIQQ
SGFSVFVVGV ADVEYNELAN IASKPSERHV FIVDDFESFE KIEDNLITFV CETATSNCPL
IYLDGYTSPG FKMLEAYNLT EKNFASVQGV SLESGSFPSY SAYRLQKNAF VNQPTVDLHP
NGLPPSYTII LLFRLLPETP NDPFAIWQIT DRDYKPQVGV IVDPSSKTLS FFNKDTRGEV
QTITFDTDEV KTLFYGSFHK IHIVVTSKSV KIYIDCYEII EKDIKEAGNI TTDGYEILGK
LLKGERKSAT FQIQNFDIVC SPVWTSRDRC CDIPSRRDEA KCPALPNACT CTQDSVGPPG
PPGPAGGPGA KGPRGERGIS GAIGPPGPRG DTGPPGPQGP PGPQGPNGLS IPGEQGRQGM
KGDAGEPGLP GRTGTPGLPG PPGPMGPPGD RGFTGKDGAM GPRGPPGPPG SPGSPGVTGA
SGKPGKPGDH GRPGPSGLKG EKGDRGDIAS QNMMRAVARQ VCEQLISGQM NRFNQMLNQI
PNDYHSNRNQ PGPPGPPGPP GNAGARGEPG PGGQPGFPGS PGMQGPPGER GLPGEKGERG
IGSPGPRGSP GPPGPQGESR TGPPGSTGSR GPPGPPGRPG NLGIRGPPGP PGYCDSSQCA
SIPYNGQGYP G
//