GenomeNet

Database: UniProt
Entry: G3TVS2_LOXAF
LinkDB: G3TVS2_LOXAF
Original site: G3TVS2_LOXAF 
ID   G3TVS2_LOXAF            Unreviewed;      1871 AA.
AC   G3TVS2;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   16-NOV-2011, sequence version 1.
DT   27-MAR-2024, entry version 58.
DE   SubName: Full=Collagen type XII alpha 1 chain {ECO:0000313|Ensembl:ENSLAFP00000019680.1};
GN   Name=COL12A1 {ECO:0000313|Ensembl:ENSLAFP00000019680.1};
OS   Loxodonta africana (African elephant).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta.
OX   NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000019680.1, ECO:0000313|Proteomes:UP000007646};
RN   [1] {ECO:0000313|Ensembl:ENSLAFP00000019680.1, ECO:0000313|Proteomes:UP000007646}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000019680.1,
RC   ECO:0000313|Proteomes:UP000007646};
RA   Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., Lindblad-Toh K.;
RT   "The Genome Sequence of Loxodonta africana (African elephant).";
RL   Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSLAFP00000019680.1}
RP   IDENTIFICATION.
RC   STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000019680.1};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   Ensembl; ENSLAFT00000027596.1; ENSLAFP00000019680.1; ENSLAFG00000010089.3.
DR   GeneTree; ENSGT00940000154923; -.
DR   HOGENOM; CLU_002527_2_0_1; -.
DR   Proteomes; UP000007646; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   CDD; cd00063; FN3; 10.
DR   CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 10.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF20; COLLAGEN ALPHA-1(XXI) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF00041; fn3; 10.
DR   Pfam; PF00092; VWA; 2.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00060; FN3; 10.
DR   SMART; SM00210; TSPN; 1.
DR   SMART; SM00327; VWA; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 7.
DR   SUPFAM; SSF53300; vWA-like; 2.
DR   PROSITE; PS50853; FN3; 10.
DR   PROSITE; PS50234; VWFA; 2.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007646};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          9..181
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          197..286
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          287..377
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          378..468
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          469..558
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          565..659
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          660..745
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          746..836
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          837..927
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          928..1016
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1017..1104
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1133..1306
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          1089..1123
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1560..1706
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1740..1871
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1107..1123
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1588..1607
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1634..1648
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1750..1764
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1828..1846
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1871 AA;  202529 MW;  C8E3C9E805908A25 CRC64;
     ECSVSAWTDL VFLVDGSWSV GRNNFKYILD FIAALVSAFD IGEAKTRVAL AQYSGDPRTE
     WQLNAHRDKQ SLLQAVANLP YKGGNTLTGM ALNFIRQQSF KTQAGMRPRA RKIGVLITDG
     KSQDDVEGPS KKLKDEGVEL FAVGIKNADE VELKMIATDP DDTHAYNVAD FESLSRIVDD
     LTINLCNSVK GPGDLEAPSN LVISERTHRS FRVSWTPPSD SVDRYKVEYY PVSGGKRQEF
     YVSRMETSTV LKDLKPETEY VVNVYSVVED EYSEPLKGTE KTLPVPVVSL NIYDVGPTNM
     RVQWQPVGGA TGYSLSYEPI NATEPTKAKE MRVGPTVNEV QLTDLIPNTE YAVTVQAALH
     DLTSEPVTAR QVTSPLPGPR DLKLRDVTHS TMNVLWEPAP GKVRKYIIRY KTPEEDAKEV
     EADRSRTNTP LRDLFSQTLY TISVSAVYDE GESPPVTAQD TTKHVPAPTN LRITEVTPET
     FRGTWDHGAS DVSLYRITWA PFGSSDKMET ILNGDENTLV FQNLNPNTLY EVSVTAIYPD
     ESESDDLVGS ERTGFFDNLS APKSGPRNLQ VYNATSNSLT VKWDPASGRV QKYRITYQPS
     TGEGNEQTTT IGGRQNSVVL QKLKPDTPYT ITVSSLYPDG EGGRMTGRGK TKPLNTVRNL
     RVYDPSTSTL NVRWDHAEGN PRQYKLFYAP TAGGPEELVP IPGNTNYAIL RNLQPDTPYT
     VTVVPVYSEG DGGRTSDTGR TLVRGLARNV QVYNPTPNSL DVRWDPAPGP VQQYRIVYSP
     VAGTRPSESI VVPGNTGMVH LERLIPDTPY SVNIVALYSD GEGNPSPGQG RTLPRSGPRN
     LRVFGETTNS LSVAWDHADG PVQQYRIIYS PTVGDPIDEY TTVPGRRNNV ILQPLQPDTP
     YKITVIPVYE DGDGGHLTGN GRTVGLLPPQ NIHISDEWYT RFRVSWDPSP SPVLGYKIVY
     KPVDSSEPME AFVGEVTSYT LHNLNPSTTY DVNVYAQYDS GLSVPLTDKG TTLYLNVTDL
     KTYQVGWDTF CVRWSPHRAA TSYRLKLNPA DGTRGQEITV RGSETSHCFT GLSPDTEYGV
     TVFVQTPNLE GPGVPVKEHT TVKPTEAPTE PPTPPPPPTI PPARDVCKGA KADIVFLTDA
     SWSIGDDNFN KVVKFIFNTV GAFDEISPAG IQVSFVQYSD EVKSEFKLNT YNDKAQALGA
     LQNIRYRGGN TRTGKALTFI KEKVLTWESG MRKNVPKVLV VVTDGRSQDE VKKAALIIQQ
     SGFSVFVVGV ADVEYNELAN IASKPSERHV FIVDDFESFE KIEDNLITFV CETATSNCPL
     IYLDGYTSPG FKMLEAYNLT EKNFASVQGV SLESGSFPSY SAYRLQKNAF VNQPTVDLHP
     NGLPPSYTII LLFRLLPETP NDPFAIWQIT DRDYKPQVGV IVDPSSKTLS FFNKDTRGEV
     QTITFDTDEV KTLFYGSFHK IHIVVTSKSV KIYIDCYEII EKDIKEAGNI TTDGYEILGK
     LLKGERKSAT FQIQNFDIVC SPVWTSRDRC CDIPSRRDEA KCPALPNACT CTQDSVGPPG
     PPGPAGGPGA KGPRGERGIS GAIGPPGPRG DTGPPGPQGP PGPQGPNGLS IPGEQGRQGM
     KGDAGEPGLP GRTGTPGLPG PPGPMGPPGD RGFTGKDGAM GPRGPPGPPG SPGSPGVTGA
     SGKPGKPGDH GRPGPSGLKG EKGDRGDIAS QNMMRAVARQ VCEQLISGQM NRFNQMLNQI
     PNDYHSNRNQ PGPPGPPGPP GNAGARGEPG PGGQPGFPGS PGMQGPPGER GLPGEKGERG
     IGSPGPRGSP GPPGPQGESR TGPPGSTGSR GPPGPPGRPG NLGIRGPPGP PGYCDSSQCA
     SIPYNGQGYP G
//
DBGET integrated database retrieval system