ID G3T503_LOXAF Unreviewed; 3063 AA.
AC G3T503;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 64.
DE SubName: Full=Collagen type XII alpha 1 chain {ECO:0000313|Ensembl:ENSLAFP00000008467.3};
GN Name=COL12A1 {ECO:0000313|Ensembl:ENSLAFP00000008467.3};
OS Loxodonta africana (African elephant).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta.
OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000008467.3, ECO:0000313|Proteomes:UP000007646};
RN [1] {ECO:0000313|Ensembl:ENSLAFP00000008467.3, ECO:0000313|Proteomes:UP000007646}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000008467.3,
RC ECO:0000313|Proteomes:UP000007646};
RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Loxodonta africana (African elephant).";
RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLAFP00000008467.3}
RP IDENTIFICATION.
RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000008467.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSLAFT00000010100.3; ENSLAFP00000008467.3; ENSLAFG00000010089.3.
DR GeneTree; ENSGT00940000154923; -.
DR HOGENOM; CLU_000467_0_0_1; -.
DR Proteomes; UP000007646; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 18.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 3.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 18.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF20; COLLAGEN ALPHA-1(XXI) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00041; fn3; 18.
DR Pfam; PF00092; VWA; 4.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 18.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 11.
DR SUPFAM; SSF53300; vWA-like; 4.
DR PROSITE; PS50853; FN3; 18.
DR PROSITE; PS50234; VWFA; 4.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000007646};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..3063
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003454879"
FT DOMAIN 27..117
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 140..316
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 336..426
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 440..616
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 634..721
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 725..816
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 817..905
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 907..999
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1000..1087
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1089..1179
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1227..1366
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1382..1471
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1472..1562
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1563..1653
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1654..1749
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1752..1846
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1847..1932
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1933..2023
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2024..2114
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2115..2203
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2204..2291
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2320..2493
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 799..827
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1076..1099
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2276..2310
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2747..2893
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2927..3063
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2294..2310
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2775..2794
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2821..2835
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2937..2951
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3015..3033
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3063 AA; 333126 MW; 0B609A67C355CCB6 CRC64;
MRNRLPPALA ALGAALLLSS IEAEVDPPSD LNFKIIDENT VHMSWARPAD PIVGYRITVD
PTTEGPTKEF TLAASTTETL LSDLIPEIEY VVTIASYDEV EESVPVIGQL TIQTGGPTKP
GEKKPRKTEI QKCSVSAWTD LVFLVDGSWS VGRNNFKYIL DFIAALVSAF DIGEAKTRVG
VIQYSSDTRT EFNLNQYYQR DELLAAIKKI PYKGGNTMTG DAIDYLIKNT FTESAGARVG
FPKVAIIITD GKSQDEVEIP ARELRNIGVE VFSLGIKAAD AKELKQIAST PSLNHVFNVA
NFDAIVDIQN EIISQVCSGV DEQLGELVSG EEVIEPPSNL IATEVSSKYI KLSWSPSPSP
VTSYKVLLTP MTAGSRQHAL SVGPQTTMLS VRDLSADTEY QISVSAMKGL TSSEPVSIME
KTQPMKVQVE CSRGVDIKAD IVFLVDGSYS IGIANFVKVR AFLEVLVKSF DISPNRVQIS
LVQYSRDPHT EFTLKKFTKV EDIIEAINTF PYRGGSTNTG KAMTYVREKI FVPSKGSRSN
VPKVMILITD GKSSDAFRDP AIKLRNSDVE IFAVGVKDAV RSELEAIASP PPETHVFTVE
DFDAFQRISF ELTQSICLRI EQELAAIKKK AYVPPKNLTF SEVTSYSFKS NWSPAGENVF
SYHITYKEMA GDGEVTVVEP ASSTSVVLTN LKPETQYLVN VTAEYEDGFS VPISGEETTE
EVKGAPRNLK VTDETTDSFK ITWTQAPGRV LRYRIIYKPV NGGESKEVTT PANQRRRTLE
NLTPDTKYEV SVIPEYYSGP GSPLTGNAAT EEVRGNPRDL KVSDPTTSTM KLSWTRAPGK
VKQYLITYTA VAGGETQEVT VRGDTTNTVL SGLKEGTQYA LSVTALYASG AGDALFGEGT
TLEERGSPQN LVTKDITDTS IGAYWTSAPG MVRGYRVSWK SLYDDIETGE KNLPGDSIHT
VIENLQPETK YKISVFATYT SGEGEPLTGE ATTEISQESK TLKVDEETEN TMRVTWKPAP
GKVVNYRVIY RPRGGGKQMV AKVPPTVTST VLKRLQPQTT YDITVLPIYK TGEGKLRQGS
GTTASRFKSP RNLKTSDPTM SSFRVTWEPA PGEVKGYKVT FHPTGDDRRL GELVVGPYDN
TVVLEELRAG TTYKVNVFGM FDGGESSPLV GQEMTTLSDT TVMPVLSSGM ERLTRAEADI
VIVMIVEHCG PNLRPVKSSF SYVSLEGPKE YKCTLAQYSG DPRTEWQLNA HRDKQSLLQA
VANLPYKGGN TLTGMALNFI RQQSFKTQAG MRPRARKIGV LITDGKSQDD VEGPSKKLKD
EGVELFAVGI KNADEVELKM IATDPDDTHA YNVADFESLS RIVDDLTINL CNSVKGPGDL
EAPSNLVISE RTHRSFRVSW TPPSDSVDRY KVEYYPVSGG KRQEFYVSRM ETSTVLKDLK
PETEYVVNVY SVVEDEYSEP LKGTEKTLPV PVVSLNIYDV GPTNMRVQWQ PVGGATGYSL
SYEPINATEP TKAKEMRVGP TVNEVQLTDL IPNTEYAVTV QAALHDLTSE PVTARQVTSP
LPGPRDLKLR DVTHSTMNVL WEPAPGKVRK YIIRYKTPEE DAKEVEADRS RTNTPLRDLF
SQTLYTISVS AVYDEGESPP VTAQDTTKHV PAPTNLRITE VTPETFRGTW DHGASDVSLY
RITWAPFGSS DKMETILNGD ENTLVFQNLN PNTLYEVSVT AIYPDESESD DLVGSERTVR
LIPLTTQAPK SGPRNLQVYN ATSNSLTVKW DPASGRVQKY RITYQPSTGE GNEQTTTIGG
RQNSVVLQKL KPDTPYTITV SSLYPDGEGG RMTGRGKTKP LNTVRNLRVY DPSTSTLNVR
WDHAEGNPRQ YKLFYAPTAG GPEELVPIPG NTNYAILRNL QPDTPYTVTV VPVYSEGDGG
RTSDTGRTLV RGLARNVQVY NPTPNSLDVR WDPAPGPVQQ YRIVYSPVAG TRPSESIVVP
GNTGMVHLER LIPDTPYSVN IVALYSDGEG NPSPGQGRTL PRSGPRNLRV FGETTNSLSV
AWDHADGPVQ QYRIIYSPTV GDPIDEYTTV PGRRNNVILQ PLQPDTPYKI TVIPVYEDGD
GGHLTGNGRT VGLLPPQNIH ISDEWYTRFR VSWDPSPSPV LGYKIVYKPV DSSEPMEAFV
GEVTSYTLHN LNPSTTYDVN VYAQYDSGLS VPLTDKGTTL YLNVTDLKTY QVGWDTFCVR
WSPHRAATSY RLKLNPADGT RGQEITVRGS ETSHCFTGLS PDTEYGVTVF VQTPNLEGPG
VPVKEHTTVK PTEAPTEPPT PPPPPTIPPA RDVCKGAKAD IVFLTDASWS IGDDNFNKVV
KFIFNTVGAF DEISPAGIQV SFVQYSDEVK SEFKLNTYND KAQALGALQN IRYRGGNTRT
GKALTFIKEK VLTWESGMRK NVPKVLVVVT DGRSQDEVKK AALIIQQSGF SVFVVGVADV
EYNELANIAS KPSERHVFIV DDFESFEKIE DNLITFVCET ATSNCPLIYL DGYTSPGFKM
LEAYNLTEKN FASVQGVSLE SGSFPSYSAY RLQKNAFVNQ PTVDLHPNGL PPSYTIILLF
RLLPETPNDP FAIWQITDRD YKPQVGVIVD PSSKTLSFFN KDTRGEVQTI TFDTDEVKTL
FYGSFHKIHI VVTSKSVKIY IDCYEIIEKD IKEAGNITTD GYEILGKLLK GERKSATFQI
QNFDIVCSPV WTSRDRCCDI PSRRDEAKCP ALPNACTCTQ DSVGPPGPPG PAGGPGAKGP
RGERGISGAI GPPGPRGDTG PPGPQGPPGP QGPNGLSIPG EQGRQGMKGD AGEPGLPGRT
GTPGLPGPPG PMGPPGDRGF TGKDGAMGPR GPPGPPGSPG SPGVTGASGK PGKPGDHGRP
GPSGLKGEKG DRGDIASQNM MRAVARQVCE QLISGQMNRF NQMLNQIPND YHSNRNQPGP
PGPPGPPGNA GARGEPGPGG QPGFPGSPGM QGPPGERGLP GEKGERGIGS PGPRGSPGPP
GPQGESRTGP PGSTGSRGPP GPPGRPGNLG IRGPPGPPGY CDSSQCASIP YNGQGYPGML
LPL
//