ID G3U1Z7_LOXAF Unreviewed; 1512 AA.
AC G3U1Z7;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 61.
DE RecName: Full=Collagen type XX alpha 1 chain {ECO:0008006|Google:ProtNLM};
OS Loxodonta africana (African elephant).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta.
OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000021855.1, ECO:0000313|Proteomes:UP000007646};
RN [1] {ECO:0000313|Ensembl:ENSLAFP00000021855.1, ECO:0000313|Proteomes:UP000007646}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000021855.1,
RC ECO:0000313|Proteomes:UP000007646};
RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Loxodonta africana (African elephant).";
RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLAFP00000021855.1}
RP IDENTIFICATION.
RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000021855.1};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9785.ENSLAFP00000021855; -.
DR Ensembl; ENSLAFT00000035451.1; ENSLAFP00000021855.1; ENSLAFG00000027493.1.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000153769; -.
DR HOGENOM; CLU_002527_0_0_1; -.
DR InParanoid; G3U1Z7; -.
DR OMA; DISGCYG; -.
DR TreeFam; TF329914; -.
DR Proteomes; UP000007646; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 4.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 5.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF39; COLLAGEN ALPHA-1(XX) CHAIN; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF00041; fn3; 4.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 5.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 4.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50853; FN3; 4.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000007646};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 15..190
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 212..301
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 392..483
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 484..573
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 591..684
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 341..361
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 786..819
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1291..1448
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1486..1512
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 788..807
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1512 AA; 159016 MW; 9838684B101CB357 CRC64;
PTGAPFHCTP AVPMDMVFLV DGSWSTGHRN FQLITDFLAS VVSPFEIDPN KVQVGLTQYS
GDPQTEWDLN SFSTKEEVFS AIGRLHYKGG NTFIGLALTH VLEQNLRPAA GLRVEAVKVV
VLVTDGKSQD DACTAARVLK SLSTDVFTVG VKNADEAELR LLASQPLDIT VHNVLDFPQL
GTLSSLLSRL ICHHVQGRGP RQRDPALDSL PAPTGLVLTP VTCSSIRLSW TPVPQPPLKY
LIVWRPSRGS APREVVVETP TSSAELRNLT SSTEYLMSVL PVYAGGVGEG LQGLATTLPL
SPPHTLALAT VTPQTIHLTW QNVPGATQYL VQCLLTPPKG EEEGREVQQG RSGWPDGGGA
GGKNYAISVQ SLRGSEVSEA RGVQARTPAL APPRHLDFEN VTHNSARVLW EGPQTPARLC
RATYISSKGD HLGQVEIPGN ASSATLGPLS SSTTYTVRVT CVYPGGGSSA LTGRVTTQKV
PSPSQLTVME MPGDEVQLEW AAAAASDVLV YQIKWTPLSD GKAHEISVPG NLCSAVLPGL
GRNAEYDITI LAYYHDGAQS DPVSLCYTPC APSSVWGASA TRDLCAVSQS PPSNLALASE
TPNTLKVSWT PPHGHVLHYR LTYALASSSG PETSVGLISV PGPRSHVTLP DLLAATKYRV
LVSAVYGAGE SMAVSATGQT AGSQGSGHQV MSKHHSIPLV SLTFWPDCRP RQGRTAPGTA
ATVSTLPGLE NGFLVAYCVA LPSGGVQPQA AVCSKQTVGA PEATGCHPCP VPSQRRLRAP
PQGLTQLSAD HRGAQPGATS LPAQGTDAAR DTSRGGMERR GWQGLKAGPS LALRSCAPPY
HPLPSLRPLP STKILCPTLP PAVLPQAPPT GFSSAASVSD TQLTRRARWG AGRQRGSRRL
QGNLLERPLW VQAPGHLVQR LGLSPKPEFC TGAISFEDSC LLLDVLRSSS YSNRNLKMQK
VYAAPSQTTA QHHSGFSPPI LLSISNRHRQ GSMHKYRHRT QCDPGTGTML SAVKGSLPEL
QDVAGQVLGA HGFDPRANEG QSHWEQPYSP FPRPSTSLDP AACPVSTLDS LGQGKFQGKL
RLRGHSGRGP TPSPGIGLCP PLGQAWPAHT LDGVPPCSDI HLAALPPEHT IIFLLRLLPE
TPREAFVLGQ ITTEDFQPAL GVLLDAGWKS LTYFNHHPSA ALQGVTFHLQ DVKKIFFGSF
HKVHVAMGRS KVQLYVDCQK VAEKPFGELD SLPTTGFVML GRLAKARGPR SSSATFQLQT
LQIVCNGSWA EEDRCCNLPA LGDAETCPTS PSVCTCSSET PGPSGPQGPP GLPGKSSTPG
EQGFPGPRGE PGPPGQMGPE GPGGQQGSPG TQGCTIGAVG PPGIKREKGD HGLPGSQGLP
GHHGAPGRVS LQGPKGMRGL EGTDGPPGPP GPRGLQGVAG TRGAGGERGP PGAVGPTGLP
GPKGERGEKG ELQSLATIYQ LVSQACESAI QTHMLKFNSF LHESTRPPMP VLEGKMRPGG
PGPPGVHSKA GP
//