GenomeNet

Database: UniProt
Entry: K7G4Y9_PELSI
LinkDB: K7G4Y9_PELSI
Original site: K7G4Y9_PELSI 
ID   K7G4Y9_PELSI            Unreviewed;      1156 AA.
AC   K7G4Y9;
DT   09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT   09-JAN-2013, sequence version 1.
DT   27-MAR-2024, entry version 49.
DE   RecName: Full=Collagen type VII alpha 1 chain {ECO:0008006|Google:ProtNLM};
OS   Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC   Trionychidae; Pelodiscus.
OX   NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000015350.1, ECO:0000313|Proteomes:UP000007267};
RN   [1] {ECO:0000313|Proteomes:UP000007267}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG   Soft-shell Turtle Genome Consortium;
RL   Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Proteomes:UP000007267}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX   PubMed=23624526; DOI=10.1038/ng.2615;
RA   Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA   White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA   Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA   Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA   Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT   "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT   into the development and evolution of the turtle-specific body plan.";
RL   Nat. Genet. 45:701-706(2013).
RN   [3] {ECO:0000313|Ensembl:ENSPSIP00000015350.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGCU01138587; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01138588; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01138589; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01138590; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01138591; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01138592; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01138593; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; K7G4Y9; -.
DR   STRING; 13735.ENSPSIP00000015350; -.
DR   Ensembl; ENSPSIT00000015422.1; ENSPSIP00000015350.1; ENSPSIG00000013727.1.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000153769; -.
DR   HOGENOM; CLU_275826_0_0_1; -.
DR   OMA; IARTHTM; -.
DR   Proteomes; UP000007267; Unassembled WGS sequence.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   CDD; cd00063; FN3; 8.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 9.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR46708:SF2; FIBRONECTIN TYPE-III DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR46708; TENASCIN; 1.
DR   Pfam; PF00041; fn3; 8.
DR   Pfam; PF00092; VWA; 1.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00060; FN3; 9.
DR   SMART; SM00327; VWA; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 6.
DR   SUPFAM; SSF53300; vWA-like; 1.
DR   PROSITE; PS50853; FN3; 8.
DR   PROSITE; PS50234; VWFA; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..1156
FT                   /note="Collagen type VII alpha 1 chain"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003904977"
FT   DOMAIN          34..207
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          228..324
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          414..503
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          504..592
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          595..683
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          686..773
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          776..864
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          869..957
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          958..1046
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
SQ   SEQUENCE   1156 AA;  124696 MW;  995AB1DBE0197657 CRC64;
     MRRWILFLAV LLAPAPALAQ KRNQGTCINV YAADIVFLVD GSSSIGRANF RMIRSFMEDL
     VRPFVHVVGE RGVRFGAVQY SDDPRLEFTL SQHPNGTEVR RAIQQLSYKT GNTRTGAGLR
     YIADNFFGPT QIRPGVPQVC ILITDGKSQD DTEQPSVKLK AQGTKVFAVG IKNADRAELM
     SVASTPSDDY FFYVNDFKIL STLLPLVSRR VCASTGGVLQ TGSQVYSGPS NLVFVEQAAD
     ALRIRWTAAG GPVTGYKVQY VPLTSLGQQV TAEMKEVSLS PGETSTVLRQ LRGGTDYLVT
     VIAQYANSIG ESVSGTGRTE ALSGVLNFRV VEAGPSFLRL AWAAALESLQ GYRITYMAQG
     EAQAEEMSLG ANSVSIMLSN LRPNTDYVVT LQPVSQQQTV APTRITGRTL RLERVQQLTV
     ENISQQSMVV TWRGVSGATG YRVSWGLPSG QDIRKFDVDA SKNSYLLTGL QPGTDYLLTV
     MPLYGQIEGP PASIRRRTET GIVQSLRTVI LGPTSIQVLW NIIRDARGYR LEWKRATGAE
     PPQTVTLPTN IDRYQLTGLQ PATEYRITLY TLYDGREVAT PVTISQTGVE PQVGSISDLR
     VVGTVGKRLR LAWGGVPGAT EYKIVLRSSQ DGSETTRQIP GTQTMLELDD LSEEVTYIVR
     VSALIGRREG SAVPLSIRIG PQPVGSITNL RVAEIRPNQL RVTWSGLPGA TSYKLTWRAS
     DGQEISRVLP ADRTSFSIEG LQAGAAYVVG VSALVGDREG SPVTIPVRTA PEQVGMVSSL
     KVLSSRSNVV RVTWVGVPGA TAYRVVWSRR DGGSESSKLV SGDTSSFDIL NLEGGVSYTV
     KVTALIGNRE GDPVSIVVTT PAEVAPVQPV GNLQVIDSSE QRVRLTWSPA PGSSGYRLSW
     RPVAGGPERS QLLPPNVRSY DIEGLEAGVR YEIRITSLVG SQESEPVGIA VSTAPLGRVT
     NFRVTETQDD SVTLAWTPAP GATGYLLTWK LPQEGGEMQR MLPGSATSHQ VSGLRLGHRY
     VFTIQPLFGS TRGMESSVTD RTVCRDARGD VIFLVHGTRD SAYSAEAVRA LLTNTVSALG
     RLGPDATQVW GAGLWVSPLT HLTHMPLVLP PALGAASGHP VSPLPLAWDR HSAPCATRSP
     AAVPWELPPA LPGSTC
//
DBGET integrated database retrieval system