GenomeNet

Database: UniProt
Entry: H2PR37_PONAB
LinkDB: H2PR37_PONAB
Original site: H2PR37_PONAB 
ID   H2PR37_PONAB            Unreviewed;      1780 AA.
AC   H2PR37; A0A2J8X2W9;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   25-MAY-2022, sequence version 2.
DT   27-MAR-2024, entry version 63.
DE   SubName: Full=COL14A1 isoform 4 {ECO:0000313|EMBL:PNJ76357.1};
DE   SubName: Full=Collagen type XIV alpha 1 chain {ECO:0000313|Ensembl:ENSPPYP00000021145.3};
GN   Name=COL14A1 {ECO:0000313|Ensembl:ENSPPYP00000021145.3};
GN   ORFNames=CR201_G0005542 {ECO:0000313|EMBL:PNJ76357.1};
OS   Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Pongo.
OX   NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000021145.3, ECO:0000313|Proteomes:UP000001595};
RN   [1] {ECO:0000313|Ensembl:ENSPPYP00000021145.3, ECO:0000313|Proteomes:UP000001595}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Wilson R.K., Mardis E.;
RT   "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome.";
RL   Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:PNJ76357.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Susie {ECO:0000313|EMBL:PNJ76357.1};
RA   Pollen A., Hastie A., Hormozdiari F., Dougherty M., Liu R., Chaisson M.,
RA   Hoppe E., Hill C., Pang A., Hillier L., Baker C., Armstrong J.,
RA   Shendure J., Paten B., Wilson R., Chao H., Schneider V., Ventura M.,
RA   Kronenberg Z., Murali S., Gordon D., Cantsilieris S., Munson K., Nelson B.,
RA   Raja A., Underwood J., Diekhans M., Fiddes I., Haussler D., Eichler E.;
RT   "High-resolution comparative analysis of great ape genomes.";
RL   Submitted (DEC-2017) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|Ensembl:ENSPPYP00000021145.3}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; NDHI03003373; PNJ76357.1; -; Genomic_DNA.
DR   Ensembl; ENSPPYT00000021988.3; ENSPPYP00000021145.3; ENSPPYG00000018846.3.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000153769; -.
DR   HOGENOM; CLU_002527_2_0_1; -.
DR   TreeFam; TF329914; -.
DR   Proteomes; UP000001595; Chromosome 8.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   CDD; cd00063; FN3; 8.
DR   CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 8.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF15; COLLAGEN ALPHA-1(XIV) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF00041; fn3; 8.
DR   Pfam; PF00092; VWA; 2.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00060; FN3; 8.
DR   SMART; SM00210; TSPN; 1.
DR   SMART; SM00327; VWA; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 6.
DR   SUPFAM; SSF53300; vWA-like; 2.
DR   PROSITE; PS50853; FN3; 7.
DR   PROSITE; PS50234; VWFA; 2.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001595};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          32..122
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          158..330
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          355..444
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          445..536
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          537..626
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          627..715
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          737..829
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          831..921
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1032..1205
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          124..145
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1462..1613
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1644..1766
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1561..1575
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1658..1675
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1710..1747
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1780 AA;  191743 MW;  9D20062BC130E067 CRC64;
     MKIFQRKMRY WLLPPFLAIV YFCTIVQGQV APPTRLRYNV ISHDSIQISW KAPRGKFGGY
     KLLVTPASGG KTNQLNLQNT ATKAIIQGLM PDQNYTVQII AYNKDKESKP AQGQFRIKDL
     EKRKDPKPRV KVVDRGNGSR PSSPEEVKFV CQTPAIADIV ILVDGSWSIG RFNFRLVRLF
     LENLVTAFDV GSEKTRIGLA QYSGDPRIEW HLNAFSTKDE VIEAVRNLPY KGGNTLTGLA
     LNYIFENSFK PEAGSRTGVS KIGILITDGK SQDDIIPPSR NLRESGVELF AIGVKNADVN
     ELQEIASEPD STHVYNVAEF DLMHTVVESL TRTLCSRVEE QDREIKASAH AITGPPTELI
     TSEVTASSFM VNWTHAPGNV EKYRVVYYPT RGGKPDEVVV DGSVSSTVLK NLMSLTEYQI
     AVFAIYAHTA SEGLRGTETT LALPMASELL LYDVTENSMR VKWDAVPGAS GYLILYAPLT
     EGLAGDEKEM KIGETHTDIE LSGLLPNTEY TVTVYAMFGE EASDPVTGQE TTLALSPPRN
     LRISNVGSNS ARLTWDPTSR QINGYRIVYN NADGTEINEV EVDPITTFPL KGLTPLTEYT
     IAIFSIYDEG QSEPLTGVFT TEEVPAQQYL EIDEVTTDSF RVTWHPLSAD EGLHKLMWIP
     VYGGKTEEVV LKEEQDSHVI EGLEPGTEYE VSLLAVLDDG SESEVVTAVG TTLDSFWTEP
     ATTIVPTTPV TSVFQTGIRN LVVGDETTSS LRVKWDISDS DVQQFRVTYM TAQGDPEEEV
     VGTVMVPGSQ NILLLKPLLP DTEYKVTVTP IYTDGEGVSV SAPGKTLPSS GPQNLRVSEE
     WYNRLRITWD PPSSPVKGYR IVYKPVSVPG PTLETFVGAD INTILITNLL SGMDYNVKIF
     ASQASGFSDA LTGMVKTLFL GVTNLQAKHV EMTSLCAHWQ VHRHATAYRV VIESPQDRQK
     QESTVGGGTT RHCFYGLQPD SEYKISVYTK LQEIEGPSVS IMEKTHSLPT QPPTFPPTIP
     PAKEVCKAAK ADLVFMVDGS WSIGDENFNK IISFLYSTVG ALNKIGTDGT QVAMVQFTDD
     PRTEFKLNAY KTKETLLDAI KHISYKGGNT KTGKAIKYVR DTLFTAESGT RRGIPKVIVV
     ITDGRSQDDV NKISREMQLD GYSIFAIGVA DADYSELVSI GSKPSARHVF FVDDFDAFKK
     IEDELITFVC ETASATCPMV HKDGIDLAGF KMMEMFGLVE KDFSSVEGVS MEPGTFNVFP
     CYQLHKDALV SQPTRYLHPE GLPSDYTISF LFRILPDTPQ EPFALWEILN KNSDPLVGVI
     LDNGGKTLTY FNYDQSGDFQ TVTFEGPEIR KIFYGSFHKL HIVVSETLVK VVIDCKQVGE
     KAMNASANIT SDGVEVLGKM VRSRGPGGNS APFQLQMFDI VCSTSWADTD KCCELPGLRD
     DESCPDLPHS CSCSETNEVA LGPAGPPGGP GLRGPKGQQG EPGPKGPDGP RGELGLPGPQ
     GPPGPQGPSG LSIQGMPGMP GEKGEKGDTG LPGPQGIPGG VGSPGRDGSP GQRGLPGKDG
     SSGPPGPPGP IGIPGAPGVP GITGSMGPQG ALGPPGVPGA KGERGERGDL QSQAMVRSVA
     RQVCEQLIQS HMARYTAILN QIPSHSSSIR TVQGPPGEPG RPGSPGAPGE QGPPGTPGFP
     GNAGVPGTPG ERGLTGIKGE KGNPGVGTQG PRGPPGPAGP SGESRPGSPG PPGSPGPRGP
     PGHLGVPGPQ GPSGQPGYCD PSSCSAYGVR DLIPYNDYQH
//
DBGET integrated database retrieval system