ID H2PR37_PONAB Unreviewed; 1780 AA.
AC H2PR37; A0A2J8X2W9;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 25-MAY-2022, sequence version 2.
DT 27-MAR-2024, entry version 63.
DE SubName: Full=COL14A1 isoform 4 {ECO:0000313|EMBL:PNJ76357.1};
DE SubName: Full=Collagen type XIV alpha 1 chain {ECO:0000313|Ensembl:ENSPPYP00000021145.3};
GN Name=COL14A1 {ECO:0000313|Ensembl:ENSPPYP00000021145.3};
GN ORFNames=CR201_G0005542 {ECO:0000313|EMBL:PNJ76357.1};
OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pongo.
OX NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000021145.3, ECO:0000313|Proteomes:UP000001595};
RN [1] {ECO:0000313|Ensembl:ENSPPYP00000021145.3, ECO:0000313|Proteomes:UP000001595}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Wilson R.K., Mardis E.;
RT "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome.";
RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:PNJ76357.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Susie {ECO:0000313|EMBL:PNJ76357.1};
RA Pollen A., Hastie A., Hormozdiari F., Dougherty M., Liu R., Chaisson M.,
RA Hoppe E., Hill C., Pang A., Hillier L., Baker C., Armstrong J.,
RA Shendure J., Paten B., Wilson R., Chao H., Schneider V., Ventura M.,
RA Kronenberg Z., Murali S., Gordon D., Cantsilieris S., Munson K., Nelson B.,
RA Raja A., Underwood J., Diekhans M., Fiddes I., Haussler D., Eichler E.;
RT "High-resolution comparative analysis of great ape genomes.";
RL Submitted (DEC-2017) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Ensembl:ENSPPYP00000021145.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NDHI03003373; PNJ76357.1; -; Genomic_DNA.
DR Ensembl; ENSPPYT00000021988.3; ENSPPYP00000021145.3; ENSPPYG00000018846.3.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000153769; -.
DR HOGENOM; CLU_002527_2_0_1; -.
DR TreeFam; TF329914; -.
DR Proteomes; UP000001595; Chromosome 8.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 8.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 8.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF15; COLLAGEN ALPHA-1(XIV) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00041; fn3; 8.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 8.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 6.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50853; FN3; 7.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000001595};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 32..122
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 158..330
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 355..444
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 445..536
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 537..626
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 627..715
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 737..829
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 831..921
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1032..1205
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 124..145
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1462..1613
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1644..1766
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1561..1575
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1658..1675
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1710..1747
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1780 AA; 191743 MW; 9D20062BC130E067 CRC64;
MKIFQRKMRY WLLPPFLAIV YFCTIVQGQV APPTRLRYNV ISHDSIQISW KAPRGKFGGY
KLLVTPASGG KTNQLNLQNT ATKAIIQGLM PDQNYTVQII AYNKDKESKP AQGQFRIKDL
EKRKDPKPRV KVVDRGNGSR PSSPEEVKFV CQTPAIADIV ILVDGSWSIG RFNFRLVRLF
LENLVTAFDV GSEKTRIGLA QYSGDPRIEW HLNAFSTKDE VIEAVRNLPY KGGNTLTGLA
LNYIFENSFK PEAGSRTGVS KIGILITDGK SQDDIIPPSR NLRESGVELF AIGVKNADVN
ELQEIASEPD STHVYNVAEF DLMHTVVESL TRTLCSRVEE QDREIKASAH AITGPPTELI
TSEVTASSFM VNWTHAPGNV EKYRVVYYPT RGGKPDEVVV DGSVSSTVLK NLMSLTEYQI
AVFAIYAHTA SEGLRGTETT LALPMASELL LYDVTENSMR VKWDAVPGAS GYLILYAPLT
EGLAGDEKEM KIGETHTDIE LSGLLPNTEY TVTVYAMFGE EASDPVTGQE TTLALSPPRN
LRISNVGSNS ARLTWDPTSR QINGYRIVYN NADGTEINEV EVDPITTFPL KGLTPLTEYT
IAIFSIYDEG QSEPLTGVFT TEEVPAQQYL EIDEVTTDSF RVTWHPLSAD EGLHKLMWIP
VYGGKTEEVV LKEEQDSHVI EGLEPGTEYE VSLLAVLDDG SESEVVTAVG TTLDSFWTEP
ATTIVPTTPV TSVFQTGIRN LVVGDETTSS LRVKWDISDS DVQQFRVTYM TAQGDPEEEV
VGTVMVPGSQ NILLLKPLLP DTEYKVTVTP IYTDGEGVSV SAPGKTLPSS GPQNLRVSEE
WYNRLRITWD PPSSPVKGYR IVYKPVSVPG PTLETFVGAD INTILITNLL SGMDYNVKIF
ASQASGFSDA LTGMVKTLFL GVTNLQAKHV EMTSLCAHWQ VHRHATAYRV VIESPQDRQK
QESTVGGGTT RHCFYGLQPD SEYKISVYTK LQEIEGPSVS IMEKTHSLPT QPPTFPPTIP
PAKEVCKAAK ADLVFMVDGS WSIGDENFNK IISFLYSTVG ALNKIGTDGT QVAMVQFTDD
PRTEFKLNAY KTKETLLDAI KHISYKGGNT KTGKAIKYVR DTLFTAESGT RRGIPKVIVV
ITDGRSQDDV NKISREMQLD GYSIFAIGVA DADYSELVSI GSKPSARHVF FVDDFDAFKK
IEDELITFVC ETASATCPMV HKDGIDLAGF KMMEMFGLVE KDFSSVEGVS MEPGTFNVFP
CYQLHKDALV SQPTRYLHPE GLPSDYTISF LFRILPDTPQ EPFALWEILN KNSDPLVGVI
LDNGGKTLTY FNYDQSGDFQ TVTFEGPEIR KIFYGSFHKL HIVVSETLVK VVIDCKQVGE
KAMNASANIT SDGVEVLGKM VRSRGPGGNS APFQLQMFDI VCSTSWADTD KCCELPGLRD
DESCPDLPHS CSCSETNEVA LGPAGPPGGP GLRGPKGQQG EPGPKGPDGP RGELGLPGPQ
GPPGPQGPSG LSIQGMPGMP GEKGEKGDTG LPGPQGIPGG VGSPGRDGSP GQRGLPGKDG
SSGPPGPPGP IGIPGAPGVP GITGSMGPQG ALGPPGVPGA KGERGERGDL QSQAMVRSVA
RQVCEQLIQS HMARYTAILN QIPSHSSSIR TVQGPPGEPG RPGSPGAPGE QGPPGTPGFP
GNAGVPGTPG ERGLTGIKGE KGNPGVGTQG PRGPPGPAGP SGESRPGSPG PPGSPGPRGP
PGHLGVPGPQ GPSGQPGYCD PSSCSAYGVR DLIPYNDYQH
//