ID G1T933_RABIT Unreviewed; 1796 AA.
AC G1T933;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 27-MAR-2024, entry version 70.
DE SubName: Full=Collagen type XIV alpha 1 chain {ECO:0000313|Ensembl:ENSOCUP00000013126.2};
GN Name=COL14A1 {ECO:0000313|Ensembl:ENSOCUP00000013126.2};
OS Oryctolagus cuniculus (Rabbit).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Lagomorpha; Leporidae; Oryctolagus.
OX NCBI_TaxID=9986 {ECO:0000313|Ensembl:ENSOCUP00000013126.2, ECO:0000313|Proteomes:UP000001811};
RN [1] {ECO:0000313|Ensembl:ENSOCUP00000013126.2, ECO:0000313|Proteomes:UP000001811}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thorbecke inbred {ECO:0000313|Ensembl:ENSOCUP00000013126.2,
RC ECO:0000313|Proteomes:UP000001811};
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSOCUP00000013126.2}
RP IDENTIFICATION.
RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000013126.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAGW02013153; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02013154; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02013155; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_002710799.1; XM_002710753.3.
DR SMR; G1T933; -.
DR STRING; 9986.ENSOCUP00000013126; -.
DR PaxDb; 9986-ENSOCUP00000013126; -.
DR Ensembl; ENSOCUT00000015277.4; ENSOCUP00000013126.2; ENSOCUG00000015234.4.
DR GeneID; 100353761; -.
DR KEGG; ocu:100353761; -.
DR CTD; 7373; -.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000153769; -.
DR HOGENOM; CLU_002527_2_0_1; -.
DR InParanoid; G1T933; -.
DR OMA; REMQSDX; -.
DR OrthoDB; 5353225at2759; -.
DR TreeFam; TF329914; -.
DR Proteomes; UP000001811; Chromosome 3.
DR Bgee; ENSOCUG00000015234; Expressed in aorta and 16 other cell types or tissues.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0005614; C:interstitial matrix; IEA:Ensembl.
DR GO; GO:0030199; P:collagen fibril organization; IEA:Ensembl.
DR GO; GO:0048873; P:homeostasis of number of cells within a tissue; IEA:Ensembl.
DR GO; GO:0061050; P:regulation of cell growth involved in cardiac muscle cell development; IEA:Ensembl.
DR GO; GO:0003229; P:ventricular cardiac muscle tissue development; IEA:Ensembl.
DR CDD; cd00063; FN3; 7.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 8.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF15; COLLAGEN ALPHA-1(XIV) CHAIN; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF00041; fn3; 8.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 8.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 5.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50853; FN3; 7.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000001811};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 32..122
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 158..330
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 355..444
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 445..536
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 537..626
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 627..715
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 737..829
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 831..921
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1032..1205
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 1462..1610
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1644..1796
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1490..1506
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1561..1575
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1658..1675
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1710..1747
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1796 AA; 193538 MW; BAC37F4676D39373 CRC64;
MKIFQCKMRY WLLPAFLTIA YFCTIVQGQV APPTRLRYSV LSHDSIQISW KAPRGKFGGY
KLLVTPASGG KTNQLNLQNT ATKAIIQGLM PEQNYTVQII AYNKDKESKP AQGQFRIKDL
EKRKDPKPRV KVVDKGNGSR PSVPEEVKFV CQTPAIADIV ILVDGSWSIG RFNFRLVRLF
LENLVTAFNV GSEKTRVGLA QYSGDPRIEW HLNAFSTKDE VIEAVRNLPY KGGNTLTGLA
LNYIFENSFK PEAGSRTGVP KIGILITDGK SQDDIIPPSR NLRESGVELF AIGVKNADVN
ELQEIASEPD STHVYNVAEF DLMHTVVESL TRTVCSRVEE QDREIKASAL ATVGPPTELI
TSEVTARSFM VNWTHAPGKV QKYRVVYYPT RGGKPEEVVV DGSVSSTVLK SLMSLTEYQI
AVFAIYAHTA SEGLRGTETT LALPMASDLE LYDVTENSMR VKWDAVPGAS GYLILYAPLT
EGLAGDEKEM KIGETYTDIE LSGLSPNTEY TVTVYAMFGE EASDPVTGQE TTLPLSPPRN
LRISNVGSNS ARLTWDPTSR KISGYRIVYN NADGTEINEV EVDPITTFPL KGLTPLTEYT
VAIFSMYEEG QSEPLTGVFT TEEVPAQQYL EIDEVTTDSF RVTWHPLSAD EGQHKLMWIP
VYGGKTQEVV LREEQDSHVI EGLEPGTEYE VSLLAVLDDG SESEVVTAVG TTLDSFWTEP
VTTIAPTSSV TSVFQTGIRN LVVDDETTSS LRVKWDISDS NVEQFRVTYL TAQGDPAEEV
VGTVMVPGRQ NSLLLKSLLP DTEYKVTVTP IYSDGEGVSV SAPGKTLPSS GPQNLRVSEE
WYNRLRITWD PPPSPVKGYR IVYKPVSVTG PTLETFVGAD INTILITNLL SGMDYNVKIF
ASQASGFSDA LTGMVKTLFL GVTNLQANQI EMTSLCAQWQ MHRHATAYRV VIESLQDTQK
QESTVSGGTT RHCFYGLQPD SEYKISVYTK LQELEGPSVS IMEKTESFPT EPPTFPPTIP
PAREVCKAAK ADLVFMVDGS WSIGDDNFNK IINFLYSTVG ALDKIGTDGT QVAMVQFTDD
PRTEFKLNAY ETKETLLDAI KRISYKGGNT KTGKAIKHVR DTLFTAESGT RRGIPKVIVV
ITDGRSQDDV NKISREMQSD GYNIFAVGVA DADYSELVNI GSKPSARHVF FVDDFDAFKK
IEDELITFVC ETASATCPLL HKDGIDLAGF KMMEMFGLVE KDFSLVEGVS MEPGTFNVYP
CYQLHKDALV SQPTKYLHPE GLPSDYTISF LFRILPDTPQ EPFALWEILN KNSDPLVGII
LDNGGKTLTY FNYDYSGDFQ TVTFEGPEIK KIFYGSFHKL HVIVSKTLVK VVIDCKEVGE
KAINASANIT ADGVEVLGRM VRSRGPNGNS APFQLQMFDI VCSTSWANRD KCCELPGLRD
EESCPDLPHS CSCSETNELA LGPAGPPGGP GLRGPKGQQG EQGPKGPDGP RGETGPPGPQ
GPPGPQGPSG LSIQGMPGMP GEKGEKGDTG LPGPQGIPGG IGSPGRDGSP GQRGFPGKDG
SSGPPGPPGP IGIPGAPGIP GITGSTGPQG ALGPPGVPGP KGERGERGDL QSQAMVRAVA
RQVCEQLIQS HMARYTAILN QIPSHSSSIR TIQGPPGEPG RPGSPGTPGE QGPPGSPGFP
GSAGVPGTPG ERGLTGIKGE KGNPGIGTQG PRGPPGPAGP SGESRPGSPG PPGSPGPRGP
PGHLGVPGPQ GPSGQPGYCD PSSCSAYGVG APHPDQPEFT PVQDEDEAME LWGPGI
//