ID G1PGP5_MYOLU Unreviewed; 1435 AA.
AC G1PGP5;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 27-MAR-2024, entry version 62.
DE RecName: Full=Collagen type XX alpha 1 chain {ECO:0008006|Google:ProtNLM};
OS Myotis lucifugus (Little brown bat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; Vespertilionidae;
OC Myotis.
OX NCBI_TaxID=59463 {ECO:0000313|Ensembl:ENSMLUP00000009820.2, ECO:0000313|Proteomes:UP000001074};
RN [1] {ECO:0000313|Ensembl:ENSMLUP00000009820.2, ECO:0000313|Proteomes:UP000001074}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSMLUP00000009820.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAPE02057339; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 59463.ENSMLUP00000009820; -.
DR Ensembl; ENSMLUT00000010773.2; ENSMLUP00000009820.2; ENSMLUG00000010733.2.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000153769; -.
DR HOGENOM; CLU_002527_0_0_1; -.
DR InParanoid; G1PGP5; -.
DR OMA; DISGCYG; -.
DR TreeFam; TF329914; -.
DR Proteomes; UP000001074; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 4.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 5.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF39; COLLAGEN ALPHA-1(XX) CHAIN; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF00041; fn3; 4.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 5.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 3.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50853; FN3; 4.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000001074};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 15..190
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 219..310
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 311..397
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 399..490
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 599..690
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 681..711
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 963..989
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1298..1435
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1435 AA; 151061 MW; 146C255FBC853F98 CRC64;
PAGLQFHCMP PTPADMIFLV DGSWSIGHSH FQQVKDFLAS IIEPFEIGPD KVQVGLTQYS
GDPQTEWDLN TFSTKEEVLA AVHSLHYRGG NTFTGLALTH VLEQNLRPAA GPRPEAAKVV
ILVTDGKSQD DACAAGRILK SLDVDIFAVG VKNADEAELR LLASPPLDIT VHHVQDFPQL
STLAGLLSRL VCQQVQGRSP RPGPVKPAAA PPTLDPLSAP TSLVLTQVTS SSVHLSWTPA
LQPPLKYLLV WRPSRGGAPQ EVEVDGPSTS TELHGLASGT EYLLSVFPVY EAGVGEGLRG
LVTTVPLPPP QALALATVTP RTIHLTWQPS AGATQYLVRY SLASPKGEEE GREVQVGQPE
VLLGSLEPGR DYDIWVQSLQ GAQASEARGI HARTPPLAPP TPLIFSDVSH DSARVSWEGT
LRPVQLFRVS YLSSKGSHSG QIEAPGNATS VTLGPLSSST TYTVRVTRLY PGGGSSTTTG
RLTTRKVPGP SQLSVTELPG DAVRLAWAAA AASGVLAYQI KWTPLGDRKA HEISVPGSLC
TAVLPGLGGP LEHEITILAS YRDGAHSDPV SFHYSPRACC CLLAQPLLTP HCPSVSRSPP
SNLTLASESP NSLRVSWTPP TGHVLHYRLT YMLASGSGPE KSVSVPGPRS HAMLPDLLAA
TTYRVLVSAV YGAGESVAVT ATGQTGPRQP PAPRTVGPLN RGPPTGGRPC RCQHRARAAV
EQKGIGFLRE DFAVETAETS KAPAGWPGPS SARPKAGDRV LALYLCRGQG SRAAHRAARR
QAGGRAHRPW QMTRRGAQHL AVVPSLPVPE ARGQQVQGPA RGEAAVGLQP ISSRRRPTAQ
GLLPADPDLL SPLSSLPSPP PRWLPCRLTP LQEAPQGSQS RLLRLVQGHC PKGHSPPEPL
LTKLWAGRGA TQSPGECETL PRLTRGRSCG QAALPSGTGR GARPVLMAES LWGLPGVSNE
AAPFRPRSRA GPPVTGSGAR RAVTPVRSHL SAEHPVGRAA RSLVRWVVRL LGIRDCGPGQ
TAPWGGRVSS RRKVGTPHFG GLPALHPDTC FSFSSPPTCL SLGLHRGPAG FDLMAAFGLV
EREYASIRGV AMEPSAFGST RTFTLFRDAQ LTRRASDIHP AALPPEHTVV FLLRLLPETP
REAFALWQMT AEDFRPVLGV LLDAGRKSLT YFSHDPRAAL QEVTFDLPEV RRIFFGSFHK
VHVAVGHSKV QLYVDCRKVA ERPIGEAGSP PATGFITLGR LAKARGPRSS SAALQLQMLQ
IVCSDTWAEE DRCCELPASK NGETCPAFPP ACTCSSQIPG PPGPQGPPGL PGRNGASGQQ
GFPGPRGEPG PPGQTGPRGP GGHQGSPGTQ GRTVQGPVGP PGVKGEKGDH GLPGLQGDPG
HQGDPGEAGL QGPKGMRGLE GTAGLPGPPG PRGYPGMAGA RGTSGERGPP GTVGP
//