ID A0A151P5X9_ALLMI Unreviewed; 1330 AA.
AC A0A151P5X9;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE RecName: Full=Alpha-1 type I collagen {ECO:0000256|ARBA:ARBA00044312};
GN Name=COL1A1 {ECO:0000313|EMBL:KYO44380.1};
GN ORFNames=Y1Q_0012139 {ECO:0000313|EMBL:KYO44380.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO44380.1};
RN [1] {ECO:0000313|EMBL:KYO44380.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO44380.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO44380.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03000817; KYO44380.1; -; Genomic_DNA.
DR STRING; 8496.A0A151P5X9; -.
DR eggNOG; KOG3544; Eukaryota.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF569; COLLAGEN ALPHA-1(I) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 6.
DR Pfam; PF00093; VWC; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00214; VWC; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:KYO44380.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..1330
FT /note="Alpha-1 type I collagen"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5007586719"
FT TRANSMEM 387..406
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 467..490
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 510..528
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 534..553
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 656..677
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 683..701
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 713..736
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 773..800
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 812..837
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 876..895
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 938..956
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 962..981
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 31..89
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1090..1330
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 98..358
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 989..1113
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 124..146
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 174..203
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 994..1008
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1075..1091
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1330 AA; 139054 MW; 68BCB97817961345 CRC64;
MFSFVDSRLL LLIAATVLLT KGQGEEDIQT GSCIQDGLAY NNTDVWKPEP CQICVCDNGN
ILCDDVICDD TSDCTNAEIP FGECCPICPD TAGSSTYPKS TGVEGPKGDT GPRGQRGLPG
PPGRDGIPGQ PGLPGLPGPP GPPGLGGNFA PQMAYGYGDE TKSAGISVPG PMGPAGPRGL
PGPPGSPGPQ GFQGPPGEPG EPGASGPMGP RGPAGPPGKN GDDGEAGKPG RPGERGPPGP
QGARGLPGTA GLPGMKGHRG FSGLDGAKGD AGPSGPKGEP GSPGENGAPG QMGPRGLPGE
RGRPGPSGPA GARGNDGSPG AAGPPGPTGP AGPPGFPGAA GAKGETGPQG SRGSEGPQGA
RVITLLSCLL SSRATLVLMV KLVPKVQLVL LVLLVLLASL ALVAHLDPRV PAVLLAPRVT
VVNPVLKATR ETLVQKESLV LLVSKAHLVQ LVKKAREEPV VSPALEVFLA LLANVVLLEA
VVSLALMAFL VPRVPLVNVV PLALLVPKDL LVNLDALVSL VSLVPRVLLE AQVAQVLMAR
LVHLAPLVLL VSLANLVREE LLDPLVLLAQ LVRMVKLVPK VLLALLVLLE REVNKVLLVL
LDSRVCPVLL AHLVNLASLV NRVFLEMLVL LVQLVQEARE VSLVSVVSKV NQVHRVHVVL
TVLPVTMVLR VMLVLLVLLV AKVLPVCRVC LVSVVLLVCL VPRVTEAILV PKALMLVPLV
TRVKLVLLAL LVPLVLVVPL EIVVSLVHLA LLDSLVPLVL MDNLVLKVNL VMLVLKVMLV
LQALLDPLVL LDLLALLVLL DPKVLVVVLD PLVLLVSLVL LEELVHLALL VTSVFLAHQA
PVEKKALKDP VVRLALLDAP VNLDLLAHQD LLARRALLVV MVPLVLLARE VSLVSPAHLA
NLANKVHLAP LVNAVLLVQW DHLAWLDLLV KLDVRVLLVL KVLLVAMALL VPRVTVVRLA
PLVLLVLPVP LELLALLALL ARMEIVGPQG ARGDKGETGE HGDRGMKGHR GFPGPQGPSG
PAGSPGEQGP SGASGPAGPR GPPGSAGTPG KDGLNGLPGP IGPPGPRGRT GDVGPAGPPG
PPGPPGPPGA PSGGFDFSFM PQPPQEKAHD PGRYYRADDA NVMRDRDLEM CHNDWKSGEY
WIDPNQGCNL DAIKVYCNME TGETCVHPTQ ATIAQKNWYM SKNPKEKKHI WFGETMSDGF
QFEYGGEGSN PADVAIQLTF LRLMSTEASQ NITYHCKNSV AYMDQETGNL KKALLLQGSN
EIEIRAEGNS RFTYGVTEDG CTTHTGAWGK TVIEYKTTKT SRLPVIDVAP MDVGAQDQEF
GIVIGPVCFL
//