GenomeNet

Database: UniProt
Entry: A0A151P5X9_ALLMI
LinkDB: A0A151P5X9_ALLMI
Original site: A0A151P5X9_ALLMI 
ID   A0A151P5X9_ALLMI        Unreviewed;      1330 AA.
AC   A0A151P5X9;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   27-MAR-2024, entry version 25.
DE   RecName: Full=Alpha-1 type I collagen {ECO:0000256|ARBA:ARBA00044312};
GN   Name=COL1A1 {ECO:0000313|EMBL:KYO44380.1};
GN   ORFNames=Y1Q_0012139 {ECO:0000313|EMBL:KYO44380.1};
OS   Alligator mississippiensis (American alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO44380.1};
RN   [1] {ECO:0000313|EMBL:KYO44380.1, ECO:0000313|Proteomes:UP000050525}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO44380.1};
RX   PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA   St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA   Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA   Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA   Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA   Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA   McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA   Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA   Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA   Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA   Ray D.A.;
RT   "Sequencing three crocodilian genomes to illuminate the evolution of
RT   archosaurs and amniotes.";
RL   Genome Biol. 13:415-415(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KYO44380.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AKHW03000817; KYO44380.1; -; Genomic_DNA.
DR   STRING; 8496.A0A151P5X9; -.
DR   eggNOG; KOG3544; Eukaryota.
DR   Proteomes; UP000050525; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR001007; VWF_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF569; COLLAGEN ALPHA-1(I) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 6.
DR   Pfam; PF00093; VWC; 1.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00214; VWC; 1.
DR   SUPFAM; SSF57603; FnI-like domain; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
DR   PROSITE; PS01208; VWFC_1; 1.
DR   PROSITE; PS50184; VWFC_2; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:KYO44380.1};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Membrane {ECO:0000256|SAM:Phobius};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   SIGNAL          1..24
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           25..1330
FT                   /note="Alpha-1 type I collagen"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5007586719"
FT   TRANSMEM        387..406
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        467..490
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        510..528
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        534..553
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        656..677
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        683..701
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        713..736
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        773..800
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        812..837
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        876..895
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        938..956
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        962..981
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          31..89
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          1090..1330
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          98..358
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          989..1113
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        124..146
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        174..203
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        994..1008
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1075..1091
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1330 AA;  139054 MW;  68BCB97817961345 CRC64;
     MFSFVDSRLL LLIAATVLLT KGQGEEDIQT GSCIQDGLAY NNTDVWKPEP CQICVCDNGN
     ILCDDVICDD TSDCTNAEIP FGECCPICPD TAGSSTYPKS TGVEGPKGDT GPRGQRGLPG
     PPGRDGIPGQ PGLPGLPGPP GPPGLGGNFA PQMAYGYGDE TKSAGISVPG PMGPAGPRGL
     PGPPGSPGPQ GFQGPPGEPG EPGASGPMGP RGPAGPPGKN GDDGEAGKPG RPGERGPPGP
     QGARGLPGTA GLPGMKGHRG FSGLDGAKGD AGPSGPKGEP GSPGENGAPG QMGPRGLPGE
     RGRPGPSGPA GARGNDGSPG AAGPPGPTGP AGPPGFPGAA GAKGETGPQG SRGSEGPQGA
     RVITLLSCLL SSRATLVLMV KLVPKVQLVL LVLLVLLASL ALVAHLDPRV PAVLLAPRVT
     VVNPVLKATR ETLVQKESLV LLVSKAHLVQ LVKKAREEPV VSPALEVFLA LLANVVLLEA
     VVSLALMAFL VPRVPLVNVV PLALLVPKDL LVNLDALVSL VSLVPRVLLE AQVAQVLMAR
     LVHLAPLVLL VSLANLVREE LLDPLVLLAQ LVRMVKLVPK VLLALLVLLE REVNKVLLVL
     LDSRVCPVLL AHLVNLASLV NRVFLEMLVL LVQLVQEARE VSLVSVVSKV NQVHRVHVVL
     TVLPVTMVLR VMLVLLVLLV AKVLPVCRVC LVSVVLLVCL VPRVTEAILV PKALMLVPLV
     TRVKLVLLAL LVPLVLVVPL EIVVSLVHLA LLDSLVPLVL MDNLVLKVNL VMLVLKVMLV
     LQALLDPLVL LDLLALLVLL DPKVLVVVLD PLVLLVSLVL LEELVHLALL VTSVFLAHQA
     PVEKKALKDP VVRLALLDAP VNLDLLAHQD LLARRALLVV MVPLVLLARE VSLVSPAHLA
     NLANKVHLAP LVNAVLLVQW DHLAWLDLLV KLDVRVLLVL KVLLVAMALL VPRVTVVRLA
     PLVLLVLPVP LELLALLALL ARMEIVGPQG ARGDKGETGE HGDRGMKGHR GFPGPQGPSG
     PAGSPGEQGP SGASGPAGPR GPPGSAGTPG KDGLNGLPGP IGPPGPRGRT GDVGPAGPPG
     PPGPPGPPGA PSGGFDFSFM PQPPQEKAHD PGRYYRADDA NVMRDRDLEM CHNDWKSGEY
     WIDPNQGCNL DAIKVYCNME TGETCVHPTQ ATIAQKNWYM SKNPKEKKHI WFGETMSDGF
     QFEYGGEGSN PADVAIQLTF LRLMSTEASQ NITYHCKNSV AYMDQETGNL KKALLLQGSN
     EIEIRAEGNS RFTYGVTEDG CTTHTGAWGK TVIEYKTTKT SRLPVIDVAP MDVGAQDQEF
     GIVIGPVCFL
//
DBGET integrated database retrieval system