GenomeNet

Database: UniProt
Entry: A0A093PT51_PHACA
LinkDB: A0A093PT51_PHACA
Original site: A0A093PT51_PHACA 
ID   A0A093PT51_PHACA        Unreviewed;      1961 AA.
AC   A0A093PT51;
DT   26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT   26-NOV-2014, sequence version 1.
DT   27-MAR-2024, entry version 32.
DE   SubName: Full=Collagen alpha-1(XII) chain {ECO:0000313|EMBL:KFW79988.1};
DE   Flags: Fragment;
GN   ORFNames=N336_01354 {ECO:0000313|EMBL:KFW79988.1};
OS   Phalacrocorax carbo (Great cormorant) (Pelecanus carbo).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Suliformes; Phalacrocoracidae;
OC   Phalacrocorax.
OX   NCBI_TaxID=9209 {ECO:0000313|EMBL:KFW79988.1, ECO:0000313|Proteomes:UP000053238};
RN   [1] {ECO:0000313|EMBL:KFW79988.1, ECO:0000313|Proteomes:UP000053238}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=BGI_N336 {ECO:0000313|EMBL:KFW79988.1};
RA   Zhang G., Li C.;
RT   "Genome evolution of avian class.";
RL   Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KL415529; KFW79988.1; -; Genomic_DNA.
DR   Proteomes; UP000053238; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   CDD; cd00063; FN3; 13.
DR   CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 13.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF00041; fn3; 13.
DR   Pfam; PF00092; VWA; 2.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00060; FN3; 13.
DR   SMART; SM00210; TSPN; 1.
DR   SMART; SM00327; VWA; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 10.
DR   SUPFAM; SSF53300; vWA-like; 2.
DR   PROSITE; PS50853; FN3; 12.
DR   PROSITE; PS50234; VWFA; 2.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KFW79988.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053238};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          6..96
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          97..185
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          187..277
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          297..469
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          485..574
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          575..666
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          667..758
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          759..854
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          857..951
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1038..1128
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1129..1219
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1220..1308
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1309..1397
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1425..1598
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          173..198
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1852..1961
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1880..1900
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1926..1940
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:KFW79988.1"
FT   NON_TER         1961
FT                   /evidence="ECO:0000313|EMBL:KFW79988.1"
SQ   SEQUENCE   1961 AA;  214803 MW;  26565C103606FC53 CRC64;
     LAERGSPRNL VTTDITDTTV GLSWTPAPGA VNDYRIVWKS LYDDTMGEKR VPGNTVDAVL
     EGLEPETKYR ISVYATYSSG EGDPVEGEAF TDVSPNARTV TVDNETENTM RVTWQPSPGK
     VLSYRVYYRP RSGGRQMFGK VNAPATSIVL KRLKPRTTYD LSVVPIYDFG QGKSRKAEGT
     TASPFKPPRN LRTSDSTMSS FRVTWEPAPG RVKGYKVTFH PTEEDGNLGE LIVGPYDSTV
     VLEELRAGTT YKVNVFGMFE GGESSPLVGQ EMTTLSDTTS EPFLSRGLEC RTRAEADIVL
     LVDGSWSIGR PNFKTIRNFI ARIVEVFDIG PDKVQIGLAQ YSGDPRTEWN LNAYRTKQAL
     LEAVANLPYK GGNTLTGMAL DFILKNNFKQ DAGLRPRSRK IGVLITDGKS QDDVVTPSRR
     LRDEGVELYA IGIKNADENE LKQIATDPDD IHAYNVADFS FLASIVDDVT TNLCNSVKGP
     GDLPPPSNLV ISEVTPRSFR LRWSPPPESV DRYRVEYYPT SGGSPQQFYV SRMETTTVLK
     DLKPETEYVV NVFSVVEDES SEPLIGMETT LPISSVRNLN VYDIGSTSMR VRWEPLDGAT
     GYLLTYEPVN ATVPTTEKEM RVGPSVNEVQ LVDLIPNTEY TLTAYVLFGD ITSDPLTTQE
     VTLPLPGPRG LTIQDVTHSS MNVLWDPAPG KVRKYILRYK IADEADGKEV EIDRLKTSTT
     LSGLSSQTLY NVKVVAVYDE GESLPINAEA VTQPVPAPVN LRITDITTNS FRGTWDHGAP
     DVSLYRITWG PYGRPEKQET ILNRDENSLI FENLNPDTLY DVSITAIYPD ESESDDLIGS
     ERTLPLVPIT TQAPKSGPRN LQVYNATSHS LTVKWDPASG RVQRYRIIYQ PISGDGPEQS
     TMVGGRQNSV VIQKLQPDTP YAITVSSMYA DGEGGRMTGR GRTKPLTTVK NLLVYDPTTS
     TLNVRWDHAE GSPRQYKVFY GPTAGGAEEM VNMPGNTNYV ILRTLEPNTP YTVTVVPVFP
     EGDGGRASDT GRTLERGTPR NIQVYNPTPN SMNVRWEPAP GPVQQYRINY SPLSGPRPSE
     SIVVPGNTRD VMLERLTPDT AYSINVIALY ADGEGNPSQA QGRTLPRSGP RNVRVFDETT
     NSLSVQWDHA DGPVQQYRII YSPTVGDPID EYITVPGIRN NVILQPLQSD TPYKITVVAV
     YEDGDGGQLT GNGRTVGLLP PQNMYITDEW YTRFRVSWDP SPSPVLGYKI VYKPVGSNEP
     MEVFVGEVTS YTLHNLSPST TYDVNVYAQY DSGMSIPLTD QGTTLYLNVT DLTSYKVGWD
     TFCIRWSAHR SATSYRLKLN PADGSRGQEI TVRGSETSHC FTGLSPDTEY NATVFVQTPN
     LEGPPVSMRE RTVLKPTEAP TLPPTPPPPP TIPPARDVCR GAKADIVFLT DASWSIGDDN
     FNKVVKFVFN TVGAFDLINP AGIQVSFVQY SDEAKSEFKL NTFDDKAQAL GALQNIQYRG
     GNTRTGKALT FIKEKVLTWE SGMRRGVPKV LVVVTDGRSQ DEVRKAATVI QHSGFSVFVV
     GVADVDYNEL AKIASKPSER HVFIVDDFDA FEKIQDNLVT FVCETATSTC PLIYLEGYTS
     PGFKMLESYN LTEKHFASVQ GVSLESGSFP SHVAYRLHKN AFVSQPIREI HPEGLPQAYT
     IILLFRLLPE SPNEPFAIWQ ITDRDYKPQV GVVLDPASKV LSFFNKDTRG EVQTVTFDND
     EVKKIFYGSF HKVHIVVTSS NVKIYIDCSE ILEKPIKEAG NITTDGYEIL GKLLKGDRRS
     ATLEIQNFDI VCSPVWTSRD RCCDLPSMRD EAKCPALPNA CTCTQDSVGP PGPPGPAGGP
     GAKGPRGERG LTGSSGPPGP RGETGPPGPQ GPPGPQGPNG LSIPGEPGRQ GMKGDAGQPG
     LPGRSGTPGL PGPPGPVGPP GERGFTGKDG PTGPRGPPGP A
//
DBGET integrated database retrieval system