ID A0A093PT51_PHACA Unreviewed; 1961 AA.
AC A0A093PT51;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE SubName: Full=Collagen alpha-1(XII) chain {ECO:0000313|EMBL:KFW79988.1};
DE Flags: Fragment;
GN ORFNames=N336_01354 {ECO:0000313|EMBL:KFW79988.1};
OS Phalacrocorax carbo (Great cormorant) (Pelecanus carbo).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Suliformes; Phalacrocoracidae;
OC Phalacrocorax.
OX NCBI_TaxID=9209 {ECO:0000313|EMBL:KFW79988.1, ECO:0000313|Proteomes:UP000053238};
RN [1] {ECO:0000313|EMBL:KFW79988.1, ECO:0000313|Proteomes:UP000053238}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N336 {ECO:0000313|EMBL:KFW79988.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL415529; KFW79988.1; -; Genomic_DNA.
DR Proteomes; UP000053238; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 13.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 13.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF00041; fn3; 13.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 13.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 10.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50853; FN3; 12.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KFW79988.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000053238};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 6..96
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 97..185
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 187..277
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 297..469
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 485..574
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 575..666
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 667..758
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 759..854
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 857..951
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1038..1128
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1129..1219
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1220..1308
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1309..1397
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1425..1598
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 173..198
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1852..1961
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1880..1900
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1926..1940
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFW79988.1"
FT NON_TER 1961
FT /evidence="ECO:0000313|EMBL:KFW79988.1"
SQ SEQUENCE 1961 AA; 214803 MW; 26565C103606FC53 CRC64;
LAERGSPRNL VTTDITDTTV GLSWTPAPGA VNDYRIVWKS LYDDTMGEKR VPGNTVDAVL
EGLEPETKYR ISVYATYSSG EGDPVEGEAF TDVSPNARTV TVDNETENTM RVTWQPSPGK
VLSYRVYYRP RSGGRQMFGK VNAPATSIVL KRLKPRTTYD LSVVPIYDFG QGKSRKAEGT
TASPFKPPRN LRTSDSTMSS FRVTWEPAPG RVKGYKVTFH PTEEDGNLGE LIVGPYDSTV
VLEELRAGTT YKVNVFGMFE GGESSPLVGQ EMTTLSDTTS EPFLSRGLEC RTRAEADIVL
LVDGSWSIGR PNFKTIRNFI ARIVEVFDIG PDKVQIGLAQ YSGDPRTEWN LNAYRTKQAL
LEAVANLPYK GGNTLTGMAL DFILKNNFKQ DAGLRPRSRK IGVLITDGKS QDDVVTPSRR
LRDEGVELYA IGIKNADENE LKQIATDPDD IHAYNVADFS FLASIVDDVT TNLCNSVKGP
GDLPPPSNLV ISEVTPRSFR LRWSPPPESV DRYRVEYYPT SGGSPQQFYV SRMETTTVLK
DLKPETEYVV NVFSVVEDES SEPLIGMETT LPISSVRNLN VYDIGSTSMR VRWEPLDGAT
GYLLTYEPVN ATVPTTEKEM RVGPSVNEVQ LVDLIPNTEY TLTAYVLFGD ITSDPLTTQE
VTLPLPGPRG LTIQDVTHSS MNVLWDPAPG KVRKYILRYK IADEADGKEV EIDRLKTSTT
LSGLSSQTLY NVKVVAVYDE GESLPINAEA VTQPVPAPVN LRITDITTNS FRGTWDHGAP
DVSLYRITWG PYGRPEKQET ILNRDENSLI FENLNPDTLY DVSITAIYPD ESESDDLIGS
ERTLPLVPIT TQAPKSGPRN LQVYNATSHS LTVKWDPASG RVQRYRIIYQ PISGDGPEQS
TMVGGRQNSV VIQKLQPDTP YAITVSSMYA DGEGGRMTGR GRTKPLTTVK NLLVYDPTTS
TLNVRWDHAE GSPRQYKVFY GPTAGGAEEM VNMPGNTNYV ILRTLEPNTP YTVTVVPVFP
EGDGGRASDT GRTLERGTPR NIQVYNPTPN SMNVRWEPAP GPVQQYRINY SPLSGPRPSE
SIVVPGNTRD VMLERLTPDT AYSINVIALY ADGEGNPSQA QGRTLPRSGP RNVRVFDETT
NSLSVQWDHA DGPVQQYRII YSPTVGDPID EYITVPGIRN NVILQPLQSD TPYKITVVAV
YEDGDGGQLT GNGRTVGLLP PQNMYITDEW YTRFRVSWDP SPSPVLGYKI VYKPVGSNEP
MEVFVGEVTS YTLHNLSPST TYDVNVYAQY DSGMSIPLTD QGTTLYLNVT DLTSYKVGWD
TFCIRWSAHR SATSYRLKLN PADGSRGQEI TVRGSETSHC FTGLSPDTEY NATVFVQTPN
LEGPPVSMRE RTVLKPTEAP TLPPTPPPPP TIPPARDVCR GAKADIVFLT DASWSIGDDN
FNKVVKFVFN TVGAFDLINP AGIQVSFVQY SDEAKSEFKL NTFDDKAQAL GALQNIQYRG
GNTRTGKALT FIKEKVLTWE SGMRRGVPKV LVVVTDGRSQ DEVRKAATVI QHSGFSVFVV
GVADVDYNEL AKIASKPSER HVFIVDDFDA FEKIQDNLVT FVCETATSTC PLIYLEGYTS
PGFKMLESYN LTEKHFASVQ GVSLESGSFP SHVAYRLHKN AFVSQPIREI HPEGLPQAYT
IILLFRLLPE SPNEPFAIWQ ITDRDYKPQV GVVLDPASKV LSFFNKDTRG EVQTVTFDND
EVKKIFYGSF HKVHIVVTSS NVKIYIDCSE ILEKPIKEAG NITTDGYEIL GKLLKGDRRS
ATLEIQNFDI VCSPVWTSRD RCCDLPSMRD EAKCPALPNA CTCTQDSVGP PGPPGPAGGP
GAKGPRGERG LTGSSGPPGP RGETGPPGPQ GPPGPQGPNG LSIPGEPGRQ GMKGDAGQPG
LPGRSGTPGL PGPPGPVGPP GERGFTGKDG PTGPRGPPGP A
//