ID C3Z9G2_BRAFL Unreviewed; 1390 AA.
AC C3Z9G2;
DT 28-JUL-2009, integrated into UniProtKB/TrEMBL.
DT 28-JUL-2009, sequence version 1.
DT 27-MAR-2024, entry version 61.
DE RecName: Full=Fibrillar collagen NC1 domain-containing protein {ECO:0000259|PROSITE:PS51461};
GN ORFNames=BRAFLDRAFT_86048 {ECO:0000313|EMBL:EEN50891.1};
OS Branchiostoma floridae (Florida lancelet) (Amphioxus).
OC Eukaryota; Metazoa; Chordata; Cephalochordata; Leptocardii; Amphioxiformes;
OC Branchiostomatidae; Branchiostoma.
OX NCBI_TaxID=7739;
RN [1] {ECO:0000313|EMBL:EEN50891.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S238N-H82 {ECO:0000313|EMBL:EEN50891.1};
RC TISSUE=Testes {ECO:0000313|EMBL:EEN50891.1};
RX PubMed=18563158; DOI=10.1038/nature06967;
RG US DOE Joint Genome Institute (JGI-PGF);
RA Putnam N.H., Butts T., Ferrier D.E.K., Furlong R.F., Hellsten U.,
RA Kawashima T., Robinson-Rechavi M., Shoguchi E., Terry A., Yu J.-K.,
RA Benito-Gutierrez E.L., Dubchak I., Garcia-Fernandez J., Gibson-Brown J.J.,
RA Grigoriev I.V., Horton A.C., de Jong P.J., Jurka J., Kapitonov V.V.,
RA Kohara Y., Kuroki Y., Lindquist E., Lucas S., Osoegawa K., Pennacchio L.A.,
RA Salamov A.A., Satou Y., Sauka-Spengler T., Schmutz J., Shin-I T.,
RA Toyoda A., Bronner-Fraser M., Fujiyama A., Holland L.Z., Holland P.W.H.,
RA Satoh N., Rokhsar D.S.;
RT "The amphioxus genome and the evolution of the chordate karyotype.";
RL Nature 453:1064-1071(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG666599; EEN50891.1; -; Genomic_DNA.
DR RefSeq; XP_002594880.1; XM_002594834.1.
DR STRING; 7739.C3Z9G2; -.
DR eggNOG; KOG3544; Eukaryota.
DR InParanoid; C3Z9G2; -.
DR GO; GO:0062023; C:collagen-containing extracellular matrix; IBA:GO_Central.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IBA:GO_Central.
DR GO; GO:0030198; P:extracellular matrix organization; IBA:GO_Central.
DR Gene3D; 2.60.120.1000; -; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01410; COLFI; 2.
DR Pfam; PF01391; Collagen; 6.
DR SMART; SM00038; COLFI; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1193..1390
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 222..367
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 390..529
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 752..897
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1081..1121
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 236..250
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 258..282
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 339..359
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 503..517
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1390 AA; 149624 MW; E3BAD4252E5F55B4 CRC64;
MNNEPPRKLA STKETPRTVD PVGVDVLRGL HLIDSHKTAP EGVIRGDYGV ILQNKARINT
RVSRVFKQGI PSDFAFILVF QSFKNFTTHL FSIKDKQKKT RLAIRIGAKD IQFYYADRKN
FPGRIRGCLI IQRTMGAICQ FEMYYDPAVA AQYCDYIRQR CLNSILDLDS VLSLHEGRVH
NMNMHGGSPT EVPDTVAAAT PTVLPLSPFH RVKIMPTTTT QESPWYPIFT GQPEEDMEPT
SVSTTTTTLP TPGNPTPPSS TAHTGHAALT LSPSSSHKVT SAGGASLPAA VTHSPASLPP
SLPSAYPISP PGEADRSPTV EPPMGATIPS AVGSLDPRPA SGGQTSTGEG SSGSPDHQEP
KPSTEPPVIF YVEDSEGETL FYIVKGEKGD KGRAGIRGLQ GYPGPQGRPG QPGPPGFSGR
TGSPGFSGLK GEKGDPGWIP LVAIPGRKGD KGFPGIPGPP GRKGLKGDQG EDGPMGLPGE
KGPPGQLGQE GLNGFQGRQG KSGPPGPPGP KGETGDPGPL GPPGLRGIDG EKLLSFNSYR
VREGLLGNRE KSEDRAFLET LVQEAHPDLT EDQVSKEIQD LGDPKVKRVC REIKVNLVIL
DLLDPRAIPV ILAFLVKMDQ PALRERADQW DLRVCLALLD LMATKEEQGG LEDWEKVASR
ASRETLGDLV NLVGRETEVQ VASLVQQVEK GKMVMKVLLA HLDYLAHLAQ KVRKVSQVYP
AEGERSDLRV HKERKVFQDQ RANLAKKVLQ VSRGHRDQMG EPGARGAMGT VGDQGLQGRD
GEPGPPGFEG LRGPRGSKGA MGKLGPPGKD GPPGAPGLAS EEPGEKGEKG QEGYPGDSGP
VGQQGQRGDP GLKGPSGPVG LPGAIGPPGL RGNRGARGFP GFQGDPGPVG PEGQEGGRGE
TLEILVLKEK WDYLDILEML SSEAQEELLV NLDLQDLQVH PVLMDQKGSQ ATEAFLVCQA
LMEYAVKMVI KANRALWVHQ ESEVSEEVQA TWGKRGMLVH LDYQAQQAFL ALMAFLEHGD
QREILANLVR SVSRVPLGSQ AGGARWGLLA RQGSKGDEGN RGWNGFPGPL GIKGTVGEYG
FDGRLGPKGP PGNEGLKGVK GPSGPDGKPG PEGRPGPPTS EEFDAAIKLL LQNTQREFDI
EEEQSGSGSG FGSGAGSFLI RKADHIAKLT NLTHAALSHV PQVPQRDQQQ MHMDVFETLQ
YLTSYIESIK NPLGTKENPA RTCRDLVDCK YKRDDGKYWI DPNLGCSTDA IEVYCNFTSG
GQTCVKPATP GRFNFTIGKV QMNFLHLLSA EAAQVVTVHC KNSPVWRTPH RSIPGVKFKT
WSGPVYHYGG PFQPEVLEDN CMKEDGKWHK TRLLFTTTDV HHLPISDVVI PEDKRLGLHY
RIEISPVCFI
//