ID A9V9R3_MONBE Unreviewed; 1344 AA.
AC A9V9R3;
DT 05-FEB-2008, integrated into UniProtKB/TrEMBL.
DT 05-FEB-2008, sequence version 1.
DT 24-JAN-2024, entry version 51.
DE RecName: Full=FG-GAP repeat protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=MONBRDRAFT_11601 {ECO:0000313|EMBL:EDQ85769.1};
OS Monosiga brevicollis (Choanoflagellate).
OC Eukaryota; Choanoflagellata; Craspedida; Salpingoecidae; Monosiga.
OX NCBI_TaxID=81824 {ECO:0000313|EMBL:EDQ85769.1, ECO:0000313|Proteomes:UP000001357};
RN [1] {ECO:0000313|EMBL:EDQ85769.1, ECO:0000313|Proteomes:UP000001357}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MX1 / ATCC 50154 {ECO:0000313|Proteomes:UP000001357};
RX PubMed=18273011; DOI=10.1038/nature06617;
RG JGI Sequencing;
RA King N., Westbrook M.J., Young S.L., Kuo A., Abedin M., Chapman J.,
RA Fairclough S., Hellsten U., Isogai Y., Letunic I., Marr M., Pincus D.,
RA Putnam N., Rokas A., Wright K.J., Zuzow R., Dirks W., Good M.,
RA Goodstein D., Lemons D., Li W., Lyons J.B., Morris A., Nichols S.,
RA Richter D.J., Salamov A., Bork P., Lim W.A., Manning G., Miller W.T.,
RA McGinnis W., Shapiro H., Tjian R., Grigoriev I.V., Rokhsar D.;
RT "The genome of the choanoflagellate Monosiga brevicollis and the origin of
RT metazoans.";
RL Nature 451:783-788(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH991571; EDQ85769.1; -; Genomic_DNA.
DR RefSeq; XP_001749484.1; XM_001749432.1.
DR EnsemblProtists; EDQ85769; EDQ85769; MONBRDRAFT_11601.
DR GeneID; 5894662; -.
DR KEGG; mbr:MONBRDRAFT_11601; -.
DR eggNOG; ENOG502SIJX; Eukaryota.
DR InParanoid; A9V9R3; -.
DR Proteomes; UP000001357; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR Gene3D; 2.130.10.130; Integrin alpha, N-terminal; 6.
DR InterPro; IPR013517; FG-GAP.
DR InterPro; IPR013519; Int_alpha_beta-p.
DR InterPro; IPR028994; Integrin_alpha_N.
DR PANTHER; PTHR44103; PROPROTEIN CONVERTASE P; 1.
DR PANTHER; PTHR44103:SF1; PROPROTEIN CONVERTASE P; 1.
DR Pfam; PF13517; FG-GAP_3; 9.
DR SMART; SM00191; Int_alpha; 5.
DR SUPFAM; SSF69318; Integrin alpha N-terminal domain; 3.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000001357};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..1344
FT /note="FG-GAP repeat protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002742873"
FT TRANSMEM 1111..1131
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
SQ SEQUENCE 1344 AA; 143196 MW; 19F8DDCDC8E10805 CRC64;
MARLLVLTVG FCWVGLVAAD GDWAATSVLG RLDGPIALTL ADLDNNSLPD IVSVSSNDNT
ICWFANLGNG SFEHTPRIIS TSAGGVSTVT TADINGDGWL DVVAGLTNDD LVAWYRNNNG
TISTLPEIIA VSVLGVSSVA TADVDGDGYV DVISASAGDN SVAWYPNNRR GGLYLQRHVV
SRQVWGVKDV FAADMDADGD MDLLCACFTA NGVLLFRNDG AGLFGNAERV ATSSVLGAFD
VQAADLDGDG HLDVLVAAAG QEQVAWFRNL GNATFEPRRR IIDAAANGAV SAFAVDFDND
GHLDVLAAQA INNTIVWYHN LGNGAAFSSP NVVSTDSWGA AAVAAADLDA DGLPDMVSAS
FADDKVAWYR HEGPRVQFDR PQTLAARYAL ATTGDLDHDG RVDLILGSFT KNAHSVAWRR
SLNLANESLA FDAPRAVVAH PIGPQALFVR DLDSDGNLDL LYAFKDQDQD SHLAWARNDG
AGNFATAPAQ LLYSSSHAIL STTVADLDGN GHPDVLFLDA AQARLAWCAN NGLGQFDTPS
LVAQLEYPPS CLSTADLDGD GHVDVIRCVF QRHAIEWLRN MGNGSFASGW LLVTSQTQGP
VQAQPADMDG DGVLDIVCAS LHDAKLAWYH NDGLGHFVEH VISQQLPDVL FVRLADLTLD
GSLDILVAQN SGALSWFGNE GAGTFAASVP IRFANILFLQ SMDVVDLDGD GLPDILCGDA
FDNTVTWFRS IGIKPAFSQA RVLPAFVHRP LMAVPGDLNG DGHIDVVSAS TDDSRLTWYP
NPGNGAFTGT QHIISQQILT IMSLAVADLD NDQDLDILCA GSGLDQVAWF QNNGGGHFAD
EPRFISTAAA GAVAVLAADL DGDGALDAVV GRGDDNTIVW HRNNGQGEFG PATPLSTTAH
MPWGLFAADL DDDGDLDVLS ANYGNDRISW YRNNGAAGFG TELVITASAD GVRGVFAADL
DNDGDLDVLS ASINDNKVAW YVNDGDGNFN FQERVISTSI NGASAISAVD LDGDGDQDVL
VSGISGNTVA WFENLGDGTF VPVPVVLTTE LYGASWVAAA DFDNDTRLDI LSPAMRAGDI
QWFHNPGTTA VPSASSPLLV PSTASGSSDH VVVVVVVVVV ALVALVFGVV LTARYRRKRF
NAVRQALYGD NTMPMSSAEQ IQLCLSQFQA SCARNAGLDP SSTTTTRSLN LVSRATRVQK
PFKAQLQSDS ALVWVQIIVS SDVKFQAVPL RHASELQGRA VLALKTFERI ELRAGGNLCF
DRVPTWWPGR RSIPIALVLE FADQGDLRTH LRKRIGQLTT NTKTVSRFVS KDHDYYRAAQ
LDDLPFRSVT FLIDCQGLPC MDLN
//