ID A0A1E2WPQ3_9NOSO Unreviewed; 1344 AA.
AC A0A1E2WPQ3;
DT 18-JAN-2017, integrated into UniProtKB/TrEMBL.
DT 18-JAN-2017, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE RecName: Full=DUF4114 domain-containing protein {ECO:0000259|Pfam:PF13448};
GN ORFNames=A4S05_33760 {ECO:0000313|EMBL:ODH00320.1};
OS Nostoc sp. KVJ20.
OC Bacteria; Cyanobacteriota; Cyanophyceae; Nostocales; Nostocaceae; Nostoc.
OX NCBI_TaxID=457944 {ECO:0000313|EMBL:ODH00320.1, ECO:0000313|Proteomes:UP000094607};
RN [1] {ECO:0000313|EMBL:ODH00320.1, ECO:0000313|Proteomes:UP000094607}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KVJ20 {ECO:0000313|EMBL:ODH00320.1,
RC ECO:0000313|Proteomes:UP000094607};
RA Wen L., He K., Yang H.;
RL Submitted (FEB-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ODH00320.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSSA01000155; ODH00320.1; -; Genomic_DNA.
DR RefSeq; WP_069074135.1; NZ_KV757740.1.
DR OrthoDB; 517984at2; -.
DR Proteomes; UP000094607; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0008305; C:integrin complex; IEA:InterPro.
DR GO; GO:0016787; F:hydrolase activity; IEA:UniProtKB-KW.
DR GO; GO:0007155; P:cell adhesion; IEA:InterPro.
DR Gene3D; 2.130.10.130; Integrin alpha, N-terminal; 8.
DR InterPro; IPR025193; DUF4114.
DR InterPro; IPR013517; FG-GAP.
DR InterPro; IPR013519; Int_alpha_beta-p.
DR InterPro; IPR000413; Integrin_alpha.
DR InterPro; IPR028994; Integrin_alpha_N.
DR PANTHER; PTHR23221; GLYCOSYLPHOSPHATIDYLINOSITOL PHOSPHOLIPASE D; 1.
DR PANTHER; PTHR23221:SF7; PHOSPHATIDYLINOSITOL-GLYCAN-SPECIFIC PHOSPHOLIPASE D; 1.
DR Pfam; PF13448; DUF4114; 1.
DR Pfam; PF01839; FG-GAP; 12.
DR PRINTS; PR01185; INTEGRINA.
DR SMART; SM00191; Int_alpha; 14.
DR SUPFAM; SSF69318; Integrin alpha N-terminal domain; 3.
DR PROSITE; PS51470; FG_GAP; 4.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1266..1340
FT /note="DUF4114"
FT /evidence="ECO:0000259|Pfam:PF13448"
FT REGION 946..1013
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 955..977
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1344 AA; 134635 MW; 691E3F31B008AB1F CRC64;
MADILFNLSS LDGSNGFAIN GINANDSSGT SVSNAGDVNN DGIDDLIIGA PGGGQSYVVF
GCTNVFSASF DLSNLDGSNG FIINGSVPDS AGISVSNAGD INGDNIDDLI IGAPGALLGA
GQSYVVFGGN SLNASLDLSS LDGSNGFAIN GINGANSDGI SDSSGQSVSS AGDINNDGID
DLIIGASSAA SGAGQSYVVF GSTSPFSASF DLSSLDGSNG FSINGSGTDS SGTSVSSAGD
INGDNIDDLI IGASGTGKSY VVFGSTSAFS ASFNLSSLDG SNGFSINGNA SDFSGASVSN
AGDVNNDDID DLIIGAYNAA SGAGESYVVF GSTSAFNANF DLSSLDGSNG FAIKGINGND
VSGYSVSAAG DVNDDDIADL IIGAPFALSG AGQSYVVFGS SDAFGANLDL SNLDESQGFA
INGINGTDSS GISVSAAGDI NDDGADDLLV GASGASLGAG QSYVIFGTPQ PEPQSEPEPQ
PQVIFNLSSL DGSNGFAING INANDSSGTS VSNAGDVNND GIDDLIIGAP FVGSGAGQSY
VVFGSTNAFN TNLDLSSLDG SNGFAINGSA TNSLGTSVSN AGDINGDNID DLIIGAPGAL
LGAGQSYVVF GGNSLNASLD LSSLDGSNGF AINGINGANS DGISDSSGKS VSSAGDINND
GIDDLIIGAS SAASGAGQSY VVFGSTSPFS ASFDLSSLDG SNGFAINGSG TDSSGTSVSS
AGDINGDNID DLIIGASGTG KSYVVFGSTS AFSASFDLSS LDGSNGFSIN GNVSDFSGAS
VSNAGDVNND DIDDLIIGAY NAASGAGESY VVFGSTSAFN ANFDLSSLDG SNGFAIKGIN
GNDVSGYSVS AAGDVNDDDI ADLIIGAPFA LSGAGQSYVV FGSSDAFGAN LDLSNLDESQ
GFAINGIDGT DSSGISVSAA GDINDDGADD LLVGASGASL GAGQSYVIFG TPQPEPQSEP
EPQSEPEPQP EPEPQPEPQP EPEPEPEPEP KPEPEPEPEP QLQLANSGND VFNIKGDNDT
VTLKVAIAGS NSNLVNEFGV YTVDDATGKI DGIAPGEAGY ARKALEEGQV ILSAIADPPN
GFSNSPLNGL LEFPSDTNLR FYLVKNSTTE NVLSNITPIT DVLFSDPSNQ KITSLDNNAF
SLAWKDGSGN STNDFQNLVV TIQPTNDPLP LGTSLQGQPQ RELIDLRDVE QEVAATFVVN
REAEFNNFVG FYKVADENGG IDTNGDGQAD ILVGQAGYAQ AAVRQRVVGI DLSVNDQGTA
TSTGTFEPGS IFAPFIIING RPDAILDTNS NNDPAVYFSF LGANADKVDH IRLLGNNTFG
FEDLAGGGDK DFNDVIVRAN LSII
//