6 results on '"Mustapha Bouakkaz"'
Search Results
2. Efficiently mining frequent itemsets applied for textual aggregation
- Author
-
Mustapha Bouakkaz, Youcef Ouinten, Philippe Fournier-Viger, Sabine Loudcher, Université Amar Telidji - Laghouat, Entrepôts, Représentation et Ingénierie des Connaissances (ERIC), Université Lumière - Lyon 2 (UL2)-Université Claude Bernard Lyon 1 (UCBL), Université de Lyon-Université de Lyon, and Université de Moncton
- Subjects
Information retrieval ,OLAP ,[SHS.STAT]Humanities and Social Sciences/Methods and statistics ,business.industry ,Computer science ,Data stream mining ,media_common.quotation_subject ,Online analytical processing ,Aggregate (data warehouse) ,Closed keywords ,02 engineering and technology ,computer.software_genre ,Term (time) ,Textual aggregation ,Text mining ,Artificial Intelligence ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Data mining ,business ,Function (engineering) ,computer ,media_common - Abstract
International audience; Abstract Text mining approaches are commonly used to discover relevant information and relationships in hugeamounts of text data. The term data mining refers to methods for analyzing data with the objective of finding patternsthat aggregate the main properties of the data. The merger between the data mining approaches and on-line analyticalprocessing (OLAP) tools allows us to refine techniques used in textual aggregation. In this paper, we propose a novel aggregation function for textual data based on the discovery of frequent closed patterns in a generated documents/keywords matrix. Our contribution aims at using a data mining technique, mainly a closed pattern mining algorithm, to aggregate keywords. An experimental study on areal corpus of more than 700 scientific papers collected on Microsoft Academic Search shows that the proposed algorithm largely outperforms four state-of-the-art textual aggregation methods in terms of recall, precision, F-measure and runtime.
- Published
- 2017
3. A New Tool for Textual Aggregation in OLAP Context
- Author
-
Mustapha Bouakkaz, Sabine Loudcher, Youcef Ouinten, Université Amar Telidji - Laghouat, Entrepôts, Représentation et Ingénierie des Connaissances (ERIC), Université Lumière - Lyon 2 (UL2)-Université Claude Bernard Lyon 1 (UCBL), Université de Lyon-Université de Lyon, and Loudcher, Sabine
- Subjects
Algorithm ,Aggregation ,OLAP ,[SHS.STAT]Humanities and Social Sciences/Methods and statistics ,[SHS.STAT] Humanities and Social Sciences/Methods and statistics ,Textual data - Abstract
International audience; We present in this paper a system for textual aggregation from scientific documents in the online analytical processing (OLAP) context. The system extracts keywords automatically from a set of documents according to the lists compiled in the Microsoft Academia Search web site. It gives the user the possibility to choose their methods of aggregation among the implemented ones. That is TOP-Keywords, TOPIC, TUBE, TAG, BienCube and GOTA. The performance of the chosen methods, in terms of recall, precision, F-measure and runtime, is investigated with two real corpora ITINNOVATION and OHSUMED with 600 and 13,000 scientific articles respectively, other corpora can be integrated to the system by users.
- Published
- 2016
4. OLAP Textual Aggregation Approach using the Google Similarity Distance
- Author
-
Sabile Loudcher, Mustapha Bouakkaz, Youcef Ouinten, Equipe de Recherche en Ingénierie des Connaissances (ERIC), Université Lumière - Lyon 2 (UL2), and Université Amar Telidji - Laghouat
- Subjects
Decision support system ,Textual Aggregation ,Information Systems and Management ,Computer science ,media_common.quotation_subject ,02 engineering and technology ,Similarity distance ,computer.software_genre ,Management Information Systems ,[SHS]Humanities and Social Sciences ,Semantic similarity ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,High level analysis ,Function (engineering) ,K-means ,media_common ,Information retrieval ,OLAP ,Online analytical processing ,Google Similarity ,k-means clustering ,Data warehouse ,020201 artificial intelligence & image processing ,Data mining ,Statistics, Probability and Uncertainty ,computer - Abstract
International audience; Data warehousing and On-Line Analytical Processing (OLAP) are essential elements to decision support. In the case of textual data, decision support requires new tools, mainly textual aggregation functions, for better and faster high level analysis and decision making. Such tools will provide textual measures to users who wish to analyse documents online. In this paper, we propose a new aggregation function for textual data in an OLAP context based on the K-means method. This approach will highlight aggregates semantically richer than those provided by classical OLAP operators. The distance used in K-means is replaced by the Google similarity distance which takes into account the semantic similarity of keywords for their aggregation. The performance of our approach is analyzed and compared to other methods such as Topkeywords, TOPIC, TuBE and BienCube. The experimental study shows that our approach achieves better performances in terms of recall, precision,F-measure complexity and runtime.
- Published
- 2016
5. GOTA: Using the Google Similarity Distance for OLAP Textual Aggregation
- Author
-
Youcef Ouinten, Sabine Loudcher, Mustapha Bouakkaz, Equipe de Recherche en Ingénierie des Connaissances (ERIC), Université Lumière - Lyon 2 (UL2), and Loudcher, Sabine
- Subjects
Multidimensional analysis ,Information retrieval ,OLAP ,[SHS.STAT]Humanities and Social Sciences/Methods and statistics ,Computer science ,Online analytical processing ,k-means clustering ,Context (language use) ,Unstructured data ,Google Similrity ,computer.software_genre ,Data warehouse ,Semantic similarity ,[SHS.STAT] Humanities and Social Sciences/Methods and statistics ,Data mining ,Textual Data ,Cluster analysis ,computer ,Aggregation Function - Abstract
International audience; With the tremendous growth of unstructured data in the Business Intelligence, there is a need for incorporating textual data into data warehouses, to provide an appropriate multidimensional analysis (OLAP) and develop new approaches that take into account the textual content of data. This will provide textual measures to users who wish to analyse documents online. In this paper, we propose a new aggregation function for textual data in an OLAP context. For aggregating keywords, our contribution is to use a data mining technique, such as kmeans, but with a distance based on the Google similarity distance. Thus our approach considers the semantic similarity of keywords for their aggregation. The performance of our approach is analyzed and compared to another method using the k-bisecting clustering algorithm and based on the Jensen-Shannon divergence for the probability distributions. The experimental study shows that our approach achieves better performances in terms of recall, precision,F-measure complexity and runtime.
- Published
- 2015
6. Automatic textual aggregation approach of scientific articles in OLAP context.
- Author
-
Mustapha, Bouakkaz, Sabine, Loudcher, and Youcef, Ouinten
- Published
- 2014
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.