Author: "Nguyen, Trinh" / Search Limiters: Peer Reviewed / Topic: data mining - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Nguyen, Trinh"' showing total 10 results

Start Over Author "Nguyen, Trinh" Search Limiters Peer Reviewed Topic data mining

10 results on '"Nguyen, Trinh"'

1. New approaches for mining high utility itemsets with multiple utility thresholds

Author: Huynh, Bao, Tung, N. T., Nguyen, Trinh D. D., Trinh, Cuong, Snasel, Vaclav, and Nguyen, Loan
Published: 2024
Full Text: View/download PDF

2. An efficient method for mining multi-level high utility Itemsets

Author: Tung, N. T., Nguyen, Loan T. T., Nguyen, Trinh D. D., and Vo, Bay
Published: 2022
Full Text: View/download PDF

3. Mining Association Rules from a Single Large Graph.

Author: Huynh, Bao, Nguyen, Lam B. Q., Nguyen, Duc H. M., Nguyen, Ngoc Thanh, Nguyen, Hung-Son, Pham, Tuyn, Pham, Tri, Nguyen, Loan T. T., Nguyen, Trinh D. D., and Vo, Bay
Abstract: Knowledge mining from single graph plays an important role in decision support systems on single graphs such as social networks, bioinformatics, etc. In recent years, the problem of Frequent Subgraph Mining (FSM) from a single graph have been developed and attracted several studies. However, the problem of mining association rules or links from frequent subgraphs has not had many contributions. In this article, we state the problem of direct mining association rules from frequent subgraphs. Existing approaches on this topic perform the task in two phases. First, they traverse the search space to directly discover parent-child relationships from the discovered frequent subgraphs, then association rules are generated. We propose a one-phase algorithm, named So-GPARs, to generate rules as soon as frequent supergraphs are constructed from already existing frequent subgraphs. Our experiments on three single graph datasets show that the one-phase algorithm is more efficient than the two-phase algorithm in terms runtime of the rules generating phase. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

4. MLC-miner: Efficiently discovering multi-level closed high utility patterns from quantitative hierarchical transaction databases.

Author: Nguyen, Trinh D.D., Tung, N.T., Nguyen, Loan T.T., Pham, Thiet T., and Vo, Bay
Subjects: *EVIDENCE gaps, *EXTRACTION techniques, *PATTERNS (Mathematics), *DATA mining, *DATABASES
Abstract: High utility pattern mining (HUPM) extends frequent pattern mining (FPM) by including item significance and quantity, which determine their utility, in transaction databases. A pattern is a high utility if its utility satisfies the minimum threshold. Several efficient approaches have been developed to address this task. However, more extensive databases combined with lower thresholds can lead to an explosion in the number of discovered patterns. Use the closed version of the pattern to decrease patterns while keeping information. To improve real-world applicability, HUPM research now considers item connections. Categorizing items creates an abstraction model that can help decision-makers. This new mining task is generalized high utility pattern mining (GHUPM). However, GHUPM significantly increases the complexity of the mining task, resulting in longer mining times, higher memory usage, and a more significant number of discovered patterns. To date, none of the proposed approaches in GHUPM have utilized the closed representation to shrink the size of the result set. This study introduces MLC-Miner, a multi-level closed high utility pattern extraction technique, to fill this research gap. MLC-Miner combines several efficient methods to enhance mining performance and reduce memory footprints. Experimental results show that MLC-Miner can handle up to 5 million transactions efficiently. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

5. An efficient method for mining High-Utility itemsets from unstable negative profit databases.

Author: Tung, N.T., Nguyen, Trinh D.D., Nguyen, Loan T.T., and Vo, Bay
Subjects: *CONSUMER behavior, *DATABASES, *PROBLEM solving
Abstract: The study of High-Utility Itemset Mining (HUIM) and Frequent Itemset Mining (FIM) is crucial since it explains consumer behavior and offers actionable advice to improve business results. HUIM algorithms have been successfully established to identify high-utility itemsets, including those with negative utilities. The problem with these approaches is that they presume incorrectly that items with negative utility across transactions would always be losses. Products with positive profitability may seem negative when combined with other items to increase sales or reduce inventory. Using strict upper-bound approaches, this paper presents strategies for making database scanning more efficient and reducing the number of prospective candidates. We also prove that it is correct to use the proposed upper-bounds for pruning on several types of items in the database. Based on all the proposed solutions, we develop a novel algorithm to solve this problem efficiently. To demonstrate their efficiency, the algorithms are tested against states-of-art HUIM algorithm on diverse datasets with regard to size and characteristics with unstable negative profits. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

6. Mining Maximal High Utility Itemsets on Dynamic Profit Databases.

Author: Nguyen, Loan T. T., Vu, Dinh-Bao, Nguyen, Trinh D. D., and Vo, Bay
Subjects: DATABASES, PROFIT, PRUNING, DATA structures, CRYPTOCURRENCY mining
Abstract: To overcome the limitation of high-utility itemset mining, more compact, lossless, and concise representations of high utility itemsets (HUIs) have been proposed in previous works, such as closed HUIs (CHUIs) or maximal HUIs (MHUIs). Focusing into MHUI mining, in this article, we present efficient approaches to directly mine MHUIs from transactional databases without generating any candidates. The proposed algorithms, which all execute in one phase, utilize many efficient data structures and pruning techniques such as EUCP combined with EUCS, CUIP combined with FUCS, and the P-set structure to significantly reduce the search space and remove nonpromising itemsets, thus, increase the performance of the MHUI mining process. Furthermore, while previous works assumed that the unit profit of items is fixed, which is not practical in many real-world applications, our work resolved this issue by applying a new utility calculation into the mining process to reflect the true nature of real-world databases, thus, generating more accurate results. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

7. Parallel approaches to extract multi-level high utility itemsets from hierarchical transaction databases.

Author: Nguyen, Trinh D.D., Tung, N.T., Pham, Thiet, and Nguyen, Loan T.T.
Subjects: *MULTICORE processors, *DATABASES, *DATA mining, *KNAPSACK problems, *PARALLEL algorithms
Abstract: In the field of data mining, high utility itemset mining (HUIM) is a relevant mining task, with the aim of analyzing customer transaction databases. HUIM consists of exploiting the set of items that are often purchased together and yield high profit value. In real-world applications, transaction databases often come with item categorization, stored in a taxonomy. Items in these databases can be clustered into specific categories at higher levels of abstraction. Extracting and analyzing itemsets discovered from different levels of abstraction can provide more useful insights into customer behaviors. However, considering item taxonomy increases the problem's complexity, hence prolonging the execution time needed to explore the search space. Parallelism is thus employed to address this drawback, but previous approaches are not efficient as they only adopt simple scheduling strategies or do not utilize the full capabilities of a multi-core processor. This work introduces three new efficient strategies to significantly boost the performance of the multi-level high utility itemset mining task using multi-core processing. Two new algorithms, called MCML+ and MCML++, are also proposed by adopting the suggested strategies. Extensive experiments on several large databases show that our proposed algorithms have better performance compared to previous approaches in terms of running time and scalability, up to 4.0 times better than the previous parallelized algorithm, the MCML-Miner algorithm; and over 9.0 times faster than the original sequential algorithm, the MLHUI-Miner algorithm. • Parallelism is applied at many parts of the algorithm to improve mining performance. • Adopting Shortest Jobs First strategy to lower average waiting time. • Two new algorithms were developed, namely MCML+ and MCML++ • Evaluating the proposed algorithms on many databases to assess their performance. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

8. Efficient algorithms for mining closed high utility itemsets in dynamic profit databases.

Author: Nguyen, Trinh D.D., Nguyen, Loan T.T., Vu, Lung, Vo, Bay, and Pedrycz, Witold
Subjects: *CRYPTOCURRENCY mining, *DATABASES, *ALGORITHMS, *PROCESS mining
Abstract: • Apply dynamic utility calculation framework to mine closed high utility itemsets. • Dramatically reduce the cost of database scans by adopting the P-set structure. • Propose a new CHUIM algorithm, named iEFIM-Closed. • Conduct experiments to measure the performance of the proposed algorithm. The problem of discovering high-utility itemsets (HUIs) in transaction databases, which is an extension of Frequent Itemset Mining, is a commonly encountered mining task. Researchers have proposed algorithms to efficiently mine highly profitable itemsets in customer transaction databases, in which the unit profits of items are fixed. However, this assumption does not reflect the true nature of the utility measure of items in real-life transaction databases, which might vary over time. Moreover, since this important characteristic is ignored by all the current HUI mining algorithms, they are either not applicable to this type of database, or they generate inaccurate results. In addition, the HUI mining algorithms' traditional limitation is that they produce a huge number of HUIs for users. In this paper, we define the problem of mining a lossless, concise and compact representation of HUIs, called closed HUIs (CHUIs), in dynamic unit profit databases. Based on newly defined of utility measure, a novel algorithm, called iEFIM-Closed, is introduced. This relies on this new utility measure, a novel compact database format to reduce the cost of database scans and increase the efficiency of the mining process. Experimental evaluations show that iEFIM-Closed significantly outperforms state-of-the-art algorithms for mining CHUIs on sparse databases with dynamic profit, and has competitive performance in dense databases in terms of runtime, the cost of database scans and scalability. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

9. An efficient strategy for mining high-efficiency itemsets in quantitative databases.

Author: Huynh, Bao, Tung, N.T., Nguyen, Trinh D.D., Bui, Quang-Thinh, Nguyen, Loan T.T., Yun, Unil, and Vo, Bay
Abstract: The classic problems in itemset mining involve finding frequent itemsets and high-utility itemsets. However, frequent itemset mining has the disadvantage of not paying attention to the profit of products, while high-utility itemset mining does not address the issue of the cost price of the products. Therefore, neither can locate products with high-efficiency value on investment. To overcome these problems, the high-efficiency itemset mining (HEIM) problem was proposed. Despite its practicality, this issue has received little attention. The algorithms proposed to exploit high-efficiency itemsets (HEI) still use ineffective strategies on dense databases and unstrict upper bounds, requiring a lot of time and memory. To address the current issues with HEIM, the paper proposes tight upper bounds for the early pruning of candidates. Several techniques are also proposed, such as combining similar transactions and saving promising transaction locations, to reduce the cost of database scanning. Finally, the techniques are combined to propose a novel way to implement the MHEI (an efficient strategy for Mining High-Efficiency Itemsets in quantitative databases) to optimize the HEIM process. The experimental process also shows that the proposed algorithm has performance better than the state-of-the-art algorithm in HEIM. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

10. Mining high-utility itemsets in dynamic profit databases.

Author: Nguyen, Loan T.T., Nguyen, Phuc, Nguyen, Trinh D.D., Vo, Bay, Fournier-Viger, Philippe, and Tseng, Vincent S.
Subjects: *MINING methodology, *PROFIT, *DATABASES, *CRITICAL currents, *RETAIL stores
Abstract: High-Utility Itemset (HUI) mining is an important data-mining task which has gained popularity in recent years due to its applications in numerous fields. HUI mining aims at discovering itemsets that have high utility (e.g., yield a high profit) in transactional databases. Although several algorithms have been designed to enumerate all HUIs, an important issue is that they assume that the utilities (e.g., unit profits) of items are static. But this simplifying assumption does not hold in real-life situations. For example, the unit profits of items often vary over time in a retail store due to fluctuating supply costs and promotions. Ignoring this important characteristic of real-life transactional databases makes current HUI-mining algorithms inapplicable in many real-world applications. To address this critical limitation of current HUI-mining techniques, this paper studies the novel problem of mining HUIs in databases having dynamic unit profits. To accurately assess the utility of any itemset in this context, a redefined utility measure is introduced. Furthermore, a novel algorithm named MEFIM (Modified EFficient high-utility Itemset Mining), which relies on a novel compact database format to discover the desired itemsets efficiently, is designed. An improved version of the MEFIM algorithm, named i MEFIM, is also introduced. This algorithm employs a novel structure called P-set to reduce the number of transaction scans and to speed up the mining process. Experimental results show that the proposed algorithms considerably outperform the state-of-the-art HUI-mining algorithms on dynamic profit databases in terms of runtime, memory usage, and scalability. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

10 results on '"Nguyen, Trinh"'

1. New approaches for mining high utility itemsets with multiple utility thresholds

2. An efficient method for mining multi-level high utility Itemsets

3. Mining Association Rules from a Single Large Graph.

4. MLC-miner: Efficiently discovering multi-level closed high utility patterns from quantitative hierarchical transaction databases.

5. An efficient method for mining High-Utility itemsets from unstable negative profit databases.

6. Mining Maximal High Utility Itemsets on Dynamic Profit Databases.

7. Parallel approaches to extract multi-level high utility itemsets from hierarchical transaction databases.

8. Efficient algorithms for mining closed high utility itemsets in dynamic profit databases.

9. An efficient strategy for mining high-efficiency itemsets in quantitative databases.

10. Mining high-utility itemsets in dynamic profit databases.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

10 results on '"Nguyen, Trinh"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources