2,130 results on '"bayes classifier"'
Search Results
2. Bayes Classification Using an Approximation to the Joint Probability Distribution of the Attributes
- Author
-
Hosein, Patrick, Baboolal, Kevin, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Fred, Ana, editor, Hadjali, Allel, editor, Gusikhin, Oleg, editor, and Sansone, Carlo, editor
- Published
- 2024
- Full Text
- View/download PDF
3. An Efficient IoT-Fog-Cloud Resource Allocation Framework Based on Two-Stage Approach
- Author
-
Ismail Zahraddeen Yakubu and M. Murali
- Subjects
Cloud computing ,fog computing ,Internet of Things (IoT) ,task classifier ,Bayes classifier ,resource allocation ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
With the advent of the Internet of Things (IoT) paradigm and the prolific growth in technology, the volume of data generated by intelligent devices has increased tremendously. Cloud computing provides unlimited processing and storage capabilities to process and store the generated data. However, the cloud computing paradigm is associated with high transmission latency, high energy consumption, and a lack of location awareness. On the other hand, the data generated by the intelligent devices is delay-sensitive and needs to be processed on the fly. Thus, cloud computing isn’t suitable for the execution of this delay-sensitive data. To curtail the issues associated with the cloud paradigm, the fog paradigm, which allows data to be processed at the proximity of IoT devices, was introduced. One common feature of the fog paradigm is its limitations in capabilities, which make it unsuitable for processing large volumes of data. To ensure the smooth execution of delay-sensitive application tasks and the large volume of data generated, there is a need for the fog paradigm to collaborate with the cloud paradigm to achieve a common goal. In this paper, an efficient resource allocation framework is proposed to efficiently and effectively utilise the fog and cloud resources for executing delay-sensitive tasks and the huge volume of data generated by end users. The allocation of resources to tasks is done in two stages. Firstly, the tasks in the arrival queue are classified based on the task guarantee ratio on the cloud and fog layers and allocated to suitable resources in the layers of their respective classes. Secondly, we apply Bayes’ classifier to previous allocation history data to classify newly arrived tasks and allocate suitable resources to the tasks for execution in the layers of their respective classes. A Crayfish Optimization Algorithm (COA) is used to generate an optimal resource allocation in both the fog and cloud layers that reduces the delay and execution time of the system. The proposed method is implemented using the iFogSim simulator toolkit, and the execution results prove more promising in comparison with the state-of-the-art methods.
- Published
- 2024
- Full Text
- View/download PDF
4. A Distributed Anomaly Detection Scheme Based on Correlation Awareness in WSN.
- Author
-
Wang, Zhongmin, Gao, Rui, Gao, Cong, Chen, Yanping, and Wang, Fengwei
- Subjects
ANOMALY detection (Computer security) ,WIRELESS sensor nodes ,WIRELESS sensor networks ,BAYESIAN analysis ,OUTLIER detection - Abstract
Wireless sensor devices are affected by internal constraints and the external environment, generating abnormal data. Currently, many anomaly detection schemes ignore the correlation between screening nodes, wasting resources due to excessive communication. Therefore, this paper proposes a distributed anomaly detection scheme based on adaptive grouping using the correlation between nodes in wireless sensor networks. Limiting the scope of collaboration between nodes can reduce the waste of resources due to excessive communication. Since the computing resources of sensor nodes are limited, an edge-cloud framework is established. The scheme uses Spatio-temporal correlation and graph theory for wireless sensor networks to determine node groups with solid correlations on the cloud server. Based on the grouping results, anomaly detection is implemented locally. A Bayesian network model is constructed at the node within the group, and outlier detection is realized by inference on nodes. A correlation consistency evaluation method is proposed to improve anomaly detection accuracy to check the data consistency on the cluster head. The proposed scheme is verified by a generated data set and the real data of Intel Berkeley Research Lab. The effectiveness of the proposed method is verified by comparing it with three existing algorithms. Experimental results show that the method improves detection accuracy and reduces false detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Machine Learning based on Probabilistic Models Applied to Medical Data: The Case of Prostate Cancer
- Author
-
Anaclet Tshikutu Bikengela, Remy Mutapay Tshimona, Pierre Kafunda Katalay, Simon Ntumba Badibanga, and Eugène Mbuyi Mukendi
- Subjects
classification ,mixture models ,gaussian mixture ,bayesian networks ,bayes classifier ,Information technology ,T58.5-58.64 ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
The growth in the amount of data in companies puts analysts in difficulties when extracting hidden knowledge from data. Several models have emerged that focus on the notion of distances while ignoring the notion of conditional probability density. This research study focuses on segmentation using mixture models and Bayesian networks for medical data mining. As enterprise data becomes large, there is a way to apply data mining methods to make sense of it using classification methods. We designed different models with different architectures and then applied these models to the medical database. The algorithms were implemented for the real data. The objective is to classify individuals according to the conditional probability density of random variables, in addition to identifying causalities between traits from tests of conditional independence and a correlation measure, both based on χ2. After a quick illustration of several models (decision tree, SVM, K-means, Bayes), we applied our method to data from an epidemiological study (done at the University of Kinshasa University clinics) of case-control of prostate cancer. Thus, we found after interpretation of the results followed by discussion that our model allows us to classify a new individual with an accuracy of 96%.
- Published
- 2023
- Full Text
- View/download PDF
6. Deep Probabilistic Learning
- Author
-
Jo, Taeho and Jo, Taeho
- Published
- 2023
- Full Text
- View/download PDF
7. Segmentation of Cattle Using Color-Based Skin Detection Approach
- Author
-
Agarwal, Diwakar, Howlett, Robert J., Series Editor, Jain, Lakhmi C., Series Editor, Reddy, V. Sivakumar, editor, Prasad, V. Kamakshi, editor, Wang, Jiacun, editor, and Reddy, K. T. V., editor
- Published
- 2023
- Full Text
- View/download PDF
8. Classification of observations into von Mises-Fisher populations with unknown parameters.
- Author
-
Jana, Nabakumar and Dey, Santanu
- Subjects
- *
PARAMETERS (Statistics) , *BAYES' estimation , *MAXIMUM likelihood statistics , *SUPPORT vector machines - Abstract
The problem of classification of an observation into von Mises-Fisher populations is considered when the concentration parameters and the mean directions are unknown. For two von Mises-Fisher distributions, we propose the restricted maximum likelihood estimators (MLEs), and Bayes estimators of the concentration parameters. The MLEs and restricted MLEs of the concentration parameters are compared in terms of risks. When the concentration parameters are ordered, we propose classification rules using the restricted MLEs and Bayes estimators of the parameters. For two populations, we also derive predictive Bayes classification rules using informative priors for the concentration parameters. We derive the likelihood ratio-based classification rule. Nonparametric rules such as k-NN rule, support vector machine classifier, and kernel density-based rule are also studied. For more than two populations, we suggest classification rules. Comparisons among the proposed rules have been carried out with respect to the expected probability of misclassification. Applications of the rules are described using directional data sets. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
9. Categorical classifiers in multiclass classification with imbalanced datasets.
- Author
-
Carpita, Maurizio and Golia, Silvia
- Subjects
- *
CLASSIFICATION , *PERFORMANCE theory , *PROBABILITY theory - Abstract
This paper discusses, in a multiclass classification setting, the issue of the choice of the so‐called categorical classifier, which is the procedure or criterion that transforms the probabilities produced by a probabilistic classifier into a single category or class. The standard choice is the Bayes Classifier (BC), but it has some limits with rare classes. This paper studies the classification performance of the BC versus two alternatives, that are the Max Difference Classifier (MDC) and Max Ratio Classifier (MRC), through an extensive simulation and some case studies. The results show that both MDC and MRC are preferable to BC in a multiclass setting with imbalanced data. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
10. Vis-NIR Spectroscopy Combined with Bayes Classifier Applied to Wine Multi-brand Identification
- Author
-
Chen, Xianghui, Li, Jiaqi, Chang, Nailiang, Chen, Jiemei, Fang, Lifang, Pan, Tao, Chu, Xiaoli, editor, Guo, Longhai, editor, Huang, Yue, editor, and Yuan, Hongfu, editor
- Published
- 2022
- Full Text
- View/download PDF
11. Diagnosis and Detection of Plant Diseases Using Data Mining Techniques
- Author
-
Bhati, Harshita, Rathore, Monika, Xhafa, Fatos, Series Editor, Gupta, Deepak, editor, Polkowski, Zdzislaw, editor, Khanna, Ashish, editor, Bhattacharyya, Siddhartha, editor, and Castillo, Oscar, editor
- Published
- 2022
- Full Text
- View/download PDF
12. Detecting Phishing Websites Using Neural Network and Bayes Classifier
- Author
-
Partheepan, Ravinthiran, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Awan, Irfan, editor, Benbernou, Salima, editor, Younas, Muhammad, editor, and Aleksy, Markus, editor
- Published
- 2022
- Full Text
- View/download PDF
13. Deep Learning vs. Traditional Approaches to Malware Traffic Classification – A Comparative Study
- Author
-
Krupski, Jacek, Rybicki, Damian, Graniszewski, Waldemar, Iwanowski, Marcin, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Choraś, Michal, editor, Choraś, Ryszard S., editor, Kurzyński, Marek, editor, Trajdos, Paweł, editor, Pejaś, Jerzy, editor, and Hyla, Tomasz, editor
- Published
- 2022
- Full Text
- View/download PDF
14. The prognostic utility of serum thyrotropin in hospitalized Covid-19 patients: statistical and machine learning approaches.
- Author
-
Pappa, E., Gourna, P., Galatas, G., Manti, M., Romiou, A., Panagiotou, L., Chatzikyriakou, R., Trakas, N., Feretzakis, G., and Christopoulos, C.
- Abstract
Purpose: To assess the prognostic value of serum TSH in Greek patients with COVID-19 and compare it with that of commonly used prognostic biomarkers. Methods: Retrospective study of 128 COVID-19 in patients with no history of thyroid disease. Serum TSH, albumin, CRP, ferritin, and D-dimers were measured at admission. Outcomes were classified as "favorable" (discharge from hospital) and "adverse" (intubation or in-hospital death of any cause). The prognostic performance of TSH and other indices was assessed using binary logistic regression, machine learning classifiers, and ROC curve analysis. Results: Patients with adverse outcomes had significantly lower TSH compared to those with favorable outcomes (0.61 versus 1.09 mIU/L, p < 0.001). Binary logistic regression with sex, age, TSH, albumin, CRP, ferritin, and D-dimers as covariates showed that only albumin (p < 0.001) and TSH (p = 0.006) were significantly predictive of the outcome. Serum TSH below the optimal cut-off value of 0.5 mIU/L was associated with an odds ratio of 4.13 (95% C.I.: 1.41–12.05) for adverse outcome. Artificial neural network analysis showed that the prognostic importance of TSH was second only to that of albumin. However, the prognostic accuracy of low TSH was limited, with an AUC of 69.5%, compared to albumin's 86.9%. A Naïve Bayes classifier based on the combination of serum albumin and TSH levels achieved high prognostic accuracy (AUC 99.2%). Conclusion: Low serum TSH is independently associated with adverse outcome in hospitalized Greek patients with COVID-19 but its prognostic utility is limited. The integration of serum TSH into machine learning classifiers in combination with other biomarkers enables outcome prediction with high accuracy. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
15. A probability estimation-based feature reduction and Bayesian rough set approach for intrusion detection in mobile ad-hoc network.
- Author
-
Prasad, Mahendra, Tripathi, Sachin, and Dahal, Keshav
- Subjects
AD hoc computer networks ,ROUGH sets ,FUZZY logic ,PROBABILITY theory ,FUZZY systems - Abstract
A mobile ad-hoc network is a small and temporary network. This network has a different working principle and structure than wired networks. A source node transfers data to the destination node through intermediate nodes. Due to mobility of node, this network is more vulnerable to routing attacks. Many security mechanisms protect the network from intrusions, such as cryptography based, lightweight, and heavyweight techniques. But, these are not powerful enough mechanisms for mobile ad-hoc networks to mitigate routing attacks. Therefore, we have proposed an enhanced intrusion detection system for the mobile ad-hoc network that handles routing attacks. This method mainly generates 11 sub-datasets and also evaluates their quality using a fuzzy logic system. We suggest a probabilistic approach for feature ranking. The next process removes ineffective features from training and test sets. We have applied a Bayesian rough set classifier that classifies the behavior of mobile nodes using incoming packets. The Bayes classifier is applied for ambiguous and unknown samples. Experimental results show that the average detection accuracy is 94.37% for blackhole attack and 99% for wormhole attack. The proposed method performs better than existing intrusion detection methods. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
16. COPULA-BASED FUNCTIONAL BAYES CLASSIFICATION WITH PRINCIPAL COMPONENTS AND PARTIAL LEAST SQUARES.
- Author
-
Wentian Huang and Ruppert, David
- Abstract
We present a new functional Bayes classifier that uses principal component (PC) or partial least squares (PLS) scores from the common covariance function, that is, the covariance function marginalized over groups. When the groups have different covariance functions, the PC or PLS scores need not be independent or even uncorrelated. We use copulas to model the dependence. Our method is semiparametric; the marginal densities are estimated nonparametrically by kernel smoothing and the copula is modeled parametrically. We focus on Gaussian and t-copulas, but other copulas could be used. The strong performance of our methodology is demonstrated through simulation, real data examples, and asymptotic properties. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
17. Study on Height Prediction of Water Flowing Fractured Zone in Deep Mines Based on Weka Platform.
- Author
-
Bai, Liyang, Liao, Changlong, Wang, Changxiang, Zhang, Meng, Meng, Fanbao, Fan, Mingjin, and Zhang, Baoliang
- Abstract
Accurately predicting the height of water flowing fractured zone is of great significance to coal mine safety mining. In recent years, most mines in China have entered deep mining. Aiming at the problem that it is difficult to accurately predict the height of water flowing fractured zone under the condition of large mining depth, the mining depth, height mining, inclined length of working face and coefficient of hard rock lithology ratio are selected as the main influencing factors of the height of water flowing fractured zone. The relationship between various factors and the height of water flowing fractured zone is analyzed by SPSS software. Based on the data mining tool Weka platform, Bayesian classifier, artificial neural network and support vector machine model are used to mine and analyze the measured data of water flowing fractured zone, and the detailed accuracy, confusion matrix and node error rate are compared. The results show that, the accuracy rate of instance classification of the three models is greater than 60%. The accuracy of the artificial neural network model is the highest and the node error rate is the lowest. In general, the training effect of the artificial neural network model is the best. By predicting engineering examples, the prediction accuracy of the model reaches 80%, and a good prediction effect is obtained. The height prediction system of water flowing fractured zone is developed based on VB language, which can provide a reference for the prediction of the height failure grade of water flowing fractured zone. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
18. Semiconcept and concept representations.
- Author
-
Gégény, Dávid, Kovács, László, and Radeleczki, Sándor
- Subjects
- *
ROUGH sets , *GENERALIZATION , *ALGORITHMS - Abstract
In FCA, we often deal with a formal context K = (G , M , I) that is only partially known, i.e. only the attributes that belong to an observable set N ⊂ M are known. There must also exist a part H of the object set G – called a training set – that consists of elements with all attributes known. The concepts of K have to be determined using the subcontexts corresponding to the training object set H and to the observable attribute set N. In our paper, this problem is examined within the extended framework of the semiconcepts of the original context, which are generalizations of its concepts. Each semiconcept of the original context induces a semiconcept in both subcontexts. In this way, each semiconcept of the context is represented by an induced pair of semiconcepts, which can also be considered its approximations — as in the case of rough sets. We describe the properties of the mapping defined by this representation and prove that the poset formed by these semiconcept pairs is a union of two complete lattices. We show that these induced semiconcept pairs can be generated by using a simplified representation of them. As the number of semiconcepts grows exponentially with the size of the training set and the observable attribute set, an algorithm that selects the representation pairs for which their support and relevance reach a certain threshold is also presented. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Face Sketch Recognition: Gender Classification Using Eyebrow Features and Bayes Classifier
- Author
-
Ounachad, Khalid, Oualla, Mohamed, Sadiq, Abdelalim, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Ben Ahmed, Mohamed, editor, Rakıp Karaș, İsmail, editor, Santos, Domingos, editor, Sergeyeva, Olga, editor, and Boudhir, Anouar Abdelhakim, editor
- Published
- 2021
- Full Text
- View/download PDF
20. A Bayesian Classifier Combination Methodology for Early Detection of Endotracheal Obstruction of COVID-19 Patients in ICU
- Author
-
Suárez-Díaz, Francisco J., Navarro-Mesa, Juan L., Ravelo-García, Antonio G., Fernández-López, Pablo, Suárez-Araujo, Carmen Paz, Pérez-Acosta, Guillermo, Santana-Cabrera, Luciano, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Rojas, Ignacio, editor, Joya, Gonzalo, editor, and Catala, Andreu, editor
- Published
- 2021
- Full Text
- View/download PDF
21. Probabilistic Learning
- Author
-
Jo, Taeho and Jo, Taeho
- Published
- 2021
- Full Text
- View/download PDF
22. Image Retrieval Method Combining Bayes and SVM Classifier Based on Relevance Feedback with Application to Small-scale Datasets
- Author
-
Siyu Lai, Qinghua Yang, Wenjin He, Yuanzhong Zhu, and Juan Wang
- Subjects
Bayes classifier ,image retrieval ,relevance feedback ,support vector machine classifier ,transfer learning ,Engineering (General). Civil engineering (General) ,TA1-2040 - Abstract
A vast amount of images has been generated due to the diversity and digitalization of devices for image acquisition. However, the gap between low-level visual features and high-level semantic representations has been a major concern that hinders retrieval accuracy. A retrieval method based on the transfer learning model and the relevance feedback technique was formulated in this study to optimize the dynamic trade-off between the structural complexity and retrieval performance of the small- and medium-scale content-based image retrieval (CBIR) system. First, the pretrained deep learning model was fine-tuned to extract features from target datasets. Then, the target dataset was clustered into the relative and irrelative image library by exploring the Bayes classifier. Next, the support vector machine (SVM) classifier was used to retrieve similar images in the relative library. Finally, the relevance feedback technique was employed to update the parameters of both classifiers iteratively until the request for the retrieval was met. Results demonstrate that the proposed method achieves 95.87% in classification index F1 - Score, which surpasses that of the suboptimal approach DCNN-BSVM by 6.76%. The performance of the proposed method is superior to that of other approaches considering retrieval criteria as average precision, average recall, and mean average precision. The study indicates that the Bayes + SVM combined classifier accomplishes the optimal quantities more efficiently than only either Bayes or SVM classifier under the transfer learning framework. Transfer learning skillfully excels training from scratch considering the feature extraction modes. This study provides a certain reference for other insights on applications of small- and medium-scale CBIR systems with inadequate samples.
- Published
- 2022
- Full Text
- View/download PDF
23. Application of Bayes Classifier to Assess the State of Unbalance Wheel
- Author
-
Prażnowski, Krzysztof, Mamala, Jarosław, Chaari, Fakher, Series Editor, Haddar, Mohamed, Series Editor, Kwon, Young W., Series Editor, Gherardini, Francesco, Series Editor, Ivanov, Vitalii, Series Editor, Cavas-Martínez, Francisco, Series Editor, Trojanowska, Justyna, Series Editor, Królczyk, Grzegorz M., editor, Niesłony, Piotr, editor, and Królczyk, Jolanta, editor
- Published
- 2020
- Full Text
- View/download PDF
24. Analysis of water quality at hydrographic basin scale using satellite images, co-occurrence matrices and Bayes classifier
- Author
-
M. G. G. Silva, D. J. Silva, P. D. Costa, R. C. Silva, T. E. B. Cassimiro, L. S. Amorim, D. A. Rocha, and Z. M. A. Peixoto
- Subjects
bayes classifier ,co-occurrence matrix ,satellite image ,water quality ,Water supply for domestic and industrial purposes ,TD201-500 ,River, lake, and water-supply engineering (General) ,TC401-506 - Abstract
Given the increased risks of water scarcity and the presence of polluting agents in water resources, this paper aims at the development and presentation of a computational tool capable of assessing water quality based on digital processing techniques applied to satellite images. Initially, a database was created for Brazilian regions, consisting of hydrographic basins' satellite images associated with the Water Quality Index (WQI), according to the criteria established by the National Water Agency (ANA). Hitherto, the database consisted of 85 images, 61 were used in the training stage and 24 in the testing stage. In both stages, the images were subjected to thresholding using Otsu's Method, binarization, linear expansion on saturation, application of a Laplacian filter, extraction of characteristics by using co-occurrence matrices and classification by the Bayes Discriminant. Such techniques were also implemented on a computational platform in the MATLAB® environment, responsible for the interface between the system and users. The proposed system presented an approximate 70% success rate regarding the classification of WQIs, which can be improved as more information is made available to improve the databases. HIGHLIGHTS Application of satellite images and processing techniques to assess water quality.; Assessment of water quality in hydrographic basins without on-site measurements.; Application of Otsu's method, linear expansion by saturation and Laplacian filter.; Water features extraction and classification by co-occurrence matrices and Bayesian Linear Discriminant Analysis Classifier.;
- Published
- 2021
- Full Text
- View/download PDF
25. Visible and Near-Infrared Spectroscopy Combined With Bayes Classifier Based on Wavelength Model Optimization Applied to Wine Multibrand Identification
- Author
-
Tao Pan, Jiaqi Li, Chunli Fu, Nailiang Chang, and Jiemei Chen
- Subjects
visible and near-infrared spectroscopy ,wine ,multibrand identification ,Bayes classifier ,equidistant combination wavelength screening ,wavelength step-by-step phase-out ,Nutrition. Foods and food supply ,TX341-641 - Abstract
The identification of high-quality wine brands can avoid adulteration and fraud and protect the rights and interests of producers and consumers. Since the main components of wine are roughly the same, the characteristic components that can distinguish wine brands are usually trace amounts and not unique. The conventional quantitative detection method for brand identification is complicated and difficult. The naive Bayes (NB) classifier is an algorithm based on probability distribution, which is simple and particularly suitable for multiclass discriminant analysis. However, the absorbance probability between spectral wavelengths is not necessarily strongly independent, which limits the application of Bayes method in spectral pattern recognition. This research proposed a Bayes classifier algorithm based on wavelength optimization. First, a large-scale wavelength screening for equidistant combination (EC) was performed, and then wavelength step-by-step phase-out (WSP) was carried out to reduce the correlation between wavelengths and improve the accuracy of Bayes discrimination. The proposed EC-WSP-Bayes method was applied to the 5-category discriminant analysis of wine brand identification based on visible and near-infrared (Vis-NIR) spectroscopy. Among them, four types of wine brands were collected from regular sales channels as identification brands. The fifth type of samples was composed of 21 other commercial brand wines and home-brewed wines from various sources, as the interference brand. The optimal EC-WSP-Bayes model was selected, the corresponding wavelength combination was 404, 600, 992, 2,070, 2,266, and 2,462 nm located in the visible light, shortwave NIR, and combination frequency regions. In modeling and independent validation, the total recognition accuracy rate (RARTotal) reached 98.1 and 97.6%, respectively. The technology is quick and easy, which is of great significance to regulate the alcohol market. The proposed model of less-wavelength and high-efficiency (N = 6) can provide a valuable reference for small special instruments. The proposed integrated chemometric method can reduce the correlation between wavelengths, improve the recognition accuracy, and improve the applicability of the Bayesian method.
- Published
- 2022
- Full Text
- View/download PDF
26. Enhance Object Detection Capability with the Object Relation
- Author
-
Li, Mei-Chen, Sharma, Lokesh, Wu, Shih-Lin, Barbosa, Simone Diniz Junqueira, Editorial Board Member, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Kotenko, Igor, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Esposito, Christian, editor, Hong, Jiman, editor, and Choo, Kim-Kwang Raymond, editor
- Published
- 2019
- Full Text
- View/download PDF
27. Data Mining Tasks and Paradigms
- Author
-
Wan, Cen, Jain, Lakhmi C., Editor-in-Chief, Wu, Xindong, Editor-in-Chief, Brahnam, Sheryl, Series Editor, Cook, Diane J, Series Editor, Domingo-Ferrer, Josep, Series Editor, Gabryś, Bogdan, Series Editor, Herrera, Francisco, Series Editor, Mamitsuka, Hiroshi, Series Editor, Phoha, Vir V., Series Editor, Siebes, Arno, Series Editor, de Wilde, Philippe, Series Editor, and Wan, Cen
- Published
- 2019
- Full Text
- View/download PDF
28. Predicting Traffic Conditions Using Knowledge-Growing Bayes Classifier
- Author
-
Emir Husni, Surya Michrandi Nasution, Kuspriyanto, and Rahadian Yusuf
- Subjects
Bayes classifier ,congestion ,knowledge growing Bayes classifier ,traffic prediction ,machine learning ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Congestion often hinders human mobility. This problem occurs due to the constant increase in vehicles every year. Reliable predictions of traffic conditions would allow drivers to choose their routes to avoid traffic jams while providing the police with traffic management strategies. Therefore, this paper tests the ability of various machine learning methods to predict traffic conditions. The study assesses Neural Networks, Bayes Classifier, Decision Trees, SVM, Deep Neural Network, and Deep Learning. Of these methods, the Decision Tree, Deep Neural Network, and Bayes Classifier show the highest performance in predicting traffic conditions using static data testing. However, in dynamic testing to assess the growth of knowledge, the performance of the Knowledge-Growing Decision Tree tends to decrease as the training data grows. Its performance decreased 3.89 points (88.24% to 84.35%) in accuracy, and 7.55 points (76.25% to 68.70%) for each precision, recall, and F1 Score. Conversely, the Knowledge-Growing Deep Neural Network and Bayes Classifier had a better performance than Decision Tree. The performances of Knowledge-Growing Deep Neural Network increased slightly by 0.35 points (93.38% to 93.73%) for accuracy and 0.69 points (86.77% to 87.64%) in other measurements. Although its performance increased, the processing time takes very long, namely 139452.76 seconds and 318832.80 seconds for sub-scheme (a) and (b), respectively. Meanwhile, the Knowledge-Growing Bayes Classifier offers a greater performance increase of 2.3 points (80.52% to 82.82%) for the accuracy and 4.6 points (65.63% to 61.03%) for the other performance measurements. In addition, it also scored better for processing time, as predictions only take 3 seconds using sub-scheme (a), and 7 seconds when using sub-scheme (b). Therefore, the paper proposes the Knowledge-Growing Bayes Classifier to predict rapidly changing traffic conditions. This method outperform the others. These can be attributed to its ability to 1) adjust to ever-changing the traffic conditions; 2) predict the result as soon as the data are acquired; and 3) make decentralized predictions.
- Published
- 2020
- Full Text
- View/download PDF
29. Down syndrome detection using modified adaboost algorithm.
- Author
-
V. K., Vincy Devi and R., Rajesh
- Subjects
DOWN syndrome ,GENETIC code ,FACIAL expression ,DEVELOPMENTAL delay ,HUMAN body ,HUMAN facial recognition software - Abstract
In human body genetic codes are stored in the genes. All of our inherited traits are associated with these genes and are grouped as structures generally called chromosomes. In typical cases, each cell consists of 23 pairs of chromosomes, out of which each parent contributes half. But if a person has a partial or full copy of chromosome 21, the situation is called Down syndrome. It results in intellectual disability, reading impairment, developmental delay, and other medical abnormalities. There is no specific treatment for Down syndrome. Thus, early detection and screening of this disability are the best styles for down syndrome prevention. In this work, recognition of Down syndrome utilizes a set of facial expression images. Solid geometric descriptor is employed for extracting the facial features from the image set. An AdaBoost method is practiced to gather the required data sets and for the categorization. The extracted information is then assigned and used to instruct the Neural Network using Backpropagation algorithm. This work recorded that the presented model meets the requirement with 98.67% accuracy. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
30. PHISH-SAFE: URL Features-Based Phishing Detection System Using Machine Learning
- Author
-
Jain, Ankit Kumar, Gupta, B. B., Kacprzyk, Janusz, Series Editor, Pal, Nikhil R., Advisory Editor, Bello Perez, Rafael, Advisory Editor, Corchado, Emilio S., Advisory Editor, Hagras, Hani, Advisory Editor, Kóczy, László T., Advisory Editor, Kreinovich, Vladik, Advisory Editor, Lin, Chin-Teng, Advisory Editor, Lu, Jie, Advisory Editor, Melin, Patricia, Advisory Editor, Nedjah, Nadia, Advisory Editor, Nguyen, Ngoc Thanh, Advisory Editor, Wang, Jun, Advisory Editor, Bokhari, M. U., editor, Agrawal, Namrata, editor, and Saini, Dharmendra, editor
- Published
- 2018
- Full Text
- View/download PDF
31. Cochain-SC: An Intra- and Inter-Domain Ddos Mitigation Scheme Based on Blockchain Using SDN and Smart Contract
- Author
-
Zakaria Abou El Houda, Abdelhakim Senhaji Hafid, and Lyes Khoukhi
- Subjects
DDoS ,entropy ,Bayes classifier ,SDN ,intra-domain mitigation ,inter-domain collaboration ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
With the exponential growth in the number of insecure devices, the impact of Distributed Denial-of-Service (DDoS) attacks is growing rapidly. Existing DDoS mitigation schemes are facing obstacles due to low flexibility, lack of resources, and high cost. The new emerging technologies, such as blockchain, introduce new opportunities for low-cost, efficient and flexible DDoS attacks mitigation across multiple domains. In this paper, we propose a blockchain-based approach, called Cochain-SC, which combines two levels of mitigation, intra-domain and inter-domain DDoS mitigation. For intra-domain, we propose an effective DDoS mitigation method in the context of software defined networks (SDN); it consists of three schemes: (1) Intra Entropy-based scheme (I-ES) to measure, using sFlow, the randomness of data inside the domain; (2) Intra Bayes-based scheme (I-BS) to classify, based on entropy values, illegitimate flows; and (3) Intra-domain Mitigation (I-DM) scheme to effectively mitigate illegitimate flows inside the domain. For inter-domain, we propose a collaborative DDoS mitigation scheme based on blockchain; it uses the concept of smart contracts (i.e., Ethereum's smart contracts) to facilitate the collaboration among SDN-based domains (i.e., Autonomous System: AS) to mitigate DDoS attacks. For this aim, we design a novel and secure scheme that allows multiple SDN-based domains to securely collaborate and transfer attack information in a decentralized manner. Combining intra- and inter-domain DDoS mitigation, Cochain-SC allows an efficient mitigation along the path of an ongoing attack and an effective mitigation near the origin of the attack. This allows reducing the enormous cost of forwarding packets, across multiple domains, which consist mostly of useless amplified attack traffic. To the best of our knowledge, Cochain-SC is the first scheme that proposes to deal with both intra-domain and inter-domain DDoS attacks mitigation combining SDN, blockchain and smart contract. The implementation of Cochain-SC is deployed on Ethereum official test network Ropsten. Moreover, we conducted extensive experiments to evaluate our proposed approach; the experimental results show that Cochain-SC achieves flexibility, efficiency, security, cost effectiveness, and high accuracy in detecting illegitimate flows, making it a promising approach to mitigate DDoS attacks.
- Published
- 2019
- Full Text
- View/download PDF
32. Wall and Object Detection with FRI and Bayes-Classifier for Autonomous Robot
- Author
-
Bartók, Roland, Bouzid, Ahmed, Vásárhelyi, József, Kiss, Márton L., Jármai, Károly, editor, and Bolló, Betti, editor
- Published
- 2017
- Full Text
- View/download PDF
33. Analysis of water quality at hydrographic basin scale using satellite images, co-occurrence matrices and Bayes classifier.
- Author
-
Silva, M. G. G., Silva, D. J., Costa, P. D., Silva, R. C., Cassimiro, T. E. B., Amorim, L. S., Rocha, D. A., and Peixoto, Z. M. A.
- Subjects
REMOTE-sensing images ,WATER quality ,WATER analysis ,WATER shortages ,WATER supply - Abstract
Given the increased risks of water scarcity and the presence of polluting agents in water resources, this paper aims at the development and presentation of a computational tool capable of assessing water quality based on digital processing techniques applied to satellite images. Initially, a database was created for Brazilian regions, consisting of hydrographic basins' satellite images associated with the Water Quality Index (WQI), according to the criteria established by the National Water Agency (ANA). Hitherto, the database consisted of 85 images, 61 were used in the training stage and 24 in the testing stage. In both stages, the images were subjected to thresholding using Otsu's Method, binarization, linear expansion on saturation, application of a Laplacian filter, extraction of characteristics by using co-occurrence matrices and classification by the Bayes Discriminant. Such techniques were also implemented on a computational platform in the MATLAB® environment, responsible for the interface between the system and users. The proposed system presented an approximate 70% success rate regarding the classification of WQIs, which can be improved as more information is made available to improve the databases. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
34. A DEPTH-BASED MODIFICATION OF THE K-NEAREST NEIGHBOUR METHOD.
- Author
-
VENCÁLEK, ONDŘEJ and HLUBINKA, DANIEL
- Subjects
PROBLEM solving ,NEIGHBORS - Abstract
We propose a new nonparametric procedure to solve the problem of classifying objects represented by d-dimensional vectors into K ≥ 2 groups. The newly proposed classifier was inspired by the k nearest neighbour (kNN) method. It is based on the idea of a depth-based distributional neighbourhood and is called k nearest depth neighbours (kNDN) classifier. The kNDN classifier has several desirable properties: in contrast to the classical kNN, it can utilize global properties of the considered distributions (symmetry). In contrast to the maximal depth classifier and related classifiers, it does not have problems with classification when the considered distributions differ in dispersion or have unequal priors. The kNDN classifier is compared to several depth-based classifiers as well as the classical kNN method in a simulation study. According to the average misclassification rates, it is comparable to the best current depthbased classifiers. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
35. Neutral Zone Classifiers Using a Decision-Theoretic Approach With Application to DNA Array Analyses
- Author
-
Yu, Hua, Jeske, Daniel R., Ruegger, Paul, and Borneman, James
- Subjects
Statistics ,Biostatistics ,Environmental Monitoring/Analysis ,Agriculture ,Statistics for Life Sciences, Medicine, Health Sciences ,Classification ,Bayes classifier ,Gaussian Mixture model ,Microbial Community Profiling - Abstract
Two-class neutral zone classifiers were recently proposed for use in microbial community profiling applications. These classifiers allow a region of neutrality for cases where probe hybridization outcomes are too ambiguous to have adequate confidence in assigning a “binding” or “no binding” result. In this paper, we generalize the idea of neutral zone classifiers to an arbitrary number of classes and apply it to improve the process of microbial community profiling by considering a third class for the outcome of probe hybridization experiments, “partial binding.” We introduce a family of class distributions that uses a mixture of Gaussian distributions as a model for a Box–Cox power transformation of the raw intensity measurements. Stratified cross-validation analyses are used to assess the efficacy of the proposed three-class neutral zone classifier. This article has supplementary material online.
- Published
- 2010
36. Bringing Intelligence to Software Defined Networks: Mitigating DDoS Attacks.
- Author
-
Abou El Houda, Zakaria, Khoukhi, Lyes, and Senhaji Hafid, Abdelhakim
- Abstract
As one of the most devastating types of Distributed Denial of Service (DDoS) attacks, Domain Name System (DNS) amplification attack represents a big threat and one of the main Internet security problems to nowadays networks. Many protocols that form the Internet infrastructure expose a set of vulnerabilities that can be exploited by attackers to carry out a set of attacks. DNS, one of the most critical elements of the Internet, is among these protocols. It is vulnerable to DDoS attacks mainly because all exchanges in this protocol use User Datagram Protocol (UDP). These attacks are difficult to defeat because attackers spoof the IP address of the victim and flood him with valid DNS responses coming from legitimate DNS servers. In this paper, we propose an efficient and scalable solution, called WisdomSDN, to effectively mitigate DNS amplification attack in the context of software defined networks (SDN). WisdomSDN covers both detection and mitigation of illegitimate DNS requests and responses. WisdomSDN consists of: (1) a novel proactive and stateful scheme (PAS) to perform one-to-one mapping between DNS requests and DNS responses; it operates proactively by sending only legitimate responses, excluding amplified illegitimate DNS responses; (2) a machine learning DDoS detection module to detect, in real-time, illegitimate DNS requests. This module consists of (a) Flow statistics collection scheme (FSC) to gather the features of flows in an efficient and scalable way using sFlow protocol; (b) Entropy calculation scheme (ECS) to measure randomness of network traffic; and (c) Bayes Network based Filtering scheme (BNF) to classify, based on entropy values, illegitimate DNS requests; and (3) DNS Mitigation scheme (DM) to effectively mitigate illegitimate DNS requests. The experimental results show that, compared to state-of-art, WisdomSDN can effectively detect/mitigate DNS amplification attack quickly with high detection rate, less false positive rate, and low overhead making it a promising solution to mitigate DNS amplification attack in a SDN environment. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
37. Prudence when assuming normality: An advice for machine learning practitioners.
- Author
-
Yousef, Waleed A.
- Subjects
- *
MACHINE learning , *PRUDENCE , *CENTRAL limit theorem , *ADVICE , *RECEIVER operating characteristic curves - Abstract
• In binary classification, the input feature vector is given a score to be compared to a threshold for class prediction. • The normal assumption of the score is sometimes severely violated even under the multinormal assumption of the feature vector. • The article proves this mathematically to provide an advice for practitioners to avoid blind assumptions of normality. • On the other hand, the article illustrate some of the expected results of the AUC under multinormal assumption. • Therefore, the message of the article is not to avoid the normal assumption; however, a prudence is needed. In a binary classification problem the feature vector (predictor) is the input to a scoring function that produces a decision value (score), which is compared to a particular chosen threshold to provide a final class prediction (output). Although the normal assumption of the scoring function is important in many applications, sometimes it is severely violated even under the simple multinormal assumption of the feature vector. This article proves this result mathematically with a counterexample to provide an advice for practitioners to avoid blind assumptions of normality. On the other hand, the article provides a set of experiments that illustrate some of the expected and well-behaved results of the Area Under the ROC curve (AUC) under the multinormal assumption of the feature vector. Therefore, the message of the article is not to avoid the normal assumption of either the input feature vector or the output scoring function; however, a prudence is needed when adopting either of both. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
38. An Attribute-Weighted Bayes Classifier Based on Asymmetric Correlation Coefficient.
- Author
-
Liu, Jingxian and Zhang, Yulin
- Subjects
- *
STATISTICAL correlation , *BAYES' estimation , *ALGORITHMS , *FORECASTING , *LOGISTIC regression analysis , *INTRACLASS correlation - Abstract
In this research, an attribute-weighted one-dependence Bayes estimation algorithm based on the asymmetric correlation coefficient is proposed. The asymmetric correlation coefficients Tau_y and Lambda_y, respectively, are used to calculate the correlation between parent attributes and category labels, then the result of calculation is regarded as weight to the parent attribute. The algorithm is applied to eight types of different datasets including binary classification and multiple classification from the UCI database. By comparing the time complexity and classification accuracy, experimental results show that the algorithm can significantly improve the classification performance with less prediction error. In addition, several baseline methods such as KNN, ANN, logistic regression and SVM are used for comparison with the proposed method. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
39. Automated Assessment of Hematoma Volume of Rodents Subjected to Experimental Intracerebral Hemorrhagic Stroke by Bayes Segmentation Approach.
- Author
-
Zhang, Zhexuan, Cho, Sunjoo, Rehni, Ashish K., Quero, Hever Navarro, Dave, Kunjan R., and Zhao, Weizhao
- Abstract
Simulating a clinical condition of intracerebral hemorrhage (ICH) in animals is key to research on the development and testing of diagnostic or treatment strategies for this high-mortality disease. In order to study the mechanism, pathology, and treatment for hemorrhagic stroke, various animal models have been developed. Measurement of hematoma volume is an important assessment parameter to evaluate post-ICH outcomes. However, due to tissue preservation conditions and variables in digitization, quantification of hematoma volume is usually labor intensive and sometimes even subjective. The objective of this study is to develop an automated method that can accurately and efficiently obtain unbiased cerebral hematoma volume. We developed an application (MATLAB program) that can delineate the brain slice from the background and use the Hue information in the Hue/Saturation/Value (HSV) color space to segment the hematoma region. The segmentation threshold of Hue is calculated based on the Bayes classifier theorem so that the minimum error is mathematically ensured and automated processing is enabled. To validate the developed method, we compared the outcomes from the developed method with the hemoglobin content by the spectrophotometric assay method. The results were linearly correlated with statistical significance. The method was also validated by digital phantoms with an error less than 5% compared with the ground truth from the phantoms. Hematoma volumes yielded by the automated processing and those obtained by the operator's manual operation are highly correlated. This automated segmentation approach can be potentially used to quantify hemorrhagic outcomes in rodent stroke models in an unbiased and efficient way. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
40. 改进卷积神经网络的手写试卷分数识别方法.
- Author
-
仝梦园, 金守峰, 陈 阳, 李 毅, and 尹加杰
- Subjects
- *
CONVOLUTIONAL neural networks , *ARTIFICIAL neural networks , *ALGORITHMS , *HANDWRITING recognition (Computer science) , *CLASSIFICATION algorithms , *ERROR rates - Abstract
Aiming at the problems of time-consuming and high error rate in handwriting test scores statistics a handwriting test paper recognition method based on improved convolution neural network algorithm was proposed. In order to simplify the classification of the score» the score column of the handwritten test paper was divided to obtain ten types of numbers from 0 to 9. In order to improve the efficiency of handwritten score recognition, a classification and recognition algorithm of convolution neural network and Bayesian was proposed. The constructed convolution neural network model was used to extract the characteristics of handwritten digits. The PCA algorithm was used to reduce the dimensionality of the features.The hayes classifier was used to distinguish ten kinds of numbers from 0 to 9. the auuracy and efficiency of the algorithm were verified in the MNIST database.The paper score summation model was established and automatic summation is performed after recognition. The experimental results show that for the recognition uf the handwritten scores of 1 188 lest papers in 3 courses♦ the algorithm in this paper has a recognition rate of 98. 23% compared with other algorithms, the average recognition time per test paper is 7. 5 s, and verified its practicality. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
41. Freshness Classification of Horse Mackerels with E-Nose System Using Hybrid Binary Decision Tree Structure.
- Author
-
Güney, Selda and Atasoy, Ayten
- Subjects
- *
DECISION trees , *FISH spoilage , *MACKERELS , *ELECTRONIC noses , *FISHER discriminant analysis , *METALLIC oxides , *HYBRID systems - Abstract
The aim of this study is to test the freshness of horse mackerels by using a low cost electronic nose system composed of eight different metal oxide sensors. The process of freshness evaluation covers a scala of seven different classes corresponding to 1, 3, 5, 7, 9, 11, and 13 storage days. These seven classes are categorized according to six different classifiers in the proposed binary decision tree structure. Classifiers at each particular node of the tree are individually trained with the training dataset. To increase success in determining the level of fish freshness, one of the k-Nearest Neighbors (k-NN), Support Vector Machines (SVM), Linear Discriminant Analysis (LDA) and Bayes methods is selected for every classifier and the feature spaces change in every node. The significance of this study among the others in the literature is that this proposed decision tree structure has never been applied to determine fish freshness before. Because the freshness of fish is observed under actual market storage conditions, the classification is more difficult. The results show that the electronic nose designed with the proposed decision tree structure is able to determine the freshness of horse mackerels with 85.71% accuracy for the test data obtained one year after the training process. Also, the performances of the proposed methods are compared against conventional methods such as Bayes, k-NN, and LDA. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
42. A Semi-supervised method for tumor segmentation in mammogram images.
- Author
-
Azary, Hanie and Abdoos, Monireh
- Subjects
- *
FISHER discriminant analysis , *IMAGE segmentation , *SUPERVISED learning , *SUPPORT vector machines , *TUMORS , *MACHINE learning - Abstract
Background: Breast cancer is one of the most common cancers in women. Mammogram images have an important role in the treatment of various states of this cancer. In recent years, machine learning methods have been widely used for tumor segmentation in mammogram images. Pixel-based segmentation methods have been presented using both supervised and unsupervised learning approaches. Supervised learning methods are usually fast and accurate, but they usually use a large number of labeled data. Besides, providing these samples is very hard and usually expensive. Unsupervised learning methods do not require the labels of the training data for decision making and they completely ignore the prior knowledge that may lead to a low performance. Semi-supervised learning methods which use a small number of labeled data solve the problem of providing the high number of samples in supervised methods, while they usually result in a higher accuracy in comparison to the unsupervised methods. Methods: In this study, we used a semisupervised method for tumor segmentation in which the pixel information is used for the classification. The static and gray level run length matrix features for each pixel are considered as the features, and Fisher discriminant analysis (FDA) is used for feature reduction. A cotraining algorithm based on support vector machine and Bayes classifiers is proposed for tumor segmentation on MIAS data set. Results and Conclusion: The results show that the proposed method outperforms both supervised methods. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
43. A Bayes Classifier Based on Multiscale Features in Brain Death Diagnosis
- Author
-
Ni, Li, Cao, Jianting, Wang, Rubin, Rubin, Wang, Series editor, Wang, Rubin, editor, and Pan, Xiaochuan, editor
- Published
- 2016
- Full Text
- View/download PDF
44. Preliminary Investigations on the spam filtering using statistical classification techniques
- Author
-
Rajeswari, A.V.
- Published
- 2017
45. On the efficient evaluation of probabilistic similarity functions for image retrieval
- Author
-
Vasconcelos, N
- Subjects
Bayes classifier ,Gauss mixture (GM) ,image databases ,Kullback-Leibler (KL) divergence ,maximum a posteriori probability (MAP) similarity ,probabilistic image retrieval ,vector quantizer (VQ) - Abstract
Probabilistic approaches are a promising solution to the image retrieval problem that, when compared to standard retrieval methods, can lead to a significant gain in retrieval accuracy. However, this occurs at the cost of a significant increase in computational complexity. In fact, closed-form solutions for probabilistic retrieval are currently available only for simple probabilistic models such as the Gaussian or the histogram. We analyze the case of mixture densities and exploit the asymptotic equivalence between likelihood and Kullback-Leibler (KL) divergence to derive solutions for these models. In particular, 1) we show that the divergence can be computed exactly for vector quantizers (VQs) and 2) has an approximate solution for Gauss mixtures (GMs) that, in high-dimensional feature spaces, introduces no significant degradation of the resulting similarity judgments. In both cases, the new solutions have closed-form and computational complexity equivalent to that of standard retrieval approaches.
- Published
- 2004
46. An Integrated Computational Schema for Analysis, Prediction and Visualization of piRNA Sequences
- Author
-
Rahiman, Anusha Abdul, Ajitha, Jithin, Chandra, Vinod, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Huang, De-Shuang, editor, Bevilacqua, Vitoantonio, editor, and Premaratne, Prashan, editor
- Published
- 2015
- Full Text
- View/download PDF
47. Yellowfin Tuna (Thunnusalbacares) Fishing Ground Forecasting Model Based On Bayes Classifier In The South China Sea
- Author
-
Zhou Wei-feng, Li An-zhou, Ji Shi-jian, and Qiu Yong-song
- Subjects
bayes classifier ,south china sea,yellowfin tuna ,fishing ground forecasting ,Naval architecture. Shipbuilding. Marine engineering ,VM1-989 - Abstract
Using the yellowfin tuna (Thunnusalbacares,YFT)longline fishing catch data in the open South China Sea (SCS) provided by WCPFC, the optimum interpolation sea surface temperature (OISST) from CPC/NOAA and multi-satellites altimetric monthly averaged product sea surface height (SSH) released by CNES, eight alternative options based on Bayes classifier were made in this paper according to different strategies on the choice of environment factors and the levels of fishing zones to classify the YFT fishing ground in the open SCS. The classification results were compared with the actual ones for validation and analyzed to know how different plans impact on classification results and precision. The results of validation showed that the precision of the eight options were 71.4%, 75%, 70.8%, 74.4%, 66.7%, 68.5%, 57.7% and 63.7% in sequence, the first to sixth among them above 65% would meet the practical application needs basically. The alternatives which use SST and SSH simultaneously as the environmental factors have higher precision than which only use single SST environmental factor, and the consideration of adding SSH can improve the model precision to a certain extent. The options which use CPUE’s mean ± standard deviation as threshold have higher precision than which use CPUE’s 33.3%-quantile and 66.7%-quantile as the threshold
- Published
- 2017
- Full Text
- View/download PDF
48. An evolutionary classifier for steel surface defects with small sample set
- Author
-
Mang Xiao, Mingming Jiang, Guangyao Li, Li Xie, and Li Yi
- Subjects
Surface defect ,Support vector machine ,Random subspace ,Bayes classifier ,Electronics ,TK7800-8360 - Abstract
Abstract Nowadays, surface defect detection systems for steel strip have replaced traditional artificial inspection systems, and automatic defect detection systems offer good performance when the sample set is large and the model is stable. However, the trained model does work well when a new production line is initiated with different equipment, processes, or detection devices. These variables make just tiny changes to the real-world model but have a significant impact on the classification result. To overcome these problems, we propose an evolutionary classifier with a Bayes kernel (BYEC) that can be adjusted with a small sample set to better adapt the model for a new production line. First, abundant features were introduced to cover detailed information about the defects. Second, we constructed a series of support vector machines (SVMs) with a random subspace of the features. Then, a Bayes classifier was trained as an evolutionary kernel fused with the results from the sub-SVM to form an integrated classifier. Finally, we proposed a method to adjust the Bayes evolutionary kernel with a small sample set. We compared the performance of this method to various algorithms; experimental results demonstrate that the proposed method can be adjusted with a small sample set to fit the changed model. Experimental evaluations were conducted to demonstrate the robustness, low requirement for samples, and adaptiveness of the proposed method.
- Published
- 2017
- Full Text
- View/download PDF
49. Bayes Imbalance Impact Index: A Measure of Class Imbalanced Data Set for Classification Problem.
- Author
-
Lu, Yang, Cheung, Yiu-Ming, and Tang, Yuan Yan
- Subjects
- *
CLASSIFICATION , *LATENT class analysis (Statistics) , *FORECASTING , *DATA , *SAMPLING methods - Abstract
Recent studies of imbalanced data classification have shown that the imbalance ratio (IR) is not the only cause of performance loss in a classifier, as other data factors, such as small disjuncts, noise, and overlapping, can also make the problem difficult. The relationship between the IR and other data factors has been demonstrated, but to the best of our knowledge, there is no measurement of the extent to which class imbalance influences the classification performance of imbalanced data. In addition, it is also unknown which data factor serves as the main barrier for classification in a data set. In this article, we focus on the Bayes optimal classifier and examine the influence of class imbalance from a theoretical perspective. We propose an instance measure called the Individual Bayes Imbalance Impact Index (IBI3) and a data measure called the Bayes Imbalance Impact Index (BI3). IBI3 and BI3 reflect the extent of influence using only the imbalance factor, in terms of each minority class sample and the whole data set, respectively. Therefore, IBI3 can be used as an instance complexity measure of imbalance and BI3 as a criterion to demonstrate the degree to which imbalance deteriorates the classification of a data set. We can, therefore, use BI3 to access whether it is worth using imbalance recovery methods, such as sampling or cost-sensitive methods, to recover the performance loss of a classifier. The experiments show that IBI3 is highly consistent with the increase of the prediction score obtained by the imbalance recovery methods and that BI3 is highly consistent with the improvement in the F1 score obtained by the imbalance recovery methods on both synthetic and real benchmark data sets. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
50. Mandarin Phonetic Symbol Combination Recogniztion in Visioned Based Input Systems
- Author
-
Yu, Chih-Chang, Cheng, Hsu-Yung, Huang, Yueh-Min, editor, Chao, Han-Chieh, editor, Deng, Der-Jiunn, editor, and Park, James J. (Jong Hyuk), editor
- Published
- 2014
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.