16 results on '"Douglas O. Cardoso"'
Search Results
2. Online evaluation of the Kolmogorov–Smirnov test on arbitrarily large samples
- Author
-
Douglas O. Cardoso and Thalis D. Galeno
- Subjects
General Computer Science ,Modeling and Simulation ,Theoretical Computer Science - Published
- 2023
3. Sentiment Analysis Applied to IBOVESPA Prediction
- Author
-
Yngwi Guimarães Vieira Souza, Luís Tarrataca, Douglas O. Cardoso, and Laura Silva de Assis
- Published
- 2022
4. Long-Term Person Reidentification: Challenges and Outlook
- Author
-
Anderson Manhães, Gabriel Matos, Douglas O. Cardoso, Milena F. Pinto, Jeferson Colares, Paulo Leitão, Diego Brandão, and Diego Haddad
- Subjects
Multimodal retrieval ,Long-term person ReID ,Deep learning ,Computer vision - Abstract
Person reidentification, i.e., retrieving a person of interest across several non-overlapping cameras, is a task that is far from trivial. Despite its great commercial value and wide range of applications (e.g., surveillance, intelligent environments, forensics, service robotics, marketing), it remains unsolved, even when the individuals do not change clothes during the recognition period. This paper provides an outlook on long-term person reidentification, an emerging research topic regarding when consecutive acquisitions of an individual can be found apart for days or even months, making such a task even more challenging. A long-term reidentification system using face recognition is presented to emphasize current techniques’ limitations. FCT (Fundação para a Ciência e a Tecnologia), under the project UIDB/05567/2020. info:eu-repo/semantics/publishedVersion
- Published
- 2022
5. Text documents streams with improved incremental similarity
- Author
-
Rui Portocarrero Sarmento, Kemmily Dearo, Pavel Brazdil, Douglas O. Cardoso, and João Gama
- Subjects
Information retrieval ,Computer science ,business.industry ,Communication ,Text document ,Subject (documents) ,STREAMS ,Computer Science Applications ,Human-Computer Interaction ,Documentation ,Text mining ,Similarity (psychology) ,ComputingMethodologies_DOCUMENTANDTEXTPROCESSING ,Media Technology ,General knowledge ,Architecture ,business ,Information Systems - Abstract
There has been a significant effort by the research community to address the problem of providing methods to organize documentation, with the help of Information Retrieval methods. In this paper, we present several experiments with stream analysis methods to explore streams of text documents. This paper also presents possible architectures of the Text Document Stream Organization, with the use of incremental algorithms like Incremental Sparse TF-IDF and Incremental Similarity. Our results show that with this architecture, significant improvements are achieved, regarding efficiency in grouping of similar documents. These improvements are important since it is of general knowledge that great amounts of text analysis are a high dimensional and complex subject of study, in the data analysis area.
- Published
- 2021
6. A sketch for the KS test for Big Data
- Author
-
Thalis D. Galeno, Douglas O. Cardoso, and João Gama
- Subjects
Information retrieval ,Computer science ,business.industry ,Big data ,business ,Sketch ,Test (assessment) - Abstract
Motivated by the challenges of Big Data, this paper presents an approximative algorithm to assess the Kolmogorov-Smirnov test. This goodness of fit statistical test is extensively used because it is non-parametric. This work focuses on the one-sample test, which considers the hypothesis that a given univariate sample follows some reference distribution. The method allows to evaluate the departure from such a distribution of a input stream, being space and time efficient. We show the accuracy of our algorithm by making several experiments in different scenarios: varying reference distribution and its parameters, sample size, and available memory. The performance of rival methods, some of which are considered the state-of-the-art, were compared. It is demonstrated that our algorithm is superior in most of the cases, considering the absolute error of the test statistic.
- Published
- 2021
7. Multi-Armed Bandits for Minesweeper: Profiting from Exploration-Exploitation Synergy
- Author
-
Diego B. Haddad, Igor Q. Lordeiro, and Douglas O. Cardoso
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer science ,Management science ,media_common.quotation_subject ,Machine Learning (stat.ML) ,Machine Learning (cs.LG) ,Artificial Intelligence (cs.AI) ,Work (electrical) ,Luck ,Statistics - Machine Learning ,Artificial Intelligence ,Control and Systems Engineering ,Order (business) ,Learning theory ,Reinforcement learning ,Electrical and Electronic Engineering ,Software ,media_common - Abstract
A popular computer puzzle, the game of Minesweeper requires its human players to have a mix of both luck and strategy to succeed. Analyzing these aspects more formally, in our research we assessed the feasibility of a novel methodology based on Reinforcement Learning as an adequate approach to tackle the problem presented by this game. For this purpose we employed Multi-Armed Bandit algorithms which were carefully adapted in order to enable their use to define autonomous computational players, targeting to make the best use of some game peculiarities. After experimental evaluation, results showed that this approach was indeed successful, especially in smaller game boards, such as the standard beginner level. Despite this fact the main contribution of this work is a detailed examination of Minesweeper from a learning perspective, which led to various original insights which are thoroughly discussed., Comment: To be published in IEEE Transactions on Games (ISSN 2475-1510 / 2475-1502)
- Published
- 2020
- Full Text
- View/download PDF
8. Weightless neuro-symbolic GPS trajectory classification
- Author
-
Douglas O. Cardoso, Felipe M. G. França, Raul Barbosa, and Diego Carvalho
- Subjects
Computer science ,business.industry ,Cognitive Neuroscience ,02 engineering and technology ,Machine learning ,computer.software_genre ,Computer Science Applications ,Gps trajectory ,Artificial Intelligence ,020204 information systems ,Public transport ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Classifier (UML) ,computer - Abstract
This paper presents a framework for dealing with the problem of GPS trajectory classification in the context of the Rio de Janeiro’s public transit system (with hundreds or more classes). Such framework combines the versatile WiSARD classifier with a set of rules defined a priori, resulting in a neuro-symbolic learning system with very interesting characteristics and cutting-edge performance. We also verified the influence of different binarization methods in order to adapt raw data to WiSARD, which feeds from binary data only. These ideas were tested against a large data set of trajectories of buses from the city of Rio de Janeiro. The results confirm the practical applicability of those, since the accomplished performance was as good as that of other state-of-the-art rival methods in most test scenarios.
- Published
- 2018
9. Weightless neural networks for open set recognition
- Author
-
Felipe M. G. França, Douglas O. Cardoso, and João Gama
- Subjects
business.industry ,Weightless neural networks ,Computer science ,020209 energy ,Computation ,Probabilistic logic ,Open set ,02 engineering and technology ,Machine learning ,computer.software_genre ,Artificial Intelligence ,Weightless ,Problem domain ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Anomaly detection ,Artificial intelligence ,business ,computer ,Classifier (UML) ,Software - Abstract
Open set recognition is a classification-like task. It is accomplished not only by the identification of observations which belong to targeted classes (i.e., the classes among those represented in the training sample which should be later recognized) but also by the rejection of inputs from other classes in the problem domain. The need for proper handling of elements of classes beyond those of interest is frequently ignored, even in works found in the literature. This leads to the improper development of learning systems, which may obtain misleading results when evaluated in their test beds, consequently failing to keep the performance level while facing some real challenge. The adaptation of a classifier for open set recognition is not always possible: the probabilistic premises most of them are built upon are not valid in a open-set setting. Still, this paper details how this was realized for WiSARD a weightless artificial neural network model. Such achievement was based on an elaborate distance-like computation this model provides and the definition of rejection thresholds during training. The proposed methodology was tested through a collection of experiments, with distinct backgrounds and goals. The results obtained confirm the usefulness of this tool for open set recognition.
- Published
- 2017
10. WCDS: A Two-Phase Weightless Neural System for Data Stream Clustering
- Author
-
Douglas O. Cardoso, João Gama, and Felipe M. G. França
- Subjects
Computer Networks and Communications ,Computer science ,media_common.quotation_subject ,02 engineering and technology ,Machine learning ,computer.software_genre ,Theoretical Computer Science ,Task (project management) ,Knowledge extraction ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Quality (business) ,Cluster analysis ,media_common ,Measure (data warehouse) ,business.industry ,Data stream clustering ,Hardware and Architecture ,Data analysis ,020201 artificial intelligence & image processing ,Data mining ,Artificial intelligence ,business ,computer ,Software ,Countermeasure (computer) - Abstract
Clustering is a powerful and versatile tool for knowledge discovery, able to provide a valuable information for data analysis in various domains. To perform this task based on streaming data is quite challenging: outdated knowledge needs to be disposed while the current knowledge is obtained from fresh data; since data are continuously flowing, strict efficiency constraints have to be met. This paper presents WCDS, an approach to this problem based on the WiSARD artificial neural network model. This model already had useful characteristics as inherent incremental learning capability and patent functioning speed. These were combined with novel features as an adaptive countermeasure to cluster imbalance, a mechanism to discard expired data, and offline clustering based on a pairwise similarity measure for WiSARD discriminators. In an insightful experimental evaluation, the proposed system had an excellent performance according to multiple quality standards. This supports its applicability for the analysis of data streams.
- Published
- 2017
11. Financial credit analysis via a clustering weightless neural classifier
- Author
-
Priscila M. V. Lima, Hugo C. C. Carneiro, Felipe M. G. França, Douglas O. Cardoso, Daniel S. F. Alves, Danilo S. Carvalho, Diego Fonseca Pereira de Souza, and Carlos E. Pedreira
- Subjects
Credit analysis ,0209 industrial biotechnology ,Artificial neural network ,Computer science ,business.industry ,Cognitive Neuroscience ,02 engineering and technology ,Machine learning ,computer.software_genre ,Computer Science Applications ,Support vector machine ,020901 industrial engineering & automation ,Artificial Intelligence ,Robustness (computer science) ,Weightless ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Cluster analysis ,computer ,Classifier (UML) - Abstract
Credit analysis is a real-world classification problem where it is quite common to find datasets with a large amount of noisy data. State-of-the-art classifiers that employ error minimisation techniques, on the other hand, require a long time to converge, in order to achieve robustness. This paper explores ClusWiSARD, a clustering customisation of the WiSARD weightless neural network model, applied to two different credit analysis real-world problems. Experimental evidence shows that ClusWiSARD is very competitive with Support Vector Machine (SVM) w.r.t. accuracy, with the advantage of being capable of online learning. ClusWiSARD outperforms SVM in training time, by two orders of magnitude, and is slightly faster in test time.
- Published
- 2016
12. Clustering data streams using a forgetful neural model
- Author
-
Douglas O. Cardoso, João Gama, and Felipe M. G. França
- Subjects
Data stream ,Artificial neural network ,Computer science ,business.industry ,Data stream mining ,02 engineering and technology ,computer.software_genre ,Machine learning ,Task (project management) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Data mining ,Artificial intelligence ,Cluster analysis ,business ,computer - Abstract
To cluster a data stream is a more challenging task than its regular batch version, having stricter performance constraints. In this paper an approach to this problem is presented, based on WiSARD, a memory-based artificial neural network (ANN) model. This model functioning was reviewed and improved, in order to adapt it to this task. The experimental results obtained support the use of this system for the analysis of data streams in an informative way.
- Published
- 2016
13. A bounded neural network for open set recognition
- Author
-
Douglas O. Cardoso, João Gama, and Felipe M. G. França
- Subjects
Training set ,Artificial neural network ,business.industry ,Time delay neural network ,Computer science ,Feature vector ,Open set ,Machine learning ,computer.software_genre ,Support vector machine ,Component (UML) ,Bounded function ,Artificial intelligence ,business ,computer - Abstract
Open set recognition is, more than an interesting research subject, a component of various machine learning applications which is sometimes neglected: it is not unusual the existence of learning systems developed on the top of closed-set assumptions, ignoring the error risk involved in a prediction. This risk is strictly related to the location in feature space where the prediction has to be made, compared to the location of the training data: the more distant the training observations are, less is known, higher is the risk. Proper handling of this risk can be necessary in various situation where classification and its variants are employed. This paper presents an approach to open set recognition based on an elaborate distance-like computation provided by a weightless neural network model. The results obtained in the proposed test scenarios are quite interesting, placing the proposed method among the current best ones.
- Published
- 2015
14. An Empirical Study of the Influence of Data Structures on the Performance of VG-RAM Classifiers
- Author
-
Priscila M. V. Lima, Hugo C. C. Carneiro, Felipe M. G. França, Daniel S. F. Alves, and Douglas O. Cardoso
- Subjects
Empirical research ,Computer science ,Weightless neural networks ,business.industry ,Artificial intelligence ,Data mining ,Data structure ,Machine learning ,computer.software_genre ,business ,computer - Abstract
This work investigates the effect of different data structures on the performance and accuracy of VG-RAM-based classifiers. This weightless neural model is based on RAM nodes having very large address input, what suggests the use of special data structures in order to deal with space and time computational costs. Four different data structures are explored, including the classical one used in recent VG-RAM related literature, resulting in a novel and accurate yet fast setup.
- Published
- 2013
15. A Weightless Neural Network-Based Approach for Stream Data Clustering
- Author
-
Douglas O. Cardoso, João Gama, Felipe M. G. França, Priscila M. V. Lima, and Massimo De Gregorio
- Subjects
Artificial neural network ,Data stream mining ,business.industry ,Weightless neural networks ,Computer science ,computer.software_genre ,Machine learning ,Data stream clustering ,Sliding window protocol ,Artificial intelligence ,Data mining ,Raw data ,business ,Stream data ,Cluster analysis ,computer - Abstract
One of the major data mining tasks is to cluster similar data, because of its usefulness, providing means of summarizing large ammounts of raw data into handy information. Clustering data streams is particularly challenging, because of the constraints imposed when dealing with this kind of input. Here we report our work, in which it was investigated the use of WiSARD discriminators as primary data synthesizing units. An analysis of StreamWiSARD, a new sliding-window stream data clustering system, the benefits and the drawbacks of its use and a comparison to other approaches are all presented.
- Published
- 2012
16. Weightless neural modeling for mining data streams
- Author
-
Felipe M. G. França, Douglas O. Cardoso, and João Gama
- Subjects
Computer science ,Data stream mining ,Weightless ,Data mining ,computer.software_genre ,computer
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.