10 results on '"Jefrey Lijffijt"'
Search Results
2. Machine Learning and Knowledge Discovery in Databases
- Author
-
Jefrey Lijffijt
- Published
- 2021
3. Artificial Intelligence and Machine Learning
- Author
-
Mitra Baratchi, Lu Cao, Frank W. Takes, Walter A. Kosters, Jefrey Lijffijt, and Jan N. van Rijn
- Subjects
Coronavirus disease 2019 (COVID-19) ,business.industry ,Computer science ,Volume (computing) ,Artificial intelligence ,business ,Machine learning ,computer.software_genre ,computer ,Game theory ,Selection (genetic algorithm) - Abstract
This book contains a selection of the best papers of the 32nd Benelux Conference on Artificial Intelligence, BNAIC/Benelearn 2020, held in Leiden, The Netherlands, in November 2020. Due to the COVID-19 pandemic the conference was held online. The 12 papers presented in this volume were carefully reviewed and selected from 41 regular submissions. They address various aspects of artificial intelligence such as natural language processing, agent technology, game theory, problem solving, machine learning, human-agent interaction, AI and education, and data analysis.
- Published
- 2021
4. Machine Learning and Knowledge Discovery in Databases
- Author
-
Frank Hutter, Isabel Valera, Kristian Kersting, and Jefrey Lijffijt
- Subjects
Artificial architecture ,Commonsense knowledge ,Computer science ,business.industry ,Open Knowledge Base Connectivity ,Rule-based system ,Marketing and artificial intelligence ,Mathematical knowledge management ,Machine learning ,computer.software_genre ,Data science ,Knowledge extraction ,Software mining ,Artificial intelligence ,business ,computer - Published
- 2021
5. Machine Learning and Knowledge Discovery in Databases
- Author
-
Isabel Valera, Kristian Kersting, Jefrey Lijffijt, and Frank Hutter
- Subjects
World Wide Web ,Knowledge extraction ,Computer science - Published
- 2021
6. Gibbs Sampling Subjectively Interesting Tiles
- Author
-
Tijl De Bie, Jefrey Lijffijt, Anes Bendimerad, Céline Robardet, Marc Plantevit, Berthold, MR, Feelders, A, Krempl, G, Data Mining and Machine Learning (DM2L), Laboratoire d'InfoRmatique en Image et Systèmes d'information (LIRIS), Institut National des Sciences Appliquées de Lyon (INSA Lyon), Université de Lyon-Institut National des Sciences Appliquées (INSA)-Université de Lyon-Institut National des Sciences Appliquées (INSA)-Centre National de la Recherche Scientifique (CNRS)-Université Claude Bernard Lyon 1 (UCBL), Université de Lyon-École Centrale de Lyon (ECL), Université de Lyon-Université Lumière - Lyon 2 (UL2)-Institut National des Sciences Appliquées de Lyon (INSA Lyon), Université de Lyon-Université Lumière - Lyon 2 (UL2), Internet Technology and Data Science Lab (IDLab), and Universiteit Antwerpen [Antwerpen]-Universiteit Gent = Ghent University [Belgium] (UGENT)
- Subjects
Technology and Engineering ,Computer science ,02 engineering and technology ,KNOWLEDGE DISCOVERY ,computer.software_genre ,[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI] ,Interpretation (model theory) ,Local pattern ,Set (abstract data type) ,symbols.namesake ,[INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG] ,Gibbs sampling ,Pattern mining ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Pattern sampling ,[INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB] ,Subjective interestingness ,business.industry ,Trawling ,Usability ,Mathematics and Statistics ,Large set (Ramsey theory) ,symbols ,020201 artificial intelligence & image processing ,Data mining ,Computational problem ,business ,computer - Abstract
International audience; The local pattern mining literature has long struggled with the so-called pattern explosion problem: the size of the set of patterns found exceeds the size of the original data. This causes computational problems (enumerating a large set of patterns will inevitably take a substantial amount of time) as well as problems for interpretation and usabil-ity (trawling through a large set of patterns is often impractical). Two complementary research lines aim to address this problem. The first aims to develop better measures of interestingness, in order to reduce the number of uninteresting patterns that are returned [6, 10]. The second aims to avoid an exhaustive enumeration of all 'interesting' patterns (where interestingness is quantified in a more traditional way, e.g. frequency), by directly sampling from this set in a way that more 'interest-ing' patterns are sampled with higher probability [2]. Unfortunately, the first research line does not reduce computational cost, while the second may miss out on the most interesting patterns. In this paper, we combine the best of both worlds for mining interesting tiles [8] from binary databases. Specifically, we propose a new pattern sampling approach based on Gibbs sampling, where the probability of sampling a pattern is proportional to their subjective interest-ingness [6]-an interestingness measure reported to better represent true interestingness. The experimental evaluation confirms the theory, but also reveals an important weakness of the proposed approach which we speculate is shared with any other pattern sampling approach. We thus conclude with a broader discussion of this issue, and a forward look.
- Published
- 2020
7. Subjectively Interesting Connecting Trees
- Author
-
Tijl De Bie, Florian Adriaens, Jefrey Lijffijt, Ceci, Michelangelo, Hollm{\'e}n, Jaakko, Todorovski, Ljup{\v{c}}o, and Vens, Celine
- Subjects
Technology and Engineering ,Information theory ,Theoretical computer science ,Subjective interestingness ,Exploratory Data Mining ,business.industry ,Computer science ,Heuristic ,Node (networking) ,02 engineering and technology ,Telecommunications network ,Set (abstract data type) ,Graph pattern mining ,Tree (data structure) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,The Internet ,Heuristics ,business ,Graphs - Abstract
Consider a large network, and a user-provided set of query nodes between which the user wishes to explore relations. For example, a researcher may want to connect research papers in a citation network, an analyst may wish to connect organized crime suspects in a communication network, or an internet user may want to organize their bookmarks given their location in the world wide web. A natural way to show how query nodes are related is in the form of a tree in the network that connects them. However, in sufficiently dense networks, most such trees will be large or somehow trivial (e.g. involving high degree nodes) and thus not insightful. In this paper, we define and investigate the new problem of mining subjectively interesting trees connecting a set of query nodes in a network, i.e., trees that are highly surprising to the specific user at hand. Using information theoretic principles, we formalize the notion of interestingness of such trees mathematically, taking in account any prior beliefs the user has specified about the network. We then propose heuristic algorithms to find the best trees efficiently, given a specified prior belief model. Modeling the user's prior belief state is however not necessarily computationally tractable. Yet, we show how a highly generic class of prior beliefs, namely about individual node degrees in combination with the density of particular sub-networks, can be dealt with in a tractable manner. Such types of beliefs can be used to model knowledge of a partial or total order of the network nodes, e.g. where the nodes represent events in time (such as papers in a citation network). An empirical validation of our methods on a large real network evaluates the different heuristics and validates the interestingness of the given trees.
- Published
- 2017
8. Hierarchical Novelty Detection
- Author
-
Matthew J McVicar, Raul Santos-Rodriguez, Tijl De Bie, Paolo Simeone, and Jefrey Lijffijt
- Subjects
Optimization ,Point (typography) ,Hierarchy (mathematics) ,Music genre classification ,Computer science ,business.industry ,Music information retrieval ,Node (networking) ,Novelty ,Pattern recognition ,02 engineering and technology ,Hierarchical classification ,Novelty detection ,ComputingMethodologies_PATTERNRECOGNITION ,Data point ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Layer (object-oriented design) ,business - Abstract
Hierarchical classification is commonly defined as multi-class classification where the classes are hierarchically nested. Many practical hierarchical classification problems also share features with multi-label classification (i.e., each data point can have any number of labels, even non-hierarchically related) and novelty detection (i.e., some data points are novelties at some level of the hierarchy). A further complication is that it is common for training data to be incompletely labelled, e.g. the most specific labels are not always provided. In music genre classification for example, there are numerous music genres (multi-class) which are hierarchically related. Songs can belong to different (even non-nested) genres (multi-label), and a song labelled as Rock may not belong to any of its sub-genres, such that it is a novelty within this genre (novelty-detection). Finally, the training data may label a song as Rock whereas it really could be labelled correctly as the more specific genre Blues Rock. In this paper we develop a new method for hierarchical classification that naturally accommodates every one of these properties. To achieve this we develop a novel approach, modelling it as a Hierarchical Novelty Detection problem that can be trained through a single convex second-order cone programming problem. This contrasts with most existing approaches that typically require a model to be trained for each layer or internal node in the label hierarchy. Empirical results on a music genre classification problem are reported, comparing with a state-of-the-art method as well as simple benchmarks.
- Published
- 2017
9. Interactive visual data exploration with subjective feedback
- Author
-
Tijl De Bie, Bo Kang, Jefrey Lijffijt, and Kai Puolamäki
- Subjects
Information retrieval ,Technology and Engineering ,Process (engineering) ,business.industry ,Computer science ,Exploratory Data Mining ,02 engineering and technology ,Machine learning ,computer.software_genre ,Synthetic data ,Visualization ,Domain (software engineering) ,Software ,Data visualization ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Zoom ,Projection (set theory) ,business ,computer ,Dimensionality Reduction ,Data Visualisation ,Subjective Interestingness - Abstract
Data visualization and iterative/interactive data mining are growing rapidly in attention, both in research as well as in industry. However, integrated methods and tools that combine advanced visualization and data mining techniques are rare, and those that exist are often specialized to a single problem or domain. In this paper, we introduce a novel generic method for interactive visual exploration of high-dimensional data. In contrast to most visualization tools, it is not based on the traditional dogma of manually zooming and rotating data. Instead, the tool initially presents the user with an ‘interesting’ projection of the data and then employs data randomization with constraints to allow users to flexibly and intuitively express their interests or beliefs using visual interactions that correspond to exactly defined constraints. These constraints expressed by the user are then taken into account by a projection-finding algorithm to compute a new ‘interesting’ projection, a process that can be iterated until the user runs out of time or finds that constraints explain everything she needs to find from the data. We present the tool by means of two case studies, one controlled study on synthetic data and another on real census data. The data and software related to this paper are available at http://www.interesting-patterns.net/forsied/interactive-visual-data-exploration-with-subjective-feedback/.
- Published
- 2016
10. A Tool for Subjective and Interactive Visual Data Exploration
- Author
-
Jefrey Lijffijt, Bo Kang, Kai Puolamäki, and Tijl De Bie
- Subjects
Clustering high-dimensional data ,Visual analytics ,Technology and Engineering ,Data exploration ,Exploratory Data Mining ,Computer science ,business.industry ,Dimensionality reduction ,Contrast (statistics) ,02 engineering and technology ,computer.software_genre ,Set (abstract data type) ,Data visualization ,Human–computer interaction ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Data mining ,business ,Projection (set theory) ,computer ,Subjective Interestingness ,Dimensionality Reduction ,Data Visualisation - Abstract
We present SIDE, a tool for Subjective and Interactive Visual Data Exploration, which lets users explore high dimensional data via subjectively informative 2D data visualizations. Many existing visual analytics tools are either restricted to specific problems and domains or they aim to find visualizations that align with user’s belief about the data. In contrast, our generic tool computes data visualizations that are surprising given a user’s current understanding of the data. The user’s belief state is represented as a set of projection tiles. Hence, this user-awareness offers users an efficient way to interactively explore yet-unknown features of complex high dimensional datasets.
- Published
- 2016
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.