Author: "Latré, Steven" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Latré, Steven"' showing total 411 results

Start Over Author "Latré, Steven"

411 results on '"Latré, Steven"'

1. Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization

Author: Deckers, Lucas, Vandersmissen, Benjamin, Tsang, Ing Jyh, Van Leekwijck, Werner, and Latré, Steven
Subjects: Computer Science - Neural and Evolutionary Computing
Abstract: The proliferation of Artificial Neural Networks (ANNs) has led to increased energy consumption, raising concerns about their sustainability. Spiking Neural Networks (SNNs), which are inspired by biological neural systems and operate using sparse, event-driven spikes to communicate information between neurons, offer a potential solution due to their lower energy requirements. An alternative technique for reducing a neural network's footprint is quantization, which compresses weight representations to decrease memory usage and energy consumption. In this study, we present Twin Network Augmentation (TNA), a novel training framework aimed at improving the performance of SNNs while also facilitating an enhanced compression through low-precision quantization of weights. TNA involves co-training an SNN with a twin network, optimizing both networks to minimize their cross-entropy losses and the mean squared error between their output logits. We demonstrate that TNA significantly enhances classification performance across various vision datasets and in addition is particularly effective when applied when reducing SNNs to ternary weight precision. Notably, during inference , only the ternary SNN is retained, significantly reducing the network in number of neurons, connectivity and weight size representation. Our results show that TNA outperforms traditional knowledge distillation methods and achieves state-of-the-art performance for the evaluated network architecture on benchmark datasets, including CIFAR-10, CIFAR-100, and CIFAR-10-DVS. This paper underscores the effectiveness of TNA in bridging the performance gap between SNNs and ANNs and suggests further exploration into the application of TNA in different network architectures and datasets.
Published: 2024

2. Designing, Developing, and Validating Network Intelligence for Scaling in Service-Based Architectures based on Deep Reinforcement Learning

Author: Soto, Paola, Camelo, Miguel, De Vleeschauwer, Danny, De Bock, Yorick, Slamnik-Kriještorac, Nina, Chang, Chia-Yu, Gaviria, Natalia, Mannens, Erik, Botero, Juan F., and Latré, Steven
Subjects: Computer Science - Networking and Internet Architecture
Abstract: Automating network processes without human intervention is crucial for the complex Sixth Generation (6G) environment. Thus, 6G networks must advance beyond basic automation, relying on Artificial Intelligence (AI) and Machine Learning (ML) for self-optimizing and autonomous operation. This requires zero-touch management and orchestration, the integration of Network Intelligence (NI) into the network architecture, and the efficient lifecycle management of intelligent functions. Despite its potential, integrating NI poses challenges in model development and application. To tackle those issues, this paper presents a novel methodology to manage the complete lifecycle of Reinforcement Learning (RL) applications in networking, thereby enhancing existing Machine Learning Operations (MLOps) frameworks to accommodate RL-specific tasks. We focus on scaling computing resources in service-based architectures, modeling the problem as a Markov Decision Process (MDP). Two RL algorithms, guided by distinct Reward Functions (RFns), are proposed to autonomously determine the number of service replicas in dynamic environments. Our proposed methodology is anchored on a dual approach: firstly, it evaluates the training performance of these algorithms under varying RFns, and secondly, it validates their performance after being trained to discern the practical applicability in real-world settings. We show that, despite significant progress, the development stage of RL techniques for networking applications, particularly in scaling scenarios, still leaves room for significant improvements. This study underscores the importance of ongoing research and development to enhance the practicality and resilience of RL techniques in real-world networking environments.
Published: 2024

3. Inferring the relationship between soil temperature and the normalized difference vegetation index with machine learning

Author: Mortier, Steven, Hamedpour, Amir, Bussmann, Bart, Wandji, Ruth Phoebe Tchana, Latré, Steven, Sigurdsson, Bjarni D., De Schepper, Tom, and Verdonck, Tim
Subjects: Computer Science - Machine Learning
Abstract: Changes in climate can greatly affect the phenology of plants, which can have important feedback effects, such as altering the carbon cycle. These phenological feedback effects are often induced by a shift in the start or end dates of the growing season of plants. The normalized difference vegetation index (NDVI) serves as a straightforward indicator for assessing the presence of green vegetation and can also provide an estimation of the plants' growing season. In this study, we investigated the effect of soil temperature on the timing of the start of the season (SOS), timing of the peak of the season (POS), and the maximum annual NDVI value (PEAK) in subarctic grassland ecosystems between 2014 and 2019. We also explored the impact of other meteorological variables, including air temperature, precipitation, and irradiance, on the inter-annual variation in vegetation phenology. Using machine learning (ML) techniques and SHapley Additive exPlanations (SHAP) values, we analyzed the relative importance and contribution of each variable to the phenological predictions. Our results reveal a significant relationship between soil temperature and SOS and POS, indicating that higher soil temperatures lead to an earlier start and peak of the growing season. However, the Peak NDVI values showed just a slight increase with higher soil temperatures. The analysis of other meteorological variables demonstrated their impacts on the inter-annual variation of the vegetation phenology. Ultimately, this study contributes to our knowledge of the relationships between soil temperature, meteorological variables, and vegetation phenology, providing valuable insights for predicting vegetation phenology characteristics and managing subarctic grasslands in the face of climate change. Additionally, this work provides a solid foundation for future ML-based vegetation phenology studies., Comment: 31 pages, 7 figures, 5 tables
Published: 2023
Full Text: View/download PDF

4. An Encoding Framework for Binarized Images using HyperDimensional Computing

Author: Smets, Laura, Van Leekwijck, Werner, Tsang, Ing Jyh, and Latré, Steven
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Hyperdimensional Computing (HDC) is a brain-inspired and light-weight machine learning method. It has received significant attention in the literature as a candidate to be applied in the wearable internet of things, near-sensor artificial intelligence applications and on-device processing. HDC is computationally less complex than traditional deep learning algorithms and typically achieves moderate to good classification performance. A key aspect that determines the performance of HDC is the encoding of the input data to the hyperdimensional (HD) space. This article proposes a novel light-weight approach relying only on native HD arithmetic vector operations to encode binarized images that preserves similarity of patterns at nearby locations by using point of interest selection and local linear mapping. The method reaches an accuracy of 97.35% on the test set for the MNIST data set and 84.12% for the Fashion-MNIST data set. These results outperform other studies using baseline HDC with different encoding approaches and are on par with more complex hybrid HDC models. The proposed encoding approach also demonstrates a higher robustness to noise and blur compared to the baseline encoding.
Published: 2023
Full Text: View/download PDF

5. Training a HyperDimensional Computing Classifier using a Threshold on its Confidence

Author: Smets, Laura, Van Leekwijck, Werner, Tsang, Ing Jyh, and Latre, Steven
Subjects: Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing
Abstract: Hyperdimensional computing (HDC) has become popular for light-weight and energy-efficient machine learning, suitable for wearable Internet-of-Things (IoT) devices and near-sensor or on-device processing. HDC is computationally less complex than traditional deep learning algorithms and achieves moderate to good classification performance. This article proposes to extend the training procedure in HDC by taking into account not only wrongly classified samples, but also samples that are correctly classified by the HDC model but with low confidence. As such, a confidence threshold is introduced that can be tuned for each dataset to achieve the best classification accuracy. The proposed training procedure is tested on UCIHAR, CTG, ISOLET and HAND dataset for which the performance consistently improves compared to the baseline across a range of confidence threshold values. The extended training procedure also results in a shift towards higher confidence values of the correctly classified samples making the classifier not only more accurate but also more confident about its predictions.
Published: 2023

6. Deep set conditioned latent representations for action recognition

Author: Singh, Akash, De Schepper, Tom, Mets, Kevin, Hellinckx, Peter, Oramas, Jose, and Latre, Steven
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In recent years multi-label, multi-class video action recognition has gained significant popularity. While reasoning over temporally connected atomic actions is mundane for intelligent species, standard artificial neural networks (ANN) still struggle to classify them. In the real world, atomic actions often temporally connect to form more complex composite actions. The challenge lies in recognising composite action of varying durations while other distinct composite or atomic actions occur in the background. Drawing upon the success of relational networks, we propose methods that learn to reason over the semantic concept of objects and actions. We empirically show how ANNs benefit from pretraining, relational inductive biases and unordered set-based latent representations. In this paper we propose deep set conditioned I3D (SCI3D), a two stream relational network that employs latent representation of state and visual representation for reasoning over events and actions. They learn to reason about temporally connected actions in order to identify all of them in the video. The proposed method achieves an improvement of around 1.49% mAP in atomic action recognition and 17.57% mAP in composite action recognition, over a I3D-NL baseline, on the CATER dataset., Comment: Conference VISAPP 2022, 11 pages,5 figures, 2 Tables, 6 plots
Published: 2022
Full Text: View/download PDF

7. Structured Exploration Through Instruction Enhancement for Object Navigation

Author: Hutsebaut-Buysse, Matthias, Mets, Kevin, De Schepper, Tom, and Latré, Steven
Subjects: Computer Science - Artificial Intelligence
Abstract: Finding an object of a specific class in an unseen environment remains an unsolved navigation problem. Hence, we propose a hierarchical learning-based method for object navigation. The top-level is capable of high-level planning, and building a memory on a floorplan-level (e.g., which room makes the most sense for the agent to visit next, where has the agent already been?). While the lower-level is tasked with efficiently navigating between rooms and looking for objects in them. Instructions can be provided to the agent using a simple synthetic language. The top-level intelligently enhances the instructions in order to make the overall task more tractable. Language grounding, mapping instructions to visual observations, is performed by utilizing an additional separate supervised trained goal assessment module. We demonstrate the effectiveness of our method on a dynamic configurable domestic environment., Comment: Paper accepted to the BNAIC/BeNeLearn 2022 conference
Published: 2022

8. An Analysis of Discretization Methods for Communication Learning with Multi-Agent Reinforcement Learning

Author: Vanneste, Astrid, Vanneste, Simon, Mets, Kevin, De Schepper, Tom, Mercelis, Siegfried, Latré, Steven, and Hellinckx, Peter
Subjects: Computer Science - Machine Learning, Computer Science - Multiagent Systems
Abstract: Communication is crucial in multi-agent reinforcement learning when agents are not able to observe the full state of the environment. The most common approach to allow learned communication between agents is the use of a differentiable communication channel that allows gradients to flow between agents as a form of feedback. However, this is challenging when we want to use discrete messages to reduce the message size since gradients cannot flow through a discrete communication channel. Previous work proposed methods to deal with this problem. However, these methods are tested in different communication learning architectures and environments, making it hard to compare them. In this paper, we compare several state-of-the-art discretization methods as well as two methods that have not been used for communication learning before. We do this comparison in the context of communication learning using gradients from other agents and perform tests on several environments. Our results show that none of the methods is best in all environments. The best choice in discretization method greatly depends on the environment. However, the discretize regularize unit (DRU), straight through DRU and the straight through gumbel softmax show the most consistent results across all the tested environments. Therefore, these methods prove to be the best choice for general use while the straight through estimator and the gumbel softmax may provide better results in specific environments but fail completely in others., Comment: Accepted at Adaptive and Learning Agents Workshop (ALA 2022) https://ala2022.github.io/
Published: 2022

9. Inferring the relationship between soil temperature and the normalized difference vegetation index with machine learning

Author: Mortier, Steven, Hamedpour, Amir, Bussmann, Bart, Wandji, Ruth Phoebe Tchana, Latré, Steven, Sigurdsson, Bjarni D., De Schepper, Tom, and Verdonck, Tim
Published: 2024
Full Text: View/download PDF

10. Learning to Communicate with Reinforcement Learning for an Adaptive Traffic Control System

Author: Vanneste, Simon, de Borrekens, Gauthier, Bosmans, Stig, Vanneste, Astrid, Mets, Kevin, Mercelis, Siegfried, Latré, Steven, and Hellinckx, Peter
Subjects: Computer Science - Machine Learning, Computer Science - Multiagent Systems
Abstract: Recent work in multi-agent reinforcement learning has investigated inter agent communication which is learned simultaneously with the action policy in order to improve the team reward. In this paper, we investigate independent Q-learning (IQL) without communication and differentiable inter-agent learning (DIAL) with learned communication on an adaptive traffic control system (ATCS). In real world ATCS, it is impossible to present the full state of the environment to every agent so in our simulation, the individual agents will only have a limited observation of the full state of the environment. The ATCS will be simulated using the Simulation of Urban MObility (SUMO) traffic simulator in which two connected intersections are simulated. Every intersection is controlled by an agent which has the ability to change the direction of the traffic flow. Our results show that a DIAL agent outperforms an independent Q-learner on both training time and on maximum achieved reward as it is able to share relevant information with the other agents.
Published: 2021
Full Text: View/download PDF

11. Mixed Cooperative-Competitive Communication Using Multi-Agent Reinforcement Learning

Author: Vanneste, Astrid, Van Wijnsberghe, Wesley, Vanneste, Simon, Mets, Kevin, Mercelis, Siegfried, Latré, Steven, and Hellinckx, Peter
Subjects: Computer Science - Machine Learning, Computer Science - Multiagent Systems
Abstract: By using communication between multiple agents in multi-agent environments, one can reduce the effects of partial observability by combining one agent's observation with that of others in the same dynamic environment. While a lot of successful research has been done towards communication learning in cooperative settings, communication learning in mixed cooperative-competitive settings is also important and brings its own complexities such as the opposing team overhearing the communication. In this paper, we apply differentiable inter-agent learning (DIAL), designed for cooperative settings, to a mixed cooperative-competitive setting. We look at the difference in performance between communication that is private for a team and communication that can be overheard by the other team. Our research shows that communicating agents are able to achieve similar performance to fully observable agents after a given training period in our chosen environment. Overall, we find that sharing communication across teams results in decreased performance for the communicating team in comparison to results achieved with private communication.
Published: 2021
Full Text: View/download PDF

12. A Review of the Deep Sea Treasure problem as a Multi-Objective Reinforcement Learning Benchmark

Author: Cassimon, Amber, Eyckerman, Reinout, Mercelis, Siegfried, Latré, Steven, and Hellinckx, Peter
Subjects: Computer Science - Machine Learning
Abstract: In this paper, the authors investigate the Deep Sea Treasure (DST) problem as proposed by Vamplew et al. Through a number of proofs, the authors show the original DST problem to be quite basic, and not always representative of practical Multi-Objective Optimization problems. In an attempt to bring theory closer to practice, the authors propose an alternative, improved version of the DST problem, and prove that some of the properties that simplify the original DST problem no longer hold. The authors also provide a reference implementation and perform a comparison between their implementation, and other existing open-source implementations of the problem. Finally, the authors also provide a complete Pareto-front for their new DST problem., Comment: 10 pages, 4 figures; Fixed Supplementary Materials PDF
Published: 2021

13. An optimized application-context relocation approach for Connected and Automated Mobility (CAM)

Author: Slamnik-Kriještorac, Nina, Latré, Steven, and Marquez-Barja, Johann M.
Subjects: Computer Science - Networking and Internet Architecture
Abstract: In this paper, we study and present a management and orchestration framework for vehicular communications, which enables service continuity for the vehicle via an optimized application-context relocation approach. To optimize the transfer of the application-context for Connected and Automated Mobility (CAM) services, our MEC orchestrator performs prediction of resource availability in the edge infrastructure based on the Long Short-Term Memory (LSTM) model, and it makes a final decision on relocation by calculating the outcome of a Multi-Criteria Decision-Making (MCDM) algorithm, taking into account the i) resource prediction, ii) latency and bandwidth on the communication links, and iii) geographical locations of the vehicle and edge hosts in the network infrastructure. Furthermore, we have built a proof-of-concept for the orchestration framework in a real-life distributed testbed environment, to showcase the efficiency in optimizing the edge host selection and application context relocation towards achieving continuity of a service that informs vehicle about the driving conditions on the road.
Published: 2021

14. Noise robustness of persistent homology on greyscale images, across filtrations and signatures

Author: Turkeš, Renata, Nys, Jannes, Verdonck, Tim, and Latré, Steven
Subjects: Mathematics - Algebraic Topology
Abstract: Topological data analysis is a recent and fast growing field that approaches the analysis of datasets using techniques from (algebraic) topology. Its main tool, persistent homology (PH), has seen a notable increase in applications in the last decade. Often cited as the most favourable property of PH and the main reason for practical success are the stability theorems that give theoretical results about noise robustness, since real data is typically contaminated with noise or measurement errors. However, little attention has been paid to what these stability theorems mean in practice. To gain some insight into this question, we evaluate the noise robustness of PH on the MNIST dataset of greyscale images. More precisely, we investigate to what extent PH changes under typical forms of image noise, and quantify the loss of performance in classifying the MNIST handwritten digits when noise is added to the data. The results show that the sensitivity to noise of PH is influenced by the choice of filtrations and persistence signatures (respectively the input and output of PH), and in particular, that PH features are often not robust to noise in a classification task., Comment: 24 pages, 7 figures, 4 tables
Published: 2021
Full Text: View/download PDF

15. Towards Personalised Performance Prediction in Road Cycling Through Machine Learning

Author: Kholkine, Leonid, de Leeuw, Arie-Willem, Verdonck, Tim, Latré, Steven, Kacprzyk, Janusz, Series Editor, Pal, Nikhil R., Advisory Editor, Bello Perez, Rafael, Advisory Editor, Corchado, Emilio S., Advisory Editor, Hagras, Hani, Advisory Editor, Kóczy, László T., Advisory Editor, Kreinovich, Vladik, Advisory Editor, Lin, Chin-Teng, Advisory Editor, Lu, Jie, Advisory Editor, Melin, Patricia, Advisory Editor, Nedjah, Nadia, Advisory Editor, Nguyen, Ngoc Thanh, Advisory Editor, Wang, Jun, Advisory Editor, Baca, Arnold, editor, and Exel, Juliana, editor
Published: 2023
Full Text: View/download PDF

16. Exploiting sensor data in professional road cycling: personalized data-driven approach for frequent fitness monitoring

Author: de Leeuw, Arie-Willem, Heijboer, Mathieu, Verdonck, Tim, Knobbe, Arno, and Latré, Steven
Published: 2023
Full Text: View/download PDF

17. Neural Additive Vector Autoregression Models for Causal Discovery in Time Series

Author: Bussmann, Bart, Nys, Jannes, and Latré, Steven
Subjects: Computer Science - Machine Learning
Abstract: Causal structure discovery in complex dynamical systems is an important challenge for many scientific domains. Although data from (interventional) experiments is usually limited, large amounts of observational time series data sets are usually available. Current methods that learn causal structure from time series often assume linear relationships. Hence, they may fail in realistic settings that contain nonlinear relations between the variables. We propose Neural Additive Vector Autoregression (NAVAR) models, a neural approach to causal structure learning that can discover nonlinear relationships. We train deep neural networks that extract the (additive) Granger causal influences from the time evolution in multi-variate time series. The method achieves state-of-the-art results on various benchmark data sets for causal discovery, while providing clear interpretations of the mapped causal relations., Comment: 11 pages, 5 figures
Published: 2020
Full Text: View/download PDF

18. HTMRL: Biologically Plausible Reinforcement Learning with Hierarchical Temporal Memory

Author: Struye, Jakob, Mets, Kevin, and Latré, Steven
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: Building Reinforcement Learning (RL) algorithms which are able to adapt to continuously evolving tasks is an open research challenge. One technology that is known to inherently handle such non-stationary input patterns well is Hierarchical Temporal Memory (HTM), a general and biologically plausible computational model for the human neocortex. As the RL paradigm is inspired by human learning, HTM is a natural framework for an RL algorithm supporting non-stationary environments. In this paper, we present HTMRL, the first strictly HTM-based RL algorithm. We empirically and statistically show that HTMRL scales to many states and actions, and demonstrate that HTM's ability for adapting to changing patterns extends to RL. Specifically, HTMRL performs well on a 10-armed bandit after 750 steps, but only needs a third of that to adapt to the bandit suddenly shuffling its arms. HTMRL is the first iteration of a novel RL approach, with the potential of extending to a capable algorithm for Meta-RL.
Published: 2020

19. Pre-trained Word Embeddings for Goal-conditional Transfer Learning in Reinforcement Learning

Author: Hutsebaut-Buysse, Matthias, Mets, Kevin, and Latré, Steven
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: Reinforcement learning (RL) algorithms typically start tabula rasa, without any prior knowledge of the environment, and without any prior skills. This however often leads to low sample efficiency, requiring a large amount of interaction with the environment. This is especially true in a lifelong learning setting, in which the agent needs to continually extend its capabilities. In this paper, we examine how a pre-trained task-independent language model can make a goal-conditional RL agent more sample efficient. We do this by facilitating transfer learning between different related tasks. We experimentally demonstrate our approach on a set of object navigation tasks., Comment: Paper accepted to the ICML 2020 Language in Reinforcement Learning (LaReL) Workshop
Published: 2020

20. Learning to Communicate Using Counterfactual Reasoning

Author: Vanneste, Simon, Vanneste, Astrid, Mets, Kevin, De Schepper, Tom, Anwar, Ali, Mercelis, Siegfried, Latré, Steven, and Hellinckx, Peter
Subjects: Computer Science - Machine Learning, Computer Science - Multiagent Systems, Statistics - Machine Learning
Abstract: Learning to communicate in order to share state information is an active problem in the area of multi-agent reinforcement learning (MARL). The credit assignment problem, the non-stationarity of the communication environment and the creation of influenceable agents are major challenges within this research field which need to be overcome in order to learn a valid communication protocol. This paper introduces the novel multi-agent counterfactual communication learning (MACC) method which adapts counterfactual reasoning in order to overcome the credit assignment problem for communicating agents. Secondly, the non-stationarity of the communication environment while learning the communication Q-function is overcome by creating the communication Q-function using the action policy of the other agents and the Q-function of the action environment. Additionally, a social loss function is introduced in order to create influenceable agents which is required to learn a valid communication protocol. Our experiments show that MACC is able to outperform the state-of-the-art baselines in four different scenarios in the Particle environment., Comment: Accepted at Adaptive and Learning Agents Workshop (ALA 2022) https://ala2022.github.io/
Published: 2020

21. Neurosciences and 6G: Lessons from and Needs of Communicative Brains

Author: Moioli, Renan C., Nardelli, Pedro H. J., Barros, Michael Taynnan, Saad, Walid, Hekmatmanesh, Amin, Gória, Pedro, de Sena, Arthur S., Dzaferagic, Merim, Siljak, Harun, van Leekwijck, Werner, Carrillo, Dick, and Latré, Steven
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Emerging Technologies, Computer Science - Information Theory, Quantitative Biology - Neurons and Cognition
Abstract: This paper presents the first comprehensive tutorial on a promising research field located at the frontier of two well-established domains: Neurosciences and wireless communications, motivated by the ongoing efforts to define how the sixth generation of mobile networks (6G) will be. In particular, this tutorial first provides a novel integrative approach that bridges the gap between these two, seemingly disparate fields. Then, we present the state-of-the-art and key challenges of these two topics. In particular, we propose a novel systematization that divides the contributions into two groups, one focused on what neurosciences will offer to 6G in terms of new applications and systems architecture (Neurosciences for Wireless), and the other focused on how wireless communication theory and 6G systems can provide new ways to study the brain (Wireless for Neurosciences). For the first group, we concretely explain how current scientific understanding of the brain would enable new application for 6G within the context of a new type of service that we dub braintype communications and that has more stringent requirements than human- and machine-type communication. In this regard, we expose the key requirements of brain-type communication services and we discuss how future wireless networks can be equipped to deal with such services. Meanwhile, for the second group, we thoroughly explore modern communication system paradigms, including Internet of Bio-nano Things and chaosbased communications, in addition to highlighting how complex systems tools can help bridging 6G and neuroscience applications. Brain-controlled vehicles are then presented as our case study. All in all, this tutorial is expected to provide a largely missing articulation between these two emerging fields while delineating concrete ways to move forward in such an interdisciplinary endeavor.
Published: 2020

22. Fast Task-Adaptation for Tasks Labeled Using Natural Language in Reinforcement Learning

Author: Hutsebaut-Buysse, Matthias, Mets, Kevin, and Latré, Steven
Subjects: Computer Science - Artificial Intelligence
Abstract: Over its lifetime, a reinforcement learning agent is often tasked with different tasks. How to efficiently adapt a previously learned control policy from one task to another, remains an open research question. In this paper, we investigate how instructions formulated in natural language can enable faster and more effective task adaptation. This can serve as the basis for developing language instructed skills, which can be used in a lifelong learning setting. Our method is capable of assessing, given a set of developed base control policies, which policy will adapt best to a new unseen task.
Published: 2019

23. Low Cost Player Tracking in Field Hockey

Author: Duarte Moura, Henrique, Kholkine, Leonid, Van Damme, Laurens, Mets, Kevin, Leysen, Christiaan, De Schepper, Tom, Hellinckx, Peter, Latré, Steven, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Brefeld, Ulf, editor, Davis, Jesse, editor, Van Haaren, Jan, editor, and Zimmermann, Albrecht, editor
Published: 2022
Full Text: View/download PDF

24. Object Detection with Semi-supervised Adversarial Domain Adaptation for Real-Time Edge Devices

Author: Billast, Mattias, De Schepper, Tom, Mets, Kevin, Hellinckx, Peter, Oramas, José, Latré, Steven, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Leiva, Luis A., editor, Pruski, Cédric, editor, Markovich, Réka, editor, Najjar, Amro, editor, and Schommer, Christoph, editor
Published: 2022
Full Text: View/download PDF

25. Task Independent Capsule-Based Agents for Deep Q-Learning

Author: Singh, Akash, De Schepper, Tom, Mets, Kevin, Hellinckx, Peter, Oramas, José, Latré, Steven, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Leiva, Luis A., editor, Pruski, Cédric, editor, Markovich, Réka, editor, Najjar, Amro, editor, and Schommer, Christoph, editor
Published: 2022
Full Text: View/download PDF

26. Quality-Aware Compression of Point Clouds with Google Draco

Author: de Hoog, Jens, Ahmed, Ahmed N., Anwar, Ali, Latré, Steven, Hellinckx, Peter, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, and Barolli, Leonard, editor
Published: 2022
Full Text: View/download PDF

27. Autonomous Building Control Using Offline Reinforcement Learning

Author: Schepers, Jorren, Eyckerman, Reinout, Elmaz, Furkan, Casteels, Wim, Latré, Steven, Hellinckx, Peter, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, and Barolli, Leonard, editor
Published: 2022
Full Text: View/download PDF

28. Lane Marking Detection Techniques for Autonomous Driving

Author: Ahmed, Ahmed N., Anwar, Ali, Eckelmann, Sven, Trautmann, Toralf, Latré, Steven, Hellinckx, Peter, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, and Barolli, Leonard, editor
Published: 2022
Full Text: View/download PDF

29. Learning to Communicate with Reinforcement Learning for an Adaptive Traffic Control System

Author: Vanneste, Simon, de Borrekens, Gauthier, Bosmans, Stig, Vanneste, Astrid, Mets, Kevin, Mercelis, Siegfried, Latré, Steven, Hellinckx, Peter, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, and Barolli, Leonard, editor
Published: 2022
Full Text: View/download PDF

30. Transfer Learning in Autonomous Driving Using Real-World Samples

Author: Troch, Arne, Hoog, Jens de, Vanneste, Simon, Balemans, Dieter, Latré, Steven, Hellinckx, Peter, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, and Barolli, Leonard, editor
Published: 2022
Full Text: View/download PDF

31. Mixed Cooperative-Competitive Communication Using Multi-agent Reinforcement Learning

Author: Vanneste, Astrid, Van Wijnsberghe, Wesley, Vanneste, Simon, Mets, Kevin, Mercelis, Siegfried, Latré, Steven, Hellinckx, Peter, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, and Barolli, Leonard, editor
Published: 2022
Full Text: View/download PDF

32. Neural Additive Vector Autoregression Models for Causal Discovery in Time Series

Author: Bussmann, Bart, Nys, Jannes, Latré, Steven, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Soares, Carlos, editor, and Torgo, Luis, editor
Published: 2021
Full Text: View/download PDF

33. Disagreement Options: Task Adaptation Through Temporally Extended Actions

Author: Hutsebaut-Buysse, Matthias, Schepper, Tom De, Mets, Kevin, Latré, Steven, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Oliver, Nuria, editor, Pérez-Cruz, Fernando, editor, Kramer, Stefan, editor, Read, Jesse, editor, and Lozano, Jose A., editor
Published: 2021
Full Text: View/download PDF

34. CNN-LSTM architecture for predictive indoor temperature modeling

Author: Elmaz, Furkan, Eyckerman, Reinout, Casteels, Wim, Latré, Steven, and Hellinckx, Peter
Published: 2021
Full Text: View/download PDF

35. Co-learning synaptic delays, weights and adaptation in spiking neural networks

Author: Deckers, Lucas, primary, Van Damme, Laurens, additional, Van Leekwijck, Werner, additional, Tsang, Ing Jyh, additional, and Latré, Steven, additional
Published: 2024
Full Text: View/download PDF

36. Missing Value Imputation of Wireless Sensor Data for Environmental Monitoring

Author: Decorte, Thomas, primary, Mortier, Steven, additional, Lembrechts, Jonas J., additional, Meysman, Filip J. R., additional, Latré, Steven, additional, Mannens, Erik, additional, and Verdonck, Tim, additional
Published: 2024
Full Text: View/download PDF

37. A Machine Learning Approach for Road Cycling Race Performance Prediction

Author: Kholkine, Leonid, De Schepper, Tom, Verdonck, Tim, Latré, Steven, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Brefeld, Ulf, editor, Davis, Jesse, editor, Van Haaren, Jan, editor, and Zimmermann, Albrecht, editor
Published: 2020
Full Text: View/download PDF

38. Reliability Models for Multi-Objective Design Problems

Author: Papadimitriou, Dimitri, Latre, Steven, Tomaszewski, Artur, Rak, Jacek, Series Editor, Sammes, A. J., Series Editor, Kantarci, Burak, Editorial Board Member, Oki, Eiji, Editorial Board Member, Popescu, Adrian, Editorial Board Member, Shen, Gangxiang, Editorial Board Member, and Hutchison, David, editor
Published: 2020
Full Text: View/download PDF

39. Autonomous Building Control Using Offline Reinforcement Learning

Author: Schepers, Jorren, primary, Eyckerman, Reinout, additional, Elmaz, Furkan, additional, Casteels, Wim, additional, Latré, Steven, additional, and Hellinckx, Peter, additional
Published: 2021
Full Text: View/download PDF

40. Mixed Cooperative-Competitive Communication Using Multi-agent Reinforcement Learning

Author: Vanneste, Astrid, primary, Van Wijnsberghe, Wesley, additional, Vanneste, Simon, additional, Mets, Kevin, additional, Mercelis, Siegfried, additional, Latré, Steven, additional, and Hellinckx, Peter, additional
Published: 2021
Full Text: View/download PDF

41. Lane Marking Detection Techniques for Autonomous Driving

Author: Ahmed, Ahmed N., primary, Anwar, Ali, additional, Eckelmann, Sven, additional, Trautmann, Toralf, additional, Latré, Steven, additional, and Hellinckx, Peter, additional
Published: 2021
Full Text: View/download PDF

42. Learning to Communicate with Reinforcement Learning for an Adaptive Traffic Control System

Author: Vanneste, Simon, primary, de Borrekens, Gauthier, additional, Bosmans, Stig, additional, Vanneste, Astrid, additional, Mets, Kevin, additional, Mercelis, Siegfried, additional, Latré, Steven, additional, and Hellinckx, Peter, additional
Published: 2021
Full Text: View/download PDF

43. Transfer Learning in Autonomous Driving Using Real-World Samples

Author: Troch, Arne, primary, Hoog, Jens de, additional, Vanneste, Simon, additional, Balemans, Dieter, additional, Latré, Steven, additional, and Hellinckx, Peter, additional
Published: 2021
Full Text: View/download PDF

44. An encoding framework for binarized images using hyperdimensional computing.

Author: Smets, Laura, Van Leekwijck, Werner, Ing Jyh Tsang, and Latré, Steven
Published: 2024
Full Text: View/download PDF

45. Evaluating Deep Neural Networks to Classify Modulated and Coded Radio Signals

Author: Cheong, Phui San, Camelo, Miguel, Latré, Steven, Akan, Ozgur, Series Editor, Bellavista, Paolo, Series Editor, Cao, Jiannong, Series Editor, Coulson, Geoffrey, Series Editor, Dressler, Falko, Series Editor, Ferrari, Domenico, Series Editor, Gerla, Mario, Series Editor, Kobayashi, Hisashi, Series Editor, Palazzo, Sergio, Series Editor, Sahni, Sartaj, Series Editor, Shen, Xuemin (Sherman), Series Editor, Stan, Mircea, Series Editor, Xiaohua, Jia, Series Editor, Zomaya, Albert Y., Series Editor, Moerman, Ingrid, editor, Marquez-Barja, Johann, editor, Shahid, Adnan, editor, Liu, Wei, editor, Giannoulis, Spilios, editor, and Jiao, Xianjun, editor
Published: 2019
Full Text: View/download PDF

46. IEEE 802.11 Latency Modeling with Non-IEEE 802.11 Interfering Source

Author: Bosch, Patrick, Latré, Steven, Blondia, Chris, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Di Felice, Marco, editor, Natalizio, Enrico, editor, Bruno, Raffaele, editor, and Kassler, Andreas, editor
Published: 2019
Full Text: View/download PDF

47. ALPACA: A PCI Assignment Algorithm Taking Advantage of Weighted ANR

Author: Zeljković, Ensar, Gogos, Dimitris, Dox, Gerwin, Latré, Steven, and Marquez-Barja, Johann M.
Published: 2022
Full Text: View/download PDF

48. Delay-constrained NFV orchestration for heterogeneous cloud networks

Author: Spinnewyn, Bart, Latré, Steven, and Botero, Juan Felipe
Published: 2020
Full Text: View/download PDF

49. Hierarchical temporal memory and recurrent neural networks for time series prediction: An empirical validation and reduction to multilayer perceptrons

Author: Struye, Jakob and Latré, Steven
Published: 2020
Full Text: View/download PDF

50. Comparative study of a cubic, Kelvin and Weaire-Phelan unit cell for the prediction of the thermal conductivity of low density silica aerogels

Author: Latré, Steven K., De Pooter, Steve, Buffel, Bart, Brabazon, Dermot, Seveno, David, and Desplentere, Frederik
Published: 2020
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

411 results on '"Latré, Steven"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources