Author: "Hügle, Maria" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Hügle, Maria"' showing total 15 results

Start Over Author "Hügle, Maria"

15 results on '"Hügle, Maria"'

1. Amortized Q-learning with Model-based Action Proposals for Autonomous Driving on Highways

Author: Mirchevska, Branka, Hügle, Maria, Kalweit, Gabriel, Werling, Moritz, and Boedecker, Joschka
Subjects: Computer Science - Machine Learning, Computer Science - Robotics
Abstract: Well-established optimization-based methods can guarantee an optimal trajectory for a short optimization horizon, typically no longer than a few seconds. As a result, choosing the optimal trajectory for this short horizon may still result in a sub-optimal long-term solution. At the same time, the resulting short-term trajectories allow for effective, comfortable and provable safe maneuvers in a dynamic traffic environment. In this work, we address the question of how to ensure an optimal long-term driving strategy, while keeping the benefits of classical trajectory planning. We introduce a Reinforcement Learning based approach that coupled with a trajectory planner, learns an optimal long-term decision-making strategy for driving on highways. By online generating locally optimal maneuvers as actions, we balance between the infinite low-level continuous action space, and the limited flexibility of a fixed number of predefined standard lane-change actions. We evaluated our method on realistic scenarios in the open-source traffic simulator SUMO and were able to achieve better performance than the 4 benchmark approaches we compared against, including a random action selecting agent, greedy agent, high-level, discrete actions agent and an IDM-based SUMO-controlled agent.
Published: 2020

2. A Dynamic Deep Neural Network For Multimodal Clinical Data Analysis

Author: Hügle, Maria, Kalweit, Gabriel, Huegle, Thomas, and Boedecker, Joschka
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Clinical data from electronic medical records, registries or trials provide a large source of information to apply machine learning methods in order to foster precision medicine, e.g. by finding new disease phenotypes or performing individual disease prediction. However, to take full advantage of deep learning methods on clinical data, architectures are necessary that 1) are robust with respect to missing and wrong values, and 2) can deal with highly variable-sized lists and long-term dependencies of individual diagnosis, procedures, measurements and medication prescriptions. In this work, we elaborate limitations of fully-connected neural networks and classical machine learning methods in this context and propose AdaptiveNet, a novel recurrent neural network architecture, which can deal with multiple lists of different events, alleviating the aforementioned limitations. We employ the architecture to the problem of disease progression prediction in rheumatoid arthritis using the Swiss Clinical Quality Management registry, which contains over 10.000 patients and more than 65.000 patient visits. Our proposed approach leads to more compact representations and outperforms the classical baselines., Comment: Accepted at the AAAI 2020 International Workshop on Health Intelligence
Published: 2020
Full Text: View/download PDF

3. Deep Inverse Q-learning with Constraints

Author: Kalweit, Gabriel, Huegle, Maria, Werling, Moritz, and Boedecker, Joschka
Subjects: Computer Science - Machine Learning, Computer Science - Robotics, Statistics - Machine Learning
Abstract: Popular Maximum Entropy Inverse Reinforcement Learning approaches require the computation of expected state visitation frequencies for the optimal policy under an estimate of the reward function. This usually requires intermediate value estimation in the inner loop of the algorithm, slowing down convergence considerably. In this work, we introduce a novel class of algorithms that only needs to solve the MDP underlying the demonstrated behavior once to recover the expert policy. This is possible through a formulation that exploits a probabilistic behavior assumption for the demonstrations within the structure of Q-learning. We propose Inverse Action-value Iteration which is able to fully recover an underlying reward of an external agent in closed-form analytically. We further provide an accompanying class of sampling-based variants which do not depend on a model of the environment. We show how to extend this class of algorithms to continuous state-spaces via function approximation and how to estimate a corresponding action-value function, leading to a policy as close as possible to the policy of the external agent, while optionally satisfying a list of predefined hard constraints. We evaluate the resulting algorithms called Inverse Action-value Iteration, Inverse Q-learning and Deep Inverse Q-learning on the Objectworld benchmark, showing a speedup of up to several orders of magnitude compared to (Deep) Max-Entropy algorithms. We further apply Deep Constrained Inverse Q-learning on the task of learning autonomous lane-changes in the open-source simulator SUMO achieving competent driving after training on data corresponding to 30 minutes of demonstrations.
Published: 2020

4. Deep Constrained Q-learning

Author: Kalweit, Gabriel, Huegle, Maria, Werling, Moritz, and Boedecker, Joschka
Subjects: Computer Science - Machine Learning, Computer Science - Robotics, Statistics - Machine Learning
Abstract: In many real world applications, reinforcement learning agents have to optimize multiple objectives while following certain rules or satisfying a list of constraints. Classical methods based on reward shaping, i.e. a weighted combination of different objectives in the reward signal, or Lagrangian methods, including constraints in the loss function, have no guarantees that the agent satisfies the constraints at all points in time and can lead to undesired behavior. When a discrete policy is extracted from an action-value function, safe actions can be ensured by restricting the action space at maximization, but can lead to sub-optimal solutions among feasible alternatives. In this work, we propose Constrained Q-learning, a novel off-policy reinforcement learning framework restricting the action space directly in the Q-update to learn the optimal Q-function for the induced constrained MDP and the corresponding safe policy. In addition to single-step constraints referring only to the next action, we introduce a formulation for approximate multi-step constraints under the current target policy based on truncated value-functions. We analyze the advantages of Constrained Q-learning in the tabular case and compare Constrained DQN to reward shaping and Lagrangian methods in the application of high-level decision making in autonomous driving, considering constraints for safety, keeping right and comfort. We train our agent in the open-source simulator SUMO and on the real HighD data set.
Published: 2020

5. Dynamic Interaction-Aware Scene Understanding for Reinforcement Learning in Autonomous Driving

Author: Huegle, Maria, Kalweit, Gabriel, Werling, Moritz, and Boedecker, Joschka
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Robotics, Statistics - Machine Learning
Abstract: The common pipeline in autonomous driving systems is highly modular and includes a perception component which extracts lists of surrounding objects and passes these lists to a high-level decision component. In this case, leveraging the benefits of deep reinforcement learning for high-level decision making requires special architectures to deal with multiple variable-length sequences of different object types, such as vehicles, lanes or traffic signs. At the same time, the architecture has to be able to cover interactions between traffic participants in order to find the optimal action to be taken. In this work, we propose the novel Deep Scenes architecture, that can learn complex interaction-aware scene representations based on extensions of either 1) Deep Sets or 2) Graph Convolutional Networks. We present the Graph-Q and DeepScene-Q off-policy reinforcement learning algorithms, both outperforming state-of-the-art methods in evaluations with the publicly available traffic simulator SUMO.
Published: 2019

6. Composite Q-learning: Multi-scale Q-function Decomposition and Separable Optimization

Author: Kalweit, Gabriel, Huegle, Maria, and Boedecker, Joschka
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: In the past few years, off-policy reinforcement learning methods have shown promising results in their application for robot control. Deep Q-learning, however, still suffers from poor data-efficiency and is susceptible to stochasticity in the environment or reward functions which is limiting with regard to real-world applications. We alleviate these problems by proposing two novel off-policy Temporal-Difference formulations: (1) Truncated Q-functions which represent the return for the first n steps of a target-policy rollout w.r.t. the full action-value and (2) Shifted Q-functions, acting as the farsighted return after this truncated rollout. This decomposition allows us to optimize both parts with their individual learning rates, achieving significant learning speedup. We prove that the combination of these short- and long-term predictions is a representation of the full return, leading to the Composite Q-learning algorithm. We show the efficacy of Composite Q-learning in the tabular case and compare Deep Composite Q-learning with TD3 and TD3(Delta), which we introduce as an off-policy variant of TD(Delta). Moreover, we show that Composite TD3 outperforms TD3 as well as state-of-the-art compositional Q-learning approaches significantly in terms of data-efficiency in multiple simulated robot tasks and that Composite Q-learning is robust to stochastic environments and reward functions.
Published: 2019

7. Dynamic Input for Deep Reinforcement Learning in Autonomous Driving

Author: Hügle, Maria, Kalweit, Gabriel, Mirchevska, Branka, Werling, Moritz, and Boedecker, Joschka
Subjects: Computer Science - Machine Learning, Computer Science - Robotics, Statistics - Machine Learning
Abstract: In many real-world decision making problems, reaching an optimal decision requires taking into account a variable number of objects around the agent. Autonomous driving is a domain in which this is especially relevant, since the number of cars surrounding the agent varies considerably over time and affects the optimal action to be taken. Classical methods that process object lists can deal with this requirement. However, to take advantage of recent high-performing methods based on deep reinforcement learning in modular pipelines, special architectures are necessary. For these, a number of options exist, but a thorough comparison of the different possibilities is missing. In this paper, we elaborate limitations of fully-connected neural networks and other established approaches like convolutional and recurrent neural networks in the context of reinforcement learning problems that have to deal with variable sized inputs. We employ the structure of Deep Sets in off-policy reinforcement learning for high-level decision making, highlight their capabilities to alleviate these limitations, and show that Deep Sets not only yield the best overall performance but also offer better generalization to unseen situations than the other approaches., Comment: Accepted at IROS 2019
Published: 2019
Full Text: View/download PDF

8. Early Seizure Detection with an Energy-Efficient Convolutional Neural Network on an Implantable Microcontroller

Author: Hügle, Maria, Heller, Simon, Watter, Manuel, Blum, Manuel, Manzouri, Farrokh, Dümpelmann, Matthias, Schulze-Bonhage, Andreas, Woias, Peter, and Boedecker, Joschka
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning, Statistics - Applications
Abstract: Implantable, closed-loop devices for automated early detection and stimulation of epileptic seizures are promising treatment options for patients with severe epilepsy that cannot be treated with traditional means. Most approaches for early seizure detection in the literature are, however, not optimized for implementation on ultra-low power microcontrollers required for long-term implantation. In this paper we present a convolutional neural network for the early detection of seizures from intracranial EEG signals, designed specifically for this purpose. In addition, we investigate approximations to comply with hardware limits while preserving accuracy. We compare our approach to three previously proposed convolutional neural networks and a feature-based SVM classifier with respect to detection accuracy, latency and computational needs. Evaluation is based on a comprehensive database with long-term EEG recordings. The proposed method outperforms the other detectors with a median sensitivity of 0.96, false detection rate of 10.1 per hour and median detection delay of 3.7 seconds, while being the only approach suited to be realized on a low power microcontroller due to its parsimonious use of computational and memory resources., Comment: Accepted at IJCNN 2018
Published: 2018
Full Text: View/download PDF

9. A Dynamic Deep Neural Network for Multimodal Clinical Data Analysis

Author: Hügle, Maria, Kalweit, Gabriel, Hügle, Thomas, Boedecker, Joschka, Kacprzyk, Janusz, Series Editor, Shaban-Nejad, Arash, editor, Michalowski, Martin, editor, and Buckeridge, David L., editor
Published: 2021
Full Text: View/download PDF

10. A Dynamic Deep Neural Network for Multimodal Clinical Data Analysis

Author: Hügle, Maria, primary, Kalweit, Gabriel, additional, Hügle, Thomas, additional, and Boedecker, Joschka, additional
Published: 2020
Full Text: View/download PDF

11. Applied machine learning and artificial intelligence in rheumatology

Author: MS Reumatologie/Immunologie/Infectie, Infection & Immunity, Regenerative Medicine and Stem Cells, Hügle, Maria, Omoumi, Patrick, van Laar, Jacob M, Boedecker, Joschka, Hügle, Thomas, MS Reumatologie/Immunologie/Infectie, Infection & Immunity, Regenerative Medicine and Stem Cells, Hügle, Maria, Omoumi, Patrick, van Laar, Jacob M, Boedecker, Joschka, and Hügle, Thomas
Published: 2020

12. Personalized Prediction of Disease Activity in Patients with Rheumatoid Arthritis Using an Adaptive Deep Neural Network

Author: Hügle, Maria, primary, Walker, Ulrich A., additional, Finckh, Axel, additional, Müller, Rüdiger, additional, Kalweit, Gabriel, additional, Scherer, Almut, additional, Boedecker, Joschka, additional, and Hügle, Thomas, additional
Published: 2020
Full Text: View/download PDF

13. Applied machine learning and artificial intelligence in rheumatology

Author: Hügle, Maria, primary, Omoumi, Patrick, primary, van Laar, Jacob M, primary, Boedecker, Joschka, primary, and Hügle, Thomas, primary
Published: 2020
Full Text: View/download PDF

14. Reduction in ulnar pressure distribution when walking with forearm crutches with a novel cuff design: Cross-sectional intervention study on the biomechanical efficacy of an ulnar recess

Author: Molteni, Pascal, primary, Hügle, Thomas, additional, Hügle, Maria, additional, Nüesch, Corina, additional, and Mündermann, Annegret, additional
Published: 2016
Full Text: View/download PDF

15. Reduction in ulnar pressure distribution when walking with forearm crutches with a novel cuff design: Cross-sectional intervention study on the biomechanical efficacy of an ulnar recess.

Author: Molteni, Pascal, Hügle, Thomas, Hügle, Maria, Nüesch, Corina, and Mündermann, Annegret
Abstract: Walking with crutches is an effective way of reducing the load on the lower extremity and is often indicated after injury or surgery. However, walking with forearm crutches with conventional cuffs can trigger symptoms including tenosynovitis in the biceps tendon, ulnar neuropraxia at the wrist, pain, or skin hematoma. The purpose of this study was to test the hypothesis that a crutch cuff design with an ulnar recess reduces ulnar pressure during walking with forearm crutches. The pressure distribution between the forearm and crutch cuff was measured in 15 healthy participants for crutch walking with conventional and novel cuffs, respectively. Relative peak pressure in the proximalmedial region compared to the overall peak pressure was reduced by 8.6% when walking with crutches with the novel cuff design compared to conventional cuffs (p < 0.001). Relative peak pressure in the distal intermediate and lateral regions were increased by 3.3% and 3.7% for the novel compared with conventional cuffs, respectively (p < 0.001 for both). Hence, the novel crutch cuffs shifted regions of high pressure away from the proximal ulnar region towards more distal regions that are covered by more soft tissue. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

15 results on '"Hügle, Maria"'

1. Amortized Q-learning with Model-based Action Proposals for Autonomous Driving on Highways

2. A Dynamic Deep Neural Network For Multimodal Clinical Data Analysis

3. Deep Inverse Q-learning with Constraints

4. Deep Constrained Q-learning

5. Dynamic Interaction-Aware Scene Understanding for Reinforcement Learning in Autonomous Driving

6. Composite Q-learning: Multi-scale Q-function Decomposition and Separable Optimization

7. Dynamic Input for Deep Reinforcement Learning in Autonomous Driving

8. Early Seizure Detection with an Energy-Efficient Convolutional Neural Network on an Implantable Microcontroller

9. A Dynamic Deep Neural Network for Multimodal Clinical Data Analysis

10. A Dynamic Deep Neural Network for Multimodal Clinical Data Analysis

11. Applied machine learning and artificial intelligence in rheumatology

12. Personalized Prediction of Disease Activity in Patients with Rheumatoid Arthritis Using an Adaptive Deep Neural Network

13. Applied machine learning and artificial intelligence in rheumatology

14. Reduction in ulnar pressure distribution when walking with forearm crutches with a novel cuff design: Cross-sectional intervention study on the biomechanical efficacy of an ulnar recess

15. Reduction in ulnar pressure distribution when walking with forearm crutches with a novel cuff design: Cross-sectional intervention study on the biomechanical efficacy of an ulnar recess.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

15 results on '"Hügle, Maria"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources