13 results on '"Rojas, Cristian R."'
Search Results
2. Optimal Transport for Correctional Learning
- Author
-
Winqvist, Rebecka, Lourenco, Inês, Quinzan, Francesco, Rojas, Cristian R., and Wahlberg, Bo
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,FOS: Electrical engineering, electronic engineering, information engineering ,Systems and Control (eess.SY) ,Electrical Engineering and Systems Science - Systems and Control ,Machine Learning (cs.LG) - Abstract
The contribution of this paper is a generalized formulation of correctional learning using optimal transport, which is about how to optimally transport one mass distribution to another. Correctional learning is a framework developed to enhance the accuracy of parameter estimation processes by means of a teacher-student approach. In this framework, an expert agent, referred to as the teacher, modifies the data used by a learning agent, known as the student, to improve its estimation process. The objective of the teacher is to alter the data such that the student's estimation error is minimized, subject to a fixed intervention budget. Compared to existing formulations of correctional learning, our novel optimal transport approach provides several benefits. It allows for the estimation of more complex characteristics as well as the consideration of multiple intervention policies for the teacher. We evaluate our approach on two theoretical examples, and on a human-robot interaction application in which the teacher's role is to improve the robots performance in an inverse reinforcement learning setting.
- Published
- 2023
- Full Text
- View/download PDF
3. Parsimonious Identification of Continuous-Time Systems: A Block-Coordinate Descent Approach
- Author
-
González, Rodrigo A., Rojas, Cristian R., Pan, Siqi, and Welsh, James S.
- Subjects
FOS: Electrical engineering, electronic engineering, information engineering ,Systems and Control (eess.SY) ,Electrical Engineering and Systems Science - Systems and Control - Abstract
The identification of electrical, mechanical, and biological systems using data can benefit greatly from prior knowledge extracted from physical modeling. Parametric continuous-time identification methods can naturally incorporate this knowledge, which leads to interpretable and parsimonious models. However, some applications lead to model structures that lack parsimonious descriptions using unfactored transfer functions, which are commonly used in standard direct approaches for continuous-time system identification. In this paper we characterize this parsimony problem, and develop a block-coordinate descent algorithm that delivers parsimonious models by sequentially estimating an additive decomposition of the transfer function of interest. Numerical simulations show the efficacy of the proposed approach., Comment: 6 pages, 3 figures
- Published
- 2023
- Full Text
- View/download PDF
4. Diagnosing and Augmenting Feature Representations in Correctional Inverse Reinforcement Learning
- Author
-
Lourenço, Inês, Bobu, Andreea, Rojas, Cristian R., and Wahlberg, Bo
- Subjects
FOS: Computer and information sciences ,Computer Science - Robotics ,FOS: Electrical engineering, electronic engineering, information engineering ,Systems and Control (eess.SY) ,Electrical Engineering and Systems Science - Systems and Control ,Robotics (cs.RO) - Abstract
Robots have been increasingly better at doing tasks for humans by learning from their feedback, but still often suffer from model misalignment due to missing or incorrectly learned features. When the features the robot needs to learn to perform its task are missing or do not generalize well to new settings, the robot will not be able to learn the task the human wants and, even worse, may learn a completely different and undesired behavior. Prior work shows how the robot can detect when its representation is missing some feature and can, thus, ask the human to be taught about the new feature; however, these works do not differentiate between features that are completely missing and those that exist but do not generalize to new environments. In the latter case, the robot would detect misalignment and simply learn a new feature, leading to an arbitrarily growing feature representation that can, in turn, lead to spurious correlations and incorrect learning down the line. In this work, we propose separating the two sources of misalignment: we propose a framework for determining whether a feature the robot needs is incorrectly learned and does not generalize to new environment setups vs. is entirely missing from the robot's representation. Once we detect the source of error, we show how the human can initiate the realignment process for the model: if the feature is missing, we follow prior work for learning new features; however, if the feature exists but does not generalize, we use data augmentation to expand its training and, thus, complete the correction. We demonstrate the proposed approach in experiments with a simulated 7DoF robot manipulator and physical human corrections., Comment: 8 pages, 4 figures
- Published
- 2023
- Full Text
- View/download PDF
5. A Statistical Decision-Theoretical Perspective on the Two-Stage Approach to Parameter Estimation
- Author
-
Lakshminarayanan, Braghadeesh and Rojas, Cristian R.
- Subjects
Methodology (stat.ME) ,FOS: Computer and information sciences ,Statistics - Machine Learning ,FOS: Electrical engineering, electronic engineering, information engineering ,Machine Learning (stat.ML) ,Systems and Control (eess.SY) ,Electrical Engineering and Systems Science - Systems and Control ,Statistics - Methodology - Abstract
One of the most important problems in system identification and statistics is how to estimate the unknown parameters of a given model. Optimization methods and specialized procedures, such as Empirical Minimization (EM) can be used in case the likelihood function can be computed. For situations where one can only simulate from a parametric model, but the likelihood is difficult or impossible to evaluate, a technique known as the Two-Stage (TS) Approach can be applied to obtain reliable parametric estimates. Unfortunately, there is currently a lack of theoretical justification for TS. In this paper, we propose a statistical decision-theoretical derivation of TS, which leads to Bayesian and Minimax estimators. We also show how to apply the TS approach on models for independent and identically distributed samples, by computing quantiles of the data as a first step, and using a linear function as the second stage. The proposed method is illustrated via numerical simulations., Comment: 7 pages, 6 figures, 1 table
- Published
- 2022
- Full Text
- View/download PDF
6. A teacher-student framework for online correctional learning
- Author
-
Louren��o, In��s, Winqvist, Rebecka, Rojas, Cristian R., and Wahlberg, Bo
- Subjects
FOS: Computer and information sciences ,ComputingMilieux_COMPUTERSANDEDUCATION ,FOS: Electrical engineering, electronic engineering, information engineering ,Systems and Control (eess.SY) ,Machine Learning (cs.LG) - Abstract
A classical learning setting is one in which a student collects data, or observations, about a system, and estimates a certain quantity of interest about it. Correctional learning is a type of cooperative teacher-student framework where a teacher, who has knowledge about the system, has the possibility to observe and alter (correct) the observations received by the student in order to improve its estimation. In this paper, we show that the variance of the estimate of the student is reduced with the help of the teacher. We further formulate the online problem - where the teacher has to decide at each time instant whether or not to change the observations - as a Markov decision process, from which the optimal policy is derived using dynamic programming. We validate the framework in numerical experiments, and compare the optimal online policy with the one from the batch setting.
- Published
- 2021
- Full Text
- View/download PDF
7. Computing monotone policies for Markov decision processes: a nearly-isotonic penalty approach
- Author
-
Mattila, Robert, Rojas, Cristian R., Krishnamurthy, Vikram, and Wahlberg, Bo
- Subjects
FOS: Electrical engineering, electronic engineering, information engineering ,Computer Science - Systems and Control ,Systems and Control (eess.SY) - Abstract
This paper discusses algorithms for solving Markov decision processes (MDPs) that have monotone optimal policies. We propose a two-stage alternating convex optimization scheme that can accelerate the search for an optimal policy by exploiting the monotone property. The first stage is a linear program formulated in terms of the joint state-action probabilities. The second stage is a regularized problem formulated in terms of the conditional probabilities of actions given states. The regularization uses techniques from nearly-isotonic regression. While a variety of iterative method can be used in the first formulation of the problem, we show in numerical simulations that, in particular, the alternating method of multipliers (ADMM) can be significantly accelerated using the regularization step., Comment: This work has been accepted for presentation at the 20th World Congress of the International Federation of Automatic Control, 9-14 July 2017
- Published
- 2017
- Full Text
- View/download PDF
8. Bayesian Learning for Low-Rank matrix reconstruction
- Author
-
Sundin, Martin, Rojas, Cristian R., Jansson, Magnus, and Chatterjee, Saikat
- Subjects
FOS: Computer and information sciences ,Computer Science - Learning ,Statistics - Machine Learning ,Computer Science - Numerical Analysis ,FOS: Mathematics ,Machine Learning (stat.ML) ,Numerical Analysis (math.NA) ,Machine Learning (cs.LG) - Abstract
We develop latent variable models for Bayesian learning based low-rank matrix completion and reconstruction from linear measurements. For under-determined systems, the developed methods are shown to reconstruct low-rank matrices when neither the rank nor the noise power is known a-priori. We derive relations between the latent variable models and several low-rank promoting penalty functions. The relations justify the use of Kronecker structured covariance matrices in a Gaussian based prior. In the methods, we use evidence approximation and expectation-maximization to learn the model parameters. The performance of the methods is evaluated through extensive numerical simulations., Comment: Submitted to IEEE Transactions on Signal Processing
- Published
- 2015
- Full Text
- View/download PDF
9. Reweighted nuclear norm regularization: A SPARSEVA approach
- Author
-
Ha, Huong, Welsh, James S., Blomberg, Niclas, Rojas, Cristian R., and Wahlberg, Bo
- Subjects
Optimization and Control (math.OC) ,FOS: Mathematics ,Mathematics - Optimization and Control - Abstract
The aim of this paper is to develop a method to estimate high order FIR and ARX models using least squares with re-weighted nuclear norm regularization. Typically, the choice of the tuning parameter in the reweighting scheme is computationally expensive, hence we propose the use of the SPARSEVA (SPARSe Estimation based on a VAlidation criterion) framework to overcome this problem. Furthermore, we suggest the use of the prediction error criterion (PEC) to select the tuning parameter in the SPARSEVA algorithm. Numerical examples demonstrate the veracity of this method which has close ties with the traditional technique of cross validation, but using much less computations., Comment: This paper is accepted and will be published in The Proceedings of the 17th IFAC Symposium on System Identification (SYSID 2015), Beijing, China, 2015
- Published
- 2015
- Full Text
- View/download PDF
10. On change point detection using the fused lasso method
- Author
-
Rojas, Cristian R. and Wahlberg, Bo
- Subjects
FOS: Computer and information sciences ,62G08, 62G20 ,Statistics - Machine Learning ,FOS: Mathematics ,Mathematics - Statistics Theory ,Machine Learning (stat.ML) ,Statistics Theory (math.ST) - Abstract
In this paper we analyze the asymptotic properties of l1 penalized maximum likelihood estimation of signals with piece-wise constant mean values and/or variances. The focus is on segmentation of a non-stationary time series with respect to changes in these model parameters. This change point detection and estimation problem is also referred to as total variation denoising or l1 -mean filtering and has many important applications in most fields of science and engineering. We establish the (approximate) sparse consistency properties, including rate of convergence, of the so-called fused lasso signal approximator (FLSA). We show that this only holds if the sign of the corresponding consecutive changes are all different, and that this estimator is otherwise incapable of correctly detecting the underlying sparsity pattern. The key idea is to notice that the optimality conditions for this problem can be analyzed using techniques related to brownian bridge theory.
- Published
- 2014
- Full Text
- View/download PDF
11. Alternating Strategies Are Good For Low-Rank Matrix Reconstruction
- Author
-
Li, Kezhi, Sundin, Martin, Rojas, Cristian R., Chatterjee, Saikat, and Jansson, Magnus
- Subjects
FOS: Computer and information sciences ,Computer Science - Information Theory ,Information Theory (cs.IT) ,FOS: Mathematics ,Mathematics - Statistics Theory ,Statistics Theory (math.ST) - Abstract
This article focuses on the problem of reconstructing low-rank matrices from underdetermined measurements using alternating optimization strategies. We endeavour to combine an alternating least-squares based estimation strategy with ideas from the alternating direction method of multipliers (ADMM) to recover structured low-rank matrices, such as Hankel structure. We show that merging these two alternating strategies leads to a better performance than the existing alternating least squares (ALS) strategy. The performance is evaluated via numerical simulations.
- Published
- 2014
- Full Text
- View/download PDF
12. Application Set Approximation in Optimal Input Design for Model Predictive Control
- Author
-
Ebadat, Afrooz, Annergren, Mariette, Larsson, Christian A., Rojas, Cristian R., and Wahlberg, Bo
- Subjects
Optimization and Control (math.OC) ,FOS: Electrical engineering, electronic engineering, information engineering ,FOS: Mathematics ,Computer Science - Systems and Control ,Systems and Control (eess.SY) ,Mathematics - Optimization and Control - Abstract
This contribution considers one central aspect of experiment design in system identification. When a control design is based on an estimated model, the achievable performance is related to the quality of the estimate. The degradation in control performance due to errors in the estimated model is measured by an application cost function. In order to use an optimization based input design method, a convex approximation of the set of models that atisfies the control specification is required. The standard approach is to use a quadratic approximation of the application cost function, where the main computational effort is to find the corresponding Hessian matrix. Our main contribution is an alternative approach for this problem, which uses the structure of the underlying optimal control problem to considerably reduce the computations needed to find the application set. This technique allows the use of applications oriented input design for MPC on much more complex plants. The approach is numerically evaluated on a distillation control problem.
- Published
- 2013
- Full Text
- View/download PDF
13. On the Design of Channel Estimators for given Signal Estimators and Detectors
- Author
-
Katselis, Dimitrios, Rojas, Cristian R., Hjalmarsson, Håkan, Bengtsson, Mats, and Skoglund, Mikael
- Subjects
FOS: Computer and information sciences ,Computer Science - Information Theory ,Information Theory (cs.IT) ,Data_CODINGANDINFORMATIONTHEORY ,Computer Science::Information Theory - Abstract
The fundamental task of a digital receiver is to decide the transmitted symbols in the best possible way, i.e., with respect to an appropriately defined performance metric. Examples of usual performance metrics are the probability of error and the Mean Square Error (MSE) of a symbol estimator. In a coherent receiver, the symbol decisions are made based on the use of a channel estimate. This paper focuses on examining the optimality of usual estimators such as the minimum variance unbiased (MVU) and the minimum mean square error (MMSE) estimators for these metrics and on proposing better estimators whenever it is necessary. For illustration purposes, this study is performed on a toy channel model, namely a single input single output (SISO) flat fading channel with additive white Gaussian noise (AWGN). In this way, this paper highlights the design dependencies of channel estimators on target performance metrics.
- Published
- 2013
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.