Author: "Ranftl, Rene" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Ranftl, Rene"' showing total 95 results

Start Over Author "Ranftl, Rene"

95 results on '"Ranftl, Rene"'

1. Monocular Visual-Inertial Depth Estimation

Author: Wofk, Diana, Ranftl, René, Müller, Matthias, and Koltun, Vladlen
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Robotics
Abstract: We present a visual-inertial depth estimation pipeline that integrates monocular depth estimation and visual-inertial odometry to produce dense depth estimates with metric scale. Our approach performs global scale and shift alignment against sparse metric depth, followed by learning-based dense alignment. We evaluate on the TartanAir and VOID datasets, observing up to 30% reduction in inverse RMSE with dense scale alignment relative to performing just global alignment alone. Our approach is especially competitive at low density; with just 150 sparse metric depth points, our dense-to-dense depth alignment method achieves over 50% lower iRMSE over sparse-to-dense depth completion by KBNet, currently the state of the art on VOID. We demonstrate successful zero-shot transfer from synthetic TartanAir to real-world VOID data and perform generalization tests on NYUv2 and VCU-RVI. Our approach is modular and is compatible with a variety of monocular depth estimation models. Video: https://youtu.be/IMwiKwSpshQ Code: https://github.com/isl-org/VI-Depth, Comment: Accepted for publication at ICRA'23
Published: 2023

2. Unsupervised Contrastive Domain Adaptation for Semantic Segmentation

Author: Zhang, Feihu, Koltun, Vladlen, Torr, Philip, Ranftl, René, and Richter, Stephan R.
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Semantic segmentation models struggle to generalize in the presence of domain shift. In this paper, we introduce contrastive learning for feature alignment in cross-domain adaptation. We assemble both in-domain contrastive pairs and cross-domain contrastive pairs to learn discriminative features that align across domains. Based on the resulting well-aligned feature representations we introduce a label expansion approach that is able to discover samples from hard classes during the adaptation process to further boost performance. The proposed approach consistently outperforms state-of-the-art methods for domain adaptation. It achieves 60.2% mIoU on the Cityscapes dataset when training on the synthetic GTA5 dataset together with unlabeled Cityscapes images.
Published: 2022

3. Language-driven Semantic Segmentation

Author: Li, Boyi, Weinberger, Kilian Q., Belongie, Serge, Koltun, Vladlen, and Ranftl, René
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: We present LSeg, a novel model for language-driven semantic image segmentation. LSeg uses a text encoder to compute embeddings of descriptive input labels (e.g., "grass" or "building") together with a transformer-based image encoder that computes dense per-pixel embeddings of the input image. The image encoder is trained with a contrastive objective to align pixel embeddings to the text embedding of the corresponding semantic class. The text embeddings provide a flexible label representation in which semantically similar labels map to similar regions in the embedding space (e.g., "cat" and "furry"). This allows LSeg to generalize to previously unseen categories at test time, without retraining or even requiring a single additional training sample. We demonstrate that our approach achieves highly competitive zero-shot performance compared to existing zero- and few-shot semantic segmentation methods, and even matches the accuracy of traditional segmentation algorithms when a fixed label set is provided. Code and demo are available at https://github.com/isl-org/lang-seg., Comment: ICLR 2022
Published: 2022

4. Transferable End-to-end Room Layout Estimation via Implicit Encoding

Author: Zhao, Hao, Ranftl, Rene, Chen, Yurong, and Zha, Hongbin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We study the problem of estimating room layouts from a single panorama image. Most former works have two stages: feature extraction and parametric model fitting. Here we propose an end-to-end method that directly predicts parametric layouts from an input panorama image. It exploits an implicit encoding procedure that embeds parametric layouts into a latent space. Then learning a mapping from images to this latent space makes end-to-end room layout estimation possible. However end-to-end methods have several notorious drawbacks despite many intriguing properties. A widely raised criticism is that they are troubled with dataset bias and do not transfer to unfamiliar domains. Our study echos this common belief. To this end, we propose to use semantic boundary prediction maps as an intermediate domain. It brings significant performance boost on four benchmarks (Structured3D, PanoContext, S3DIS, and Matterport3D), notably in the zero-shot transfer setting. Code, data, and models will be released., Comment: Project: https://sites.google.com/view/transferrl/
Published: 2021

5. Learning High-Speed Flight in the Wild

Author: Loquercio, Antonio, Kaufmann, Elia, Ranftl, René, Müller, Matthias, Koltun, Vladlen, and Scaramuzza, Davide
Subjects: Computer Science - Robotics, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Systems and Control
Abstract: Quadrotors are agile. Unlike most other machines, they can traverse extremely complex environments at high speeds. To date, only expert human pilots have been able to fully exploit their capabilities. Autonomous operation with on-board sensing and computation has been limited to low speeds. State-of-the-art methods generally separate the navigation problem into subtasks: sensing, mapping, and planning. While this approach has proven successful at low speeds, the separation it builds upon can be problematic for high-speed navigation in cluttered environments. Indeed, the subtasks are executed sequentially, leading to increased processing latency and a compounding of errors through the pipeline. Here we propose an end-to-end approach that can autonomously fly quadrotors through complex natural and man-made environments at high speeds, with purely onboard sensing and computation. The key principle is to directly map noisy sensory observations to collision-free trajectories in a receding-horizon fashion. This direct mapping drastically reduces processing latency and increases robustness to noisy and incomplete perception. The sensorimotor mapping is performed by a convolutional network that is trained exclusively in simulation via privileged learning: imitating an expert with access to privileged information. By simulating realistic sensor noise, our approach achieves zero-shot transfer from simulation to challenging real-world environments that were never experienced during training: dense forests, snow-covered terrain, derailed trains, and collapsed buildings. Our work demonstrates that end-to-end policies trained in simulation enable high-speed autonomous flight through challenging environments, outperforming traditional obstacle avoidance pipelines., Comment: 16 pages (+7 supplementary)
Published: 2021
Full Text: View/download PDF

6. An Analysis of Super-Net Heuristics in Weight-Sharing NAS

Author: Yu, Kaicheng, Ranftl, René, and Salzmann, Mathieu
Subjects: Computer Science - Machine Learning
Abstract: Weight sharing promises to make neural architecture search (NAS) tractable even on commodity hardware. Existing methods in this space rely on a diverse set of heuristics to design and train the shared-weight backbone network, a.k.a. the super-net. Since heuristics substantially vary across different methods and have not been carefully studied, it is unclear to which extent they impact super-net training and hence the weight-sharing NAS algorithms. In this paper, we disentangle super-net training from the search algorithm, isolate 14 frequently-used training heuristics, and evaluate them over three benchmark search spaces. Our analysis uncovers that several commonly-used heuristics negatively impact the correlation between super-net and stand-alone performance, whereas simple, but often overlooked factors, such as proper hyper-parameter settings, are key to achieve strong performance. Equipped with this knowledge, we show that simple random search achieves competitive performance to complex state-of-the-art NAS algorithms when the super-net is properly trained., Comment: Accepted to T-PAMI
Published: 2021

7. Landmark Regularization: Ranking Guided Super-Net Training in Neural Architecture Search

Author: Yu, Kaicheng, Ranftl, Rene, and Salzmann, Mathieu
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Weight sharing has become a de facto standard in neural architecture search because it enables the search to be done on commodity hardware. However, recent works have empirically shown a ranking disorder between the performance of stand-alone architectures and that of the corresponding shared-weight networks. This violates the main assumption of weight-sharing NAS algorithms, thus limiting their effectiveness. We tackle this issue by proposing a regularization term that aims to maximize the correlation between the performance rankings of the shared-weight network and that of the standalone architectures using a small set of landmark architectures. We incorporate our regularization term into three different NAS algorithms and show that it consistently improves performance across algorithms, search-spaces, and tasks., Comment: Accepted to CVPR 2021
Published: 2021

8. Vision Transformers for Dense Prediction

Author: Ranftl, René, Bochkovskiy, Alexey, and Koltun, Vladlen
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We introduce dense vision transformers, an architecture that leverages vision transformers in place of convolutional networks as a backbone for dense prediction tasks. We assemble tokens from various stages of the vision transformer into image-like representations at various resolutions and progressively combine them into full-resolution predictions using a convolutional decoder. The transformer backbone processes representations at a constant and relatively high resolution and has a global receptive field at every stage. These properties allow the dense vision transformer to provide finer-grained and more globally coherent predictions when compared to fully-convolutional networks. Our experiments show that this architecture yields substantial improvements on dense prediction tasks, especially when a large amount of training data is available. For monocular depth estimation, we observe an improvement of up to 28% in relative performance when compared to a state-of-the-art fully-convolutional network. When applied to semantic segmentation, dense vision transformers set a new state of the art on ADE20K with 49.02% mIoU. We further show that the architecture can be fine-tuned on smaller datasets such as NYUv2, KITTI, and Pascal Context where it also sets the new state of the art. Our models are available at https://github.com/intel-isl/DPT., Comment: 15 pages
Published: 2021

9. Deep Drone Acrobatics

Author: Kaufmann, Elia, Loquercio, Antonio, Ranftl, René, Müller, Matthias, Koltun, Vladlen, and Scaramuzza, Davide
Subjects: Computer Science - Robotics
Abstract: Performing acrobatic maneuvers with quadrotors is extremely challenging. Acrobatic flight requires high thrust and extreme angular accelerations that push the platform to its physical limits. Professional drone pilots often measure their level of mastery by flying such maneuvers in competitions. In this paper, we propose to learn a sensorimotor policy that enables an autonomous quadrotor to fly extreme acrobatic maneuvers with only onboard sensing and computation. We train the policy entirely in simulation by leveraging demonstrations from an optimal controller that has access to privileged information. We use appropriate abstractions of the visual input to enable transfer to a real quadrotor. We show that the resulting policy can be directly deployed in the physical world without any fine-tuning on real data. Our methodology has several favorable properties: it does not require a human expert to provide demonstrations, it cannot harm the physical system during training, and it can be used to learn maneuvers that are challenging even for the best human pilots. Our approach enables a physical quadrotor to fly maneuvers such as the Power Loop, the Barrel Roll, and the Matty Flip, during which it incurs accelerations of up to 3g., Comment: 8 pages + 2 pages references. Video: https://youtu.be/2N_wKXQ6MXA. Code: https://github.com/uzh-rpg/deep_drone_acrobatics
Published: 2020

10. High-dimensional Convolutional Networks for Geometric Pattern Recognition

Author: Choy, Christopher, Lee, Junha, Ranftl, Rene, Park, Jaesik, and Koltun, Vladlen
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Many problems in science and engineering can be formulated in terms of geometric patterns in high-dimensional spaces. We present high-dimensional convolutional networks (ConvNets) for pattern recognition problems that arise in the context of geometric registration. We first study the effectiveness of convolutional networks in detecting linear subspaces in high-dimensional spaces with up to 32 dimensions: much higher dimensionality than prior applications of ConvNets. We then apply high-dimensional ConvNets to 3D registration under rigid motions and image correspondence estimation. Experiments indicate that our high-dimensional ConvNets outperform prior approaches that relied on deep networks based on global pooling operators., Comment: Accepted for CVPR 2020 oral presentation
Published: 2020

11. How to Train Your Super-Net: An Analysis of Training Heuristics in Weight-Sharing NAS

Author: Yu, Kaicheng, Ranftl, Rene, and Salzmann, Mathieu
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning
Abstract: Weight sharing promises to make neural architecture search (NAS) tractable even on commodity hardware. Existing methods in this space rely on a diverse set of heuristics to design and train the shared-weight backbone network, a.k.a. the super-net. Since heuristics and hyperparameters substantially vary across different methods, a fair comparison between them can only be achieved by systematically analyzing the influence of these factors. In this paper, we therefore provide a systematic evaluation of the heuristics and hyperparameters that are frequently employed by weight-sharing NAS algorithms. Our analysis uncovers that some commonly-used heuristics for super-net training negatively impact the correlation between super-net and stand-alone performance, and evidences the strong influence of certain hyperparameters and architectural choices. Our code and experiments set a strong and reproducible baseline that future works can build on., Comment: Updated with latest results on NASBench-101, now we achieve 0.48 sparse Kendall-Tau on this space
Published: 2020

12. Safe Robot Navigation via Multi-Modal Anomaly Detection

Author: Wellhausen, Lorenz, Ranftl, René, and Hutter, Marco
Subjects: Computer Science - Robotics
Abstract: Navigation in natural outdoor environments requires a robust and reliable traversability classification method to handle the plethora of situations a robot can encounter. Binary classification algorithms perform well in their native domain but tend to provide overconfident predictions when presented with out-of-distribution samples, which can lead to catastrophic failure when navigating unknown environments. We propose to overcome this issue by using anomaly detection on multi-modal images for traversability classification, which is easily scalable by training in a self-supervised fashion from robot experience. In this work, we evaluate multiple anomaly detection methods with a combination of uni- and multi-modal images in their performance on data from different environmental conditions. Our results show that an approach using a feature extractor and normalizing flow with an input of RGB, depth and surface normals performs best. It achieves over 95% area under the ROC curve and is robust to out-of-distribution samples.
Published: 2020
Full Text: View/download PDF

13. Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer

Author: Ranftl, René, Lasinger, Katrin, Hafner, David, Schindler, Konrad, and Koltun, Vladlen
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The success of monocular depth estimation relies on large and diverse training sets. Due to the challenges associated with acquiring dense ground-truth depth across different environments at scale, a number of datasets with distinct characteristics and biases have emerged. We develop tools that enable mixing multiple datasets during training, even if their annotations are incompatible. In particular, we propose a robust training objective that is invariant to changes in depth range and scale, advocate the use of principled multi-objective learning to combine data from different sources, and highlight the importance of pretraining encoders on auxiliary tasks. Armed with these tools, we experiment with five diverse training datasets, including a new, massive data source: 3D films. To demonstrate the generalization power of our approach we use zero-shot cross-dataset transfer}, i.e. we evaluate on datasets that were not seen during training. The experiments confirm that mixing data from complementary sources greatly improves monocular depth estimation. Our approach clearly outperforms competing methods across diverse datasets, setting a new state of the art for monocular depth estimation. Some results are shown in the supplementary video at https://youtu.be/D46FzVyL9I8, Comment: To appear in TPAMI (accepted August 2020)
Published: 2019

14. High Speed and High Dynamic Range Video with an Event Camera

Author: Rebecq, Henri, Ranftl, René, Koltun, Vladlen, and Scaramuzza, Davide
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Event cameras are novel sensors that report brightness changes in the form of a stream of asynchronous "events" instead of intensity frames. They offer significant advantages with respect to conventional cameras: high temporal resolution, high dynamic range, and no motion blur. While the stream of events encodes in principle the complete visual signal, the reconstruction of an intensity image from a stream of events is an ill-posed problem in practice. Existing reconstruction approaches are based on hand-crafted priors and strong assumptions about the imaging process as well as the statistics of natural images. In this work we propose to learn to reconstruct intensity images from event streams directly from data instead of relying on any hand-crafted priors. We propose a novel recurrent network to reconstruct videos from a stream of events, and train it on a large amount of simulated event data. During training we propose to use a perceptual loss to encourage reconstructions to follow natural image statistics. We further extend our approach to synthesize color images from color event streams. Our network surpasses state-of-the-art reconstruction methods by a large margin in terms of image quality (> 20%), while comfortably running in real-time. We show that the network is able to synthesize high framerate videos (> 5,000 frames per second) of high-speed phenomena (e.g. a bullet hitting an object) and is able to provide high dynamic range reconstructions in challenging lighting conditions. We also demonstrate the effectiveness of our reconstructions as an intermediate representation for event data. We show that off-the-shelf computer vision algorithms can be applied to our reconstructions for tasks such as object classification and visual-inertial odometry and that this strategy consistently outperforms algorithms that were specifically designed for event data., Comment: arXiv admin note: substantial text overlap with arXiv:1904.08298
Published: 2019

15. Deep Drone Racing: From Simulation to Reality with Domain Randomization

Author: Loquercio, Antonio, Kaufmann, Elia, Ranftl, René, Dosovitskiy, Alexey, Koltun, Vladlen, and Scaramuzza, Davide
Subjects: Computer Science - Robotics
Abstract: Dynamically changing environments, unreliable state estimation, and operation under severe resource constraints are fundamental challenges that limit the deployment of small autonomous drones. We address these challenges in the context of autonomous, vision-based drone racing in dynamic environments. A racing drone must traverse a track with possibly moving gates at high speed. We enable this functionality by combining the performance of a state-of-the-art planning and control system with the perceptual awareness of a convolutional neural network (CNN). The resulting modular system is both platform- and domain-independent: it is trained in simulation and deployed on a physical quadrotor without any fine-tuning. The abundance of simulated data, generated via domain randomization, makes our system robust to changes of illumination and gate appearance. To the best of our knowledge, our approach is the first to demonstrate zero-shot sim-to-real transfer on the task of agile drone flight. We extensively test the precision and robustness of our system, both in simulation and on a physical platform, and show significant improvements over the state of the art., Comment: Accepted as a Regular Paper to the IEEE Transactions on Robotics Journal. arXiv admin note: substantial text overlap with arXiv:1806.08548
Published: 2019
Full Text: View/download PDF

16. Feedback MPC for Torque-Controlled Legged Robots

Author: Grandia, Ruben, Farshidian, Farbod, Ranftl, René, and Hutter, Marco
Subjects: Computer Science - Robotics
Abstract: The computational power of mobile robots is currently insufficient to achieve torque level whole-body Model Predictive Control (MPC) at the update rates required for complex dynamic systems such as legged robots. This problem is commonly circumvented by using a fast tracking controller to compensate for model errors between updates. In this work, we show that the feedback policy from a Differential Dynamic Programming (DDP) based MPC algorithm is a viable alternative to bridge the gap between the low MPC update rate and the actuation command rate. We propose to augment the DDP approach with a relaxed barrier function to address inequality constraints arising from the friction cone. A frequency-dependent cost function is used to reduce the sensitivity to high-frequency model errors and actuator bandwidth limits. We demonstrate that our approach can find stable locomotion policies for the torque-controlled quadruped, ANYmal, both in simulation and on hardware., Comment: Paper accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019)
Published: 2019

17. What Do Single-view 3D Reconstruction Networks Learn?

Author: Tatarchenko, Maxim, Richter, Stephan R., Ranftl, René, Li, Zhuwen, Koltun, Vladlen, and Brox, Thomas
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Convolutional networks for single-view object reconstruction have shown impressive performance and have become a popular subject of research. All existing techniques are united by the idea of having an encoder-decoder network that performs non-trivial reasoning about the 3D structure of the output space. In this work, we set up two alternative approaches that perform image classification and retrieval respectively. These simple baselines yield better results than state-of-the-art methods, both qualitatively and quantitatively. We show that encoder-decoder methods are statistically indistinguishable from these baselines, thus indicating that the current state of the art in single-view object reconstruction does not actually perform reconstruction but image classification. We identify aspects of popular experimental procedures that elicit this behavior and discuss ways to improve the current state of research.
Published: 2019

18. Events-to-Video: Bringing Modern Computer Vision to Event Cameras

Author: Rebecq, Henri, Ranftl, René, Koltun, Vladlen, and Scaramuzza, Davide
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Event cameras are novel sensors that report brightness changes in the form of asynchronous "events" instead of intensity frames. They have significant advantages over conventional cameras: high temporal resolution, high dynamic range, and no motion blur. Since the output of event cameras is fundamentally different from conventional cameras, it is commonly accepted that they require the development of specialized algorithms to accommodate the particular nature of events. In this work, we take a different view and propose to apply existing, mature computer vision techniques to videos reconstructed from event data. We propose a novel recurrent network to reconstruct videos from a stream of events, and train it on a large amount of simulated event data. Our experiments show that our approach surpasses state-of-the-art reconstruction methods by a large margin (> 20%) in terms of image quality. We further apply off-the-shelf computer vision algorithms to videos reconstructed from event data on tasks such as object classification and visual-inertial odometry, and show that this strategy consistently outperforms algorithms that were specifically designed for event data. We believe that our approach opens the door to bringing the outstanding properties of event cameras to an entirely new range of tasks. A video of the experiments is available at https://youtu.be/IdYrC4cUO0I
Published: 2019

19. Beauty and the Beast: Optimal Methods Meet Learning for Drone Racing

Author: Kaufmann, Elia, Gehrig, Mathias, Foehn, Philipp, Ranftl, René, Dosovitskiy, Alexey, Koltun, Vladlen, and Scaramuzza, Davide
Subjects: Computer Science - Robotics
Abstract: Autonomous micro aerial vehicles still struggle with fast and agile maneuvers, dynamic environments, imperfect sensing, and state estimation drift. Autonomous drone racing brings these challenges to the fore. Human pilots can fly a previously unseen track after a handful of practice runs. In contrast, state-of-the-art autonomous navigation algorithms require either a precise metric map of the environment or a large amount of training data collected in the track of interest. To bridge this gap, we propose an approach that can fly a new track in a previously unseen environment without a precise map or expensive data collection. Our approach represents the global track layout with coarse gate locations, which can be easily estimated from a single demonstration flight. At test time, a convolutional network predicts the poses of the closest gates along with their uncertainty. These predictions are incorporated by an extended Kalman filter to maintain optimal maximum-a-posteriori estimates of gate locations. This allows the framework to cope with misleading high-variance estimates that could stem from poor observability or lack of visible gates. Given the estimated gate poses, we use model predictive control to quickly and accurately navigate through the track. We conduct extensive experiments in the physical world, demonstrating agile and robust flight through complex and diverse previously-unseen race tracks. The presented approach was used to win the IROS 2018 Autonomous Drone Race Competition, outracing the second-placing team by a factor of two., Comment: 6 pages (+1 references)
Published: 2018

20. Frequency-Aware Model Predictive Control

Author: Grandia, Ruben, Farshidian, Farbod, Dosovitskiy, Alexey, Ranftl, René, and Hutter, Marco
Subjects: Computer Science - Robotics, Computer Science - Systems and Control
Abstract: Transferring solutions found by trajectory optimization to robotic hardware remains a challenging task. When the optimization fully exploits the provided model to perform dynamic tasks, the presence of unmodeled dynamics renders the motion infeasible on the real system. Model errors can be a result of model simplifications, but also naturally arise when deploying the robot in unstructured and nondeterministic environments. Predominantly, compliant contacts and actuator dynamics lead to bandwidth limitations. While classical control methods provide tools to synthesize controllers that are robust to a class of model errors, such a notion is missing in modern trajectory optimization, which is solved in the time domain. We propose frequency-shaped cost functions to achieve robust solutions in the context of optimal control for legged robots. Through simulation and hardware experiments we show that motion plans can be made compatible with bandwidth limits set by actuators and contact dynamics. The smoothness of the model predictive solutions can be continuously tuned without compromising the feasibility of the problem. Experiments with the quadrupedal robot ANYmal, which is driven by highly-compliant series elastic actuators, showed significantly improved tracking performance of the planned motion, torque, and force trajectories and enabled the machine to walk robustly on terrain with unmodeled compliance.
Published: 2018
Full Text: View/download PDF

21. Deep Drone Racing: Learning Agile Flight in Dynamic Environments

Author: Kaufmann, Elia, Loquercio, Antonio, Ranftl, Rene, Dosovitskiy, Alexey, Koltun, Vladlen, and Scaramuzza, Davide
Subjects: Computer Science - Robotics
Abstract: Autonomous agile flight brings up fundamental challenges in robotics, such as coping with unreliable state estimation, reacting optimally to dynamically changing environments, and coupling perception and action in real time under severe resource constraints. In this paper, we consider these challenges in the context of autonomous, vision-based drone racing in dynamic environments. Our approach combines a convolutional neural network (CNN) with a state-of-the-art path-planning and control system. The CNN directly maps raw images into a robust representation in the form of a waypoint and desired speed. This information is then used by the planner to generate a short, minimum-jerk trajectory segment and corresponding motor commands to reach the desired goal. We demonstrate our method in autonomous agile flight scenarios, in which a vision-based quadrotor traverses drone-racing tracks with possibly moving gates. Our method does not require any explicit map of the environment and runs fully onboard. We extensively test the precision and robustness of the approach in simulation and in the physical world. We also evaluate our method against state-of-the-art navigation approaches and professional human drone pilots., Comment: Accepted for publication in the Conference on Robotic Learning (CoRL) 2018, Zurich. 10 pages (+3 supplementary)
Published: 2018

22. Accurate Optical Flow via Direct Cost Volume Processing

Author: Xu, Jia, Ranftl, René, and Koltun, Vladlen
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We present an optical flow estimation approach that operates on the full four-dimensional cost volume. This direct approach shares the structural benefits of leading stereo matching pipelines, which are known to yield high accuracy. To this day, such approaches have been considered impractical due to the size of the cost volume. We show that the full four-dimensional cost volume can be constructed in a fraction of a second due to its regularity. We then exploit this regularity further by adapting semi-global matching to the four-dimensional setting. This yields a pipeline that achieves significantly higher accuracy than state-of-the-art optical flow methods while being faster than most. Our approach outperforms all published general-purpose optical flow methods on both Sintel and KITTI 2015 benchmarks., Comment: Published at the Conference on Computer Vision and Pattern Recognition (CVPR 2017)
Published: 2017

23. Techniques for Gradient Based Bilevel Optimization with Nonsmooth Lower Level Problems

Author: Ochs, Peter, Ranftl, René, Brox, Thomas, and Pock, Thomas
Subjects: Mathematics - Optimization and Control
Abstract: We propose techniques for approximating bilevel optimization problems with non-smooth lower level problems that can have a non-unique solution. To this end, we substitute the expression of a minimizer of the lower level minimization problem with an iterative algorithm that is guaranteed to converge to a minimizer of the problem. Using suitable non-linear proximal distance functions, the update mappings of such an iterative algorithm can be differentiable, notwithstanding the fact that the minimization problem is non-smooth.
Published: 2016

24. A higher-order MRF based variational model for multiplicative noise reduction

Author: Chen, Yunjin, Feng, Wensen, Ranftl, René, Qiao, Hong, and Pock, Thomas
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The Fields of Experts (FoE) image prior model, a filter-based higher-order Markov Random Fields (MRF) model, has been shown to be effective for many image restoration problems. Motivated by the successes of FoE-based approaches, in this letter, we propose a novel variational model for multiplicative noise reduction based on the FoE image prior model. The resulted model corresponds to a non-convex minimization problem, which can be solved by a recently published non-convex optimization algorithm. Experimental results based on synthetic speckle noise and real synthetic aperture radar (SAR) images suggest that the performance of our proposed method is on par with the best published despeckling algorithm. Besides, our proposed model comes along with an additional advantage, that the inference is extremely efficient. {Our GPU based implementation takes less than 1s to produce state-of-the-art despeckling performance.}, Comment: 5 pages, 5 figures, to appear in IEEE Signal Processing Letters
Published: 2014
Full Text: View/download PDF

25. Revisiting loss-specific training of filter-based MRFs for image restoration

Author: Chen, Yunjin, Pock, Thomas, Ranftl, René, and Bischof, Horst
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: It is now well known that Markov random fields (MRFs) are particularly effective for modeling image priors in low-level vision. Recent years have seen the emergence of two main approaches for learning the parameters in MRFs: (1) probabilistic learning using sampling-based algorithms and (2) loss-specific training based on MAP estimate. After investigating existing training approaches, it turns out that the performance of the loss-specific training has been significantly underestimated in existing work. In this paper, we revisit this approach and use techniques from bi-level optimization to solve it. We show that we can get a substantial gain in the final performance by solving the lower-level problem in the bi-level framework with high accuracy using our newly proposed algorithm. As a result, our trained model is on par with highly specialized image denoising algorithms and clearly outperforms probabilistically trained MRF models. Our findings suggest that for the loss-specific training scheme, solving the lower-level problem with higher accuracy is beneficial. Our trained model comes along with the additional advantage, that inference is extremely efficient. Our GPU-based implementation takes less than 1s to produce state-of-the-art performance., Comment: 10 pages, 2 figures, appear at 35th German Conference, GCPR 2013, Saarbr\"ucken, Germany, September 3-6, 2013. Proceedings
Published: 2014
Full Text: View/download PDF

26. A bi-level view of inpainting - based image compression

Author: Chen, Yunjin, Ranftl, René, and Pock, Thomas
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Inpainting based image compression approaches, especially linear and non-linear diffusion models, are an active research topic for lossy image compression. The major challenge in these compression models is to find a small set of descriptive supporting points, which allow for an accurate reconstruction of the original image. It turns out in practice that this is a challenging problem even for the simplest Laplacian interpolation model. In this paper, we revisit the Laplacian interpolation compression model and introduce two fast algorithms, namely successive preconditioning primal dual algorithm and the recently proposed iPiano algorithm, to solve this problem efficiently. Furthermore, we extend the Laplacian interpolation based compression model to a more general form, which is based on principles from bi-level optimization. We investigate two different variants of the Laplacian model, namely biharmonic interpolation and smoothed Total Variation regularization. Our numerical results show that significant improvements can be obtained from the biharmonic interpolation model, and it can recover an image with very high quality from only 5% pixels., Comment: 8 pages, 4 figures, best paper award of CVWW 2014, Computer Vision Winter Workshop, K\v{r}tiny, Czech Republic, 3-5th February 2014
Published: 2014

27. Insights into analysis operator learning: From patch-based sparse models to higher-order MRFs

Author: Chen, Yunjin, Ranftl, René, and Pock, Thomas
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper addresses a new learning algorithm for the recently introduced co-sparse analysis model. First, we give new insights into the co-sparse analysis model by establishing connections to filter-based MRF models, such as the Field of Experts (FoE) model of Roth and Black. For training, we introduce a technique called bi-level optimization to learn the analysis operators. Compared to existing analysis operator learning approaches, our training procedure has the advantage that it is unconstrained with respect to the analysis operator. We investigate the effect of different aspects of the co-sparse analysis model and show that the sparsity promoting function (also called penalty function) is the most important factor in the model. In order to demonstrate the effectiveness of our training approach, we apply our trained models to various classical image restoration problems. Numerical experiments show that our trained models clearly outperform existing analysis operator learning approaches and are on par with state-of-the-art image denoising algorithms. Our approach develops a framework that is intuitive to understand and easy to implement., Comment: 13 pages, 10 figures, accepted to IEEE Image Processing
Published: 2014
Full Text: View/download PDF

28. Deep Fundamental Matrix Estimation

Author: Ranftl, René, Koltun, Vladlen, Hutchison, David, Series Editor, Kanade, Takeo, Series Editor, Kittler, Josef, Series Editor, Kleinberg, Jon M., Series Editor, Mattern, Friedemann, Series Editor, Mitchell, John C., Series Editor, Naor, Moni, Series Editor, Pandu Rangan, C., Series Editor, Steffen, Bernhard, Series Editor, Terzopoulos, Demetri, Series Editor, Tygar, Doug, Series Editor, Weikum, Gerhard, Series Editor, Ferrari, Vittorio, editor, Hebert, Martial, editor, Sminchisescu, Cristian, editor, and Weiss, Yair, editor
Published: 2018
Full Text: View/download PDF

29. Bilevel Optimization with Nonsmooth Lower Level Problems

Author: Ochs, Peter, Ranftl, René, Brox, Thomas, Pock, Thomas, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Aujol, Jean-François, editor, Nikolova, Mila, editor, and Papadakis, Nicolas, editor
Published: 2015
Full Text: View/download PDF

30. A Deep Variational Model for Image Segmentation

Author: Ranftl, René, Pock, Thomas, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Jiang, Xiaoyi, editor, Hornegger, Joachim, editor, and Koch, Reinhard, editor
Published: 2014
Full Text: View/download PDF

31. Non-local Total Generalized Variation for Optical Flow Estimation

Author: Ranftl, René, Bredies, Kristian, Pock, Thomas, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Kobsa, Alfred, Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Fleet, David, editor, Pajdla, Tomas, editor, Schiele, Bernt, editor, and Tuytelaars, Tinne, editor
Published: 2014
Full Text: View/download PDF

32. Revisiting Loss-Specific Training of Filter-Based MRFs for Image Restoration

Author: Chen, Yunjin, Pock, Thomas, Ranftl, René, Bischof, Horst, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Weickert, Joachim, editor, Hein, Matthias, editor, and Schiele, Bernt, editor
Published: 2013
Full Text: View/download PDF

33. Variational Shape from Light Field

Author: Heber, Stefan, Ranftl, Rene, Pock, Thomas, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Heyden, Anders, editor, Kahl, Fredrik, editor, Olsson, Carl, editor, Oskarsson, Magnus, editor, and Tai, Xue-Cheng, editor
Published: 2013
Full Text: View/download PDF

34. Minimizing TGV-Based Variational Models with Non-convex Data Terms

Author: Ranftl, Rene, Pock, Thomas, Bischof, Horst, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Kuijper, Arjan, editor, Bredies, Kristian, editor, Pock, Thomas, editor, and Bischof, Horst, editor
Published: 2013
Full Text: View/download PDF

35. Approximate Envelope Minimization for Curvature Regularity

Author: Heber, Stefan, Ranftl, Rene, Pock, Thomas, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Fusiello, Andrea, editor, Murino, Vittorio, editor, and Cucchiara, Rita, editor
Published: 2012
Full Text: View/download PDF

36. Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-Shot Cross-Dataset Transfer

Author: Ranftl, Rene, primary, Lasinger, Katrin, additional, Hafner, David, additional, Schindler, Konrad, additional, and Koltun, Vladlen, additional
Published: 2022
Full Text: View/download PDF

37. Vision Transformers for Dense Prediction

Author: Ranftl, Rene, primary, Bochkovskiy, Alexey, additional, and Koltun, Vladlen, additional
Published: 2021
Full Text: View/download PDF

38. An Analysis of Super-Net Heuristics in Weight-Sharing NAS.

Author: Yu, Kaicheng, Ranftl, Rene, and Salzmann, Mathieu
Subjects: *HEURISTIC, *SEARCH algorithms, *NETWORK-attached storage, *COMPUTER architecture, *TASK analysis
Abstract: Weight sharing promises to make neural architecture search (NAS) tractable even on commodity hardware. Existing methods in this space rely on a diverse set of heuristics to design and train the shared-weight backbone network, a.k.a. the super-net. Since heuristics substantially vary across different methods and have not been carefully studied, it is unclear to which extent they impact super-net training and hence the weight-sharing NAS algorithms. In this paper, we disentangle super-net training from the search algorithm, isolate 14 frequently-used training heuristics, and evaluate them over three benchmark search spaces. Our analysis uncovers that several commonly-used heuristics negatively impact the correlation between super-net and stand-alone performance, whereas simple, but often overlooked factors, such as proper hyper-parameter settings, are key to achieve strong performance. Equipped with this knowledge, we show that simple random search achieves competitive performance to complex state-of-the-art NAS algorithms when the super-net is properly trained. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

39. Deep Drone Acrobatics (Extended Abstract)

Author: Kaufmann, Elia, primary, Loquercio, Antonio, additional, Ranftl, Rene, additional, Müller, Matthias, additional, Koltun, Vladlen, additional, and Scaramuzza, Davide, additional
Published: 2021
Full Text: View/download PDF

40. High Speed and High Dynamic Range Video with an Event Camera

Author: Rebecq, Henri, primary, Ranftl, Rene, additional, Koltun, Vladlen, additional, and Scaramuzza, Davide, additional
Published: 2021
Full Text: View/download PDF

41. Landmark Regularization: Ranking Guided Super-Net Training in Neural Architecture Search

Author: Yu, Kaicheng, primary, Ranftl, Rene, additional, and Salzmann, Mathieu, additional
Published: 2021
Full Text: View/download PDF

42. Deep Drone Racing: From Simulation to Reality With Domain Randomization

Author: Loquercio, Antonio, Kaufmann, Elia, Ranftl, Rene, Dosovitskiy, Alexey, Koltun, Vladlen, Scaramuzza, Davide, Loquercio, Antonio, Kaufmann, Elia, Ranftl, Rene, Dosovitskiy, Alexey, Koltun, Vladlen, and Scaramuzza, Davide
Abstract: Dynamically changing environments, unreliable state estimation, and operation under severe resource constraints are fundamental challenges that limit the deployment of small autonomous drones. We address these challenges in the context of autonomous, vision-based drone racing in dynamic environments. A racing drone must traverse a track with possibly moving gates at high speed. We enable this functionality by combining the performance of a state-of-the-art planning and control system with the perceptual awareness of a convolutional neural network. The resulting modular system is both platform independent and domain independent: it is trained in simulation and deployed on a physical quadrotor without any fine-tuning. The abundance of simulated data, generated via domain randomization, makes our system robust to changes of illumination and gate appearance. To the best of our knowledge, our approach is the first to demonstrate zero-shot sim-to-real transfer on the task of agile drone flight. We extensively test the precision and robustness of our system, both in simulation and on a physical platform, and show significant improvements over the state of the art.
Published: 2020

43. An Analysis of Super-Net Heuristics in Weight-Sharing NAS

Author: Yu, Kaicheng, primary, Ranftl, Rene, additional, and Salzmann, Mathieu, additional
Published: 2021
Full Text: View/download PDF

44. Approximate Envelope Minimization for Curvature Regularity

Author: Heber, Stefan, primary, Ranftl, Rene, additional, and Pock, Thomas, additional
Published: 2012
Full Text: View/download PDF

45. Deep Drone Acrobatics

Author: Kaufmann, Elia, primary, Loquercio, Antonio, additional, Ranftl, Rene, additional, Müller, Matthias, additional, Koltun, Vladlen, additional, and Scaramuzza, Davide, additional
Published: 2020
Full Text: View/download PDF

46. High-Dimensional Convolutional Networks for Geometric Pattern Recognition

Author: Choy, Christopher, primary, Lee, Junha, additional, Ranftl, Rene, additional, Park, Jaesik, additional, and Koltun, Vladlen, additional
Published: 2020
Full Text: View/download PDF

47. Safe Robot Navigation Via Multi-Modal Anomaly Detection

Author: Wellhausen, Lorenz, primary, Ranftl, Rene, additional, and Hutter, Marco, additional
Published: 2020
Full Text: View/download PDF

48. Deep Drone Racing: From Simulation to Reality With Domain Randomization

Author: Loquercio, Antonio, primary, Kaufmann, Elia, additional, Ranftl, Rene, additional, Dosovitskiy, Alexey, additional, Koltun, Vladlen, additional, and Scaramuzza, Davide, additional
Published: 2020
Full Text: View/download PDF

49. Beauty and the Beast: Optimal Methods Meet Learning for Drone Racing

Author: Kaufmann, Elia, Gehrig, Mathias, Foehn, Philipp, Ranftl, Rene, Dosovitskiy, Alexey, Koltun, Vladlen, Scaramuzza, Davide, Kaufmann, Elia, Gehrig, Mathias, Foehn, Philipp, Ranftl, Rene, Dosovitskiy, Alexey, Koltun, Vladlen, and Scaramuzza, Davide
Abstract: Autonomous micro aerial vehicles still struggle with fast and agile maneuvers, dynamic environments, imperfect sensing, and state estimation drift. Autonomous drone racing brings these challenges to the fore. Human pilots can fly a previously unseen track after a handful of practice runs. In contrast, state-of-the-art autonomous navigation algorithms require either a precise metric map of the environment or a large amount of training data collected in the track of interest. To bridge this gap, we propose an approach that can fly a new track in a previously unseen environment without a precise map or expensive data collection. Our approach represents the global track layout with coarse gate locations, which can be easily estimated from a single demonstration flight. At test time, a convolutional network predicts the poses of the closest gates along with their uncertainty. These predictions are incorporated by an extended Kalman filter to maintain optimal maximum-a-posteriori estimates of gate locations. This allows the framework to cope with misleading high-variance estimates that could stem from poor observability or lack of visible gates. Given the estimated gate poses, we use model predictive control to quickly and accurately navigate through the track. We conduct extensive experiments in the physical world, demonstrating agile and robust flight through complex and diverse previously-unseen race tracks. The presented approach was used to win the IROS 2018 Autonomous Drone Race Competition, outracing the second-placing team by a factor of two.
Published: 2019

50. Feedback MPC for Torque-Controlled Legged Robots

Author: Grandia, Ruben, primary, Farshidian, Farbod, additional, Ranftl, Rene, additional, and Hutter, Marco, additional
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

95 results on '"Ranftl, Rene"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources