Author: "Welling M" / Topic: fos: computer and information sciences - Searchworks@Jio Institute Digital Library Search Results

1. Complex-Valued Autoencoders for Object Discovery

Author: Löwe, S., Lippe, P., Rudolph, M., Welling, M., Amsterdam Machine Learning lab (IVI, FNWI), IvI Research (FNWI), Video & Image Sense Lab (IvI, FNWI), and Information Retrieval Lab (IvI, FNWI)
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)
Abstract: Object-centric representations form the basis of human perception, and enable us to reason about the world and to systematically generalize to new settings. Currently, most works on unsupervised object discovery focus on slot-based approaches, which explicitly separate the latent representations of individual objects. While the result is easily interpretable, it usually requires the design of involved architectures. In contrast to this, we propose a comparatively simple approach - the Complex AutoEncoder (CAE) - that creates distributed object-centric representations. Following a coding scheme theorized to underlie object representations in biological neurons, its complex-valued activations represent two messages: their magnitudes express the presence of a feature, while the relative phase differences between neurons express which features should be bound together to create joint object representations. In contrast to previous approaches using complex-valued activations for object discovery, we present a fully unsupervised approach that is trained end-to-end - resulting in significant improvements in performance and efficiency. Further, we show that the CAE achieves competitive or better unsupervised object discovery performance on simple multi-object datasets compared to a state-of-the-art slot-based approach while being up to 100 times faster to train., Comment: Published in Transactions on Machine Learning Research (TMLR)
Published: 2022

2. SVNet: Where SO(3) Equivariance Meets Binarization on Point Cloud Representation

Author: Su, Z. (Zhuo), Welling, M. (Max), Pietikainen, M. (Matti), and Liu, L. (Li)
Subjects: FOS: Computer and information sciences, Equivariant-networks, Network-binarization, Point-cloud-representation, Computer Vision and Pattern Recognition (cs.CV), Efficient-representation, Computer Science - Computer Vision and Pattern Recognition
Abstract: Efficiency and robustness are increasingly needed for applications on 3D point clouds, with the ubiquitous use of edge devices in scenarios like autonomous driving and robotics, which often demand real-time and reliable responses. The paper tackles the challenge by designing a general framework to construct 3D learning architectures with SO(3) equivariance and network binarization. However, a naive combination of equivariant networks and binarization either causes sub-optimal computational efficiency or geometric ambiguity. We propose to locate both scalar and vector features in our networks to avoid both cases. Precisely, the presence of scalar features makes the major part of the network binarizable, while vector features serve to retain rich structural information and ensure SO(3) equivariance. The proposed approach can be applied to general backbones like PointNet and DGCNN. Meanwhile, experiments on ModelNet40, ShapeNet, and the real-world dataset ScanObjectNN, demonstrated that the method achieves a great trade-off between efficiency, rotation robustness, and accuracy. The codes are available at https://github.com/zhuoinoulu/svnet., Accepted in 3DV 2022. 11 pages including the appendix
Published: 2022

3. Topographic VAEs learn Equivariant Capsules

Author: Keller, T.A., Welling, M., Ranzato, M., Beygelzimer, A., Dauphin, Y., Liang, P.S., Wortman Vaughan, J., and Amsterdam Machine Learning lab (IVI, FNWI)
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Science - Neural and Evolutionary Computing, Neural and Evolutionary Computing (cs.NE), Machine Learning (cs.LG)
Abstract: In this work we seek to bridge the concepts of topographic organization and equivariance in neural networks. To accomplish this, we introduce the Topographic VAE: a novel method for efficiently training deep generative models with topographically organized latent variables. We show that such a model indeed learns to organize its activations according to salient characteristics such as digit class, width, and style on MNIST. Furthermore, through topographic organization over time (i.e. temporal coherence), we demonstrate how predefined latent space transformation operators can be encouraged for observed transformed input sequences -- a primitive form of unsupervised learned equivariance. We demonstrate that this model successfully learns sets of approximately equivariant features (i.e. "capsules") directly from sequences and achieves higher likelihood on correspondingly transforming test sequences. Equivariance is verified quantitatively by measuring the approximate commutativity of the inference network and the sequence transformations. Finally, we demonstrate approximate equivariance to complex transformations, expanding upon the capabilities of existing group equivariant neural networks.
Published: 2022

4. Deep Policy Dynamic Programming for Vehicle Routing Problems

Author: Kool, W., van Hoof, H., Gromicho, J., Welling, M., Schaus, P., Amsterdam Machine Learning lab (IVI, FNWI), Business Analytics (ABS, FEB), and Operations Management (ABS, FEB)
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: Routing problems are a class of combinatorial problems with many practical applications. Recently, end-to-end deep learning methods have been proposed to learn approximate solution heuristics for such problems. In contrast, classical dynamic programming (DP) algorithms guarantee optimal solutions, but scale badly with the problem size. We propose Deep Policy Dynamic Programming (DPDP), which aims to combine the strengths of learned neural heuristics with those of DP algorithms. DPDP prioritizes and restricts the DP state space using a policy derived from a deep neural network, which is trained to predict edges from example solutions. We evaluate our framework on the travelling salesman problem (TSP), the vehicle routing problem (VRP) and TSP with time windows (TSPTW) and show that the neural policy improves the performance of (restricted) DP algorithms, making them competitive to strong alternatives such as LKH, while also outperforming most other 'neural approaches' for solving TSPs, VRPs and TSPTWs with 100 nodes., 21 pages
Published: 2022

5. E(n) Equivariant Graph Neural Networks

Author: Garcia Satorras, V., Hoogeboom, E., Welling, M., Amsterdam Machine Learning lab (IVI, FNWI), and Intelligent Sensory Information Systems (IVI, FNWI)
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: This paper introduces a new model to learn graph neural networks equivariant to rotations, translations, reflections and permutations called E(n)-Equivariant Graph Neural Networks (EGNNs). In contrast with existing methods, our work does not require computationally expensive higher-order representations in intermediate layers while it still achieves competitive or better performance. In addition, whereas existing methods are limited to equivariance on 3 dimensional spaces, our model is easily scaled to higher-dimensional spaces. We demonstrate the effectiveness of our method on dynamical systems modelling, representation learning in graph autoencoders and predicting molecular properties.
Published: 2021

6. E(n) Equivariant Normalizing Flows

Author: Satorras, VG, Hoogeboom, E, Fuchs, FB, Posner, I, and Welling, M
Subjects: Chemical Physics (physics.chem-ph), FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Physics - Chemical Physics, FOS: Physical sciences, Machine Learning (stat.ML), Computer Science::Operating Systems, Machine Learning (cs.LG)
Abstract: This paper introduces a generative model equivariant to Euclidean symmetries: E(n) Equivariant Normalizing Flows (E-NFs). To construct E-NFs, we take the discriminative E(n) graph neural networks and integrate them as a differential equation to obtain an invertible equivariant function: a continuous-time normalizing flow. We demonstrate that E-NFs considerably outperform baselines and existing methods from the literature on particle systems such as DW4 and LJ13, and on molecules from QM9 in terms of log-likelihood. To the best of our knowledge, this is the first flow that jointly generates molecule features and positions in 3D., Comment: Accepted at Neural Information Processing Systems (NeurIPS 2021)
Published: 2021
Full Text: View/download PDF

7. Self Normalizing Flows

Author: Keller, T.A., Peters, J.W.T., Jaini, P., Hoogeboom, E., Forré, P., Welling, M., Amsterdam Machine Learning lab (IVI, FNWI), and Intelligent Sensory Information Systems (IVI, FNWI)
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Computer Science - Neural and Evolutionary Computing, Machine Learning (stat.ML), Neural and Evolutionary Computing (cs.NE), Machine Learning (cs.LG)
Abstract: Efficient gradient computation of the Jacobian determinant term is a core problem in many machine learning settings, and especially so in the normalizing flow framework. Most proposed flow models therefore either restrict to a function class with easy evaluation of the Jacobian determinant, or an efficient estimator thereof. However, these restrictions limit the performance of such density models, frequently requiring significant depth to reach desired performance levels. In this work, we propose Self Normalizing Flows, a flexible framework for training normalizing flows by replacing expensive terms in the gradient by learned approximate inverses at each layer. This reduces the computational complexity of each layer's exact update from $\mathcal{O}(D^3)$ to $\mathcal{O}(D^2)$, allowing for the training of flow architectures which were otherwise computationally infeasible, while also providing efficient sampling. We show experimentally that such models are remarkably stable and optimize to similar data likelihood values as their exact gradient counterparts, while training more quickly and surpassing the performance of functionally constrained counterparts.
Published: 2020

8. Relaxed Quantization for Discretized Neural Networks

Author: Louizos, C., Reisser, M., Blankevoort, T., Gavves, E., Welling, M., Amsterdam Machine Learning lab (IVI, FNWI), and Intelligent Sensory Information Systems (IVI, FNWI)
Subjects: Optimization, FOS: Computer and information sciences, Computer Science - Machine Learning, Stochastic systems, Gradient descent, Differentiability, Large models, Machine Learning (stat.ML), ComputerApplications_COMPUTERSINOTHERSYSTEMS, Machine Learning (cs.LG), Gradient methods, Loss of performance, Statistics - Machine Learning, Continuous distribution, Resourceconstrained devices, Computer Science::Databases, Gradient-based optimization
Abstract: Neural network quantization has become an important research area due to its great impact on deployment of large models on resource constrained devices. In order to train networks that can be effectively discretized without loss of performance, we introduce a differentiable quantization procedure. Differentiability can be achieved by transforming continuous distributions over the weights and activations of the network to categorical distributions over the quantization grid. These are subsequently relaxed to continuous surrogates that can allow for efficient gradient-based optimization. We further show that stochastic rounding can be seen as a special case of the proposed approach and that under this formulation the quantization grid itself can also be optimized with gradient descent. We experimentally validate the performance of our method on MNIST, CIFAR 10 and Imagenet classification. © 7th International Conference on Learning Representations, ICLR 2019. All Rights Reserved.
Published: 2019

9. DIVA: Domain Invariant Variational Autoencoders

Author: Ilse, M., Tomczak, J.M., Louizos, C., and Welling, M.
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Learning systems, Auto encoders, Machine Learning (stat.ML), Machine Learning (cs.LG), Benchmarking, ComputingMethodologies_PATTERNRECOGNITION, Statistics - Machine Learning, In-field, Medical imaging, Unlabeled data, Generative model, Labeled data
Abstract: We consider the problem of domain generalization, namely, how to learn representations given data from a set of domains that generalize to data from a previously unseen domain. We propose the Domain Invariant Variational Autoencoder (DIVA), a generative model that tackles this problem by learning three independent latent subspaces, one for the domain, one for the class, and one for any residual variations. We highlight that due to the generative nature of our model we can also incorporate unlabeled data from known or previously unseen domains. To the best of our knowledge this has not been done before in a domain generalization setting. This property is highly desirable in fields like medical imaging where labeled data is scarce. We experimentally evaluate our model on the rotated MNIST benchmark and a malaria cell images dataset where we show that (i) the learned subspaces are indeed complementary to each other, (ii) we improve upon recent works on this task and (iii) incorporating unlabelled data can boost the performance even further., Comment: Code available at https://github.com/AMLab-Amsterdam/DIVA
Published: 2019
Full Text: View/download PDF

10. Emerging Convolutions for Generative Normalizing Flows

Author: Hoogeboom, E., van den Berg, R., Welling, M., and Amsterdam Machine Learning lab (IVI, FNWI)
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: Generative flows are attractive because they admit exact likelihood optimization and efficient image synthesis. Recently, Kingma & Dhariwal (2018) demonstrated with Glow that generative flows are capable of generating high quality images. We generalize the 1 x 1 convolutions proposed in Glow to invertible d x d convolutions, which are more flexible since they operate on both channel and spatial axes. We propose two methods to produce invertible convolutions that have receptive fields identical to standard convolutions: Emerging convolutions are obtained by chaining specific autoregressive convolutions, and periodic convolutions are decoupled in the frequency domain. Our experiments show that the flexibility of d x d convolutions significantly improves the performance of generative flow models on galaxy images, CIFAR10 and ImageNet., Accepted at International Conference on Machine Learning (ICML) 2019
Published: 2019
Full Text: View/download PDF

11. Sinkhorn AutoEncoders

Author: Patrini, G., Bhargav, S., Den Berg, R., Welling, M., Forré, P., Genewein, T., Carioni, M., and Frank Nielsen
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: Optimal transport offers an alternative to maximum likelihood for learning generative autoencoding models. We show that minimizing the p-Wasserstein distance between the generator and the true data distribution is equivalent to the unconstrained min-min optimization of the p-Wasserstein distance between the encoder aggregated posterior and the prior in latent space, plus a reconstruction error. We also identify the role of its trade-off hyperparameter as the capacity of the generator: its Lipschitz constant. Moreover, we prove that optimizing the encoder over any class of universal approximators, such as deterministic neural networks, is enough to come arbitrarily close to the optimum. We therefore advertise this framework, which holds for any metric space and prior, as a sweet-spot of current generative autoencoding objectives. We then introduce the Sinkhorn auto-encoder (SAE), which approximates and minimizes the p-Wasserstein distance in latent space via backprogation through the Sinkhorn algorithm. SAE directly works on samples, i.e. it models the aggregated posterior as an implicit distribution, with no need for a reparameterization trick for gradients estimations. SAE is thus able to work with different metric spaces and priors with minimal adaptations. We demonstrate the flexibility of SAE on latent spaces with different geometries and priors and compare with other methods on benchmark data sets., Comment: Accepted for oral presentation at UAI19
Published: 2018
Full Text: View/download PDF

12. Multiplicative Normalizing Flows for Variational Bayesian Neural Networks

Author: Louizos, C, Welling, M., and Amsterdam Machine Learning lab (IVI, FNWI)
Subjects: FOS: Computer and information sciences, TS - Technical Sciences, Learning systems, Variational bayesian, Bayesian neural networks, Multiplicative noise, Predictive accuracy, Defence Research, Machine Learning (stat.ML), Lower bounds, Defence, Safety and Security, Auxiliary random variable, Machine Learning (cs.LG), Predictive uncertainty, Computer Science - Learning, Mean field, Statistics - Machine Learning, II - Intelligent Imaging, 2015 Observation, Weapon & Protection Systems, Neural networks
Abstract: We reinterpret multiplicative noise in neural networks as auxiliary random variables that augment the approximate posterior in a variational setting for Bayesian neural networks. We show that through this interpretation it is both efficient and straightforward to improve the approximation by employing normalizing flows while still allowing for local reparametrizations and a tractable lower bound. In experiments we show that with this new approximation we can significantly improve upon classical mean field for Bayesian neural networks on both predictive accuracy as well as predictive uncertainty., Comment: Appearing at the International Conference on Machine Learning (ICML) 2017
Published: 2017
Full Text: View/download PDF

13. Private Topic Modeling

Author: Park, M., Foulds, J., Chaudhuri, K., Welling, M., and Amsterdam Machine Learning lab (IVI, FNWI)
Subjects: FOS: Computer and information sciences, Computer Science - Cryptography and Security, Statistics - Machine Learning, Machine Learning (stat.ML), Cryptography and Security (cs.CR)
Abstract: We develop a privatised stochastic variational inference method for Latent Dirichlet Allocation (LDA). The iterative nature of stochastic variational inference presents challenges: multiple iterations are required to obtain accurate posterior distributions, yet each iteration increases the amount of noise that must be added to achieve a reasonable degree of privacy. We propose a practical algorithm that overcomes this challenge by combining: (1) an improved composition method for differential privacy, called the moments accountant, which provides a tight bound on the privacy cost of multiple variational inference iterations and thus significantly decreases the amount of additive noise; and (2) privacy amplification resulting from subsampling of large-scale data. Focusing on conjugate exponential family models, in our private variational inference, all the posterior distributions will be privatised by simply perturbing expected sufficient statistics. Using Wikipedia data, we illustrate the effectiveness of our algorithm for large-scale data.
Published: 2016

14. On smoothing and inference for topic models

Author: Asuncion, A., Welling, M., Padhraic Smyth, and Teh, Y. W.
Subjects: FOS: Computer and information sciences, Computer Science - Learning, Statistics - Machine Learning, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: Latent Dirichlet analysis, or topic modeling, is a flexible latent variable framework for modeling high-dimensional sparse count data. Various learning algorithms have been developed in recent years, including collapsed Gibbs sampling, variational inference, and maximum a posteriori estimation, and this variety motivates the need for careful empirical comparisons. In this paper, we highlight the close connections between these approaches. We find that the main differences are attributable to the amount of smoothing applied to the counts. When the hyperparameters are optimized, the differences in performance among the algorithms diminish significantly. The ability of these algorithms to achieve solutions of comparable accuracy gives us the freedom to select computationally efficient approaches. Using the insights gained from this comparative study, we show how accurate topic models can be learned in several seconds on text corpora with thousands of documents., Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)
Published: 2016

15. Scalable MCMC for Mixed Membership Stochastic Blockmodels

Author: Li, W., Ahn, S., Welling, M., and Amsterdam Machine Learning lab (IVI, FNWI)
Subjects: FOS: Computer and information sciences, Computer Science - Learning, Statistics - Machine Learning, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: We propose a stochastic gradient Markov chain Monte Carlo (SG-MCMC) algorithm for scalable inference in mixed-membership stochastic blockmodels (MMSB). Our algorithm is based on the stochastic gradient Riemannian Langevin sampler and achieves both faster speed and higher accuracy at every iteration than the current state-of-the-art algorithm based on stochastic variational inference. In addition we develop an approximation that can handle models that entertain a very large number of communities. The experimental results show that SG-MCMC strictly dominates competing algorithms in all cases., Comment: 9 pages, 18 figures
Published: 2015
Full Text: View/download PDF

16. Stochastic Collapsed Variational Bayesian Inference for Latent Dirichlet Allocation

Author: Foulds, J., Boyles, L., DuBois, C., Smyth, P., Welling, M., Dhillon, I.S., Koren, Y., Ghani, R., Senator, T.E., Bradley, P., Parekh, R., He, J., Grossman, R.L., Uthurusamy, R., and Amsterdam Machine Learning lab (IVI, FNWI)
Subjects: Text corpus, Topic model, FOS: Computer and information sciences, business.industry, Computer science, Inference, Bayesian inference, Machine learning, computer.software_genre, Latent Dirichlet allocation, Dynamic topic model, Machine Learning (cs.LG), Computer Science - Learning, symbols.namesake, ComputingMethodologies_PATTERNRECOGNITION, Variational message passing, symbols, Artificial intelligence, business, Representation (mathematics), computer
Abstract: In the internet era there has been an explosion in the amount of digital text information available, leading to difficulties of scale for traditional inference algorithms for topic models. Recent advances in stochastic variational inference algorithms for latent Dirichlet allocation (LDA) have made it feasible to learn topic models on large-scale corpora, but these methods do not currently take full advantage of the collapsed representation of the model. We propose a stochastic algorithm for collapsed variational Bayesian inference for LDA, which is simpler and more efficient than the state of the art method. We show connections between collapsed variational Bayesian inference and MAP estimation for LDA, and leverage these connections to prove convergence properties of the proposed algorithm. In experiments on large-scale text corpora, the algorithm was found to converge faster and often to a better solution than the previous method. Human-subject experiments also demonstrated that the method can learn coherent topics in seconds on small corpora, facilitating the use of topic models in interactive document analysis software.
Published: 2013
Full Text: View/download PDF

17. Herded Gibbs Sampling

Author: Chen, Y., Bornn, L., de Freitas, N., Eskelin, M., Fang, J., Welling, M., and Amsterdam Machine Learning lab (IVI, FNWI)
Subjects: Computer Science::Multiagent Systems, Physics::Physics and Society, FOS: Computer and information sciences, Computer Science - Learning, Nonlinear Sciences::Adaptation and Self-Organizing Systems, Statistics - Machine Learning, Machine Learning (stat.ML), Statistics - Computation, Computation (stat.CO), Statistics::Computation, Machine Learning (cs.LG)
Abstract: The Gibbs sampler is one of the most popular algorithms for inference in statistical models. In this paper, we introduce a herding variant of this algorithm, called herded Gibbs, that is entirely deterministic. We prove that herded Gibbs has an $O(1/T)$ convergence rate for models with independent variables and for fully connected probabilistic graphical models. Herded Gibbs is shown to outperform Gibbs in the tasks of image denoising with MRFs and named entity recognition with CRFs. However, the convergence for herded Gibbs for sparsely connected probabilistic graphical models is still an open problem., Comment: 19 pages, including the appendix. Submission for ICLR 2013
Published: 2013
Full Text: View/download PDF

18. Structured Region Graphs: Morphing EP into GBP

Author: Welling, M, Minka, TP, and Teh, YW
Subjects: FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence
Abstract: GBP and EP are two successful algorithms for approximate probabilistic inference, which are based on different approximation strategies. An open problem in both algorithms has been how to choose an appropriate approximation structure. We introduce 'structured region graphs', a formalism which marries these two strategies, reveals a deep connection between them, and suggests how to choose good approximation structures. In this formalism, each region has an internal structure which defines an exponential family, whose sufficient statistics must be matched by the parent region. Reduction operators on these structures allow conversion between EP and GBP free energies. Thus it is revealed that all EP approximations on discrete variables are special cases of GBP, and conversely that some wellknown GBP approximations, such as overlapping squares, are special cases of EP. Furthermore, region graphs derived from EP have a number of good structural properties, including maxent-normality and overall counting number of one. The result is a convenient framework for producing high-quality approximations with a user-adjustable level of complexity, Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)
Published: 2012

19. Gibbs Sampling for (Coupled) Infinite Mixture Models in the Stick Breaking Representation

Author: Porteous, I., Alexander Ihler, Smyth, P., and Welling, M.
Subjects: Methodology (stat.ME), FOS: Computer and information sciences, Computer Science - Learning, Statistics - Machine Learning, stat.ME, cs.LG, Machine Learning (stat.ML), stat.ML, Statistics - Methodology, Machine Learning (cs.LG)
Abstract: Nonparametric Bayesian approaches to clustering, information retrieval, language modeling and object recognition have recently shown great promise as a new paradigm for unsupervised data analysis. Most contributions have focused on the Dirichlet process mixture models or extensions thereof for which efficient Gibbs samplers exist. In this paper we explore Gibbs samplers for infinite complexity mixture models in the stick breaking representation. The advantage of this representation is improved modeling flexibility. For instance, one can design the prior distribution over cluster sizes or couple multiple infinite mixture models (e.g. over time) at the level of their parameters (i.e. the dependent Dirichlet process model). However, Gibbs samplers for infinite mixture models (as recently introduced in the statistics literature) seem to mix poorly over cluster labels. Among others issues, this can have the adverse effect that labels for the same cluster in coupled mixture models are mixed up. We introduce additional moves in these samplers to improve mixing over cluster labels and to bring clusters into correspondence. An application to modeling of storm trajectories is used to illustrate these ideas., Comment: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)
Published: 2012

20. Fast Optical Flow Using Dense Inverse Search

Author: Luc Van Gool, Radu Timofte, Dengxin Dai, Till Kroeger, Leibe, B, Matas, J, Sebe, N, and Welling, M
Subjects: FOS: Computer and information sciences, Machine vision, business.industry, Computer Vision and Pattern Recognition (cs.CV), Computation, Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Optical flow, Inverse, 020207 software engineering, 02 engineering and technology, PSI_VISICS, Computer Science - Robotics, Temporal resolution, Displacement field, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Computer vision, Artificial intelligence, Focus (optics), business, Robotics (cs.RO), Algorithm, Time complexity, Mathematics
Abstract: Most recent works in optical flow extraction focus on the accuracy and neglect the time complexity. However, in real-life visual applications, such as tracking, activity detection and recognition, the time complexity is critical. We propose a solution with very low time complexity and competitive accuracy for the computation of dense optical flow. It consists of three parts: 1) inverse search for patch correspondences; 2) dense displacement field creation through patch aggregation along multiple scales; 3) variational refinement. At the core of our Dense Inverse Search-based method (DIS) is the efficient search of correspondences inspired by the inverse compositional image alignment proposed by Baker and Matthews in 2001. DIS is competitive on standard optical flow benchmarks with large displacements. DIS runs at 300Hz up to 600Hz on a single CPU core, reaching the temporal resolution of human's biological vision system. It is order(s) of magnitude faster than state-of-the-art methods in the same range of accuracy, making DIS ideal for visual applications., 9 pages main paper + 16 pages supplementary material
Published: 2016

21. Convolutional Oriented Boundaries

Author: Kevis-Kokitsi Maninis, Pablo Arbeláez, Luc Van Gool, Jordi Pont-Tuset, Leibe, B, Matas, J, Sebe, N, and Welling, M
Subjects: FOS: Computer and information sciences, Contextual image classification, Computer science, business.industry, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 020207 software engineering, Pattern recognition, 02 engineering and technology, Pascal (programming language), PSI_VISICS, Convolutional neural network, Computer Science::Computer Vision and Pattern Recognition, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Segmentation, Artificial intelligence, business, computer, computer.programming_language
Abstract: We present Convolutional Oriented Boundaries (COB), which produces multiscale oriented contours and region hierarchies starting from generic image classification Convolutional Neural Networks (CNNs). COB is computationally efficient, because it requires a single CNN forward pass for contour detection and it uses a novel sparse boundary representation for hierarchical segmentation; it gives a significant leap in performance over the state-of-the-art, and it generalizes very well to unseen categories and datasets. Particularly, we show that learning to estimate not only contour strength but also orientation provides more accurate results. We perform extensive experiments on BSDS, PASCAL Context, PASCAL Segmentation, and MS-COCO, showing that COB provides state-of-the-art contours, region hierarchies, and object proposals in all datasets., Comment: ECCV 2016 Camera Ready
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

21 results on '"Welling M"'

1. Complex-Valued Autoencoders for Object Discovery

2. SVNet: Where SO(3) Equivariance Meets Binarization on Point Cloud Representation

3. Topographic VAEs learn Equivariant Capsules

4. Deep Policy Dynamic Programming for Vehicle Routing Problems

5. E(n) Equivariant Graph Neural Networks

6. E(n) Equivariant Normalizing Flows

7. Self Normalizing Flows

8. Relaxed Quantization for Discretized Neural Networks

9. DIVA: Domain Invariant Variational Autoencoders

10. Emerging Convolutions for Generative Normalizing Flows

11. Sinkhorn AutoEncoders

12. Multiplicative Normalizing Flows for Variational Bayesian Neural Networks

13. Private Topic Modeling

14. On smoothing and inference for topic models

15. Scalable MCMC for Mixed Membership Stochastic Blockmodels

16. Stochastic Collapsed Variational Bayesian Inference for Latent Dirichlet Allocation

17. Herded Gibbs Sampling

18. Structured Region Graphs: Morphing EP into GBP

19. Gibbs Sampling for (Coupled) Infinite Mixture Models in the Stick Breaking Representation

20. Fast Optical Flow Using Dense Inverse Search

21. Convolutional Oriented Boundaries

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

Publisher

21 results on '"Welling M"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources