Author: "Hengel, A. van den" / Language: undetermined - Searchworks@Jio Institute Digital Library Search Results

1. RanPAC: Random Projections and Pre-trained Models for Continual Learning

Author: McDonnell, Mark D., Gong, Dong, Parveneh, Amin, Abbasnejad, Ehsan, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)
Abstract: Continual learning (CL) aims to incrementally learn different tasks (such as classification) in a non-stationary data stream without forgetting old ones. Most CL works focus on tackling catastrophic forgetting under a learning-from-scratch paradigm. However, with the increasing prominence of foundation models, pre-trained models equipped with informative representations have become available for various downstream requirements. Several CL methods based on pre-trained models have been explored, either utilizing pre-extracted features directly (which makes bridging distribution gaps challenging) or incorporating adaptors (which may be subject to forgetting). In this paper, we propose a concise and effective approach for CL with pre-trained models. Given that forgetting occurs during parameter updating, we contemplate an alternative approach that exploits training-free random projectors and class-prototype accumulation, which thus bypasses the issue. Specifically, we inject a frozen Random Projection layer with nonlinear activation between the pre-trained model's feature representations and output head, which captures interactions between features with expanded dimensionality, providing enhanced linear separability for class-prototype-based CL. We also demonstrate the importance of decorrelating the class-prototypes to reduce the distribution disparity when using pre-trained representations. These techniques prove to be effective and circumvent the problem of forgetting for both class- and domain-incremental continual learning. Compared to previous methods applied to pre-trained ViT-B/16 models, we reduce final error rates by between 10\% and 62\% on seven class-incremental benchmark datasets, despite not using any rehearsal memory. We conclude that the full potential of pre-trained models for simple, effective, and fast continual learning has not hitherto been fully tapped., Comment: 30 pages, 11 figures
Published: 2023
Full Text: View/download PDF

2. Learning Common Rationale to Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems

Author: Shu, Yangyang, Hengel, Anton van den, and Liu, Lingqiao
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)
Abstract: Self-supervised learning (SSL) strategies have demonstrated remarkable performance in various recognition tasks. However, both our preliminary investigation and recent studies suggest that they may be less effective in learning representations for fine-grained visual recognition (FGVR) since many features helpful for optimizing SSL objectives are not suitable for characterizing the subtle differences in FGVR. To overcome this issue, we propose learning an additional screening mechanism to identify discriminative clues commonly seen across instances and classes, dubbed as common rationales in this paper. Intuitively, common rationales tend to correspond to the discriminative patterns from the key parts of foreground objects. We show that a common rationale detector can be learned by simply exploiting the GradCAM induced from the SSL objective without using any pre-trained object parts or saliency detectors, making it seamlessly to be integrated with the existing SSL process. Specifically, we fit the GradCAM with a branch with limited fitting capacity, which allows the branch to capture the common rationales and discard the less common discriminative patterns. At the test stage, the branch generates a set of spatial weights to selectively aggregate features representing an instance. Extensive experimental results on four visual tasks demonstrate that the proposed method can lead to a significant improvement in different evaluation settings., Comment: To Appear at CVPR 2023
Published: 2023
Full Text: View/download PDF

3. BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling

Author: Ramasinghe, Sameera, Shevchenko, Violetta, Avraham, Gil, and Hengel, Anton Van Den
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: Reasoning the 3D structure of a non-rigid dynamic scene from a single moving camera is an under-constrained problem. Inspired by the remarkable progress of neural radiance fields (NeRFs) in photo-realistic novel view synthesis of static scenes, extensions have been proposed for dynamic settings. These methods heavily rely on neural priors in order to regularize the problem. In this work, we take a step back and reinvestigate how current implementations may entail deleterious effects, including limited expressiveness, entanglement of light and density fields, and sub-optimal motion localization. As a remedy, we advocate for a bridge between classic non-rigid-structure-from-motion (\nrsfm) and NeRF, enabling the well-studied priors of the former to constrain the latter. To this end, we propose a framework that factorizes time and space by formulating a scene as a composition of bandlimited, high-dimensional signals. We demonstrate compelling results across complex dynamic scenes that involve changes in lighting, texture and long-range dynamics.
Published: 2023
Full Text: View/download PDF

4. Retrieval Augmented Classification for Long-Tail Visual Recognition

Author: Long, Alexander, Yin, Wei, Ajanthan, Thalaiyasingam, Nguyen, Vu, Purkait, Pulak, Garg, Ravi, Blair, Alan, Shen, Chunhua, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: We introduce Retrieval Augmented Classification (RAC), a generic approach to augmenting standard image classification pipelines with an explicit retrieval module. RAC consists of a standard base image encoder fused with a parallel retrieval branch that queries a non-parametric external memory of pre-encoded images and associated text snippets. We apply RAC to the problem of long-tail classification and demonstrate a significant improvement over previous state-of-the-art on Places365-LT and iNaturalist-2018 (14.5% and 6.7% respectively), despite using only the training datasets themselves as the external information source. We demonstrate that RAC's retrieval module, without prompting, learns a high level of accuracy on tail classes. This, in turn, frees the base encoder to focus on common classes, and improve its performance thereon. RAC represents an alternative approach to utilizing large, pretrained models without requiring fine-tuning, as well as a first step towards more effectively making use of external memory within common computer vision architectures.
Published: 2022
Full Text: View/download PDF

5. Identifying Weight-Variant Latent Causal Models

Author: Liu, Yuhang, Zhang, Zhen, Gong, Dong, Gong, Mingming, Huang, Biwei, Hengel, Anton van den, Zhang, Kun, and Shi, Javen Qinfeng
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: The task of causal representation learning aims to uncover latent higher-level causal representations that affect lower-level observations. Identifying true latent causal representations from observed data, while allowing instantaneous causal relations among latent variables, remains a challenge, however. To this end, we start from the analysis of three intrinsic properties in identifying latent space from observations: transitivity, permutation indeterminacy, and scaling indeterminacy. We find that transitivity acts as a key role in impeding the identifiability of latent causal representations. To address the unidentifiable issue due to transitivity, we introduce a novel identifiability condition where the underlying latent causal model satisfies a linear-Gaussian model, in which the causal coefficients and the distribution of Gaussian noise are modulated by an additional observed variable. Under some mild assumptions, we can show that the latent causal representations can be identified up to trivial permutation and scaling. Furthermore, based on this theoretical result, we propose a novel method, termed Structural caUsAl Variational autoEncoder, which directly learns latent causal representations and causal relationships among them, together with the mapping from the latent causal variables to the observed ones. We show that the proposed method learns the true parameters asymptotically. Experimental results on synthetic and real data demonstrate the identifiability and consistency results and the efficacy of the proposed method in learning latent causal representations.
Published: 2022
Full Text: View/download PDF

6. Confident Sinkhorn Allocation for Pseudo-Labeling

Author: Nguyen, Vu, Husain, Hisham, Farfade, Sachin, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Machine Learning (cs.LG)
Abstract: Semi-supervised learning is a critical tool in reducing machine learning's dependence on labeled data. It has been successfully applied to structured data, such as images and natural language, by exploiting the inherent spatial and semantic structure therein with pretrained models or data augmentation. These methods are not applicable, however, when the data does not have the appropriate structure, or invariances. Due to their simplicity, pseudo-labeling (PL) methods can be widely used without any domain assumptions. However, PL is sensitive to a threshold and can perform poorly if wrong assignments are made due to overconfidence. This paper studies theoretically the role of uncertainty to pseudo-labeling and proposes Confident Sinkhorn Allocation (CSA), which identifies the best pseudo-label allocation via optimal transport to only samples with high confidence scores. CSA outperforms the current state-of-the-art in this practically important area of semi-supervised learning. Additionally, we propose to use the Integral Probability Metrics to extend and improve the existing PAC-Bayes bound which relies on the Kullback-Leibler (KL) divergence, for ensemble models. Our code is publicly available at https://github.com/amzn/confident-sinkhorn-allocation., Comment: Code https://github.com/amzn/confident-sinkhorn-allocation
Published: 2022
Full Text: View/download PDF

7. Poseur: Direct Human Pose Regression with Transformers

Author: Mao, Weian, Ge, Yongtao, Shen, Chunhua, Tian, Zhi, Wang, Xinlong, Wang, Zhibin, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: We propose a direct, regression-based approach to 2D human pose estimation from single images. We formulate the problem as a sequence prediction task, which we solve using a Transformer network. This network directly learns a regression mapping from images to the keypoint coordinates, without resorting to intermediate representations such as heatmaps. This approach avoids much of the complexity associated with heatmap-based approaches. To overcome the feature misalignment issues of previous regression-based methods, we propose an attention mechanism that adaptively attends to the features that are most relevant to the target keypoints, considerably improving the accuracy. Importantly, our framework is end-to-end differentiable, and naturally learns to exploit the dependencies between keypoints. Experiments on MS-COCO and MPII, two predominant pose-estimation datasets, demonstrate that our method significantly improves upon the state-of-the-art in regression-based pose estimation. More notably, ours is the first regression-based approach to perform favorably compared to the best heatmap-based pose estimation methods., Comment: Accepted to Proc. Eur. Conf. Comp. Vision (ECCV) 2022
Published: 2022
Full Text: View/download PDF

8. PointInst3D: Segmenting 3D Instances by Points

Author: He, Tong, Yin, Wei, Shen, Chunhua, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: The current state-of-the-art methods in 3D instance segmentation typically involve a clustering step, despite the tendency towards heuristics, greedy algorithms, and a lack of robustness to the changes in data statistics. In contrast, we propose a fully-convolutional 3D point cloud instance segmentation method that works in a per-point prediction fashion. In doing so it avoids the challenges that clustering-based methods face: introducing dependencies among different tasks of the model. We find the key to its success is assigning a suitable target to each sampled point. Instead of the commonly used static or distance-based assignment strategies, we propose to use an Optimal Transport approach to optimally assign target masks to the sampled points according to the dynamic matching costs. Our approach achieves promising results on both ScanNet and S3DIS benchmarks. The proposed approach removes intertask dependencies and thus represents a simpler and more flexible 3D instance segmentation framework than other competing methods, while achieving improved segmentation accuracy., Comment: Accepted by ECCV22. Code and model will be released at https://github.com/tonghe90/PointInst3D
Published: 2022
Full Text: View/download PDF

9. Deep Learning for Hate Speech Detection: A Comparative Study

Author: Malik, Jitendra Singh, Pang, Guansong, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computation and Language, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computation and Language (cs.CL), Information Retrieval (cs.IR), Computer Science - Information Retrieval, Machine Learning (cs.LG)
Abstract: Automated hate speech detection is an important tool in combating the spread of hate speech, particularly in social media. Numerous methods have been developed for the task, including a recent proliferation of deep-learning based approaches. A variety of datasets have also been developed, exemplifying various manifestations of the hate-speech detection problem. We present here a large-scale empirical comparison of deep and shallow hate-speech detection methods, mediated through the three most commonly used datasets. Our goal is to illuminate progress in the area, and identify strengths and weaknesses in the current state-of-the-art. We particularly focus our analysis on measures of practical performance, including detection accuracy, computational efficiency, capability in using pre-trained models, and domain generalization. In doing so we aim to provide guidance as to the use of hate-speech detection in practice, quantify the state-of-the-art, and identify future research directions. Code and dataset are available at https://github.com/jmjmalik22/Hate-Speech-Detection., Comment: 17 pages, 4 figures, and 6 tables
Published: 2022
Full Text: View/download PDF

10. Distributionally Robust Bayesian Optimization with $ϕ$-divergences

Author: Husain, Hisham, Nguyen, Vu, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Optimization and Control (math.OC), FOS: Mathematics, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: The study of robustness has received much attention due to its inevitability in data-driven settings where many systems face uncertainty. One such example of concern is Bayesian Optimization (BO), where uncertainty is multi-faceted, yet there only exists a limited number of works dedicated to this direction. In particular, there is the work of Kirschner et al. (2020), which bridges the existing literature of Distributionally Robust Optimization (DRO) by casting the BO problem from the lens of DRO. While this work is pioneering, it admittedly suffers from various practical shortcomings such as finite contexts assumptions, leaving behind the main question Can one devise a computationally tractable algorithm for solving this DRO-BO problem? In this work, we tackle this question to a large degree of generality by considering robustness against data-shift in $ϕ$-divergences, which subsumes many popular choices, such as the $χ^2$-divergence, Total Variation, and the extant Kullback-Leibler (KL) divergence. We show that the DRO-BO problem in this setting is equivalent to a finite-dimensional optimization problem which, even in the continuous context setting, can be easily implemented with provable sublinear regret bounds. We then show experimentally that our method surpasses existing methods, attesting to the theoretical results., 21 pages
Published: 2022
Full Text: View/download PDF

11. Active Learning by Feature Mixing

Author: Parvaneh, Amin, Abbasnejad, Ehsan, Teney, Damien, Haffari, Reza, Hengel, Anton van den, and Shi, Javen Qinfeng
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: The promise of active learning (AL) is to reduce labelling costs by selecting the most valuable examples to annotate from a pool of unlabelled data. Identifying these examples is especially challenging with high-dimensional data (e.g. images, videos) and in low-data regimes. In this paper, we propose a novel method for batch AL called ALFA-Mix. We identify unlabelled instances with sufficiently-distinct features by seeking inconsistencies in predictions resulting from interventions on their representations. We construct interpolations between representations of labelled and unlabelled instances then examine the predicted labels. We show that inconsistencies in these predictions help discovering features that the model is unable to recognise in the unlabelled instances. We derive an efficient implementation based on a closed-form solution to the optimal interpolation causing changes in predictions. Our method outperforms all recent AL approaches in 30 different settings on 12 benchmarks of images, videos, and non-visual data. The improvements are especially significant in low-data regimes and on self-trained vision transformers, where ALFA-Mix outperforms the state-of-the-art in 59% and 43% of the experiments respectively., Comment: CVPR 2022
Published: 2022
Full Text: View/download PDF

12. The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation

Author: Qi, Yuankai, Pan, Zizheng, Hong, Yicong, Yang, Ming-Hsuan, Hengel, Anton van den, and Wu, Qi
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Computation and Language (cs.CL)
Abstract: Vision-and-Language Navigation (VLN) requires an agent to find a path to a remote location on the basis of natural-language instructions and a set of photo-realistic panoramas. Most existing methods take the words in the instructions and the discrete views of each panorama as the minimal unit of encoding. However, this requires a model to match different nouns (e.g., TV, table) against the same input view feature. In this work, we propose an object-informed sequential BERT to encode visual perceptions and linguistic instructions at the same fine-grained level, namely objects and words. Our sequential BERT also enables the visual-textual clues to be interpreted in light of the temporal context, which is crucial to multi-round VLN tasks. Additionally, we enable the model to identify the relative direction (e.g., left/right/front/back) of each navigable location and the room type (e.g., bedroom, kitchen) of its current and final navigation goal, as such information is widely mentioned in instructions implying the desired next and final locations. We thus enable the model to know-where the objects lie in the images, and to know-where they stand in the scene. Extensive experiments demonstrate the effectiveness compared against several state-of-the-art methods on three indoor VLN tasks: REVERIE, NDH, and R2R. Project repository: https://github.com/YuankaiQi/ORIST, Comment: Original title: Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
Published: 2021
Full Text: View/download PDF

13. Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions with Superior OOD Generalization

Author: Teney, Damien, Abbasnejad, Ehsan, Lucey, Simon, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)
Abstract: Neural networks trained with SGD were recently shown to rely preferentially on linearly-predictive features and can ignore complex, equally-predictive ones. This simplicity bias can explain their lack of robustness out of distribution (OOD). The more complex the task to learn, the more likely it is that statistical artifacts (i.e. selection biases, spurious correlations) are simpler than the mechanisms to learn. We demonstrate that the simplicity bias can be mitigated and OOD generalization improved. We train a set of similar models to fit the data in different ways using a penalty on the alignment of their input gradients. We show theoretically and empirically that this induces the learning of more complex predictive patterns. OOD generalization fundamentally requires information beyond i.i.d. examples, such as multiple training environments, counterfactual examples, or other side information. Our approach shows that we can defer this requirement to an independent model selection stage. We obtain SOTA results in visual recognition on biased data and generalization across visual domains. The method - the first to evade the simplicity bias - highlights the need for a better understanding and control of inductive biases in deep learning., Comment: CVPR 2022
Published: 2021
Full Text: View/download PDF

14. Learning for Visual Navigation by Imagining the Success

Author: Moghaddam, Mahdi Kazemi, Abbasnejad, Ehsan, Wu, Qi, Shi, Javen, and Hengel, Anton Van Den
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: Visual navigation is often cast as a reinforcement learning (RL) problem. Current methods typically result in a suboptimal policy that learns general obstacle avoidance and search behaviours. For example, in the target-object navigation setting, the policies learnt by traditional methods often fail to complete the task, even when the target is clearly within reach from a human perspective. In order to address this issue, we propose to learn to imagine a latent representation of the successful (sub-)goal state. To do so, we have developed a module which we call Foresight Imagination (ForeSIT). ForeSIT is trained to imagine the recurrent latent representation of a future state that leads to success, e.g. either a sub-goal state that is important to reach before the target, or the goal state itself. By conditioning the policy on the generated imagination during training, our agent learns how to use this imagination to achieve its goal robustly. Our agent is able to imagine what the (sub-)goal state may look like (in the latent space) and can learn to navigate towards that state. We develop an efficient learning algorithm to train ForeSIT in an on-policy manner and integrate it into our RL objective. The integration is not trivial due to the constantly evolving state representation shared between both the imagination and the policy. We, empirically, observe that our method outperforms the state-of-the-art methods by a large margin in the commonly accepted benchmark AI2THOR environment. Our method can be readily integrated or added to other model-free RL navigation frameworks.
Published: 2021
Full Text: View/download PDF

15. Deep Depression Prediction on Longitudinal Data via Joint Anomaly Ranking and Classification

Author: Pang, Guansong, Pham, Ngoc Thien Anh, Baker, Emma, Bentley, Rebecca, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computers and Society, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computers and Society (cs.CY), Machine Learning (cs.LG)
Abstract: A wide variety of methods have been developed for identifying depression, but they focus primarily on measuring the degree to which individuals are suffering from depression currently. In this work we explore the possibility of predicting future depression using machine learning applied to longitudinal socio-demographic data. In doing so we show that data such as housing status, and the details of the family environment, can provide cues for predicting future psychiatric disorders. To this end, we introduce a novel deep multi-task recurrent neural network to learn time-dependent depression cues. The depression prediction task is jointly optimized with two auxiliary anomaly ranking tasks, including contrastive one-class feature ranking and deviation ranking. The auxiliary tasks address two key challenges of the problem: 1) the high within class variance of depression samples: they enable the learning of representations that are robust to highly variant in-class distribution of the depression samples; and 2) the small labeled data volume: they significantly enhance the sample efficiency of the prediction model, which reduces the reliance on large depression-labeled datasets that are difficult to collect in practice. Extensive empirical results on large-scale child depression data show that our model is sample-efficient and can accurately predict depression 2-4 years before the illness occurs, substantially outperforming eight representative comparators., Comment: Accepted to PAKDD 2022
Published: 2020
Full Text: View/download PDF

16. Learning What Makes a Difference from Counterfactual Examples and Gradient Supervision

Author: Teney, Damien, Abbasnedjad, Ehsan, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)
Abstract: One of the primary challenges limiting the applicability of deep learning is its susceptibility to learning spurious correlations rather than the underlying mechanisms of the task of interest. The resulting failure to generalise cannot be addressed by simply using more data from the same distribution. We propose an auxiliary training objective that improves the generalization capabilities of neural networks by leveraging an overlooked supervisory signal found in existing datasets. We use pairs of minimally-different examples with different labels, a.k.a counterfactual or contrasting examples, which provide a signal indicative of the underlying causal structure of the task. We show that such pairs can be identified in a number of existing datasets in computer vision (visual question answering, multi-label image classification) and natural language processing (sentiment analysis, natural language inference). The new training objective orients the gradient of a model's decision function with pairs of counterfactual examples. Models trained with this technique demonstrate improved performance on out-of-distribution test sets.
Published: 2020
Full Text: View/download PDF

17. Visual Question Answering with Prior Class Semantics

Author: Shevchenko, Violetta, Teney, Damien, Dick, Anthony, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)
Abstract: We present a novel mechanism to embed prior knowledge in a model for visual question answering. The open-set nature of the task is at odds with the ubiquitous approach of training of a fixed classifier. We show how to exploit additional information pertaining to the semantics of candidate answers. We extend the answer prediction process with a regression objective in a semantic space, in which we project candidate answers using prior knowledge derived from word embeddings. We perform an extensive study of learned representations with the GQA dataset, revealing that important semantic information is captured in the relations between embeddings in the answer space. Our method brings improvements in consistency and accuracy over a range of question types. Experiments with novel answers, unseen during training, indicate the method's potential for open-set prediction.
Published: 2020
Full Text: View/download PDF

18. REFUGE Challenge: A unified framework for evaluating automated methods for glaucoma assessment from fundus photographs

Author: Orlando, Jos�� Ignacio, Fu, Huazhu, Breda, Jo��o Barbossa, van Keer, Karel, Bathula, Deepti R., Diaz-Pinto, Andr��s, Fang, Ruogu, Heng, Pheng-Ann, Kim, Jeyoung, Lee, JoonHo, Lee, Joonseok, Li, Xiaoxiao, Liu, Peng, Lu, Shuai, Murugesan, Balamurali, Naranjo, Valery, Phaye, Sai Samarth R., Shankaranarayana, Sharath M., Sikka, Apoorva, Son, Jaemin, Hengel, Anton van den, Wang, Shujun, Wu, Junyan, Wu, Zifeng, Xu, Guanghui, Xu, Yongli, Yin, Pengshuai, Li, Fei, Zhang, Xiulan, Xu, Yanwu, and Bogunovi��, Hrvoje
Subjects: FOS: Computer and information sciences, genetic structures, Computer science, Image classification, Fundus Oculi, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Glaucoma, Datasets as Topic, Health Informatics, Fundus (eye), Diagnostic Techniques, Ophthalmological, Machine learning, computer.software_genre, 030218 nuclear medicine & medical imaging, 03 medical and health sciences, 0302 clinical medicine, Deep Learning, TEORIA DE LA SEÑAL Y COMUNICACIONES, medicine, Photography, Humans, Radiology, Nuclear Medicine and imaging, Segmentation, Ground truth, Image segmentation, Modality (human–computer interaction), Radiological and Ultrasound Technology, Contextual image classification, medicine.diagnostic_test, business.industry, Fundus photography, Deep learning, medicine.disease, Computer Graphics and Computer-Aided Design, eye diseases, Computer Vision and Pattern Recognition, Artificial intelligence, sense organs, business, computer, 030217 neurology & neurosurgery
Abstract: Glaucoma is one of the leading causes of irreversible but preventable blindness in working age populations. Color fundus photography (CFP) is the most cost-effective imaging modality to screen for retinal disorders. However, its application to glaucoma has been limited to the computation of a few related biomarkers such as the vertical cup-to-disc ratio. Deep learning approaches, although widely applied for medical image analysis, have not been extensively used for glaucoma assessment due to the limited size of the available data sets. Furthermore, the lack of a standardize benchmark strategy makes difficult to compare existing methods in a uniform way. In order to overcome these issues we set up the Retinal Fundus Glaucoma Challenge, REFUGE (\url{https://refuge.grand-challenge.org}), held in conjunction with MICCAI 2018. The challenge consisted of two primary tasks, namely optic disc/cup segmentation and glaucoma classification. As part of REFUGE, we have publicly released a data set of 1200 fundus images with ground truth segmentations and clinical glaucoma labels, currently the largest existing one. We have also built an evaluation framework to ease and ensure fairness in the comparison of different models, encouraging the development of novel techniques in the field. 12 teams qualified and participated in the online challenge. This paper summarizes their methods and analyzes their corresponding results. In particular, we observed that two of the top-ranked teams outperformed two human experts in the glaucoma classification task. Furthermore, the segmentation results were in general consistent with the ground truth annotations, with complementary outcomes that can be further exploited by ensembling the results., Comment: Accepted for publication in Medical Image Analysis
Published: 2019

19. On Incorporating Semantic Prior Knowledge in Deep Learning Through Embedding-Space Constraints

Author: Teney, Damien, Abbasnejad, Ehsan, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)
Abstract: The knowledge that humans hold about a problem often extends far beyond a set of training data and output labels. While the success of deep learning mostly relies on supervised training, important properties cannot be inferred efficiently from end-to-end annotations alone, for example causal relations or domain-specific invariances. We present a general technique to supplement supervised training with prior knowledge expressed as relations between training instances. We illustrate the method on the task of visual question answering to exploit various auxiliary annotations, including relations of equivalence and of logical entailment between questions. Existing methods to use these annotations, including auxiliary losses and data augmentation, cannot guarantee the strict inclusion of these relations into the model since they require a careful balancing against the end-to-end objective. Our method uses these relations to shape the embedding space of the model, and treats them as strict constraints on its learned representations. In the context of VQA, this approach brings significant improvements in accuracy and robustness, in particular over the common practice of incorporating the constraints as a soft regularizer. We also show that incorporating this type of prior knowledge with our method brings consistent improvements, independently from the amount of supervised data used. It demonstrates the value of an additional training signal that is otherwise difficult to extract from end-to-end annotations alone.
Published: 2019
Full Text: View/download PDF

20. Bayesian Conditional Generative Adverserial Networks

Author: Abbasnejad, M. Ehsan, Shi, Qinfeng, Abbasnejad, Iman, Hengel, Anton van den, and Dick, Anthony
Subjects: FOS: Computer and information sciences, Computer Science - Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Statistics - Machine Learning, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: Traditional GANs use a deterministic generator function (typically a neural network) to transform a random noise input $z$ to a sample $\mathbf{x}$ that the discriminator seeks to distinguish. We propose a new GAN called Bayesian Conditional Generative Adversarial Networks (BC-GANs) that use a random generator function to transform a deterministic input $y'$ to a sample $\mathbf{x}$. Our BC-GANs extend traditional GANs to a Bayesian framework, and naturally handle unsupervised learning, supervised learning, and semi-supervised learning problems. Experiments show that the proposed BC-GANs outperforms the state-of-the-arts.
Published: 2017
Full Text: View/download PDF

21. High-performance Semantic Segmentation Using Very Deep Fully Convolutional Networks

Author: Wu, Zifeng, Shen, Chunhua, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: We propose a method for high-performance semantic image segmentation (or semantic pixel labelling) based on very deep residual networks, which achieves the state-of-the-art performance. A few design factors are carefully considered to this end. We make the following contributions. (i) First, we evaluate different variations of a fully convolutional residual network so as to find the best configuration, including the number of layers, the resolution of feature maps, and the size of field-of-view. Our experiments show that further enlarging the field-of-view and increasing the resolution of feature maps are typically beneficial, which however inevitably leads to a higher demand for GPU memories. To walk around the limitation, we propose a new method to simulate a high resolution network with a low resolution network, which can be applied during training and/or testing. (ii) Second, we propose an online bootstrapping method for training. We demonstrate that online bootstrapping is critically important for achieving good accuracy. (iii) Third we apply the traditional dropout to some of the residual blocks, which further improves the performance. (iv) Finally, our method achieves the currently best mean intersection-over-union 78.3\% on the PASCAL VOC 2012 dataset, as well as on the recent dataset Cityscapes., Comment: 11 pages
Published: 2016
Full Text: View/download PDF

22. Deep Recurrent Convolutional Networks for Video-based Person Re-identification: An End-to-End Approach

Author: Wu, Lin, Shen, Chunhua, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we present an end-to-end approach to simultaneously learn spatio-temporal features and corresponding similarity metric for video-based person re-identification. Given the video sequence of a person, features from each frame that are extracted from all levels of a deep convolutional network can preserve a higher spatial resolution from which we can model finer motion patterns. These low-level visual percepts are leveraged into a variant of recurrent model to characterize the temporal variation between time-steps. Features from all time-steps are then summarized using temporal pooling to produce an overall feature representation for the complete sequence. The deep convolutional network, recurrent layer, and the temporal pooling are jointly trained to extract comparable hidden-unit representations from input pair of time series to compute their corresponding similarity value. The proposed framework combines time series modeling and metric learning to jointly learn relevant features and a good similarity measure between time sequences of person. Experiments demonstrate that our approach achieves the state-of-the-art performance for video-based person re-identification on iLIDS-VID and PRID 2011, the two primary public datasets for this purpose., Comment: 11 pages
Published: 2016
Full Text: View/download PDF

23. Zero-Shot Visual Question Answering

Author: Teney, Damien and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Computation and Language (cs.CL)
Abstract: Part of the appeal of Visual Question Answering (VQA) is its promise to answer new questions about previously unseen images. Most current methods demand training questions that illustrate every possible concept, and will therefore never achieve this capability, since the volume of required training data would be prohibitive. Answering general questions about images requires methods capable of Zero-Shot VQA, that is, methods able to answer questions beyond the scope of the training questions. We propose a new evaluation protocol for VQA methods which measures their ability to perform Zero-Shot VQA, and in doing so highlights significant practical deficiencies of current approaches, some of which are masked by the biases in current datasets. We propose and evaluate several strategies for achieving Zero-Shot VQA, including methods based on pretrained word embeddings, object classifiers with semantic embeddings, and test-time retrieval of example images. Our extensive experiments are intended to serve as baselines for Zero-Shot VQA, and they also achieve state-of-the-art performance in the standard VQA evaluation setting.
Published: 2016
Full Text: View/download PDF

24. PersonNet: Person Re-identification with Deep Convolutional Neural Networks

Author: Wu, Lin, Shen, Chunhua, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we propose a deep end-to-end neu- ral network to simultaneously learn high-level features and a corresponding similarity metric for person re-identification. The network takes a pair of raw RGB images as input, and outputs a similarity value indicating whether the two input images depict the same person. A layer of computing neighborhood range differences across two input images is employed to capture local relationship between patches. This operation is to seek a robust feature from input images. By increasing the depth to 10 weight layers and using very small (3$\times$3) convolution filters, our architecture achieves a remarkable improvement on the prior-art configurations. Meanwhile, an adaptive Root- Mean-Square (RMSProp) gradient decent algorithm is integrated into our architecture, which is beneficial to deep nets. Our method consistently outperforms state-of-the-art on two large datasets (CUHK03 and Market-1501), and a medium-sized data set (CUHK01)., Comment: 7 pages. Fixed Figure 4 (a)
Published: 2016
Full Text: View/download PDF

25. Crisis within modernity: Léon Dehon and the social reign of the Sacred Heart

Author: Hengel, John van den
Published: 2016
Full Text: View/download PDF

26. Bridging Category-level and Instance-level Semantic Image Segmentation

Author: Wu, Zifeng, Shen, Chunhua, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
Abstract: We propose an approach to instance-level image segmentation that is built on top of category-level segmentation. Specifically, for each pixel in a semantic category mask, its corresponding instance bounding box is predicted using a deep fully convolutional regression network. Thus it follows a different pipeline to the popular detect-then-segment approaches that first predict instances' bounding boxes, which are the current state-of-the-art in instance segmentation. We show that, by leveraging the strength of our state-of-the-art semantic segmentation models, the proposed method can achieve comparable or even better results to detect-then-segment approaches. We make the following contributions. (i) First, we propose a simple yet effective approach to semantic instance segmentation. (ii) Second, we propose an online bootstrapping method during training, which is critically important for achieving good performance for both semantic category segmentation and instance-level segmentation. (iii) As the performance of semantic category segmentation has a significant impact on the instance-level segmentation, which is the second step of our approach, we train fully convolutional residual networks to achieve the best semantic category segmentation accuracy. On the PASCAL VOC 2012 dataset, we obtain the currently best mean intersection-over-union score of 79.1%. (iv) We also achieve state-of-the-art results for instance-level segmentation., Comment: 14 pages. arXiv admin note: substantial text overlap with arXiv:1604.04339
Published: 2016
Full Text: View/download PDF

27. A model-based approach to recovering the structure of a plant from images

Author: Ward, Ben, Bastian, John, Hengel, Anton van den, Pooley, Daniel, Bari, Rajendra, Berger, Bettina, and Tester, Mark
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, food and beverages
Abstract: We present a method for recovering the structure of a plant directly from a small set of widely-spaced images. Structure recovery is more complex than shape estimation, but the resulting structure estimate is more closely related to phenotype than is a 3D geometric model. The method we propose is applicable to a wide variety of plants, but is demonstrated on wheat. Wheat is made up of thin elements with few identifiable features, making it difficult to analyse using standard feature matching techniques. Our method instead analyses the structure of plants using only their silhouettes. We employ a generate-and-test method, using a database of manually modelled leaves and a model for their composition to synthesise plausible plant structures which are evaluated against the images. The method is capable of efficiently recovering accurate estimates of plant structure in a wide variety of imaging scenarios, with no manual intervention.
Published: 2015
Full Text: View/download PDF

28. Great Divides. Een cartografie van de archeologische wetenschap

Author: Hengel, L.B.N. van den
Subjects: De antieke wereld, Dynamics of gender, The Ancient World
Abstract: Item does not contain fulltext 16 p.
Published: 2006

29. De stem van de pijn. De politieke kunst van Diamanda Galás

Author: Hengel, L.B.N. van den
Subjects: Dynamics of gender
Abstract: Item does not contain fulltext Componiste en stemkunstenares Galás heeft een bereik van bijna vier octaven. De 'bel canto' techniek gebruikt zij als politiek instrument ten behoeve van de zwakkeren in de samenleving.
Published: 2006

30. Het beeld van de keizer. Seksualiteit, schoonheid en de politiek van het lichaam

Author: Hengel, L.B.N. van den
Subjects: De antieke wereld, Dynamics of gender
Abstract: Item does not contain fulltext Een portret is een 'verbeelding' van een persoon, en daardoor tot op zekere hoogte een fictie. Anderzijds is het een heel concreet, materieel ding dat de persoon in kwestie lijfelijk aanwezig stelt. In de kunst lijkt deze vervlechting tussen mens en beeld, fictie en werkelijkheid, voor het eerst voor te komen in de Romeinse keizerportretten. Maar hoe moeten wij deze beelden bekijken? Is het mogelijk de 'ware' persoon van de keizer erin te ontdekken? En hoe wordt onze blik zelf gekleurd door bijvoorbeeld gender, etniciteit, sociale status en academische traditie? 4 p.
Published: 2004

31. Deconstruction of compound objects from image sets

Author: Hengel, Anton van den, Bastian, John, Dick, Anthony, and Fleming, Lachlan
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: We propose a method to recover the structure of a compound object from multiple silhouettes. Structure is expressed as a collection of 3D primitives chosen from a pre-defined library, each with an associated pose. This has several advantages over a volume or mesh representation both for estimation and the utility of the recovered model. The main challenge in recovering such a model is the combinatorial number of possible arrangements of parts. We address this issue by exploiting the sparse nature of the problem, and show that our method scales to objects constructed from large libraries of parts.
Published: 2014
Full Text: View/download PDF

32. Constraint Reduction using Marginal Polytope Diagrams for MAP LP Relaxations

Author: Zhang, Zhen, Shi, Qinfeng, Zhang, Yanning, Shen, Chunhua, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: LP relaxation-based message passing algorithms provide an effective tool for MAP inference over Probabilistic Graphical Models. However, different LP relaxations often have different objective functions and variables of differing dimensions, which presents a barrier to effective comparison and analysis. In addition, the computational complexity of LP relaxation-based methods grows quickly with the number of constraints. Reducing the number of constraints without sacrificing the quality of the solutions is thus desirable. We propose a unified formulation under which existing MAP LP relaxations may be compared and analysed. Furthermore, we propose a new tool called Marginal Polytope Diagrams. Some properties of Marginal Polytope Diagrams are exploited such as node redundancy and edge equivalence. We show that using Marginal Polytope Diagrams allows the number of constraints to be reduced without loosening the LP relaxations. Then, using Marginal Polytope Diagrams and constraint reduction, we develop three novel message passing algorithms, and demonstrate that two of these show a significant improvement in speed over state-of-art algorithms while delivering a competitive, and sometimes higher, quality of solution.
Published: 2013
Full Text: View/download PDF

33. Efficient pedestrian detection by directly optimize the partial area under the ROC curve

Author: Paisitkriangkrai, Sakrapee, Shen, Chunhua, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Science - Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)
Abstract: Many typical applications of object detection operate within a prescribed false-positive range. In this situation the performance of a detector should be assessed on the basis of the area under the ROC curve over that range, rather than over the full curve, as the performance outside the range is irrelevant. This measure is labelled as the partial area under the ROC curve (pAUC). Effective cascade-based classification, for example, depends on training node classifiers that achieve the maximal detection rate at a moderate false positive rate, e.g., around 40% to 50%. We propose a novel ensemble learning method which achieves a maximal detection rate at a user-defined range of false positive rates by directly optimizing the partial AUC using structured learning. By optimizing for different ranges of false positive rates, the proposed method can be used to train either a single strong classifier or a node classifier forming part of a cascade classifier. Experimental results on both synthetic and real-world data sets demonstrate the effectiveness of our approach, and we show that it is possible to train state-of-the-art pedestrian detectors using the proposed structured ensemble learning method., Comment: 10 pages. Appearing in Int. Conf. Computer Vision (ICCV) 2013
Published: 2013
Full Text: View/download PDF

34. A scalable stage-wise approach to large-margin multi-class loss based boosting

Author: Paisitkriangkrai, Sakrapee, Shen, Chunhua, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Science - Learning, Machine Learning (cs.LG)
Abstract: We present a scalable and effective classification model to train multi-class boosting for multi-class classification problems. Shen and Hao introduced a direct formulation of multi- class boosting in the sense that it directly maximizes the multi- class margin [C. Shen and Z. Hao, "A direct formulation for totally-corrective multi- class boosting", in Proc. IEEE Conf. Comp. Vis. Patt. Recogn., 2011]. The major problem of their approach is its high computational complexity for training, which hampers its application on real-world problems. In this work, we propose a scalable and simple stage-wise multi-class boosting method, which also directly maximizes the multi-class margin. Our approach of- fers a few advantages: 1) it is simple and computationally efficient to train. The approach can speed up the training time by more than two orders of magnitude without sacrificing the classification accuracy. 2) Like traditional AdaBoost, it is less sensitive to the choice of parameters and empirically demonstrates excellent generalization performance. Experimental results on challenging multi-class machine learning and vision tasks demonstrate that the proposed approach substantially improves the convergence rate and accuracy of the final visual detector at no additional computational cost compared to existing multi-class boosting., Comment: 12 pages
Published: 2013
Full Text: View/download PDF

35. Fast Training of Effective Multi-class Boosting Using Coordinate Descent Optimization

Author: Lin, Guosheng, Shen, Chunhua, Hengel, Anton van den, and Suter, David
Subjects: FOS: Computer and information sciences, Computer Science - Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Statistics - Computation, Computation (stat.CO), Machine Learning (cs.LG)
Abstract: Wepresentanovelcolumngenerationbasedboostingmethod for multi-class classification. Our multi-class boosting is formulated in a single optimization problem as in Shen and Hao (2011). Different from most existing multi-class boosting methods, which use the same set of weak learners for all the classes, we train class specified weak learners (i.e., each class has a different set of weak learners). We show that using separate weak learner sets for each class leads to fast convergence, without introducing additional computational overhead in the training procedure. To further make the training more efficient and scalable, we also propose a fast co- ordinate descent method for solving the optimization problem at each boosting iteration. The proposed coordinate descent method is conceptually simple and easy to implement in that it is a closed-form solution for each coordinate update. Experimental results on a variety of datasets show that, compared to a range of existing multi-class boosting meth- ods, the proposed method has much faster convergence rate and better generalization performance in most cases. We also empirically show that the proposed fast coordinate descent algorithm needs less training time than the MultiBoost algorithm in Shen and Hao (2011)., Comment: Appeared in Proc. Asian Conf. Computer Vision 2012. Code can be downloaded at http://goo.gl/WluhrQ
Published: 2013
Full Text: View/download PDF

36. An Efficient Dual Approach to Distance Metric Learning

Author: Shen, Chunhua, Kim, Junae, Liu, Fayao, Wang, Lei, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Science - Learning, Machine Learning (cs.LG)
Abstract: Distance metric learning is of fundamental interest in machine learning because the distance metric employed can significantly affect the performance of many learning methods. Quadratic Mahalanobis metric learning is a popular approach to the problem, but typically requires solving a semidefinite programming (SDP) problem, which is computationally expensive. Standard interior-point SDP solvers typically have a complexity of $O(D^{6.5})$ (with $D$ the dimension of input data), and can thus only practically solve problems exhibiting less than a few thousand variables. Since the number of variables is $D (D+1) / 2 $, this implies a limit upon the size of problem that can practically be solved of around a few hundred dimensions. The complexity of the popular quadratic Mahalanobis metric learning approach thus limits the size of problem to which metric learning can be applied. Here we propose a significantly more efficient approach to the metric learning problem based on the Lagrange dual formulation of the problem. The proposed formulation is much simpler to implement, and therefore allows much larger Mahalanobis metric learning problems to be solved. The time complexity of the proposed method is $O (D ^ 3) $, which is significantly lower than that of the SDP approach. Experiments on a variety of datasets demonstrate that the proposed method achieves an accuracy comparable to the state-of-the-art, but is applicable to significantly larger problems. We also show that the proposed method can be applied to solve more general Frobenius-norm regularized SDP problems approximately.
Published: 2013
Full Text: View/download PDF

37. Magie, macht en castratie. De verbeelding van heksen in de vroege 16e eeuw

Author: Hengel, L.B.N. van den
Subjects: De antieke wereld, Dynamics of gender
Abstract: Item does not contain fulltext 3 p.
Published: 2004

38. A Direct Approach to Multi-class Boosting and Extensions

Author: Shen, Chunhua, Paisitkriangkrai, Sakrapee, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Science - Learning, Machine Learning (cs.LG)
Abstract: Boosting methods combine a set of moderately accurate weaklearners to form a highly accurate predictor. Despite the practical importance of multi-class boosting, it has received far less attention than its binary counterpart. In this work, we propose a fully-corrective multi-class boosting formulation which directly solves the multi-class problem without dividing it into multiple binary classification problems. In contrast, most previous multi-class boosting algorithms decompose a multi-boost problem into multiple binary boosting problems. By explicitly deriving the Lagrange dual of the primal optimization problem, we are able to construct a column generation-based fully-corrective approach to boosting which directly optimizes multi-class classification performance. The new approach not only updates all weak learners' coefficients at every iteration, but does so in a manner flexible enough to accommodate various loss functions and regularizations. For example, it enables us to introduce structural sparsity through mixed-norm regularization to promote group sparsity and feature sharing. Boosting with shared features is particularly beneficial in complex prediction problems where features can be expensive to compute. Our experiments on various data sets demonstrate that our direct multi-class boosting generalizes as well as, or better than, a range of competing multi-class boosting methods. The end result is a highly effective and compact ensemble classifier which can be trained in a distributed fashion., Comment: 34 pages
Published: 2012
Full Text: View/download PDF

39. Introduction

Author: Hekster, O.J., Hengel, L.B.N. van den, Mols, S.T.A.M., Hekster, Olivier, and Mols, Stephan
Subjects: GeneralLiterature_REFERENCE(e.g.,dictionaries,encyclopedias,glossaries)
Published: 2010

40. Optimally Training a Cascade Classifier

Author: Shen, Chunhua, Wang, Peng, and Hengel, Anton van den
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: Cascade classifiers are widely used in real-time object detection. Different from conventional classifiers that are designed for a low overall classification error rate, a classifier in each node of the cascade is required to achieve an extremely high detection rate and moderate false positive rate. Although there are a few reported methods addressing this requirement in the context of object detection, there is no a principled feature selection method that explicitly takes into account this asymmetric node learning objective. We provide such an algorithm here. We show a special case of the biased minimax probability machine has the same formulation as the linear asymmetric classifier (LAC) of \cite{wu2005linear}. We then design a new boosting algorithm that directly optimizes the cost function of LAC. The resulting totally-corrective boosting algorithm is implemented by the column generation technique in convex optimization. Experimental results on object detection verify the effectiveness of the proposed boosting algorithm as a node classifier in cascade object detection, and show performance better than that of the current state-of-the-art., Comment: 16 pages
Published: 2010
Full Text: View/download PDF

41. Introduction

Author: Hekster, O.J., Hengel, L.B.N. van den, Mols, S.T.A.M., Hekster, Olivier, and Mols, Stephan
Subjects: The Ancient World
Abstract: Contains fulltext : 302742.pdf (Publisher’s version ) (Open Access) BABesch 80th Anniversary Workshop, 08 september 2006 101 p.
Published: 2010

42. Redactioneel

Author: Brink, M.C.L. van den, Hengel, L.B.N. van den, and Mens-Verhulst, J. van
Published: 2009

43. Redactioneel

Author: Brink, M.C.L. van den, Hengel, L.B.N. van den, and Mens-Verhulst, J. van
Published: 2009

44. Imago. Romeinse keizerbeelden en de belichaming van gender

Author: Hengel, L.B.N. van den, Jansen, W.H.M., Moormann, E.M., Hoogland, R.C., and Radboud University Nijmegen
Subjects: De antieke wereld, Dynamics of gender, Anthropology and Development Studies, The Ancient World
Abstract: Contains fulltext : mmubn000001_511410085.pdf (Publisher’s version ) (Open Access) Beelden van de Romeinse keizers waren alomtegenwoordig in de antieke wereld. Met name portretsculptuur was een geijkt middel om het gezag van de heerser uit te dragen naar alle lagen van de samenleving. Deze studie gaat in op de betekenis van het Romeinse keizerportret als een belichaming van macht in de meest letterlijke zin van het woord, dat wil zeggen dat het beeld centraal gesteld wordt niet alleen als politieke betekenisdrager, maar ook – en vooral – als een lichamelijk artefact. Als lichaamsbeeld fungeerde het keizerportret niet alleen als toonbeeld van macht, maar ook als belichaming van gender: de betekenis ervan hangt direct samen met de antieke visies op mannelijkheid en vrouwelijkheid. Op het grensvlak van de Romeinse archeologie en de hedendaagse genderstudies, werpt deze studie een kritisch nieuw licht op een van de meest traditionele onderdelen van de klassieke kunst- en cultuurgeschiedenis. RU Radboud Universiteit Nijmegen, 13 maart 2009 Promotores : Moormann, E.M., Jansen, W.H.M. Co-promotor : Hoogland, R.C. 503 p.
Published: 2009

45. Redactioneel

Author: Brink, M.C.L. van den, Hengel, L.B.N. van den, and Mens-Verhulst, J. van
Subjects: Dynamics of gender
Abstract: Contains fulltext : 127999pub.pdf (Publisher’s version ) (Open Access) 1 p.
Published: 2009

46. Het theater van de wreedheid. Gladiatorenspelen in de Romeinse keizertijd

Author: Hengel, L.B.N. van den, Mols, S.T.A.M., Hekster, O.J., Moormann, E.M., Mols, Stephan, Hekster, Olivier, Moormann, Eric, Mols, S.T.A.M., Hekster, O.J., and Moormann, E.M.
Subjects: De antieke wereld, Dynamics of gender, The Ancient World
Abstract: Item does not contain fulltext
Published: 2008

47. Mannen van marmer. De lichamelijkheid van het Romeinse keizerportret

Author: Hengel, L.B.N. van den
Subjects: De antieke wereld, The Ancient World
Abstract: Item does not contain fulltext Gastcollege klassieke archeologie, Griekse en Romeinse portretten Radboud Universiteit Nijmegen, Griekse en Latijnse Taal en Cultuur
Published: 2007

48. Emperors and Gladiators. Images of Masculinity in Ancient Rome

Author: Hengel, L.B.N. van den
Subjects: De antieke wereld, The Ancient World
Abstract: Item does not contain fulltext University College Maastricht Gastcollege Cultural Studies II (BA 200
Published: 2007

49. Verzwegen verlangen. Foucault, feminisme en de geschiedenis van de seksualiteit

Author: Hengel, L.B.N. van den
Subjects: Dynamics of gender
Abstract: Contains fulltext : 54789.pdf (Publisher’s version ) (Open Access)
Published: 2006

50. Safety testing of ammonium nitrate products

Author: Kersten, R.J.A., Hengel, E.I.V. van den, Steen, A.C. van der, and TNO Defensie en Veiligheid
Published: 2006

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Journal

Database

Publisher

65 results on '"Hengel, A. van den"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources