Author: "Hospedales, Timothy M." / Database: Academic Search Index - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Hospedales, Timothy M."' showing total 17 results

Start Over Author "Hospedales, Timothy M." Database Academic Search Index

17 results on '"Hospedales, Timothy M."'

1. Discovery of Shared Semantic Spaces for Multiscene Video Query and Summarization.

Author: Xun Xu, Hospedales, Timothy M., and Shaogang Gong
Subjects: *VIDEO recording, *SEMANTICS, *VIDEO surveillance, *PIXELS, *AUTOMATION
Abstract: The growing rate of public space closed-circuit television (CCTV) installations has generated a need for automated methods for exploiting video surveillance data, including scene understanding, query, behavior annotation, and summarization. For this reason, extensive research has been performed on surveillance scene understanding and analysis. However, most studies have considered single scenes or groups of adjacent scenes. The semantic similarity between different but related scenes (e.g., many different traffic scenes of a similar layout) is not generally exploited to improve any automated surveillance tasks and reduce manual effort. Exploiting commonality and sharing any supervised annotations between different scenes is, however, challenging due to the following reason: some scenes are totally unrelated and thus any information sharing between them would be detrimental, whereas others may share only a subset of common activities and thus information sharing is only useful if it is selective. Moreover, semantically similar activities that should be modeled together and shared across scenes may have quite different pixel-level appearances in each scene. To address these issues, we develop a new framework for distributed multiple-scene global understanding that clusters surveillance scenes by their ability to explain each other's behaviors and further discovers which subset of activities are shared versus scene specific within each cluster. We show how to use this structured representation of multiple scenes to improve common surveillance tasks, including scene activity understanding, cross-scene query-by-example, behavior classification with reduced supervised labeling requirements, and video summarization. In each case, we demonstrate how our multiscene model improves on a collection of standard single-scene models and a flat model of all scenes. [ABSTRACT FROM AUTHOR]
Published: 2017
Full Text: View/download PDF

2. When and where to transfer for Bayesian network parameter learning.

Author: Zhou, Yun, Hospedales, Timothy M., and Fenton, Norman
Subjects: *COGNITIVE structures, *MENTAL arithmetic, *MATHEMATICAL analysis, *BAYESIAN analysis, *STATISTICAL decision making
Abstract: Learning Bayesian networks from scarce data is a major challenge in real-world applications where data are hard to acquire. Transfer learning techniques attempt to address this by leveraging data from different but related problems. For example, it may be possible to exploit medical diagnosis data from a different country. A challenge with this approach is heterogeneous relatedness to the target, both within and across source networks. In this paper we introduce the Bayesian network parameter transfer learning (BNPTL) algorithm to reason about both network and fragment (sub-graph) relatedness. BNPTL addresses (i) how to find the most relevant source network and network fragments to transfer, and (ii) how to fuse source and target parameters in a robust way. In addition to improving target task performance, explicit reasoning allows us to diagnose network and fragment relatedness across Bayesian networks, even if latent variables are present, or if their state space is heterogeneous. This is important in some applications where relatedness itself is an output of interest. Experimental results demonstrate the superiority of BNPTL at various scarcities and source relevance levels compared to single task learning and other state-of-the-art parameter transfer methods. Moreover, we demonstrate successful application to real-world medical case studies. [ABSTRACT FROM AUTHOR]
Published: 2016
Full Text: View/download PDF

3. Robust Subjective Visual Property Prediction from Crowdsourced Pairwise Labels.

Author: Fu, Yanwei, Hospedales, Timothy M., Xiang, Tao, Xiong, Jiechao, Gong, Shaogang, Wang, Yizhou, and Yao, Yuan
Subjects: *IMAGE recognition (Computer vision), *CROWDSOURCING, *OUTLIERS (Statistics), *ROBUST statistics, *SPARSE approximations
Abstract: The problem of estimating subjective visual properties from image and video has attracted increasing interest. A subjective visual property is useful either on its own (e.g. image and video interestingness) or as an intermediate representation for visual recognition (e.g. a relative attribute). Due to its ambiguous nature, annotating the value of a subjective visual property for learning a prediction model is challenging. To make the annotation more reliable, recent studies employ crowdsourcing tools to collect pairwise comparison labels. However, using crowdsourced data also introduces outliers. Existing methods rely on majority voting to prune the annotation outliers/errors. They thus require a large amount of pairwise labels to be collected. More importantly as a local outlier detection method, majority voting is ineffective in identifying outliers that can cause global ranking inconsistencies. In this paper, we propose a more principled way to identify annotation outliers by formulating the subjective visual property prediction task as a unified robust learning to rank problem, tackling both the outlier detection and learning to rank jointly. This differs from existing methods in that (1) the proposed method integrates local pairwise comparison labels together to minimise a cost that corresponds to global inconsistency of ranking order, and (2) the outlier detection and learning to rank problems are solved jointly. This not only leads to better detection of annotation outliers but also enables learning with extremely sparse annotations. [ABSTRACT FROM PUBLISHER]
Published: 2016
Full Text: View/download PDF

4. Transductive Multi-View Zero-Shot Learning.

Author: Fu, Yanwei, Hospedales, Timothy M., Xiang, Tao, and Gong, Shaogang
Subjects: *OBJECT recognition (Computer vision), *COMPUTER vision, *HYPERGRAPHS, *MACHINE learning, *COMPUTATIONAL learning theory
Abstract: Most existing zero-shot learning approaches exploit transfer learning via an intermediate semantic representation shared between an annotated auxiliary dataset and a target dataset with different classes and no annotation. A projection from a low-level feature space to the semantic representation space is learned from the auxiliary dataset and applied without adaptation to the target dataset. In this paper we identify two inherent limitations with these approaches. First, due to having disjoint and potentially unrelated classes, the projection functions learned from the auxiliary dataset/domain are biased when applied directly to the target dataset/domain. We call this problem the projection domain shift problem and propose a novel framework, transductive multi-view embedding, to solve it. The second limitation is the prototype sparsity problem which refers to the fact that for each target class, only a single prototype is available for zero-shot learning given a semantic representation. To overcome this problem, a novel heterogeneous multi-view hypergraph label propagation method is formulated for zero-shot learning in the transductive embedding space. It effectively exploits the complementary information offered by different semantic representations and takes advantage of the manifold structures of multiple representation spaces in a coherent manner. We demonstrate through extensive experiments that the proposed approach (1) rectifies the projection shift between the auxiliary and target domains, (2) exploits the complementarity of multiple semantic representations, (3) significantly outperforms existing methods for both zero-shot and N-shot recognition on three image and video benchmark datasets, and (4) enables novel cross-view annotation tasks. [ABSTRACT FROM PUBLISHER]
Published: 2015
Full Text: View/download PDF

5. Bayesian Joint Modelling for Object Localisation in Weakly Labelled Images.

Author: Shi, Zhiyuan, Hospedales, Timothy M., and Xiang, Tao
Subjects: *OBJECT recognition (Computer vision), *LOCALIZATION theory, *SUPERVISED learning, *IMAGE processing, *BAYESIAN analysis
Abstract: We address the problem of localisation of objects as bounding boxes in images and videos with weak labels. This weakly supervised object localisation problem has been tackled in the past using discriminative models where each object class is localised independently from other classes. In this paper, a novel framework based on Bayesian joint topic modelling is proposed, which differs significantly from the existing ones in that: (1) All foreground object classes are modelled jointly in a single generative model that encodes multiple object co-existence so that “explaining away” inference can resolve ambiguity and lead to better learning and localisation. (2) Image backgrounds are shared across classes to better learn varying surroundings and “push out” objects of interest. (3) Our model can be learned with a mixture of weakly labelled and unlabelled data, allowing the large volume of unlabelled images on the Internet to be exploited for learning. Moreover, the Bayesian formulation enables the exploitation of various types of prior knowledge to compensate for the limited supervision offered by weakly labelled data, as well as Bayesian domain adaptation for transfer learning. Extensive experiments on the PASCAL VOC, ImageNet and YouTube-Object videos datasets demonstrate the effectiveness of our Bayesian joint model for weakly supervised object localisation. [ABSTRACT FROM PUBLISHER]
Published: 2015
Full Text: View/download PDF

6. Learning Multimodal Latent Attributes.

Author: Fu, Yanwei, Hospedales, Timothy M., Xiang, Tao, and Gong, Shaogang
Subjects: *COMPUTER multitasking, *SOCIAL media research, *OBJECT recognition (Computer vision), *SOCIAL groups, *LATENT functions (Social sciences), *PSYCHOLOGY
Abstract: The rapid development of social media sharing has created a huge demand for automatic media classification and annotation techniques. Attribute learning has emerged as a promising paradigm for bridging the semantic gap and addressing data sparsity via transferring attribute knowledge in object recognition and relatively simple action classification. In this paper, we address the task of attribute learning for understanding multimedia data with sparse and incomplete labels. In particular, we focus on videos of social group activities, which are particularly challenging and topical examples of this task because of their multimodal content and complex and unstructured nature relative to the density of annotations. To solve this problem, we 1) introduce a concept of semilatent attribute space, expressing user-defined and latent attributes in a unified framework, and 2) propose a novel scalable probabilistic topic model for learning multimodal semilatent attributes, which dramatically reduces requirements for an exhaustive accurate attribute ontology and expensive annotation effort. We show that our framework is able to exploit latent attributes to outperform contemporary approaches for addressing a variety of realistic multimedia sparse data learning tasks including: multitask learning, learning with label noise, N-shot transfer learning, and importantly zero-shot learning. [ABSTRACT FROM PUBLISHER]
Published: 2014
Full Text: View/download PDF

7. Finding Rare Classes: Active Learning with Generative and Discriminative Models.

Author: Hospedales, Timothy M., Gong, Shaogang, and Xiang, Tao
Subjects: *DATA mining, *MACHINE learning, *EDUCATIONAL technology, *PROGRAMMED instruction, *ACTIVE learning, *EXPERIENTIAL learning
Abstract: Discovering rare categories and classifying new instances of them are important data mining issues in many fields, but fully supervised learning of a rare class classifier is prohibitively costly in labeling effort. There has therefore been increasing interest both in active discovery: to identify new classes quickly, and active learning: to train classifiers with minimal supervision. These goals occur together in practice and are intrinsically related because examples of each class are required to train a classifier. Nevertheless, very few studies have tried to optimise them together, meaning that data mining for rare classes in new domains makes inefficient use of human supervision. Developing active learning algorithms to optimise both rare class discovery and classification simultaneously is challenging because discovery and classification have conflicting requirements in query criteria. In this paper, we address these issues with two contributions: a unified active learning model to jointly discover new categories and learn to classify them by adapting query criteria online; and a classifier combination algorithm that switches generative and discriminative classifiers as learning progresses. Extensive evaluation on a batch of standard UCI and vision data sets demonstrates the superiority of this approach over existing methods. [ABSTRACT FROM AUTHOR]
Published: 2013
Full Text: View/download PDF

8. Structure Inference for Bayesian Multisensory Scene Understanding.

Author: Hospedales, Timothy M. and Vijayakumar, Sethu
Subjects: *SIGNAL processing, *SENSOR networks, *MULTISENSOR data fusion, *BAYESIAN analysis, *INTEGRATION (Theory of knowledge), *PSYCHOPHYSIOLOGY
Abstract: We investigate a solution to the problem of multisensor scene understanding by formulating it in the framework of Bayesian model selection and structure inference. Humans robustly associate multimodal data as appropriate, but previous modeling work has focused largely on optimal fusion, leaving segregation unaccounted for and unexploited by machine perception systems. We illustrate a unifying Bayesian solution to multisensory perception and tracking, which accounts for both integration and segregation by explicit probabilistic reasoning about data association in a temporal context. Such an explicit inference of multimodal data association is also of intrinsic interest for higher level understanding of multisensory data. We illustrate this by using a probabilistic implementation of data association in a multiparty audiovisual scenario, where unsupervised learning and structure inference is used to automatically segment, associate, and track individual subjects in audiovisual sequences. Indeed, the structure-inference-based framework introduced in this work provides the theoretical foundation needed to satisfactorily explain many confounding results in human psychophysics experiments involving multimodal cue integration and association. [ABSTRACT FROM AUTHOR]
Published: 2008
Full Text: View/download PDF

9. Implications of Noise and Neural Heterogeneity for Vestibulo-Ocular Reflex Fidelity.

Author: Hospedales, Timothy M., van Rossum, Mark C. W., Graham, Bruce P., and Mayank B. Dutia
Subjects: *NOISE, *NEURONS, *CELLS, *ELECTROPHYSIOLOGY, *NEUROSCIENCES
Abstract: The vestibulo-ocular reflex (VOR) is characterized by a short-latency, high-fidelity eye movement response to head rotations at frequencies up to 20 Hz. Electrophysiological studies of medial vestibular nucleus (MVN) neurons, however, show that their response to sinusoidal currents above 10 to 12Hz is highly nonlinear and distorted by aliasing for all but very small current amplitudes. How can this system function in vivo when single cell response cannot explain its operation? Here we show that the necessary wide VOR frequency response may be achieved not by firing rate encoding of head velocity in single neurons, but in the integrated population response of asynchronously firing, intrinsically active neurons. Diffusive synaptic noise and the pacemaker-driven, intrinsic firing of MVN cells synergistically maintain asynchronous, spontaneous spiking in a population of model MVN neurons over a wide range of input signal amplitudes and frequencies. Response fidelity is further improved by a reciprocal inhibitory link between two MVN populations, mimicking the vestibular commissural system in vivo, but only if asynchrony is maintained by noise and pacemaker inputs. These results provide a previously missing explanation for the full range of VOR function and a novel account of the role of the intrinsic pacemaker conductances in MVN cells. The values of diffusive noise and pacemaker currents that give optimal response fidelity yield firing statistics similar to those in vivo, suggesting that the in vivo network is tuned to optimal performance. While theoretical studies have argued that noise and population heterogeneity can improve coding, to our knowledge this is the first evidence indicating that these parameters are indeed tuned to optimize coding fidelity in a neural control system in vivo. [ABSTRACT FROM AUTHOR]
Published: 2008
Full Text: View/download PDF

10. Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool.

Author: Liu, Feng, Xiang, Tao, Hospedales, Timothy M., Yang, Wankou, and Sun, Changyin
Subjects: *REINFORCEMENT learning, *QUESTION answering systems, *ARTIFICIAL intelligence, *IMAGE color analysis, *INVERSE problems
Abstract: In recent years, visual question answering (VQA) has become topical. The premise of VQA's significance as a benchmark in AI, is that both the image and textual question need to be well understood and mutually grounded in order to infer the correct answer. However, current VQA models perhaps ‘understand’ less than initially hoped, and instead master the easier task of exploiting cues given away in the question and biases in the answer distribution. In this paper we propose the inverse problem of VQA (iVQA). The iVQA task is to generate a question that corresponds to a given image and answer pair. We propose a variational iVQA model that can generate diverse, grammatically correct and content correlated questions that match the given answer. Based on this model, we show that iVQA is an interesting benchmark for visuo-linguistic understanding, and a more challenging alternative to VQA because an iVQA model needs to understand the image better to be successful. As a second contribution, we show how to use iVQA in a novel reinforcement learning framework to diagnose any existing VQA model by way of exposing its belief set: the set of question-answer pairs that the VQA model would predict true for a given image. This provides a completely new window into what VQA models ‘believe’ about images. We show that existing VQA models have more erroneous beliefs than previously thought, revealing their intrinsic weaknesses. Suggestions are then made on how to address these weaknesses going forward. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

11. Weakly-Supervised Image Annotation and Segmentation with Objects and Attributes.

Author: Shi, Zhiyuan, Yang, Yongxin, Hospedales, Timothy M., and Xiang, Tao
Subjects: *IMAGE processing, *PATTERN recognition systems, *BAYESIAN analysis, *ANNOTATIONS, *SEMANTICS
Abstract: We propose to model complex visual scenes using a non-parametric Bayesian model learned from weakly labelled images abundant on media sharing sites such as Flickr. Given weak image-level annotations of objects and attributes without locations or associations between them, our model aims to learn the appearance of object and attribute classes as well as their association on each object instance. Once learned, given an image, our model can be deployed to tackle a number of vision problems in a joint and coherent manner, including recognising objects in the scene (automatic object annotation), describing objects using their attributes (attribute prediction and association), and localising and delineating the objects (object detection and semantic segmentation). This is achieved by developing a novel Weakly Supervised Markov Random Field Stacked Indian Buffet Process (WS-MRF-SIBP) that models objects and attributes as latent factors and explicitly captures their correlations within and across superpixels. Extensive experiments on benchmark datasets demonstrate that our weakly supervised model significantly outperforms weakly supervised alternatives and is often comparable with existing strongly supervised models on a variety of tasks including semantic segmentation, automatic image annotation and retrieval based on object-attribute associations. [ABSTRACT FROM PUBLISHER]
Published: 2017
Full Text: View/download PDF

12. Identifying Rare and Subtle Behaviors: A Weakly Supervised Joint Topic Model.

Author: Hospedales, Timothy M., Li, Jian, Gong, Shaogang, and Xiang, Tao
Subjects: *SUPERVISED learning, *HIDDEN Markov models, *DATA modeling, *ELECTRONIC surveillance, *ALGORITHMS, *MACHINE learning
Abstract: One of the most interesting and desired capabilities for automated video behavior analysis is the identification of rarely occurring and subtle behaviors. This is of practical value because dangerous or illegal activities often have few or possibly only one prior example to learn from and are often subtle. Rare and subtle behavior learning is challenging for two reasons: 1) Contemporary modeling approaches require more data and supervision than may be available and 2) the most interesting and potentially critical rare behaviors are often visually subtle—occurring among more obvious typical behaviors or being defined by only small spatio-temporal deviations from typical behaviors. In this paper, we introduce a novel weakly supervised joint topic model which addresses these issues. Specifically, we introduce a multiclass topic model with partially shared latent structure and associated learning and inference algorithms. These contributions will permit modeling of behaviors from as few as one example, even without localization by the user and when occurring in clutter, and subsequent classification and localization of such behaviors online and in real time. We extensively validate our approach on two standard public-space data sets, where it clearly outperforms a batch of contemporary alternatives. [ABSTRACT FROM PUBLISHER]
Published: 2011
Full Text: View/download PDF

13. On Learning Semantic Representations for Large-Scale Abstract Sketches.

Author: Xu, Peng, Huang, Yongye, Yuan, Tongtong, Xiang, Tao, Hospedales, Timothy M., Song, Yi-Zhe, and Wang, Liang
Subjects: *VIDEO games, *SPEECH perception, *BINARY codes, *FEATURE extraction, *TASK analysis
Abstract: In this paper, we focus on learning semantic representations for large-scale highly abstract sketches that were produced by the practical sketch-based application rather than the excessively well dawn sketches obtained by crowd-sourcing. We propose a dual-branch CNN-RNN network architecture to represent sketches, which simultaneously encodes both the static and temporal patterns of sketch strokes. Based on this architecture, we further explore learning the sketch-oriented semantic representations in two practical settings, i.e., hashing retrieval and zero-shot recognition on million-scale highly abstract sketches produced by practical online interactions. Specifically, we use our dual-branch architecture as a universal representation framework to design two sketch-specific deep models: (i) We propose a deep hashing model for sketch retrieval, where a novel hashing loss is specifically designed to further accommodate both the abstract and messy traits of sketches. (ii) We propose a deep embedding model for sketch zero-shot recognition, via collecting a large-scale edge-map dataset and proposing to extract a set of semantic vectors from edge-maps as the semantic knowledge for sketch zero-shot domain alignment. Both deep models are evaluated by comprehensive experiments on million-scale abstract sketches produced by a global online game QuickDraw and outperform state-of-the-art competitors. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

14. Fine-Grained Instance-Level Sketch-Based Video Retrieval.

Author: Xu, Peng, Liu, Kun, Xiang, Tao, Hospedales, Timothy M., Ma, Zhanyu, Guo, Jun, and Song, Yi-Zhe
Subjects: *IMAGE retrieval, *VIDEOS, *MOTION detectors, *STREAMING video & television
Abstract: Existing sketch-analysis work studies sketches depicting static objects or scenes. In this work, we propose a novel cross-modal retrieval problem of fine-grained instance-level sketch-based video retrieval (FG-SBVR), where a sketch sequence is used as a query to retrieve a specific target video instance. Compared with sketch-based still image retrieval, and coarse-grained category-level video retrieval, this is more challenging as both visual appearance and motion need to be simultaneously matched at a fine-grained level. We contribute the first FG-SBVR dataset with rich annotations. We then introduce a novel multi-stream multi-modality deep network to perform FG-SBVR under both strong and weakly supervised settings. The key component of the network is a relation module, designed to prevent model overfitting given scarce training data. We show that this model significantly outperforms a number of existing state-of-the-art models designed for video analysis. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

15. Fine-Grained Instance-Level Sketch-Based Image Retrieval.

Author: Yu, Qian, Song, Jifei, Song, Yi-Zhe, Xiang, Tao, and Hospedales, Timothy M.
Subjects: *IMAGE retrieval, *DEEP learning, *MACHINE learning
Abstract: The problem of fine-grained sketch-based image retrieval (FG-SBIR) is defined and investigated in this paper. In FG-SBIR, free-hand human sketch images are used as queries to retrieve photo images containing the same object instances. It is thus a cross-domain (sketch to photo) instance-level retrieval task. It is an extremely challenging problem because (i) visual comparisons and matching need to be executed under large domain gap, i.e., from black and white line drawing sketches to colour photos; (ii) it requires to capture the fine-grained (dis)similarities of sketches and photo images while free-hand sketches drawn by different people present different levels of deformation and expressive interpretation; and (iii) annotated cross-domain fine-grained SBIR datasets are scarce, challenging many state-of-the-art machine learning techniques, particularly those based on deep learning. In this paper, for the first time, we address all these challenges, providing a step towards the capabilities that would underpin a commercial sketch-based object instance retrieval application. Specifically, a new large-scale FG-SBIR database is introduced which is carefully designed to reflect the real-world application scenarios. A deep cross-domain matching model is then formulated to solve the intrinsic drawing style variability, large domain gap issues, and capture instance-level discriminative features. It distinguishes itself by a carefully designed attention module. Extensive experiments on the new dataset demonstrate the effectiveness of the proposed model and validate the need for a rigorous definition of the FG-SBIR problem and collecting suitable datasets. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

16. Sketch-a-Segmenter: Sketch-Based Photo Segmenter Generation.

Author: Hu, Conghui, Li, Da, Yang, Yongxin, Hospedales, Timothy M., and Song, Yi-Zhe
Subjects: *IMAGE segmentation, *PHOTOGRAPHS
Abstract: Given pixel-level annotated data, traditional photo segmentation techniques have achieved promising results. However, these photo segmentation models can only identify objects in categories for which data annotation and training have been carried out. This limitation has inspired recent work on few-shot and zero-shot learning for image segmentation. In this article, we show the value of sketch for photo segmentation, in particular as a transferable representation to describe a concept to be segmented. We show, for the first time, that it is possible to generate a photo-segmentation model of a novel category using just a single sketch and furthermore exploit the unique fine-grained characteristics of sketch to produce more detailed segmentation. More specifically, we propose a sketch-based photo segmentation method that takes sketch as input and synthesizes the weights required for a neural network to segment the corresponding region of a given photo. Our framework can be applied at both the category-level and the instance-level, and fine-grained input sketches provide more accurate segmentation in the latter. This framework generalizes across categories via sketch and thus provides an alternative to zero-shot learning when segmenting a photo from a category without annotated training data. To investigate the instance-level relationship across sketch and photo, we create the SketchySeg dataset which contains segmentation annotations for photos corresponding to paired sketches in the Sketchy Dataset. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

17. Toward Deep Universal Sketch Perceptual Grouper.

Author: Li, Ke, Pang, Kaiyue, Song, Yi-Zhe, Xiang, Tao, Hospedales, Timothy M., and Zhang, Honggang
Subjects: *GROUPERS, *DRAWING, *IMAGE retrieval, *TASK analysis, *IMAGE segmentation
Abstract: Human free-hand sketches provide the useful data for studying human perceptual grouping, where the grouping principles such as the Gestalt laws of grouping are naturally in play during both the perception and sketching stages. In this paper, we make the first attempt to develop a universal sketch perceptual grouper. That is, a grouper that can be applied to sketches of any category created with any drawing style and ability, to group constituent strokes/segments into semantically meaningful object parts. The first obstacle to achieving this goal is the lack of large-scale datasets with grouping annotation. To overcome this, we contribute the largest sketch perceptual grouping dataset to date, consisting of 20 000 unique sketches evenly distributed over 25 object categories. Furthermore, we propose a novel deep perceptual grouping model learned with both generative and discriminative losses. The generative loss improves the generalization ability of the model, while the discriminative loss guarantees both local and global grouping consistency. Extensive experiments demonstrate that the proposed grouper significantly outperforms the state-of-the-art competitors. In addition, we show that our grouper is useful for a number of sketch analysis tasks, including sketch semantic segmentation, synthesis, and fine-grained sketch-based image retrieval. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

17 results on '"Hospedales, Timothy M."'

1. Discovery of Shared Semantic Spaces for Multiscene Video Query and Summarization.

2. When and where to transfer for Bayesian network parameter learning.

3. Robust Subjective Visual Property Prediction from Crowdsourced Pairwise Labels.

4. Transductive Multi-View Zero-Shot Learning.

5. Bayesian Joint Modelling for Object Localisation in Weakly Labelled Images.

6. Learning Multimodal Latent Attributes.

7. Finding Rare Classes: Active Learning with Generative and Discriminative Models.

8. Structure Inference for Bayesian Multisensory Scene Understanding.

9. Implications of Noise and Neural Heterogeneity for Vestibulo-Ocular Reflex Fidelity.

10. Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool.

11. Weakly-Supervised Image Annotation and Segmentation with Objects and Attributes.

12. Identifying Rare and Subtle Behaviors: A Weakly Supervised Joint Topic Model.

13. On Learning Semantic Representations for Large-Scale Abstract Sketches.

14. Fine-Grained Instance-Level Sketch-Based Video Retrieval.

15. Fine-Grained Instance-Level Sketch-Based Image Retrieval.

16. Sketch-a-Segmenter: Sketch-Based Photo Segmenter Generation.

17. Toward Deep Universal Sketch Perceptual Grouper.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

17 results on '"Hospedales, Timothy M."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources