Author: "Andrew Zisserman" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Andrew Zisserman"' showing total 1,018 results

Start Over Author "Andrew Zisserman"

1,018 results on '"Andrew Zisserman"'

1. Audio-visual modelling in a clinical setting

Author: Jianbo Jiao, Mohammad Alsharid, Lior Drukker, Aris T. Papageorghiou, Andrew Zisserman, and J. Alison Noble
Subjects: Medicine, Science
Abstract: Abstract Auditory and visual signals are two primary perception modalities that are usually present together and correlate with each other, not only in natural environments but also in clinical settings. However, audio-visual modelling in the latter case can be more challenging, due to the different sources of audio/video signals and the noise (both signal-level and semantic-level) in auditory signals—usually speech audio. In this study, we consider audio-visual modelling in a clinical setting, providing a solution to learn medical representations that benefit various clinical tasks, without relying on dense supervisory annotations from human experts for the model training. A simple yet effective multi-modal self-supervised learning framework is presented for this purpose. The proposed approach is able to help find standard anatomical planes, predict the focusing position of sonographer’s eyes, and localise anatomical regions of interest during ultrasound imaging. Experimental analysis on a large-scale clinical multi-modal ultrasound video dataset show that the proposed novel representation learning method provides good transferable anatomical representations that boost the performance of automated downstream clinical tasks, even outperforming fully-supervised solutions. Being able to learn such medical representations in a self-supervised manner will contribute to several aspects including a better understanding of obstetric imaging, training new sonographers, more effective assistive tools for human experts, and enhancement of the clinical workflow.
Published: 2024
Full Text: View/download PDF

2. Automated detection, labelling and radiological grading of clinical spinal MRIs

Author: Rhydian Windsor, Amir Jamaludin, Timor Kadir, and Andrew Zisserman
Subjects: Medicine, Science
Abstract: Abstract Spinal magnetic resonance (MR) scans are a vital tool for diagnosing the cause of back pain for many diseases and conditions. However, interpreting clinically useful information from these scans can be challenging, time-consuming and hard to reproduce across different radiologists. In this paper, we alleviate these problems by introducing a multi-stage automated pipeline for analysing spinal MR scans. This pipeline first detects and labels vertebral bodies across several commonly used sequences (e.g. T1w, T2w and STIR) and fields of view (e.g. lumbar, cervical, whole spine). Using these detections it then performs automated diagnosis for several spinal disorders, including intervertebral disc degenerative changes in T1w and T2w lumbar scans, and spinal metastases, cord compression and vertebral fractures. To achieve this, we propose a new method of vertebrae detection and labelling, using vector fields to group together detected vertebral landmarks and a language-modelling inspired beam search to determine the corresponding levels of the detections. We also employ a new transformer-based architecture to perform radiological grading which incorporates context from multiple vertebrae and sequences, as a real radiologist would. The performance of each stage of the pipeline is tested in isolation on several clinical datasets, each consisting of 66 to 421 scans. The outputs are compared to manual annotations of expert radiologists, demonstrating accurate vertebrae detection across a range of scan parameters. Similarly, the model’s grading predictions for various types of disc degeneration and detection of spinal metastases closely match those of an expert radiologist. To aid future research, our code and trained models are made publicly available.
Published: 2024
Full Text: View/download PDF

3. AutoAD III: The Prequel - Back to the Pixels.

Author: Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, and Andrew Zisserman
Published: 2024
Full Text: View/download PDF

4. Separating the 'Chirp' from the 'Chat': Self-supervised Visual Grounding of Sound and Language.

Author: Mark Hamilton, Andrew Zisserman, John R. Hershey, and William T. Freeman
Published: 2024
Full Text: View/download PDF

5. The Manga Whisperer: Automatically Generating Transcriptions for Comics.

Author: Ragav Sachdeva and Andrew Zisserman
Published: 2024
Full Text: View/download PDF

6. Learning from One Continuous Video Stream.

Author: João Carreira 0001, Michael King, Viorica Patraucean, Dilara Gokay, Catalin Ionescu, Yi Yang 0007, Daniel Zoran, Joseph Heyward, Carl Doersch, Yusuf Aytar, Dima Damen, and Andrew Zisserman
Published: 2024
Full Text: View/download PDF

7. A Simple Recipe for Contrastively Pre-Training Video-First Encoders Beyond 16 Frames.

Author: Pinelopi Papalampidi, Skanda Koppula, Shreya Pathak, Justin Chiu, Joe Heyward, Viorica Patraucean, Jiajun Shen, Antoine Miech, Andrew Zisserman, and Aida Nematzadeh
Published: 2024
Full Text: View/download PDF

8. Amodal Ground Truth and Completion in the Wild.

Author: Guanqi Zhan, Chuanxia Zheng, Weidi Xie, and Andrew Zisserman
Published: 2024
Full Text: View/download PDF

9. TIM: A Time Interval Machine for Audio-Visual Action Recognition.

Author: Jacob Chalk, Jaesung Huh, Evangelos Kazakos, Andrew Zisserman, and Dima Damen
Published: 2024
Full Text: View/download PDF

10. Appearance-Based Refinement for Object-Centric Motion Segmentation.

Author: Junyu Xie, Weidi Xie, and Andrew Zisserman
Published: 2024
Full Text: View/download PDF

11. 3D Spine Shape Estimation from Single 2D DXA.

Author: Emmanuelle Bourigault, Amir Jamaludin, and Andrew Zisserman
Published: 2024
Full Text: View/download PDF

12. Automated Spinal MRI Labelling from Reports Using a Large Language Model.

Author: Robin Y. Park, Rhydian Windsor, Amir Jamaludin, and Andrew Zisserman
Published: 2024
Full Text: View/download PDF

13. Voicevector: Multimodal Enrolment Vectors for Speaker Separation.

Author: Akam Rahimi, Triantafyllos Afouras, and Andrew Zisserman
Published: 2024
Full Text: View/download PDF

14. A Sound Approach: Using Large Language Models to Generate Audio Descriptions for Egocentric Text-Audio Retrieval.

Author: Andreea-Maria Oncescu, João F. Henriques, Andrew Zisserman, Samuel Albanie, and A. Sophia Koepke
Published: 2024
Full Text: View/download PDF

15. Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling.

Author: Bruno Korbar, Jaesung Huh, and Andrew Zisserman
Published: 2024
Full Text: View/download PDF

16. Synchformer: Efficient Synchronization From Sparse Cues.

Author: Vladimir Iashin, Weidi Xie, Esa Rahtu, and Andrew Zisserman
Published: 2024
Full Text: View/download PDF

17. Diagnostically relevant facial gestalt information from ordinary photos

Author: Quentin Ferry, Julia Steinberg, Caleb Webber, David R FitzPatrick, Chris P Ponting, Andrew Zisserman, and Christoffer Nellåker
Subjects: phenotyping, computer vision, clinical genetics, computational biology, Medicine, Science, Biology (General), QH301-705.5
Abstract: Craniofacial characteristics are highly informative for clinical geneticists when diagnosing genetic diseases. As a first step towards the high-throughput diagnosis of ultra-rare developmental diseases we introduce an automatic approach that implements recent developments in computer vision. This algorithm extracts phenotypic information from ordinary non-clinical photographs and, using machine learning, models human facial dysmorphisms in a multidimensional 'Clinical Face Phenotype Space'. The space locates patients in the context of known syndromes and thereby facilitates the generation of diagnostic hypotheses. Consequently, the approach will aid clinicians by greatly narrowing (by 27.6-fold) the search space of potential diagnoses for patients with suspected developmental disorders. Furthermore, this Clinical Face Phenotype Space allows the clustering of patients by phenotype even when no known syndrome diagnosis exists, thereby aiding disease identification. We demonstrate that this approach provides a novel method for inferring causative genetic variants from clinical sequencing data through functional genetic pathway comparisons.
Published: 2014
Full Text: View/download PDF

18. Overcoming Registration Uncertainty in Image Super-Resolution: Maximize or Marginalize?

Author: Andrew Zisserman, Stephen J. Roberts, David P. Capel, and Lyndsey C. Pickup
Subjects: Telecommunication, TK5101-6720, Electronics, TK7800-8360
Abstract: In multiple-image super-resolution, a high-resolution image is estimated from a number of lower-resolution images. This usually involves computing the parameters of a generative imaging model (such as geometric and photometric registration, and blur) and obtaining a MAP estimate by minimizing a cost function including an appropriate prior. Two alternative approaches are examined. First, both registrations and the super-resolution image are found simultaneously using a joint MAP optimization. Second, we perform Bayesian integration over the unknown image registration parameters, deriving a cost function whose only variables of interest are the pixel values of the super-resolution image. We also introduce a scheme to learn the parameters of the image prior as part of the super-resolution algorithm. We show examples on a number of real sequences including multiple stills, digital video, and DVDs of movies.
Published: 2007
Full Text: View/download PDF

19. TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement.

Author: Carl Doersch, Yi Yang 0007, Mel Vecerík, Dilara Gokay, Ankush Gupta 0001, Yusuf Aytar, João Carreira 0001, and Andrew Zisserman
Published: 2023
Full Text: View/download PDF

20. AutoAD II: The Sequel - Who, When, and What in Movie Audio Description.

Author: Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, and Andrew Zisserman
Published: 2023
Full Text: View/download PDF

21. Helping Hands: An Object-Aware Ego-Centric Video Recognition Model.

Author: Chuhan Zhang, Ankush Gupta 0001, and Andrew Zisserman
Published: 2023
Full Text: View/download PDF

22. The Making and Breaking of Camouflage.

Author: Hala Lamdouar, Weidi Xie, and Andrew Zisserman
Published: 2023
Full Text: View/download PDF

23. Verbs in Action: Improving verb understanding in video-language models.

Author: Liliane Momeni, Mathilde Caron, Arsha Nagrani, Andrew Zisserman, and Cordelia Schmid
Published: 2023
Full Text: View/download PDF

24. GestSync: Determining who is speaking without a talking head.

Author: Sindhu B. Hegde and Andrew Zisserman
Published: 2023

25. The Change You Want to See (Now in 3D).

Author: Ragav Sachdeva and Andrew Zisserman
Published: 2023
Full Text: View/download PDF

26. WhisperX: Time-Accurate Speech Transcription of Long-Form Audio.

Author: Max Bain, Jaesung Huh, Tengda Han, and Andrew Zisserman
Published: 2023
Full Text: View/download PDF

27. AutoAD: Movie Description in Context.

Author: Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, and Andrew Zisserman
Published: 2023
Full Text: View/download PDF

28. A Light Touch Approach to Teaching Transformers Multi-view Geometry.

Author: Yash Bhalgat, João F. Henriques, and Andrew Zisserman
Published: 2023
Full Text: View/download PDF

29. Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime.

Author: Rhydian Windsor, Amir Jamaludin, Timor Kadir, and Andrew Zisserman
Published: 2023

30. Deep Facial Phenotyping with Mixup Augmentation.

Author: Jonathan Campbell, Mitchell Dawson, Andrew Zisserman, Weidi Xie, and Christoffer Nellåker
Published: 2023
Full Text: View/download PDF

31. Epic-Sounds: A Large-Scale Dataset of Actions that Sound.

Author: Jaesung Huh, Jacob Chalk, Evangelos Kazakos, Dima Damen, and Andrew Zisserman
Published: 2023
Full Text: View/download PDF

32. 3D Shape Analysis of Scoliosis.

Author: Emmanuelle Bourigault, Amir Jamaludin, Emma Clark, Jeremy Fairbank, Timor Kadir, and Andrew Zisserman
Published: 2023
Full Text: View/download PDF

33. Multi-Modal Classifiers for Open-Vocabulary Object Detection.

Author: Prannay Kaul, Weidi Xie, and Andrew Zisserman
Published: 2023

34. The Change You Want to See.

Author: Ragav Sachdeva and Andrew Zisserman
Published: 2023
Full Text: View/download PDF

35. It's About Time: Analog Clock Reading in the Wild.

Author: Charig Yang, Weidi Xie, and Andrew Zisserman
Published: 2022
Full Text: View/download PDF

36. Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation.

Author: Akam Rahimi, Triantafyllos Afouras, and Andrew Zisserman
Published: 2022
Full Text: View/download PDF

37. Generalized Category Discovery.

Author: Sagar Vaze, Kai Han 0001, Andrea Vedaldi, and Andrew Zisserman
Published: 2022
Full Text: View/download PDF

38. Sub-word Level Lip Reading With Visual Attention.

Author: K. R. Prajwal, Triantafyllos Afouras, and Andrew Zisserman
Published: 2022
Full Text: View/download PDF

39. Input-level Inductive Biases for 3D Reconstruction.

Author: Wang Yifan 0011, Carl Doersch, Relja Arandjelovic, João Carreira 0001, and Andrew Zisserman
Published: 2022
Full Text: View/download PDF

40. Label, Verify, Correct: A Simple Few Shot Object Detection Method.

Author: Prannay Kaul, Weidi Xie, and Andrew Zisserman
Published: 2022
Full Text: View/download PDF

41. Temporal Alignment Networks for Long-term Video.

Author: Tengda Han, Weidi Xie, and Andrew Zisserman
Published: 2022
Full Text: View/download PDF

42. Compressed Vision for Efficient Video Understanding.

Author: Olivia Wiles, João Carreira 0001, Iain Barr, Andrew Zisserman, and Mateusz Malinowski
Published: 2022
Full Text: View/download PDF

43. Is an Object-Centric Video Representation Beneficial for Transfer?

Author: Chuhan Zhang, Ankush Gupta 0001, and Andrew Zisserman
Published: 2022
Full Text: View/download PDF

44. Context-Aware Transformers for Spinal Cancer Detection and Radiological Grading.

Author: Rhydian Windsor, Amir Jamaludin, Timor Kadir, and Andrew Zisserman
Published: 2022
Full Text: View/download PDF

45. Automatic Dense Annotation of Large-Vocabulary Sign Language Videos.

Author: Liliane Momeni, Hannah Bull, K. R. Prajwal, Samuel Albanie, Gül Varol, and Andrew Zisserman
Published: 2022
Full Text: View/download PDF

46. Object Discovery and Representation Networks.

Author: Olivier J. Hénaff, Skanda Koppula, Evan Shelhamer, Daniel Zoran, Andrew Jaegle, Andrew Zisserman, João Carreira 0001, and Relja Arandjelovic
Published: 2022
Full Text: View/download PDF

47. No Representation Rules Them All in Category Discovery.

Author: Sagar Vaze, Andrea Vedaldi, and Andrew Zisserman
Published: 2023

48. Perception Test: A Diagnostic Benchmark for Multimodal Video Models.

Author: Viorica Patraucean, Lucas Smaira, Ankush Gupta 0001, Adrià Recasens, Larisa Markeeva, Dylan Banarse, Skanda Koppula, Joseph Heyward, Mateusz Malinowski, Yi Yang 0007, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Alexandre Fréchette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman, and João Carreira 0001
Published: 2023

49. Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion.

Author: Yash Bhalgat, Iro Laina, João F. Henriques, Andrea Vedaldi, and Andrew Zisserman
Published: 2023

50. Self-supervised Video Object Segmentation by Motion Grouping.

Author: Charig Yang, Hala Lamdouar, Erika Lu, Andrew Zisserman, and Weidi Xie
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Database

Publisher

1,018 results on '"Andrew Zisserman"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources