Author: "Johannes Totz" / Topic: computer vision - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Johannes Totz"' showing total 6 results

Start Over Author "Johannes Totz" Topic computer vision

6 results on '"Johannes Totz"'

1. Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation

Author: Zehan Wang, Andrew Peter Aitken, Jose Caballero, Johannes Totz, Wenzhe Shi, Alejandro Acosta, and Christian Ledig
Subjects: FOS: Computer and information sciences, Image fusion, Motion compensation, Artificial neural network, Computer science, business.industry, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 020206 networking & telecommunications, 02 engineering and technology, Iterative reconstruction, Convolutional neural network, Convolution, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Computer vision, Artificial intelligence, business, Image resolution
Abstract: Convolutional neural networks have enabled accurate image super-resolution in real-time. However, recent attempts to benefit from temporal correlations in video super-resolution have been limited to naive or inefficient architectures. In this paper, we introduce spatio-temporal sub-pixel convolution networks that effectively exploit temporal redundancies and improve reconstruction accuracy while maintaining real-time speed. Specifically, we discuss the use of early fusion, slow fusion and 3D convolutions for the joint processing of multiple consecutive video frames. We also propose a novel joint motion compensation and video super-resolution algorithm that is orders of magnitude more efficient than competing methods, relying on a fast multi-resolution spatial transformer module that is end-to-end trainable. These contributions provide both higher accuracy and temporally more consistent videos, which we confirm qualitatively and quantitatively. Relative to single-frame models, spatio-temporal networks can either reduce the computational cost by 30% whilst maintaining the same quality or provide a 0.2dB gain for a similar computational cost. Results on publicly available datasets demonstrate that the proposed algorithms surpass current state-of-the-art performance in both accuracy and efficiency., Comment: Changes: * Uploaded Vid4 results (footnote 1). * Added references [14, 29] as spatial-transformer prior art. * Fixed typos
Published: 2017

2. Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network

Author: Andrew Peter Aitken, Rob Bishop, Johannes Totz, Zehan Wang, Wenzhe Shi, Daniel Rueckert, Jose Caballero, and Ferenc Huszar
Subjects: FOS: Computer and information sciences, Computer science, Computer Vision and Pattern Recognition (cs.CV), Feature extraction, Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Machine Learning (stat.ML), 02 engineering and technology, Iterative reconstruction, Convolutional neural network, Convolution, Statistics - Machine Learning, 0202 electrical engineering, electronic engineering, information engineering, Computer vision, Image resolution, Pixel, Artificial neural network, business.industry, 020207 software engineering, Filter (signal processing), Feature (computer vision), Bicubic interpolation, 020201 artificial intelligence & image processing, Artificial intelligence, business, Interpolation
Abstract: Recently, several models based on deep neural networks have achieved great success in terms of both reconstruction accuracy and computational performance for single image super-resolution. In these methods, the low resolution (LR) input image is upscaled to the high resolution (HR) space using a single filter, commonly bicubic interpolation, before reconstruction. This means that the super-resolution (SR) operation is performed in HR space. We demonstrate that this is sub-optimal and adds computational complexity. In this paper, we present the first convolutional neural network (CNN) capable of real-time SR of 1080p videos on a single K2 GPU. To achieve this, we propose a novel CNN architecture where the feature maps are extracted in the LR space. In addition, we introduce an efficient sub-pixel convolution layer which learns an array of upscaling filters to upscale the final LR feature maps into the HR output. By doing so, we effectively replace the handcrafted bicubic filter in the SR pipeline with more complex upscaling filters specifically trained for each feature map, whilst also reducing the computational complexity of the overall SR operation. We evaluate the proposed approach using images and videos from publicly available datasets and show that it performs significantly better (+0.15dB on Images and +0.39dB on Videos) and is an order of magnitude faster than previous CNN-based methods., CVPR 2016 paper with updated affiliations and supplemental material, fixed typo in equation 4
Published: 2016

3. Accuracy validation of an image guided laparoscopy system for liver resection

Author: Stephen A. Thompson, Kurinchi Selvan Gurusamy, Crispin Schneider, Danail Stoyanov, Johannes Totz, Yi Song, Stian Flage Johnsen, Matthew J. Clarkson, Brian R. Davidson, Sebastien Ourselin, and David J. Hawkes
Subjects: Liver surgery, medicine.diagnostic_test, Computer science, business.industry, Iterative closest point, Imaging phantom, Resection, Image (mathematics), Image-guided surgery, medicine, Computer vision, Artificial intelligence, Laparoscopy, business, Surface reconstruction
Abstract: We present an analysis of the registration component of a proposed image guidance system for image guided liver surgery, using contrast enhanced CT. The analysis is performed on a visually realistic liver phantom and in-vivo porcine data. A robust registration process that can be deployed clinically is a key component of any image guided surgery system. It is also essential that the accuracy of the registration can be quantified and communicated to the surgeon. We summarise the proposed guidance system and discuss its clinical feasibility. The registration combines an intuitive manual alignment stage, surface reconstruction from a tracked stereo laparoscope and a rigid iterative closest point registration to register the intra-operative liver surface to the liver surface derived from CT. Testing of the system on a liver phantom shows that subsurface landmarks can be localised to an accuracy of 2.9 mm RMS. Testing during five porcine liver surgeries demonstrated that registration can be performed during surgery, with an error of less than 10 mm RMS for multiple surface landmarks.
Published: 2015

4. Visual Search Behaviour and Analysis of Augmented Visualisation for Minimally Invasive Surgery

Author: Johannes Totz, Guang-Zhong Yang, and Kenko Fujii
Subjects: Visual search, Computer science, business.industry, Region of interest, Orientation (computer vision), Trajectory, Eye movement, Eye tracking, Computer vision, Artificial intelligence, business, Sensory cue, Visualization
Abstract: Disorientation has been one of the key issues hampering natural orifice translumenal endoscopic surgery (NOTES) adoption. A new Dynamic View Expansion (DVE) technique was recently introduced as a method to increase the field-of-view, as well as to provide temporal visual cues to encode the camera motion trajectory. This paper presents a systematic analysis of visual search behaviour during the use of DVE for NOTES navigation. The study compares spatial orientation and latency with and without the use of the new DVE technique with motion trajectory encoding. Eye tracking data was recorded and modelled using Markov chains to characterise the visual search behaviour, where a new region of interest (ROI) definition was used to determine the states in the transition graphs. Resultant state transition graphs formed from the participants' eye movements showed a marked difference in visual search behaviour with increased cross-referencing between grey and less grey regions. The results demonstrate the advantages of using motion trajectory encoding for DVE.
Published: 2012

5. Enhanced visualisation for minimally invasive surgery

Author: Guang-Zhong Yang, Johannes Totz, Kenko Fujii, and Peter Mountney
Subjects: Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Biomedical Engineering, Video Recording, Health Informatics, Field of view, Imaging, Three-Dimensional, Image Interpretation, Computer-Assisted, Specular highlight, Humans, Radiology, Nuclear Medicine and imaging, Computer vision, Spatial contextual awareness, business.industry, Orientation (computer vision), Phantoms, Imaging, Reproducibility of Results, Endoscopy, General Medicine, Computer Graphics and Computer-Aided Design, Computer Science Applications, Visualization, Surgery, Computer-Assisted, Feature (computer vision), Peripheral vision, Surgery, Computer Vision and Pattern Recognition, Artificial intelligence, business, Algorithms, Texture synthesis
Abstract: Endoscopes used in minimally invasive surgery provide a limited field of view, thus requiring a high degree of spatial awareness and orientation. Attempts at expanding this small, restricted view with previously observed imagery have been made by researchers and is generally known as image mosaicing or dynamic view expansion. For minimally invasive endoscopy, SLAM-based methods have been shown to have potential values but have yet to address effective visualisation techniques.The live endoscopic video feed is expanded with previously observed footage. To this end, a method that highlights the difference between actual camera image and historic data observed earlier is proposed. Old video data is faded out to grey scale to mimic human peripheral vision. Specular highlights are removed with the help of texture synthesis to avoid distracting visual cues. The method is further evaluated on in vivo and phantom sequences by a detailed user study to examine the ability of the user in discerning temporal motion trajectories while visualising the expanded field of view, a feature that is of practical value for enhancing spatial awareness and orientation.The difference between historic data and live video is integrated effectively. The use of a single texture domain generated by planar parameterisation is demonstrated for view expansion. Specular highlights can be removed through texture synthesis without introducing noticeable artefacts. The implicit encoding of motion trajectory of the endoscopic camera visualised by the proposed method facilitates both global awareness and temporal evolution of the scene.Dynamic view expansion provides more context for navigation and orientation by establishing reference points beyond the camera's field of view. Effective integration of visual cues is paramount for concise visualisation.
Published: 2011

6. Dense Surface Reconstruction for Enhanced Navigation in MIS

Author: Guang-Zhong Yang, Johannes Totz, Danail Stoyanov, and Peter Mountney
Subjects: Ground truth, Computer science, business.industry, Invasive surgery, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Computer vision, Artificial intelligence, business, Surface reconstruction, ComputingMethodologies_COMPUTERGRAPHICS, Rendering (computer graphics)
Abstract: Recent introduction of dynamic view expansion has led to the development of computer vision methods for minimally invasive surgery to artificially expand the intra-operative field-of-view of the laparoscope. This provides improved awareness of the surrounding anatomical structures and minimises the effect of disorientation during surgical navigation. It permits the augmentation of live laparoscope images with information from previously captured views. Current approaches, however, can only represent the tissue geometry as planar surfaces or sparse 3D models, thus introducing noticeable visual artefacts in the final rendering results. This paper proposes a high-fidelity tissue geometry mapping by combining a sparse SLAM map with semi-dense surface reconstruction. The method is validated on phantom data with known ground truth, as well as in-vivo data captured during a robotic assisted MIS procedure. The derived results have shown that the method is able to effectively increase the coverage of the expanded surgical view without compromising mapping accuracy.
Published: 2011

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

6 results on '"Johannes Totz"'

1. Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation

2. Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network

3. Accuracy validation of an image guided laparoscopy system for liver resection

4. Visual Search Behaviour and Analysis of Augmented Visualisation for Minimally Invasive Surgery

5. Enhanced visualisation for minimally invasive surgery

6. Dense Surface Reconstruction for Enhanced Navigation in MIS

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

6 results on '"Johannes Totz"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources