Author: "Zanca, A." - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Zanca, A."' showing total 4,157 results

Start Over Author "Zanca, A."

4,157 results on '"Zanca, A."'

1. Caption-Driven Explorations: Aligning Image and Text Embeddings through Human-Inspired Foveated Vision

Author: Zanca, Dario, Zugarini, Andrea, Dietz, Simon, Altstidl, Thomas R., Ndjeuha, Mark A. Turban, Schwinn, Leo, and Eskofier, Bjoern
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Understanding human attention is crucial for vision science and AI. While many models exist for free-viewing, less is known about task-driven image exploration. To address this, we introduce CapMIT1003, a dataset with captions and click-contingent image explorations, to study human attention during the captioning task. We also present NevaClip, a zero-shot method for predicting visual scanpaths by combining CLIP models with NeVA algorithms. NevaClip generates fixations to align the representations of foveated visual stimuli and captions. The simulated scanpaths outperform existing human attention models in plausibility for captioning and free-viewing tasks. This research enhances the understanding of human attention and advances scanpath prediction models., Comment: arXiv admin note: substantial text overlap with arXiv:2305.12380
Published: 2024

2. Human-inspired Explanations for Vision Transformers and Convolutional Neural Networks

Author: Panda, Mahadev Prasad, Tiezzi, Matteo, Vilas, Martina, Roig, Gemma, Eskofier, Bjoern M., and Zanca, Dario
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: We introduce Foveation-based Explanations (FovEx), a novel human-inspired visual explainability (XAI) method for Deep Neural Networks. Our method achieves state-of-the-art performance on both transformer (on 4 out of 5 metrics) and convolutional models (on 3 out of 5 metrics), demonstrating its versatility. Furthermore, we show the alignment between the explanation map produced by FovEx and human gaze patterns (+14\% in NSS compared to RISE, +203\% in NSS compared to gradCAM), enhancing our confidence in FovEx's ability to close the interpretation gap between humans and machines., Comment: Accepted at the Human-inspired Computer Vision (HCV) ECCV 2024 Workshop as an extended abstract. A long version of the work can be found at arXiv:2408.02123v1
Published: 2024

3. How Intermodal Interaction Affects the Performance of Deep Multimodal Fusion for Mixed-Type Time Series

Author: Dietz, Simon, Altstidl, Thomas, Zanca, Dario, Eskofier, Björn, and Nguyen, An
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Mixed-type time series (MTTS) is a bimodal data type that is common in many domains, such as healthcare, finance, environmental monitoring, and social media. It consists of regularly sampled continuous time series and irregularly sampled categorical event sequences. The integration of both modalities through multimodal fusion is a promising approach for processing MTTS. However, the question of how to effectively fuse both modalities remains open. In this paper, we present a comprehensive evaluation of several deep multimodal fusion approaches for MTTS forecasting. Our comparison includes three fusion types (early, intermediate, and late) and five fusion methods (concatenation, weighted mean, weighted mean with correlation, gating, and feature sharing). We evaluate these fusion approaches on three distinct datasets, one of which was generated using a novel framework. This framework allows for the control of key data properties, such as the strength and direction of intermodal interactions, modality imbalance, and the degree of randomness in each modality, providing a more controlled environment for testing fusion approaches. Our findings show that the performance of different fusion approaches can be substantially influenced by the direction and strength of intermodal interactions. The study reveals that early and intermediate fusion approaches excel at capturing fine-grained and coarse-grained cross-modal features, respectively. These findings underscore the crucial role of intermodal interactions in determining the most effective fusion strategy for MTTS forecasting.
Published: 2024

4. Learning to direct one’s care: barriers and facilitators reported by people with tetraplegia

Author: Zanca, Jeanne M., Gibson-Gill, Carol M., Williams, Joyce, McMillion, Tomicka, Morris, John T., and Dijkers, Marcel P.
Published: 2024
Full Text: View/download PDF

5. Trends, Applications, and Challenges in Human Attention Modelling

Author: Cartella, Giuseppe, Cornia, Marcella, Cuculo, Vittorio, D'Amelio, Alessandro, Zanca, Dario, Boccignone, Giuseppe, and Cucchiara, Rita
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Human attention modelling has proven, in recent years, to be particularly useful not only for understanding the cognitive processes underlying visual exploration, but also for providing support to artificial intelligence models that aim to solve problems in various domains, including image and video processing, vision-and-language applications, and language modelling. This survey offers a reasoned overview of recent efforts to integrate human attention mechanisms into contemporary deep learning models and discusses future research directions and challenges. For a comprehensive overview on the ongoing research refer to our dedicated repository available at https://github.com/aimagelab/awesome-human-visual-attention., Comment: Accepted at IJCAI 2024 Survey Track
Published: 2024

6. Validation of a multi-parameter algorithm for personalized contrast injection protocol in liver CT

Author: Hugues G. Brat, Benoit Dufour, Natalie Heracleous, Pauline Sastre, Cyril Thouly, Benoit Rizk, and Federica Zanca
Subjects: Abdomen, Body composition, Contrast media, Liver, Multidetector computed tomography, Medical physics. Medical radiology. Nuclear medicine, R895-920
Abstract: Abstract Background In liver computed tomography (CT), tailoring the contrast injection to the patient’s specific characteristics is relevant for optimal imaging and patient safety. We evaluated a novel algorithm engineered for personalized contrast injection to achieve reproducible liver enhancement centered on 50 HU. Methods From September 2020 to August 31, 2022, CT data from consecutive adult patients were prospectively collected at our multicenter premises. Inclusion criteria consisted of an abdominal CT referral for cancer staging or follow-up. For all examinations, a web interface incorporating data from the radiology information system (patient details and examination information) and radiographer-inputted data (patient fat-free mass, imaging center, kVp, contrast agent details, and imaging phase) were used. Calculated contrast volume and injection rate were manually entered into the CT console controlling the injector. Iopamidol 370 mgI/mL or Iohexol 350 mgI/mL were used, and kVp varied (80, 100, or 120) based on patient habitus. Results We enrolled 384 patients (mean age 61.2 years, range 21.1–94.5). The amount of administered iodine dose (gI) was not significantly different across contrast agents (p = 0.700), while a significant increase in iodine dose was observed with increasing kVp (p
Published: 2024
Full Text: View/download PDF

7. Validation of a multi-parameter algorithm for personalized contrast injection protocol in liver CT

Author: Brat, Hugues G., Dufour, Benoit, Heracleous, Natalie, Sastre, Pauline, Thouly, Cyril, Rizk, Benoit, and Zanca, Federica
Published: 2024
Full Text: View/download PDF

8. Impact of deep learning image reconstruction on volumetric accuracy and image quality of pulmonary nodules with different morphologies in low-dose CT

Author: D’hondt, L., Franck, C., Kellens, P-J., Zanca, F., Buytaert, D., Van Hoyweghen, A., Addouli, H. El, Carpentier, K., Niekel, M., Spinhoven, M., Bacher, K., and Snoeckx, A.
Published: 2024
Full Text: View/download PDF

9. From Patches to Objects: Exploiting Spatial Reasoning for Better Visual Representations

Author: Albert, Toni, Eskofier, Bjoern, and Zanca, Dario
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: As the field of deep learning steadily transitions from the realm of academic research to practical application, the significance of self-supervised pretraining methods has become increasingly prominent. These methods, particularly in the image domain, offer a compelling strategy to effectively utilize the abundance of unlabeled image data, thereby enhancing downstream tasks' performance. In this paper, we propose a novel auxiliary pretraining method that is based on spatial reasoning. Our proposed method takes advantage of a more flexible formulation of contrastive learning by introducing spatial reasoning as an auxiliary task for discriminative self-supervised methods. Spatial Reasoning works by having the network predict the relative distances between sampled non-overlapping patches. We argue that this forces the network to learn more detailed and intricate internal representations of the objects and the relationships between their constituting parts. Our experiments demonstrate substantial improvement in downstream performance in linear evaluation compared to similar work and provide directions for further research into spatial reasoning.
Published: 2023

10. Contrastive Language-Image Pretrained Models are Zero-Shot Human Scanpath Predictors

Author: Zanca, Dario, Zugarini, Andrea, Dietz, Simon, Altstidl, Thomas R., Ndjeuha, Mark A. Turban, Schwinn, Leo, and Eskofier, Bjoern
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Understanding the mechanisms underlying human attention is a fundamental challenge for both vision science and artificial intelligence. While numerous computational models of free-viewing have been proposed, less is known about the mechanisms underlying task-driven image exploration. To address this gap, we present CapMIT1003, a database of captions and click-contingent image explorations collected during captioning tasks. CapMIT1003 is based on the same stimuli from the well-known MIT1003 benchmark, for which eye-tracking data under free-viewing conditions is available, which offers a promising opportunity to concurrently study human attention under both tasks. We make this dataset publicly available to facilitate future research in this field. In addition, we introduce NevaClip, a novel zero-shot method for predicting visual scanpaths that combines contrastive language-image pretrained (CLIP) models with biologically-inspired neural visual attention (NeVA) algorithms. NevaClip simulates human scanpaths by aligning the representation of the foveated visual stimulus and the representation of the associated caption, employing gradient-driven visual exploration to generate scanpaths. Our experimental results demonstrate that NevaClip outperforms existing unsupervised computational models of human visual attention in terms of scanpath plausibility, for both captioning and free-viewing tasks. Furthermore, we show that conditioning NevaClip with incorrect or misleading captions leads to random behavior, highlighting the significant impact of caption guidance in the decision-making process. These findings contribute to a better understanding of mechanisms that guide human attention and pave the way for more sophisticated computational approaches to scanpath prediction that can integrate direct top-down guidance of downstream tasks.
Published: 2023

11. FastAMI -- a Monte Carlo Approach to the Adjustment for Chance in Clustering Comparison Metrics

Author: Klede, Kai, Schwinn, Leo, Zanca, Dario, and Eskofier, Björn
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning, I.2.0
Abstract: Clustering is at the very core of machine learning, and its applications proliferate with the increasing availability of data. However, as datasets grow, comparing clusterings with an adjustment for chance becomes computationally difficult, preventing unbiased ground-truth comparisons and solution selection. We propose FastAMI, a Monte Carlo-based method to efficiently approximate the Adjusted Mutual Information (AMI) and extend it to the Standardized Mutual Information (SMI). The approach is compared with the exact calculation and a recently developed variant of the AMI based on pairwise permutations, using both synthetic and real data. In contrast to the exact calculation our method is fast enough to enable these adjusted information-theoretic comparisons for large datasets while maintaining considerably more accurate results than the pairwise approach., Comment: Accepted at AAAI 2023
Published: 2023
Full Text: View/download PDF

12. Data-Centric Benchmarking of Neural Network Architectures for the Univariate Time Series Forecasting Task

Author: Philipp Schlieper, Mischa Dombrowski, An Nguyen, Dario Zanca, and Bjoern Eskofier
Subjects: deep learning, time series, neural networks, model selection, data synthesis, univariate forecasting, Science (General), Q1-390, Mathematics, QA1-939
Abstract: Time series forecasting has witnessed a rapid proliferation of novel neural network approaches in recent times. However, performances in terms of benchmarking results are generally not consistent, and it is complicated to determine in which cases one approach fits better than another. Therefore, we propose adopting a data-centric perspective for benchmarking neural network architectures on time series forecasting by generating ad hoc synthetic datasets. In particular, we combine sinusoidal functions to synthesize univariate time series data for multi-input-multi-output prediction tasks. We compare the most popular architectures for time series, namely long short-term memory (LSTM) networks, convolutional neural networks (CNNs), and transformers, and directly connect their performance with different controlled data characteristics, such as the sequence length, noise and frequency, and delay length. Our findings suggest that transformers are the best architecture for dealing with different delay lengths. In contrast, for different noise and frequency levels and different sequence lengths, LSTM is the best-performing architecture by a significant amount. Based on our insights, we derive recommendations which allow machine learning (ML) practitioners to decide which architecture to apply, given the dataset’s characteristics.
Published: 2024
Full Text: View/download PDF

13. New data on spiders (Arachnida, Araneae) from the islands of the Strait of Sicily (Southern Italy) with taxonomic notes on Poecilochroa loricata Kritscher, 1996 (Araneae, Gnaphosidae) and eight new records for Europe

Author: G. Nicolosi, P. Pantini, U. Devincenzo, L. A. Guariento, V. Italiano, L. Zanca, M. Sarà, and M. Isaia
Subjects: Spider survey, Pelagie islands, Pantelleria, checklist, species-area relationship, SAR, Zoology, QL1-991
Abstract: We present a checklist of spiders inhabiting the Pelagie archipelago and Pantelleria island, in the Strait of Sicily (Southern Italy). Data were compiled from both literature sources and unpublished materials stored in museum collections. In total, we report new data on 100 species, bringing the total number of species documented for the islands of the Strait of Sicily to 148. Among these, 8 are new for Europe and 9 new for Italy. The island of Lampedusa hosts the highest number of species (107), followed by Pantelleria (63) and Linosa (25). The most represented families are Gnaphosidae in Lampedusa and Linosa, while Salticidae is dominant in Pantelleria. Most of the species present on the islands of the Strait of Sicily have a Holarctic distribution. Several rare species are recorded, including Haplodrassus crassipes (Lucas, 1846), Palliduphantes labilis (Simon, 1913) and Xysticus promiscuus O. Pickard-Cambridge, 1876. In addition, the female of Poecilochroa loricata Kritscher, 1996 (Gnaphosidae) is here described and illustrated for the first time. As revealed by the study of the species-area relationship, the islands of the Strait of Sicily host a higher number of species compared to other small islands in Italy. Accordingly, species richness recorded in Lampedusa and Linosa is higher than expected, while further investigations are suggested to increase knowledge of the local spider diversity.
Published: 2024
Full Text: View/download PDF

14. Bioluminescent Genetically Encoded Glutamate Indicators for Molecular Imaging of Neuronal Activity.

Author: Petersen, Eric, Lapan, Alexandra, Castellanos Franco, E, Fillion, Adam, Crespo, Emmanuel, Lambert, Gerard, Grady, Connor, Zanca, Albertina, Orcutt, Richard, Hochgeschwender, Ute, Shaner, Nathan, and Gilad, Assaf
Subjects: bioluminescence, bioluminescent, neuroimaging, neurotransmitter, optical sensors, optogenetics, Humans, Glutamic Acid, Biosensing Techniques, Brain, Neurotransmitter Agents, Molecular Imaging
Abstract: Genetically encoded optical sensors and advancements in microscopy instrumentation and techniques have revolutionized the scientific toolbox available for probing complex biological processes such as release of specific neurotransmitters. Most genetically encoded optical sensors currently used are based on fluorescence and have been highly successful tools for single-cell imaging in superficial brain regions. However, there remains a need to develop new tools for reporting neuronal activity in vivo within deeper structures without the need for hardware such as lenses or fibers to be implanted within the brain. Our approach to this problem is to replace the fluorescent elements of the existing biosensors with bioluminescent elements. This eliminates the need of external light sources to illuminate the sensor, thus allowing deeper brain regions to be imaged noninvasively. Here, we report the development of the first genetically encoded neurotransmitter indicators based on bioluminescent light emission. These probes were optimized by high-throughput screening of linker libraries. The selected probes exhibit robust changes in light output in response to the extracellular presence of the excitatory neurotransmitter glutamate. We expect this new approach to neurotransmitter indicator design to enable the engineering of specific bioluminescent probes for multiple additional neurotransmitters in the future, ultimately allowing neuroscientists to monitor activity associated with a specific neurotransmitter as it relates to behavior in a variety of neuronal and psychiatric disorders, among many other applications.
Published: 2023

15. Automated long-term monitoring of stereotypical movement in polar bears under human care using machine learning

Author: Matthias Zuerl, Philip Stoll, Ingrid Brehm, Jonas Sueskind, René Raab, Jan Petermann, Dario Zanca, Ralph Simon, Lorenzo von Fersen, and Bjoern Eskofier
Subjects: Animal welfare, Animal tracking, Behaviour classification, Deep learning, Computer vision, Coping, Information technology, T58.5-58.64, Ecology, QH540-549.5
Abstract: The welfare of animals under human care is often assessed by observing behaviours indicative of stress or discomfort, such as stereotypical behaviour (SB), which often shows as repetitive, invariant pacing. Traditional behaviour monitoring methods, however, are labour-intensive and subject to observer bias. Our study presents an innovative automated approach utilising computer vision and machine learning to non-invasively detect and analyse SB in managed populations, exemplified by a longitudinal study of two polar bears. We designed an animal tracking framework to localise and identify individual animals in the enclosure. After determining their position on the enclosure map via homographic transformation, we refined the resulting trajectories using a particle filter. Finally, we classified the trajectory patterns as SB or normal behaviour using a lightweight random forest approach with an accuracy of 94.9 %. The system not only allows for continuous, objective monitoring of animal behaviours but also provides insights into seasonal variations in SB, illustrating its potential for improving animal welfare in zoological settings. Ultimately, we analysed 607 days for the occurrence of SB, allowing us to discuss seasonal patterns of SB in both the male and female polar bear monitored. This work advances the field of animal welfare research by introducing a scalable, efficient method for the long-term, automated detection and monitoring of stereotypical behaviour, paving the way for its application across various settings and species that can be continuously monitored with cameras. We made the code publicly available at https://github.com/team-vera/stereotypy-detector.
Published: 2024
Full Text: View/download PDF

16. Trends, Applications, and Challenges in Human Attention Modelling.

Author: Giuseppe Cartella, Marcella Cornia, Vittorio Cuculo, Alessandro D'Amelio, Dario Zanca, Giuseppe Boccignone, and Rita Cucchiara
Published: 2024

17. How Intermodal Interaction Affects the Performance of Deep Multimodal Fusion for Mixed-Type Time Series.

Author: Simon Dietz, Thomas Altstidl, Dario Zanca, Björn M. Eskofier, and An Nguyen
Published: 2024
Full Text: View/download PDF

18. Efficient Training of Recurrent Neural Networks for Remaining Time Prediction in Predictive Process Monitoring.

Author: Johannes Roider, Dario Zanca, and Bjoern M. Eskofier
Published: 2024
Full Text: View/download PDF

19. Efficient Training of Recurrent Neural Networks for Remaining Time Prediction in Predictive Process Monitoring

Author: Roider, Johannes, Zanca, Dario, Eskofier, Bjoern M., Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Marrella, Andrea, editor, Resinas, Manuel, editor, Jans, Mieke, editor, and Rosemann, Michael, editor
Published: 2024
Full Text: View/download PDF

20. Multiphase Models for Moving Boundary Problems in Biology

Author: Ahmed, Ishraq U., Flegg, Jennifer A., Miller, Claire, Ruiz-Baier, Ricardo, Won, Joshua, Zanca, Adriana, Wood, David R., Editor-in-Chief, de Gier, Jan, Series Editor, Praeger, Cheryl E., Series Editor, and Tao, Terence, Series Editor
Published: 2024
Full Text: View/download PDF

21. Simulating Human Gaze with Neural Visual Attention

Author: Schwinn, Leo, Precup, Doina, Eskofier, Bjoern, and Zanca, Dario
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Existing models of human visual attention are generally unable to incorporate direct task guidance and therefore cannot model an intent or goal when exploring a scene. To integrate guidance of any downstream visual task into attention modeling, we propose the Neural Visual Attention (NeVA) algorithm. To this end, we impose to neural networks the biological constraint of foveated vision and train an attention mechanism to generate visual explorations that maximize the performance with respect to the downstream task. We observe that biologically constrained neural networks generate human-like scanpaths without being trained for this objective. Extensive experiments on three common benchmark datasets show that our method outperforms state-of-the-art unsupervised human attention models in generating human-like scanpaths.
Published: 2022

22. Just a Matter of Scale? Reevaluating Scale Equivariance in Convolutional Neural Networks

Author: Altstidl, Thomas, Nguyen, An, Schwinn, Leo, Köferl, Franz, Mutschler, Christopher, Eskofier, Björn, and Zanca, Dario
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The widespread success of convolutional neural networks may largely be attributed to their intrinsic property of translation equivariance. However, convolutions are not equivariant to variations in scale and fail to generalize to objects of different sizes. Despite recent advances in this field, it remains unclear how well current methods generalize to unobserved scales on real-world data and to what extent scale equivariance plays a role. To address this, we propose the novel Scaled and Translated Image Recognition (STIR) benchmark based on four different domains. Additionally, we introduce a new family of models that applies many re-scaled kernels with shared weights in parallel and then selects the most appropriate one. Our experimental results on STIR show that both the existing and proposed approaches can improve generalization across scales compared to standard convolutions. We also demonstrate that our family of models is able to generalize well towards larger scales and improve scale equivariance. Moreover, due to their unique design we can validate that kernel selection is consistent with input scale. Even so, none of the evaluated models maintain their performance for large differences in scale, demonstrating that a general understanding of how scale equivariance can improve generalization and robustness is still lacking.
Published: 2022

23. Argentine Catholic Intellectuals in the Twentieth Century: an Analysis in Perspective

Author: Zanca, José
Published: 2023
Full Text: View/download PDF

24. Impact of deep learning image reconstruction on volumetric accuracy and image quality of pulmonary nodules with different morphologies in low-dose CT

Author: L. D’hondt, C. Franck, P-J. Kellens, F. Zanca, D. Buytaert, A. Van Hoyweghen, H. El Addouli, K. Carpentier, M. Niekel, M. Spinhoven, K. Bacher, and A. Snoeckx
Subjects: Computed tomography, Deep learning image reconstruction, Iterative reconstruction, Lung cancer screening, Nodule volumetry, Nodule morphology, Medical physics. Medical radiology. Nuclear medicine, R895-920, Neoplasms. Tumors. Oncology. Including cancer and carcinogens, RC254-282
Abstract: Abstract Background This study systematically compares the impact of innovative deep learning image reconstruction (DLIR, TrueFidelity) to conventionally used iterative reconstruction (IR) on nodule volumetry and subjective image quality (IQ) at highly reduced radiation doses. This is essential in the context of low-dose CT lung cancer screening where accurate volumetry and characterization of pulmonary nodules in repeated CT scanning are indispensable. Materials and methods A standardized CT dataset was established using an anthropomorphic chest phantom (Lungman, Kyoto Kaguku Inc., Kyoto, Japan) containing a set of 3D-printed lung nodules including six diameters (4 to 9 mm) and three morphology classes (lobular, spiculated, smooth), with an established ground truth. Images were acquired at varying radiation doses (6.04, 3.03, 1.54, 0.77, 0.41 and 0.20 mGy) and reconstructed with combinations of reconstruction kernels (soft and hard kernel) and reconstruction algorithms (ASIR-V and DLIR at low, medium and high strength). Semi-automatic volumetry measurements and subjective image quality scores recorded by five radiologists were analyzed with multiple linear regression and mixed-effect ordinal logistic regression models. Results Volumetric errors of nodules imaged with DLIR are up to 50% lower compared to ASIR-V, especially at radiation doses below 1 mGy and when reconstructed with a hard kernel. Also, across all nodule diameters and morphologies, volumetric errors are commonly lower with DLIR. Furthermore, DLIR renders higher subjective IQ, especially at the sub-mGy doses. Radiologists were up to nine times more likely to score the highest IQ-score to these images compared to those reconstructed with ASIR-V. Lung nodules with irregular margins and small diameters also had an increased likelihood (up to five times more likely) to be ascribed the best IQ scores when reconstructed with DLIR. Conclusion We observed that DLIR performs as good as or even outperforms conventionally used reconstruction algorithms in terms of volumetric accuracy and subjective IQ of nodules in an anthropomorphic chest phantom. As such, DLIR potentially allows to lower the radiation dose to participants of lung cancer screening without compromising accurate measurement and characterization of lung nodules.
Published: 2024
Full Text: View/download PDF

25. Active Learning of Ordinal Embeddings: A User Study on Football Data

Author: Loeffler, Christoffer, Fallah, Kion, Fenu, Stefano, Zanca, Dario, Eskofier, Bjoern, Rozell, Christopher John, and Mutschler, Christopher
Subjects: Computer Science - Machine Learning
Abstract: Humans innately measure distance between instances in an unlabeled dataset using an unknown similarity function. Distance metrics can only serve as proxy for similarity in information retrieval of similar instances. Learning a good similarity function from human annotations improves the quality of retrievals. This work uses deep metric learning to learn these user-defined similarity functions from few annotations for a large football trajectory dataset. We adapt an entropy-based active learning method with recent work from triplet mining to collect easy-to-answer but still informative annotations from human participants and use them to train a deep convolutional network that generalizes to unseen samples. Our user study shows that our approach improves the quality of the information retrieval compared to a previous deep metric learning approach that relies on a Siamese network. Specifically, we shed light on the strengths and weaknesses of passive sampling heuristics and active learners alike by analyzing the participants' response efficacy. To this end, we collect accuracy, algorithmic time complexity, the participants' fatigue and time-to-response, qualitative self-assessment and statements, as well as the effects of mixed-expertise annotators and their consistency on model performance and transfer-learning., Comment: 23 pages, 17 figures
Published: 2022

26. From patches to objects: exploiting spatial reasoning for better visual representations

Author: Albert, Toni, Eskofier, Bjoern, and Zanca, Dario
Published: 2024
Full Text: View/download PDF

27. Tracking the Visitor

Author: Koeferl, Franz, primary, Zuerl, Matthias, additional, Jami, Jitin, additional, Li, Jindong, additional, Zanca, Dario, additional, and Eskofier, Bjoern, additional
Published: 2023
Full Text: View/download PDF

28. Improving Robustness against Real-World and Worst-Case Distribution Shifts through Decision Region Quantification

Author: Schwinn, Leo, Bungert, Leon, Nguyen, An, Raab, René, Pulsmeyer, Falk, Precup, Doina, Eskofier, Björn, and Zanca, Dario
Subjects: Computer Science - Machine Learning
Abstract: The reliability of neural networks is essential for their use in safety-critical applications. Existing approaches generally aim at improving the robustness of neural networks to either real-world distribution shifts (e.g., common corruptions and perturbations, spatial transformations, and natural adversarial examples) or worst-case distribution shifts (e.g., optimized adversarial examples). In this work, we propose the Decision Region Quantification (DRQ) algorithm to improve the robustness of any differentiable pre-trained model against both real-world and worst-case distribution shifts in the data. DRQ analyzes the robustness of local decision regions in the vicinity of a given data point to make more reliable predictions. We theoretically motivate the DRQ algorithm by showing that it effectively smooths spurious local extrema in the decision surface. Furthermore, we propose an implementation using targeted and untargeted adversarial attacks. An extensive empirical evaluation shows that DRQ increases the robustness of adversarially and non-adversarially trained models against real-world and worst-case distribution shifts on several computer vision benchmark datasets.
Published: 2022

29. Explain to Not Forget: Defending Against Catastrophic Forgetting with XAI

Author: Ede, Sami, Baghdadlian, Serop, Weber, Leander, Nguyen, An, Zanca, Dario, Samek, Wojciech, and Lapuschkin, Sebastian
Subjects: Computer Science - Machine Learning
Abstract: The ability to continuously process and retain new information like we do naturally as humans is a feat that is highly sought after when training neural networks. Unfortunately, the traditional optimization algorithms often require large amounts of data available during training time and updates wrt. new data are difficult after the training process has been completed. In fact, when new data or tasks arise, previous progress may be lost as neural networks are prone to catastrophic forgetting. Catastrophic forgetting describes the phenomenon when a neural network completely forgets previous knowledge when given new information. We propose a novel training algorithm called training by explaining in which we leverage Layer-wise Relevance Propagation in order to retain the information a neural network has already learned in previous tasks when training on new data. The method is evaluated on a range of benchmark datasets as well as more complex data. Our method not only successfully retains the knowledge of old tasks within the neural networks but does so more resource-efficiently than other state-of-the-art solutions., Comment: 14 pages including appendix, 5 figures, 2 tables, 1 algorithm listing. v2 update increases figure readability, updates Fig 5 caption, adds our collaborators Dario and An as co-authors v3 brings the preprint in line with the final version accepted for peer-reviewed publication at CD-MAKE 2022. v4 metadata update
Published: 2022

30. Behind the Machine's Gaze: Neural Networks with Biologically-inspired Constraints Exhibit Human-like Visual Attention

Author: Schwinn, Leo, Precup, Doina, Eskofier, Björn, and Zanca, Dario
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: By and large, existing computational models of visual attention tacitly assume perfect vision and full access to the stimulus and thereby deviate from foveated biological vision. Moreover, modeling top-down attention is generally reduced to the integration of semantic features without incorporating the signal of a high-level visual tasks that have been shown to partially guide human attention. We propose the Neural Visual Attention (NeVA) algorithm to generate visual scanpaths in a top-down manner. With our method, we explore the ability of neural networks on which we impose a biologically-inspired foveated vision constraint to generate human-like scanpaths without directly training for this objective. The loss of a neural network performing a downstream visual task (i.e., classification or reconstruction) flexibly provides top-down guidance to the scanpath. Extensive experiments show that our method outperforms state-of-the-art unsupervised human attention models in terms of similarity to human scanpaths. Additionally, the flexibility of the framework allows to quantitatively investigate the role of different tasks in the generated visual behaviors. Finally, we demonstrate the superiority of the approach in a novel experiment that investigates the utility of scanpaths in real-world applications, where imperfect viewing conditions are given., Comment: 31 pages, 14 figures, 4 tables
Published: 2022

31. Don't Get Me Wrong: How to Apply Deep Visual Interpretations to Time Series

Author: Loeffler, Christoffer, Lai, Wei-Cheng, Eskofier, Bjoern, Zanca, Dario, Schmidt, Lukas, and Mutschler, Christopher
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The correct interpretation and understanding of deep learning models are essential in many applications. Explanatory visual interpretation approaches for image, and natural language processing allow domain experts to validate and understand almost any deep learning model. However, they fall short when generalizing to arbitrary time series, which is inherently less intuitive and more diverse. Whether a visualization explains valid reasoning or captures the actual features is difficult to judge. Hence, instead of blind trust, we need an objective evaluation to obtain trustworthy quality metrics. We propose a framework of six orthogonal metrics for gradient-, propagation- or perturbation-based post-hoc visual interpretation methods for time series classification and segmentation tasks. An experimental study includes popular neural network architectures for time series and nine visual interpretation methods. We evaluate the visual interpretation methods with diverse datasets from the UCR repository and a complex, real-world dataset and study the influence of standard regularization techniques during training. We show that none of the methods consistently outperforms others on all metrics, while some are sometimes ahead. Our insights and recommendations allow experts to choose suitable visualization techniques for the model and task., Comment: 36 pages, 13 figues
Published: 2022

32. The Facets of Argentine Catholicism in the Twentieth Century

Author: Zanca, José and Mauro, Diego
Published: 2023
Full Text: View/download PDF

33. Automated long-term monitoring of stereotypical movement in polar bears under human care using machine learning

Author: Zuerl, Matthias, Stoll, Philip, Brehm, Ingrid, Sueskind, Jonas, Raab, René, Petermann, Jan, Zanca, Dario, Simon, Ralph, von Fersen, Lorenzo, and Eskofier, Bjoern
Published: 2024
Full Text: View/download PDF

34. Assessing the Performance of Remaining Time Prediction Methods for Business Processes

Author: Johannes Roider, An Nguyen, Dario Zanca, and Bjoern M. Eskofier
Subjects: Business process, graph neural network, LSTM, machine learning, predictive process monitoring, process mining, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: The prediction of the remaining time for business processes is a major task in predictive process monitoring (PPM). In the last years, various machine learning methods were introduced which reduced error levels steadily. However, the commonly applied metric for optimization and evaluation, the Mean Absolute Error (MAE), has limitations regarding its interpretability. In this work we introduce and evaluate the normalized Mean Absolute Error (nMAE) as an interpretable metric for model evaluation. It accounts for different kinds of label shifts, which are a special type of concept drift that can distort remaining time results. We investigate these concepts in a thorough benchmark study and use them to assess the current state of remaining time prediction for business processes. This includes the evaluation of four different baseline models, identifying the most accurate one. Furthermore, our study compares three different state-of-the-art methods, namely XGBoost, DA-LSTM, and PGT-Net. In contrary to prior studies we find that there is no significant difference in the performance between these models. Additionally, using the nMAE as evaluation metric we find that these models do not perform reasonably well on a range of event logs. Initial ideas for this behaviour are discussed and consolidated along with other findings from the case study into a comprehensive list motivating future research directions.
Published: 2024
Full Text: View/download PDF

35. Exploring misclassifications of robust neural networks to enhance adversarial attacks

Author: Schwinn, Leo, Raab, René, Nguyen, An, Zanca, Dario, and Eskofier, Bjoern
Published: 2023
Full Text: View/download PDF

36. Data quality in hospital information systems: Lessons learned from analyzing 30 years of patient data in a regional German hospital

Author: Förstel, Stefan, Förstel, Markus, Gallistl, Markus, Zanca, Dario, Eskofier, Bjoern M., and Rothgang, Eva M.
Published: 2024
Full Text: View/download PDF

37. HISTORIA Y MEMORIA DE LA UNIVERSIDAD DE BUENOS AIRES

Author: José Zanca
Published: 2024

38. SVC-onGoing: Signature Verification Competition

Author: Tolosana, Ruben, Vera-Rodriguez, Ruben, Gonzalez-Garcia, Carlos, Fierrez, Julian, Morales, Aythami, Ortega-Garcia, Javier, Ruiz-Garcia, Juan Carlos, Romero-Tapiador, Sergio, Rengifo, Santiago, Caruana, Miguel, Jiang, Jiajia, Lai, Songxuan, Jin, Lianwen, Zhu, Yecheng, Galbally, Javier, Diaz, Moises, Ferrer, Miguel Angel, Gomez-Barrero, Marta, Hodashinsky, Ilya, Sarin, Konstantin, Slezkin, Artem, Bardamova, Marina, Svetlakov, Mikhail, Saleem, Mohammad, Szucs, Cintia Lia, Kovari, Bence, Pulsmeyer, Falk, Wehbi, Mohamad, Zanca, Dario, Ahmad, Sumaiya, Mishra, Sarthak, and Jabin, Suraiya
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Human-Computer Interaction
Abstract: This article presents SVC-onGoing, an on-going competition for on-line signature verification where researchers can easily benchmark their systems against the state of the art in an open common platform using large-scale public databases, such as DeepSignDB and SVC2021_EvalDB, and standard experimental protocols. SVC-onGoing is based on the ICDAR 2021 Competition on On-Line Signature Verification (SVC 2021), which has been extended to allow participants anytime. The goal of SVC-onGoing is to evaluate the limits of on-line signature verification systems on popular scenarios (office/mobile) and writing inputs (stylus/finger) through large-scale public databases. Three different tasks are considered in the competition, simulating realistic scenarios as both random and skilled forgeries are simultaneously considered on each task. The results obtained in SVC-onGoing prove the high potential of deep learning methods in comparison with traditional methods. In particular, the best signature verification system has obtained Equal Error Rate (EER) values of 3.33% (Task 1), 7.41% (Task 2), and 6.04% (Task 3). Future studies in the field should be oriented to improve the performance of signature verification systems on the challenging mobile scenarios of SVC-onGoing in which several mobile devices and the finger are used during the signature acquisition., Comment: arXiv admin note: substantial text overlap with arXiv:2106.00739
Published: 2021

39. ICDAR 2021 Competition on On-Line Signature Verification

Author: Tolosana, Ruben, Vera-Rodriguez, Ruben, Gonzalez-Garcia, Carlos, Fierrez, Julian, Rengifo, Santiago, Morales, Aythami, Ortega-Garcia, Javier, Ruiz-Garcia, Juan Carlos, Romero-Tapiador, Sergio, Jiang, Jiajia, Lai, Songxuan, Jin, Lianwen, Zhu, Yecheng, Galbally, Javier, Diaz, Moises, Ferrer, Miguel Angel, Gomez-Barrero, Marta, Hodashinsky, Ilya, Sarin, Konstantin, Slezkin, Artem, Bardamova, Marina, Svetlakov, Mikhail, Saleem, Mohammad, Szücs, Cintia Lia, Kovari, Bence, Pulsmeyer, Falk, Wehbi, Mohamad, Zanca, Dario, Ahmad, Sumaiya, Mishra, Sarthak, and Jabin, Suraiya
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Human-Computer Interaction
Abstract: This paper describes the experimental framework and results of the ICDAR 2021 Competition on On-Line Signature Verification (SVC 2021). The goal of SVC 2021 is to evaluate the limits of on-line signature verification systems on popular scenarios (office/mobile) and writing inputs (stylus/finger) through large-scale public databases. Three different tasks are considered in the competition, simulating realistic scenarios as both random and skilled forgeries are simultaneously considered on each task. The results obtained in SVC 2021 prove the high potential of deep learning methods. In particular, the best on-line signature verification system of SVC 2021 obtained Equal Error Rate (EER) values of 3.33% (Task 1), 7.41% (Task 2), and 6.04% (Task 3). SVC 2021 will be established as an on-going competition, where researchers can easily benchmark their systems against the state of the art in an open common platform using large-scale public databases such as DeepSignDB and SVC2021_EvalDB, and standard experimental protocols.
Published: 2021

40. Towards an IMU-based Pen Online Handwriting Recognizer

Author: Wehbi, Mohamad, Hamann, Tim, Barth, Jens, Kaempf, Peter, Zanca, Dario, and Eskofier, Bjoern
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Most online handwriting recognition systems require the use of specific writing surfaces to extract positional data. In this paper we present a online handwriting recognition system for word recognition which is based on inertial measurement units (IMUs) for digitizing text written on paper. This is obtained by means of a sensor-equipped pen that provides acceleration, angular velocity, and magnetic forces streamed via Bluetooth. Our model combines convolutional and bidirectional LSTM networks, and is trained with the Connectionist Temporal Classification loss that allows the interpretation of raw sensor data into words without the need of sequence segmentation. We use a dataset of words collected using multiple sensor-enhanced pens and evaluate our model on distinct test sets of seen and unseen words achieving a character error rate of 17.97% and 17.08%, respectively, without the use of a dictionary or language model, Comment: Accepted at ICDAR 2021
Published: 2021

41. Exploring Misclassifications of Robust Neural Networks to Enhance Adversarial Attacks

Author: Schwinn, Leo, Raab, René, Nguyen, An, Zanca, Dario, and Eskofier, Bjoern
Subjects: Computer Science - Machine Learning
Abstract: Progress in making neural networks more robust against adversarial attacks is mostly marginal, despite the great efforts of the research community. Moreover, the robustness evaluation is often imprecise, making it difficult to identify promising approaches. We analyze the classification decisions of 19 different state-of-the-art neural networks trained to be robust against adversarial attacks. Our findings suggest that current untargeted adversarial attacks induce misclassification towards only a limited amount of different classes. Additionally, we observe that both over- and under-confidence in model predictions result in an inaccurate assessment of model robustness. Based on these observations, we propose a novel loss function for adversarial attacks that consistently improves attack success rate compared to prior loss functions for 19 out of 19 analyzed models.
Published: 2021

42. Editorial Expression of Concern: MicroRNA signatures of TRAIL resistance in human non-small cell lung cancer

Author: Garofalo, M., Quintavalle, C., Di Leva, G., Zanca, C., Romano, G., Taccioli, C., Liu, C. G., Croce, C. M., and Condorelli, G.
Published: 2024
Full Text: View/download PDF

43. Identifying Untrustworthy Predictions in Neural Networks by Geometric Gradient Analysis

Author: Schwinn, Leo, Nguyen, An, Raab, René, Bungert, Leon, Tenbrinck, Daniel, Zanca, Dario, Burger, Martin, and Eskofier, Bjoern
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: The susceptibility of deep neural networks to untrustworthy predictions, including out-of-distribution (OOD) data and adversarial examples, still prevent their widespread use in safety-critical applications. Most existing methods either require a re-training of a given model to achieve robust identification of adversarial attacks or are limited to out-of-distribution sample detection only. In this work, we propose a geometric gradient analysis (GGA) to improve the identification of untrustworthy predictions without retraining of a given model. GGA analyzes the geometry of the loss landscape of neural networks based on the saliency maps of their respective input. To motivate the proposed approach, we provide theoretical connections between gradients' geometrical properties and local minima of the loss function. Furthermore, we demonstrate that the proposed method outperforms prior approaches in detecting OOD data and adversarial attacks, including state-of-the-art and adaptive attacks.
Published: 2021

44. System Design for a Data-driven and Explainable Customer Sentiment Monitor

Author: Nguyen, An, Foerstel, Stefan, Kittler, Thomas, Kurzyukov, Andrey, Schwinn, Leo, Zanca, Dario, Hipp, Tobias, Sun, Da Jun, Schrapp, Michael, Rothgang, Eva, and Eskofier, Bjoern
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: The most important goal of customer services is to keep the customer satisfied. However, service resources are always limited and must be prioritized. Therefore, it is important to identify customers who potentially become unsatisfied and might lead to escalations. Today this prioritization of customers is often done manually. Data science on IoT data (esp. log data) for machine health monitoring, as well as analytics on enterprise data for customer relationship management (CRM) have mainly been researched and applied independently. In this paper, we present a framework for a data-driven decision support system which combines IoT and enterprise data to model customer sentiment. Such decision support systems can help to prioritize customers and service resources to effectively troubleshoot problems or even avoid them. The framework is applied in a real-world case study with a major medical device manufacturer. This includes a fully automated and interpretable machine learning pipeline designed to meet the requirements defined with domain experts and end users. The overall framework is currently deployed, learns and evaluates predictive models from terabytes of IoT and enterprise data to actively monitor the customer sentiment for a fleet of thousands of high-end medical devices. Furthermore, we provide an anonymized industrial benchmark dataset for the research community.
Published: 2021
Full Text: View/download PDF

45. Just a Matter of Scale? Reevaluating Scale Equivariance in Convolutional Neural Networks.

Author: Thomas Altstidl, An Nguyen, Leo Schwinn, Franz Köferl, Christopher Mutschler, Björn M. Eskofier, and Dario Zanca
Published: 2023
Full Text: View/download PDF

46. FastAMI - a Monte Carlo Approach to the Adjustment for Chance in Clustering Comparison Metrics.

Author: Kai Klede, Leo Schwinn, Dario Zanca, and Björn M. Eskofier
Published: 2023
Full Text: View/download PDF

47. Hematologic Cancers

Author: Aghakhanyan, Gayane, Sollini, Martina, Galimberti, Sara, Zanca, Roberta, Bachi, Costanza, Bartoli, Francesco, Erba, Paola Anna, Neri, Emanuele, editor, and Erba, Paola Anna, editor
Published: 2023
Full Text: View/download PDF

48. Predation risk modifies habitat use and habitat selection of diving beetles (Coleoptera: Dytiscidae) in an Urban Pondscape

Author: Liao, Wenfei, Zanca, Tommaso, and Niemelä, Jari
Published: 2024
Full Text: View/download PDF

49. Transitioning to Online: A SWOT Analysis by First Time Online Business Faculty

Author: de los Santos, Esmeralda and Zanca, Nürsen A.
Abstract: Online education continues its growth trajectory benefitting public four-year institutions of higher education as well as private universities. Recently, private non-profits have experienced double-digit increases despite late entry into online education. Situated within the context of a private nonprofit institution and its recently developed asynchronous online program, the purpose of this paper is to examine the teaching experiences of first-time business professors as seen through the lens of a SWOT analysis. Designed to capture the views of three stakeholders: Students, Faculty, and the Institution, the SWOT analysis suggests that local and regional research may yield untapped sources of opportunity for online programs at non-profit institutions.
Published: 2018

50. Predation risk modifies habitat use and habitat selection of diving beetles (Coleoptera: Dytiscidae) in an Urban Pondscape

Author: Wenfei Liao, Tommaso Zanca, and Jari Niemelä
Subjects: Aquatic insect, Aquatic plant, Pondscape of fear, Non-consumptive effect, Species trait, Urban blue space, Ecology, QH540-549.5
Abstract: Urban freshwater ecosystems often involve the introduction of predator species that affect biodiversity via both direct and indirect effects of predation, altering the distribution of prey species. Yet, limited research has explored indirect effects on aquatic invertebrates in urban pondscapes. Here, we use Dytiscidae as our study taxon to investigate how predator-prey interaction modifies the habitat use and habitat selection of macroinvertebrates in ponds of an urban landscape. We sampled dytiscids in 11 ponds with, and 15 ponds without fish, in Helsinki, Finland, during 2018 – 2020, and emergent plant cover in pond margins as a proxy for the quantity of prey refuges. We found (i) at the pond scale, vegetation cover can mitigate the negative effects of predators on dytiscid species richness and abundance, and dytiscids prefer microhabitats with bulrush and sedges to microhabitats with no vegetation or common reeds, reflecting the importance of providing aquatic plants with high structural complexity as prey refuges. (ii) At the landscape scale, small-sized dytiscids favour fishless habitats, and the community-weighted mean body size of dytiscids has seasonal fluctuations, with smaller body sizes in May and June than in July in fishless ponds, indicating that dytiscids select habitats to regulate their investment in vigilance according to their life cycles. Our findings highlight that predation can alter the habitat use and habitat selection of aquatic invertebrates. Mitigating predation risk at both the habitat scale and the landscape scale is crucial to facilitate the fitness of aquatic invertebrates, especially small-sized species, to promote their diversity in urban pondscapes.
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

4,157 results on '"Zanca, A."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources