Author: "Petra Galuščáková" / Topic: computer science - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Petra Galuščáková"' showing total 14 results

Start Over Author "Petra Galuščáková" Topic computer science

14 results on '"Petra Galuščáková"'

1. Cross-language Sentence Selection via Data Augmentation and Rationale Training

Author: Petra Galuščáková, Douglas W. Oard, Yanda Chen, Suraj Nair, Kathleen R. McKeown, Rui Zhang, and Chris Kedzie
Subjects: FOS: Computer and information sciences, Phrase, Computer Science - Computation and Language, Machine translation, Computer science, business.industry, computer.software_genre, Variety (linguistics), Computer Science - Information Retrieval, Selection (linguistics), Embedding, Relevance (information retrieval), Artificial intelligence, business, Computation and Language (cs.CL), computer, Information Retrieval (cs.IR), Word (computer architecture), Sentence, Natural language processing
Abstract: This paper proposes an approach to cross-language sentence selection in a low-resource setting. It uses data augmentation and negative sampling techniques on noisy parallel sentence data to directly learn a cross-lingual embedding-based query relevance model. Results show that this approach performs as well as or better than multiple state-of-the-art machine translation + monolingual retrieval systems trained on the same parallel data. Moreover, when a rationale training secondary objective is applied to encourage the model to match word alignment hints from a phrase-based statistical machine translation model, consistent improvements are seen across three language pairs (English-Somali, English-Swahili and English-Tagalog) over a variety of state-of-the-art baselines., ACL 2021 main conference
Published: 2021

2. Segmenting Subtitles for Correcting ASR Segmentation Errors

Author: Zhengping Jiang, Chris Kedzie, Peter Bell, Elena Zotkina, Faisal Ladhak, Elsbeth Turcan, David Wan, Kathleen R. McKeown, and Petra Galuščáková
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Machine translation, Common error, Computer science, Speech recognition, InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL, computer.software_genre, ComputingMethodologies_ARTIFICIALINTELLIGENCE, Market segmentation, Spoken language translation, Segmentation, Computation and Language (cs.CL), computer
Abstract: Typical ASR systems segment the input audio in toutterances using purely acoustic information, which may not resemble the sentence-like units that are expected by conventional machine translation (MT) systems for Spoken Language Translation. In this work, we propose a model for correcting the acoustic segmentation of ASR models for low-resource languages to improve performance on down-stream tasks. We propose the use of subtitles as a proxy dataset for correcting ASR acoustic segmentation, creating synthetic acoustic utterances by modeling common error modes. We train a neural tagging model for correcting ASR acoustic segmentation and show that it improves downstream performance on MT and audio-document cross-language information retrieval (CLIR).
Published: 2021

3. Combining Contextualized and Non-contextualized Query Translations to Improve CLIR

Author: Douglas W. Oard, Suraj Nair, and Petra Galuščáková
Subjects: Vocabulary, Basis (linear algebra), Machine translation, business.industry, Computer science, media_common.quotation_subject, InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL, Probabilistic logic, Contrast (statistics), Context (language use), 02 engineering and technology, Translation (geometry), computer.software_genre, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Word (computer architecture), Natural language processing, media_common
Abstract: In cross-language information retrieval using probabilistic structured queries (PSQ), translation probabilities from statistical machine translation act as a bridge between the query and document vocabulary. These translation probabilities are typically estimated from a sentence-aligned corpus on a word to word basis without taking into account the context. Neural methods, by contrast, can learn to translate using the context around the words, and this can be used as a basis for estimating context-dependent translation probabilities. However, sparsity limits the accuracy of context-specific translation probabilities for rare words, which can be important in retrieval applications. This paper presents evidence that combining such context-dependent translation probabilities with context-independent translation probabilities learned from the same parallel corpus can yield improvements in the effectiveness of cross-language ranked retrieval.
Published: 2020
Full Text: View/download PDF

4. Experiments with Cross-Language Speech Retrieval for Lower-Resource Languages

Author: Ondrej Klejch, Douglas W. Oard, Petra Galuščáková, Anton Ragni, and Suraj Nair
Subjects: Swahili, Recall, Computer science, business.industry, media_common.quotation_subject, Speech retrieval, Ambiguity, computer.software_genre, language.human_language, Language speech, language, Artificial intelligence, Transcription (software), business, Limited resources, computer, Natural language processing, Tagalog, media_common
Abstract: Cross-language speech retrieval systems face a cascade of errors due to transcription and translation ambiguity. Using 1-best speech recognition and 1-best translation in such a scenario could adversely affect recall if those 1-best system guesses are not correct. Accurately representing transcription and translation probabilities could therefore improve recall, although possibly at some cost in precision. The difficulty of the task is exacerbated when working with languages for which limited resources are available, since both recognition and translation probabilities may be less accurate in such cases. This paper explores the combination of expected term counts from recognition with expected term counts from translation to perform cross-language speech retrieval in which the queries are in English and the spoken content to be retrieved is in Tagalog or Swahili. Experiments were conducted using two query types, one focused on term presence and the other focused on topical retrieval. Overall, the results show that significant improvements in ranking quality result from modeling transcription and recognition ambiguity, even in lower-resource settings, and that adapting the ranking model to specific query types can yield further improvements.
Published: 2020
Full Text: View/download PDF

5. Unsupervised System Combination for Set-Based Retrieval with Expectation Maximization

Author: Han-Chin Shing, Douglas W. Oard, Philip Resnik, Joe Barrow, and Petra Galuščáková
Subjects: System combination, Computer science, Result set, business.industry, 05 social sciences, Duality (optimization), 010501 environmental sciences, Machine learning, computer.software_genre, 01 natural sciences, Clef, 0502 economics and business, Expectation–maximization algorithm, Overall performance, Artificial intelligence, 050207 economics, business, computer, 0105 earth and related environmental sciences, Sparse matrix
Abstract: System combination has been shown to improve overall performance on many rank-based retrieval tasks, often by combining results from multiple systems into a single ranked list. In contrast, set-based retrieval tasks call for a technique to combine results in ways that require decisions on whether each document is in or out of the result set. This paper presents a set-generating unsupervised system combination framework that draws inspiration from evaluation techniques in sparse data settings. It argues for the existence of a duality between evaluation and system combination, and then capitalizes on this duality to perform unsupervised system combination. To do this, the framework relies on the consensus of the systems to estimate latent “goodness” for each system. An implementation of this framework using data programming is compared to other unsupervised system combination approaches to demonstrate its effectiveness on CLEF and MATERIAL collections.
Published: 2019
Full Text: View/download PDF

6. Towards Automatic Cataloging of Image and Textual Collections with Wikipedia

Author: Tokinori Suzuki, Douglas W. Oard, Petra Galuščáková, and Daisuke Ikeda
Subjects: Information retrieval, Computer science, Cataloging, 020207 software engineering, Entity Linking, 02 engineering and technology, Test Collection, Task (project management), Metadata, Entity linking, Annotation, Upload, Multimedia, 0202 electrical engineering, electronic engineering, information engineering, Mean reciprocal rank, 020201 artificial intelligence & image processing, Digitization, Wikipedia
Abstract: In recent years, a large amount of multimedia data consisting of images and text have been generated in libraries through the digitization of physical materials into data for their preservation. When they are archived, appropriate cataloging metadata are assigned to them by librarians. Automatic annotations are helpful for reducing the cost of manual annotations. To this end, we propose a mapping system that links images and the associated text to entries on Wikipedia as a replacement for annotation by targeting images and associated text from photo-sharing sites. The uploaded images are accompanied by descriptive labels of contents of the sites that can be indexed for the catalogue. However, because users freely tag images with labels, these user-assigned labels are often ambiguous. The label “albatross”, for example, may refer to a type of bird or aircraft. If the ambiguities are resolved, we can use Wikipedia entries for cataloging as an alternative to ontologies. To formalize this, we propose a task called image label disambiguation where, given an image and assigned target labels to be disambiguated, an appropriate Wikipedia page is selected for the given labels. We propose a hybrid approach for this task that makes use of both user tags as textual information and features of images generated through image recognition. To evaluate the proposed task, we develop a freely available test collection containing 450 images and 2,280 ambiguous labels. The proposed method outperformed prevalent text-based approaches in terms of the mean reciprocal rank, attaining a value of over 0.6 on both our collection and the ImageCLEF collection.
Published: 2019
Full Text: View/download PDF

7. Session details: Posters

Author: Petra Galuščáková
Subjects: Multimedia, Computer science, Session (computer science), computer.software_genre, computer
Published: 2017
Full Text: View/download PDF

8. Visual Descriptors in Methods for Video Hyperlinking

Author: Jiří Matas, David Novak, Michal Batko, Pavel Pecina, Jan Cech, and Petra Galuščáková
Subjects: Information retrieval, Similarity (geometry), Computer science, business.industry, InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL, Visual descriptors, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 020207 software engineering, Image processing, 02 engineering and technology, Hyperlink, computer.software_genre, Convolutional neural network, TRECVID, Task (project management), Visual processing, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Artificial intelligence, business, computer, Natural language processing
Abstract: In this paper, we survey different state-of-the-art visual processing methods and utilize them in hyperlinking. Visual information, calculated using Features Signatures, SIMILE descriptors and convolutional neural networks (CNN), is utilized as similarity between video frames and used to find similar faces, objects and setting. Visual concepts in frames are also automatically recognized and textual output of the recognition is combined with search based on subtitles and transcripts. All presented experiments were performed in the Search and Hyperlinking 2014 MediaEval task and Video Hyperlinking 2015 TRECVid task.
Published: 2017
Full Text: View/download PDF

9. SHAMUS: UFAL Search and Hyperlinking Multimedia System

Author: Shadi Saleh, Petra Galuščáková, and Pavel Pecina
Subjects: Multimedia, Computer science, Interface (computing), Component (UML), 0202 electrical engineering, electronic engineering, information engineering, 020207 software engineering, 020201 artificial intelligence & image processing, 02 engineering and technology, Multimedia system, Hyperlink, computer.software_genre, computer
Abstract: In this paper, we describe SHAMUS, our system for an easy search and navigation in multimedia archives. The system consists of three components. The Search component provides a text-based search in a multimedia collection, the Anchoring component determines the most important segments of videos, and segments topically related to the anchoring ones are retrieved by the Hyperlinking component. In the paper, we describe each component of the system as well as the online demo interface http://ufal.mff.cuni.cz/shamus which currently works with a collection of TED talks.
Published: 2016
Full Text: View/download PDF

10. Audio Information for Hyperlinking of TV Content

Author: Pavel Pecina and Petra Galuščáková
Subjects: Metadata, Focus (computing), Information retrieval, Computer science, InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL, Content (measure theory), ComputingMethodologies_DOCUMENTANDTEXTPROCESSING, Context (language use), Hyperlink, Task (project management)
Abstract: In this paper, we explore the use of audio information in the retrieval of multimedia content. Specifically, we focus on linking similar segments in a collection consisting of 4,000 hours of BBC TV programmes. We provide a description of our system submitted to the Hyperlinking Sub-task of the Search and Hyperlinking Task in the MediaEval 2014 Benchmark, in which it scored best. We explore three automatic transcripts and compare them to available subtitles. We confirm the relationship between retrieval performance and transcript quality. The performance of the retrieval is further improved by extending transcripts by metadata and context, by combining different transcripts, using the highest confident words of the transcripts, and by utilizing acoustic similarity.
Published: 2015
Full Text: View/download PDF

11. Experiments with Segmentation Strategies for Passage Retrieval in Audio-Visual Documents

Author: Petra Galuščáková and Pavel Pecina
Subjects: Focus (computing), Information retrieval, business.industry, Computer science, Speech retrieval, Hyperlink, computer.software_genre, Task (project management), Audio visual, Benchmark (computing), Segmentation, Visual Word, Artificial intelligence, business, computer, Natural language processing
Abstract: This paper deals with Information Retrieval from audio-visual recordings. Such recordings are often quite long and users may want to find the exact starting points of relevant passages they search for. In Passage Retrieval, the recordings are automatically segmented into smaller parts, on which the standard retrieval techniques are applied. In this paper, we discuss various techniques for segmentation of audio-visual recordings and focus on machine learning approaches which decide on segment boundaries based on various features combined in a decision-tree model. Our experiments are carried out on the data used for the Search and Hyperlinking Task and Similar Segments in Social Speech Task of the MediaEval Benchmark 2013.
Published: 2014
Full Text: View/download PDF

12. Segmentation strategies for passage retrieval in audio-visual documents

Author: Petra Galuščáková
Subjects: Similarity (geometry), business.industry, Computer science, Decision tree, Scale-space segmentation, Pattern recognition, Hyperlink, Semantics, Feature (computer vision), Segmentation, Visual Word, Artificial intelligence, business, Sentence
Abstract: The importance of Information Retrieval (IR) in audio-visual recordings has been increasing with steeply growing numbers of audio-visual documents available on-line. Compared to traditional IR methods, this task requires specific techniques, such as Passage Retrieval which can accelerate the search process by retrieving the exact relevant passage of a recording instead of the full document. In Passage Retrieval, full recordings are divided into shorter segments which serve as individual documents for the further IR setup. This technique also allows normalizing document length and applying positional information. It was shown that it can even improve retrieval results. In this work, we examine two general strategies for Passage Retrieval: blind segmentation into overlapping regular-length passages and segmentation into variable-length passages based on semantics of their content. Time-based segmentation was already shown to improve retrieval of textual documents and audio-visual recordings. Our experiments performed on the test collection used in the Search subtask of the Search and Hyperlinking Task in MediaEval Benchmarking 2012 confirm those findings and show that parameters (segment length and shift) tuning for a specific test collection can further improve the results. Our best results on this collection were achieved by using 45-second long segments with 15-second shifts. Semantic-based segmentation can be divided into three types: similarity-based (producing segments with high intra-similarity and low inter-similarity), lexical-chain-based (producing segments with frequent lexically connected words), and feature-based (combining various features which signalize a segment break in a machine-learning setting). In this work, we mainly focus on feature-based segmentation which allows exploiting various features from all modalities of the data (including segment length) in a single trainable model and produces segments which can eventually overlap. Our preliminary results show that even simple semantic-based segmentation outperforms regular segmentation. Our model is a decision tree incorporating the following features: shot segments, output of TextTiling algorithm, cue words (well, thanks, so, I, now), sentence breaks, and the length of the silence after the previous word. In terms of the MASP, the relative improvement over regular segmentation is more than 19%.
Published: 2013
Full Text: View/download PDF

13. Multimedia Information Seeking through Search and Hyperlinking

Author: Danish Nadeem, Camille Guinaudeau, Shu Chen, Roeland Ordelman, Rik Van de Walle, Petra Galuščáková, Gareth J. F. Jones, Martha Larson, Tom De Nies, Pedro Debevere, Guillaume Gravier, Pascale Sébillot, Robin Aly, Pavel Pecina, Maria Eskevich, Centre for Next Generation Localisation (CNGL), Dublin City University [Dublin] (DCU), University of Twente, Multimedia content-based indexing (TEXMEX), Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA), Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Inria Rennes – Bretagne Atlantique, Institut National de Recherche en Informatique et en Automatique (Inria), Department of Electronics and Information Systems - Ghent University (ELIS), Universiteit Gent = Ghent University (UGENT), Charles University [Prague] (CU), Delft Multimedia Information Retrieval Lab (DMIR), Delft University of Technology (TU Delft), University of Twente [Netherlands], Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes 1 (UR1), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Inria Rennes – Bretagne Atlantique, and Universiteit Gent = Ghent University [Belgium] (UGENT)
Subjects: Computer science, InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL, 02 engineering and technology, Multimedia information retrieval, Crowdsourcing, hyperlinking, Task (project management), World Wide Web, Image processing, Web page, 0202 electrical engineering, electronic engineering, information engineering, Information retrieval, Multimodal search, search, multimedia, business.industry, Information seeking, [INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM], 020207 software engineering, Hyperlink, multimedia information retrieval, [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing, 020201 artificial intelligence & image processing, business, Internet video, Multimedia systems
Abstract: International audience; Searching for relevant webpages and following hyperlinks to related content is a widely accepted and effective approach to information seeking on the textual web. Existing work on multimedia information retrieval has focused on search for individual relevant items or on content linking without specific attention to search results. We describe our research exploring integrated multimodal search and hyperlinking for multimedia data. Our investigation is based on the MediaEval 2012 Search and Hyperlinking task. This includes a known-item search task using the Blip10000 internet video collection, where automatically created hyperlinks link each relevant item to related items within the collection. The search test queries and link assessment for this task was generated using the Amazon Mechanical Turk crowdsourcing platform. Our investigation examines a range of alternative methods which seek to address the challenges of search and hyperlinking using multimodal approaches. The results of our experiments are used to propose a research agenda for developing effective techniques for search and hyperlinking of multimedia content.
Published: 2013

14. Penalty Functions for Evaluation Measures of Unsegmented Speech Retrieval

Author: Petra Galuščáková, Jan Hajič, and Pavel Pecina
Subjects: Matching (statistics), Measure (data warehouse), Computer science, Speech recognition, Perception, media_common.quotation_subject, Penalty method, Relevance (information retrieval), Quality (business), Gold standard (test), Focus (optics), media_common
Abstract: This paper deals with evaluation of information retrieval from unsegmented speech. We focus on Mean Generalized Average Precision, the evaluation measure widely used for unsegmented speech retrieval. This measure is designed to allow certain tolerance in matching retrieval results (starting points of relevant segments) against a gold standard relevance assessment. It employs a Penalty Function which evaluates non-exact matches in the retrieval results based on their distance from the beginnings of their nearest true relevant segments. However, the choice of the Penalty Function is usually ad-hoc and does not necessary reflect users' perception of the speech retrieval quality. We perform a lab test to study satisfaction of users of a speech retrieval system to empirically estimate the optimal shape of the Penalty Function.
Published: 2012
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

14 results on '"Petra Galuščáková"'

1. Cross-language Sentence Selection via Data Augmentation and Rationale Training

2. Segmenting Subtitles for Correcting ASR Segmentation Errors

3. Combining Contextualized and Non-contextualized Query Translations to Improve CLIR

4. Experiments with Cross-Language Speech Retrieval for Lower-Resource Languages

5. Unsupervised System Combination for Set-Based Retrieval with Expectation Maximization

6. Towards Automatic Cataloging of Image and Textual Collections with Wikipedia

7. Session details: Posters

8. Visual Descriptors in Methods for Video Hyperlinking

9. SHAMUS: UFAL Search and Hyperlinking Multimedia System

10. Audio Information for Hyperlinking of TV Content

11. Experiments with Segmentation Strategies for Passage Retrieval in Audio-Visual Documents

12. Segmentation strategies for passage retrieval in audio-visual documents

13. Multimedia Information Seeking through Search and Hyperlinking

14. Penalty Functions for Evaluation Measures of Unsegmented Speech Retrieval

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

14 results on '"Petra Galuščáková"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources