Author: "Ruppik, Benjamin Matthias" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Ruppik, Benjamin Matthias"' showing total 4 results

Start Over Author "Ruppik, Benjamin Matthias"

4 results on '"Ruppik, Benjamin Matthias"'

1. Local Topology Measures of Contextual Language Model Latent Spaces With Applications to Dialogue Term Extraction

Author: Ruppik, Benjamin Matthias, Heck, Michael, van Niekerk, Carel, Vukovic, Renato, Lin, Hsien-chin, Feng, Shutong, Zibrowius, Marcus, and Gašić, Milica
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: A common approach for sequence tagging tasks based on contextual word representations is to train a machine learning classifier directly on these embedding vectors. This approach has two shortcomings. First, such methods consider single input sequences in isolation and are unable to put an individual embedding vector in relation to vectors outside the current local context of use. Second, the high performance of these models relies on fine-tuning the embedding model in conjunction with the classifier, which may not always be feasible due to the size or inaccessibility of the underlying feature-generation model. It is thus desirable, given a collection of embedding vectors of a corpus, i.e., a datastore, to find features of each vector that describe its relation to other, similar vectors in the datastore. With this in mind, we introduce complexity measures of the local topology of the latent space of a contextual language model with respect to a given datastore. The effectiveness of our features is demonstrated through their application to dialogue term extraction. Our work continues a line of research that explores the manifold hypothesis for word embeddings, demonstrating that local structure in the space carved out by word embeddings can be exploited to infer semantic properties., Comment: Accepted as a long paper to SIGDIAL 2024. 9 pages, 2 figures, 3 tables
Published: 2024

2. Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decoding

Author: Vukovic, Renato, Arps, David, van Niekerk, Carel, Ruppik, Benjamin Matthias, Lin, Hsien-Chin, Heck, Michael, and Gašić, Milica
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: State-of-the-art task-oriented dialogue systems typically rely on task-specific ontologies for fulfilling user queries. The majority of task-oriented dialogue data, such as customer service recordings, comes without ontology and annotation. Such ontologies are normally built manually, limiting the application of specialised systems. Dialogue ontology construction is an approach for automating that process and typically consists of two steps: term extraction and relation extraction. In this work, we focus on relation extraction in a transfer learning set-up. To improve the generalisation, we propose an extension to the decoding mechanism of large language models. We adapt Chain-of-Thought (CoT) decoding, recently developed for reasoning problems, to generative relation extraction. Here, we generate multiple branches in the decoding space and select the relations based on a confidence threshold. By constraining the decoding to ontology terms and relations, we aim to decrease the risk of hallucination. We conduct extensive experimentation on two widely used datasets and find improvements in performance on target ontology for source fine-tuned and one-shot prompted large language models., Comment: Accepted to appear at SIGDIAL 2024. 9 pages, 4 figures
Published: 2024

3. Unknotting via null-homologous twists and multi-twists

Author: Allen, Samantha, Ince, Kenan, Kim, Seungwon, Ruppik, Benjamin Matthias, and Turner, Hannah
Subjects: Mathematics - Geometric Topology, 57K40, 57K10
Abstract: The untwisting number of a knot K is the minimum number of null-homologous twists required to convert K to the unknot. Such a twist can be viewed as a generalization of a crossing change, since a classical crossing change can be effected by a null-homologous twist on 2 strands. While the unknotting number gives an upper bound on the smooth 4-genus, the untwisting number gives an upper bound on the topological 4-genus. The surgery description number, which allows multiple null-homologous twists in a single twisting region to count as one operation, lies between the topological 4-genus and the untwisting number. We show that the untwisting and surgery description numbers are different for infinitely many knots, though we also find that the untwisting number is at most twice the surgery description number plus 1., Comment: 14 pages, 6 figures
Published: 2022
Full Text: View/download PDF

4. Dialogue Term Extraction using Transfer Learning and Topological Data Analysis

Author: Vukovic, Renato, Heck, Michael, Ruppik, Benjamin Matthias, van Niekerk, Carel, Zibrowius, Marcus, and Gašić, Milica
Subjects: Computer Science - Computation and Language
Abstract: Goal oriented dialogue systems were originally designed as a natural language interface to a fixed data-set of entities that users might inquire about, further described by domain, slots, and values. As we move towards adaptable dialogue systems where knowledge about domains, slots, and values may change, there is an increasing need to automatically extract these terms from raw dialogues or related non-dialogue data on a large scale. In this paper, we take an important step in this direction by exploring different features that can enable systems to discover realizations of domains, slots, and values in dialogues in a purely data-driven fashion. The features that we examine stem from word embeddings, language modelling features, as well as topological features of the word embedding space. To examine the utility of each feature set, we train a seed model based on the widely used MultiWOZ data-set. Then, we apply this model to a different corpus, the Schema-Guided Dialogue data-set. Our method outperforms the previously proposed approach that relies solely on word embeddings. We also demonstrate that each of the features is responsible for discovering different kinds of content. We believe our results warrant further research towards ontology induction, and continued harnessing of topological data analysis for dialogue and natural language processing research., Comment: Accepted as a long paper to SIGDIAL 2022 (Edinburgh)
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

4 results on '"Ruppik, Benjamin Matthias"'

1. Local Topology Measures of Contextual Language Model Latent Spaces With Applications to Dialogue Term Extraction

2. Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decoding

3. Unknotting via null-homologous twists and multi-twists

4. Dialogue Term Extraction using Transfer Learning and Topological Data Analysis

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

4 results on '"Ruppik, Benjamin Matthias"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources