53 results on '"Rayner, Manny"'
Search Results
2. Easy as ABC: Using LARA to Build Multimedia Alphabet Books
- Author
-
Akhlaghi-Baghoojari, Elham, Bédi, Branislav, Chua, Cathy, Horváthová, Ivana, Ivanova, Nedelina, Maizonniaux, Christèle, Mykhats, Marta, Chiaráin, Neasa Ní, Rayner, Manny, Orian Weiss, Catherine, and Zviel-Girshin, Rina
- Abstract
We present a study in which multimedia alphabet books were constructed for ten languages using the Learning And Reading Assistant (LARA) platform. We describe the alphabet books we built, the different design features they instantiate, and an initial evaluation using an anonymous online questionnaire. Links are provided to the books themselves, which are freely available on the web. [For the complete volume, "Intelligent CALL, Granular Systems and Learner Data: Short Papers from EUROCALL 2022 (30th, Reykjavik, Iceland, August 17-19, 2022)," see ED624779.]
- Published
- 2022
3. Assessing the Quality of TTS Audio in the LARA Learning-by-Reading Platform
- Author
-
Akhlaghi, Elham, Baczkowska, Anna, Berthelsen, Harald, Bédi, Branislav, Chua, Cathy, Cucchiarini, Catia, Habibi, Hanieh, Horváthová, Ivana, Hvalsøe, Pernille, Lotz, Roy, Maizonniaux, Christèle, Chiaráin, Neasa Ní, Rayner, Manny, Tsourakis, Nikos, and Yao, Chunlin
- Abstract
A popular idea in Computer Assisted Language Learning (CALL) is to use multimodal annotated texts, with annotations typically including embedded audio and translations, to support L2 learning through reading. An important question is how to create the audio, which can be done either through human recording or by a Text-To-Speech (TTS) synthesis engine. We may reasonably expect TTS to be quicker and easier, but humans to be of higher quality. Here, we report a study using the open-source LARA platform and ten languages. Samples of LARA audio totaling about three and a half minutes were provided for each language in both human and TTS form; subjects used a web form to compare different versions of the same item and rate the voices as a whole. Although human voice was more often preferred, TTS achieved higher ratings in some languages and was close in others. [For the complete volume, "CALL and Professionalisation: Short Papers from EUROCALL 2021 (29th, Online, August 26-27, 2021)," see ED616972.]
- Published
- 2021
4. LARA: An Extensible Open Source Platform for Learning Languages by Reading
- Author
-
Bédi, Branislav, Butterweck, Matt, Chua, Cathy, Gerlach, Johanna, Guðmarsdóttir, Birgitta Björg, Habibi, Hanieh, Jónsson, Bjartur Örn, Rayner, Manny, and Vigfússon, Sigurður
- Abstract
Learning and Reading Assistant (LARA) is an open source platform that enables conversion of plain texts into an interactive multimedia form designed to support second- and foreign-language (L2) learners. In this workshop, we illustrate the open source aspects using collaborative work carried out during a six-week summer project at the Árni Magnússon Institute for Icelandic Studies. Three undergraduate level students extended the platform in different directions in cooperation with other members of the international LARA team. The three subprojects were respectively concerned with adding automatically generated flashcards, adding multimedia versions of poetic texts in the archaic language Old Norse, and extending LARA to allow the inclusion of sign language content in Icelandic sign language -- Íslenskt TáknMál (ÍTM). All three reached successful conclusions. [For the complete volume, "CALL for Widening Participation: Short Papers from EUROCALL 2020 (28th, Online, August 20-21, 2020)," see ED610330.]
- Published
- 2020
5. Constructing an Interactive Old Norse Text with LARA
- Author
-
Bédi, Branislav, Bernharðsson, Haraldur, Chua, Cathy, Guðmarsdóttir, Birgitta Björg, Habibi, Hanieh, and Rayner, Manny
- Abstract
We describe how the open-source Learning and Reading Assistant (LARA) platform was used to convert a classic Old Norse text, the "Völuspá," into an interactive online form. The LARA version includes high-quality recorded audio, translations, notes on key words and phrases, an automatically generated concordance, and links to other online resources. The interactive text was created in two different editions, one with Modern Icelandic translations designed to support Icelandic school students who read the poem as a set text, and one with English translations designed for English readers with basic Old Norse who wish to able to appreciate the poem in the original as a piece of literature. Initial feedback from groups has been positive. [For the complete volume, "CALL for Widening Participation: Short Papers from EUROCALL 2020 (28th, Online, August 20-21, 2020)," see ED610330.]
- Published
- 2020
6. Using LARA for Language Learning: A Pilot Study for Icelandic
- Author
-
Bédi, Branislav, Chua, Cathy, Habibi, Hanieh, Martinez-Lopez, Ruth, and Rayner, Manny
- Abstract
This paper presents a brief overview of LARA (Learning And Reading Assistant), an open source online tool that has been under development since summer 2018. LARA currently contains a corpus of about 25 texts in ten languages and a crowdsourcing model used to expand the corpus. The central goal is to provide support for improving second language (L2) reading comprehension. The focus here is on the development of Icelandic content and its use during pilot testing amongst adult L2 learners of Icelandic. Preliminary feedback from users, while mostly positive, contained suggestions on how the tool might be improved. [For the complete proceedings, see ED600837.]
- Published
- 2019
7. ChatGPT-Based Learning And Reading Assistant: Initial Report *
- Author
-
Bédi, Branislav, Chiera, Belinda, Chua, Cathy, Cucchiarini, Catia, Chiaráin, Neasa Ní, Rayner, Manny, Simonsen, Annika, Zviel-Girshin, Rina, and Chatgpt
- Published
- 2023
- Full Text
- View/download PDF
8. Building High-Quality Online CALL Resources Can Be Fun: Using LARA to Construct Multimodal Plays
- Author
-
Anker, Kirsten, Chua, Cathy, Davies, Yann, Dejeu, Eric, Glasser, Gwyn, Glasser, Robert, Hanieh Habibi, Mykhats, Marta, Fard, Chadi Raheb, Rayner, Manny, and Ritchie, Rosa
- Published
- 2021
- Full Text
- View/download PDF
9. Using the LARA platform to crowdsource a multilingual, multimodal Little Prince.
- Author
-
AKHLAGHI, ELHAM, BĄCZKOWSKA, ANNA, BÉDI, BRANISLAV, BEEDAR, HAKEEM, CHUA, CATHY, CUCCHIARINI, CATIA, HABIBI, HANIEH, HORVÁTHOVÁ, IVANA, MAIZONNIAUX, CHRISTÈLE, CHIARÁIN, NEASA NÍ, PATERSON, NATHALIE, RAHEB, CHADI, RAYNER, MANNY, and CHUNLIN YAO
- Published
- 2022
- Full Text
- View/download PDF
10. Research on Spoken Dialogue Systems
- Author
-
Aist, Gregory, Hieronymus, James, Dowding, John, Hockey, Beth Ann, Rayner, Manny, Chatzichrisafis, Nikos, Farrell, Kim, and Renders, Jean-Michel
- Subjects
Computer Systems - Abstract
Research in the field of spoken dialogue systems has been performed with the goal of making such systems more robust and easier to use in demanding situations. The term "spoken dialogue systems" signifies unified software systems containing speech-recognition, speech-synthesis, dialogue management, and ancillary components that enable human users to communicate, using natural spoken language or nearly natural prescribed spoken language, with other software systems that provide information and/or services.
- Published
- 2010
11. Decentralising power: how we are trying to keep CALLector ethical
- Author
-
Chua, Cathy, Habibi, Hanieh, Rayner, Manny, and Tsourakis, Nikos
- Subjects
FOS: Computer and information sciences ,Computer Science - Computers and Society ,Computers and Society (cs.CY) ,Computer Science - Human-Computer Interaction ,Human-Computer Interaction (cs.HC) - Abstract
We present a brief overview of the CALLector project, and consider ethical questions arising from its overall goal of creating a social network to support creation and use of online CALL resources. We argue that these questions are best addressed in a decentralised, pluralistic open source architecture., 6 pages; based on talk presented at enetCollect WG3 & WG5 Meeting, Leiden, Holland, 2018
- Published
- 2019
12. A Voice Enabled Procedure Browser for the International Space Station
- Author
-
Rayner, Manny, Chatzichrisafis, Nikos, Hockey, Beth Ann, Farrell, Kim, and Renders, Jean-Michel
- Subjects
Space Sciences (General) - Abstract
Clarissa, an experimental voice enabled procedure browser that has recently been deployed on the International Space Station (ISS), is to the best of our knowledge the first spoken dialog system in space. This paper gives background on the system and the ISS procedures, then discusses the research developed to address three key problems: grammar-based speech recognition using the Regulus toolkit; SVM based methods for open microphone speech recognition; and robust side-effect free dialogue management for handling undos, corrections and confirmations.
- Published
- 2005
13. Corpus-Based Optimization of Language Models Derived from Unification Grammars
- Author
-
Rayner, Manny, Hockey, Beth Ann, James, Frankie, Bratt, Harry, Bratt, Elizabeth O, Gawron, Mark, Goldwater, Sharon, Dowding, John, and Bhagat, Amrita
- Subjects
Computer Programming And Software - Abstract
We describe a technique which makes it feasible to improve the performance of a language model derived from a manually constructed unification grammar, using low-quality untranscribed speech data and a minimum of human annotation. The method is on a medium-vocabulary spoken language command and control task.
- Published
- 2000
14. 'Do That Again': Evaluating Spoken Dialogue Interfaces
- Author
-
James, Frankie, Rayner, Manny, and Hockey, Beth Ann
- Subjects
Communications And Radar - Abstract
We present a new technique for evaluating spoken dialogue interfaces that allows us to separate the dialogue behavior from the rest of the speech system. By using a dialogue simulator that we have developed, we can gather usability data on the system s dialogue interaction and behaviors that can guide improvements to the speech interface. Preliminary testing has shown promising results, suggesting that it is possible to test properties of dialogue separately from other factors such as recognition quality.
- Published
- 2000
15. Translation and technology. The case of translation games for language learning
- Author
-
Bouillon, Pierrette, Cervini, Cristiana, Rayner, Manny, Université de Genève (UNIGE), Fiona Farr, Liam Murray, Bouillon, Pierrette, Cervini, Cristiana, and Manny, Rayner
- Subjects
spoken translation game, foreign language learning ,Spoken translation games ,CALL ,[SHS] Humanities and Social Sciences ,ddc:410.2 ,[SHS.LANGUE]Humanities and Social Sciences/Linguistics ,Speech recognition ,Computer-assisted language learning ,CALL-SLT ,[SHS.LANGUE] Humanities and Social Sciences/Linguistics ,ComputingMilieux_MISCELLANEOUS ,[SHS]Humanities and Social Sciences - Abstract
International audience
- Published
- 2016
16. Build Your Own Speech-Enabled Online CALL Course, No Experience Required
- Author
-
Rayner Manny, Baur Claudia, Bouillon Pierrette, Chua Cathy, and Tsourakis Nikos
- Subjects
CALL ,ddc:410.2 ,Speech recognition ,Open architectures ,Web - Abstract
We describe Open CALL-SLT, a framework that allows people who may only have modest computer skills to create interactive speech-enabled CALL courses on the web. User content is uploaded to a server and remotely compiled and deployed. Courses can be created at six levels of increasing sophistication, where the specification of the content ranges from simple prompt/response pairs at one end to multimedia-enabled gamified dialogues with multiple paths at the other. We briefly describe the overall framework and the different levels.
- Published
- 2015
17. A web-deployed Swedish spoken CALL systembased on a large shared English/Swedish feature grammar
- Author
-
Rayner, Manny, Gerlach, Johanna, Starlander, Marianne, Tsourakis, Nikos, Kruckenberg, Anita, Eklund, Robert, Jönsson, Arne, McAllister, Anita, and Chua, Cathy
- Subjects
Teknik och teknologier ,Engineering and Technology - Abstract
We describe a Swedish version of CALL-SLT,a web-deployed CALL system that allows beginner/intermediate students to practise generativespoken language skills. Speech recognitionis grammar-based, with language modelsderived, using the Regulus platform, fromsubstantial domain-independent feature grammars.The paper focusses on the Swedishgrammar resources, which were developedby generalising the existing English featuregrammar into a shared grammar for Englishand Swedish. It turns out that this can be donevery economically: all but a handful of rulesand features are shared, and English grammaressentially ends up being treated as a reducedform of Swedish. We conclude by presentinga simple evaluation which compares theSwedish and French versions of CALL-SLT.
- Published
- 2012
18. For A Fistful Of Dollars: Using Crowd-Sourcing To Evaluate A Spoken Language CALL Application
- Author
-
Rayner Manny, Frank Ian, Chua Cathy, Tsourakis Nikos, and Bouillon Pierrette
- Subjects
CALL ,Japanese ,ddc:410.2 ,Crowd-sourcing ,Speech recognition ,Evaluation - Abstract
We present an evaluation of a Web-deployed spoken language CALL system, carried out using crowd-sourcing methods. The system, “Survival Japanese”, is a crash course in tourist Japanese implemented within the platform CALL-SLT. The evaluation was carried out over one week using the Amazon Mechanical Turk. Although we found a high proportion of attempted scammers, there was a core of 23 subjects who used the system in a responsible manner. The evidence that these subjects learned from their 111 sessions and 9092 spoken interactions was significant at P=0.001. Our conclusion is that crowd-sourcing is a potentially valid method for evaluating spoken CALL systems.
- Published
- 2011
19. Bootstrapping a statistical speech translator from a rule-based one
- Author
-
Rayner, Manny, Estrella, Paula, Bouillon, Pierrette, and International Workshop on Free/Open-Source Rule-Based Machine Translation (2nd : 2011 : Barcelona)
- Subjects
Programari lliure ,Traducció automàtica ,Software libre ,Traducción automática ,bootstrapping ,Computational linguistics ,Lingüística computacional ,Traducció automàtica estadística ,Statistical ,Open source software ,Traducción automática estadística ,Machine translating - Abstract
We describe a series of experiments in which we start with English to French and English to Japanese versions of an Open Source rule-based speech translation system for a medical domain, and bootstrap correspondign statistical systems. Comparative evaluation reveals that the rule-based systems are still significantly better than the statistical ones, despite the fact that considerable effort has been invested in tuning both the recognition and translation components; also, a hybrid system only marginally improved recall at the cost of a los in precision. The result suggests that rule-based architectures may still be preferable to statistical ones for safety-critical speech translation tasks. Describimos una serie de experimentos en los que comenzamos con las versiones de inglés a francés y de inglés a japonés de un sistema de traducción oral basado en reglas de código abierto para un dominio médico, e iniciamos los sistemas estadísticos correspondientes. La evaluación comparativa revela que los sistemas basados en reglas son significativamente mejores que los estadísticos, a pesar del hecho de que se ha dedicado un esfuerzo considerable en mejorar los componentes de reconocimiento y traducción; además, un sistema híbrido sólo mejoró de modo marginal la memoria a costa de una pérdida de precisión. Los resultados indican que las arquitecturas basadas en reglas podrían ser mejores para tareas de traducción oral críticas. Descrivim una sèrie d'experiments en què comencem amb les versions d'anglès a francès i d'anglès a japonès d'un sistema de traducció oral basat en regles en codi obert per a un domini mèdic, i iniciem els sistemes estadístics corresponents. L'avaluació comparativa revela que els sistemes basats en regles encara són significativament millors que els estadístics, malgrat el fet que s'ha dedicat un esforç important a millorar els components tant de reconeixement com de traducció; a més a més, un sistema híbrid només va millorar marginalment la memòria a costa d'una pèrdua de precisió. Els resultats indiquen que les arquitectures basades en regles podrien ser millors que les estadístiques per a tasques de traducció oral crítiques.
- Published
- 2010
20. Design Issues for a Bidirectional Mobile Medical Speech Translator
- Author
-
Tsourakis, Nikolaos, Bouillon, Pierrette, and Rayner, Manny
- Subjects
MedSLT ,ddc:410.2 ,Regulus ,Medical Speech Translation - Abstract
We argue that there is an urgent need for spoken dialogue translation systems in the medical domain. In this work we describe how an existing system of this kind, originally intended for a desktop PC, was adapted to run in a mobile environment. The different characteristics of the target device necessitate new interaction approaches and interface design. The new configuration uses two client applications running on two different devices, one for the doctor and one for the patient, together with a server; the components are connected over a wireless network. The system supports context-dependent translation in both directions. We give a general overview of the system, and discuss some of the relevant design issues pertaining to deployment of speech-enabled systems on mobile devices.
- Published
- 2009
21. Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System
- Author
-
Bouillon, Pierrette, Halimi, Sonia, Nakao, Yukie, Kanzaki, Kyoko, Isahara, Hitoshi, Tsourakis, Nikos, Starlander, Marianne, Hockey, Beth Ann, Rayner, Manny, and Nakao, Yukie
- Subjects
[INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL] ,ddc:410.2 ,ComputingMilieux_MISCELLANEOUS ,[INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD] - Abstract
We describe recent work on MedSLT, a medium-vocabulary interlingua-based medical speech translation system, focussing on issues that arise when handling languages of which the grammar engineer has little or no knowledge. We describe how we can systematically create and maintain multiple forms of grammars, lexica and interlingual representations, with some versions being used by language informants, and some by grammar engineers. In particular, we describe the advantages of structuring the interlingua definition as a simple semantic grammar, which includes a human-readable surface form. We show how this allows us to rationalise the process of evaluating translations between languages lacking common speakers. The grammar-based interlingua definition can also be used in other ways. We describe two applications: a simple generic tool for debugging to-interlingua translation rules, and a method for improving speech understanding performance by rescoring N-best speech hypothesis lists. Examples presented focus on the concrete case of translation between Japanese and Arabic in both directions.
- Published
- 2008
22. The 2008 MedSLT System
- Author
-
Rayner, Manny, Bouillon, Pierrette, Brotanek, Jane, Flores, Glenn, Halimi, Sonia, Hockey, Beth Ann, Isahara, Hitoshi, Kanzaki, Kyoko, Kron, Elisabeth, Nakao, Yukie, Santaholma, Marianne, Starlander, Marianne, Tsourakis, Nikos, and Nakao, Yukie
- Subjects
[INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL] ,ComputingMilieux_MISCELLANEOUS ,[INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD] - Published
- 2008
23. Many-to-Many Multilingual Medical Speech Translation on a PDA
- Author
-
Bouillon, Pierrette, Flores, Glenn, Georgescul, Maria, Halimi, Sonia, Hockey, Beth Ann, Isahara, Hitoshi, Kanzaki, Kyoko, Nakao, Yukie, Rayner, Manny, Santaholma, Marianne, Starlander, Marianne, Tsourakis, Nikos, and Nakao, Yukie
- Subjects
ddc:410.2 ,ComputingMilieux_MISCELLANEOUS ,[INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD] ,[INFO.INFO-PL] Computer Science [cs]/Programming Languages [cs.PL] - Abstract
Particularly considering the requirement of high reliability, we argue that the most appropriate architecture for a medical speech translator that can be realised using today's technology combines unidirectional (doctor to patient) translation, medium-vocabulary controlled language coverage, interlingua-based translation, an embedded help component, and deployability on a hand-held hardware platform. We present an overview of the Open Source MedSLT prototype, which has been developed in accordance with these design principles. The system is implemented on top of the Regulus and Nuance 8.5 platforms, translates patient examination questions for all language pairs in the set {English, French, Japanese, Arabic, Catalan}, using vocabularies of about 400 to 1 100 words, and can be run in a distributed client/server environment, where the client application is hosted on a Nokia Internet Tablet device.
- Published
- 2008
24. Helping Domain Experts Build Phrasal Speech Translation Systems.
- Author
-
Rayner, Manny, Armando, Alejandro, Bouillon, Pierrette, Ebling, Sarah, Gerlach, Johanna, Halimi, Sonia, Strasly, Irene, and Tsourakis, Nikos
- Published
- 2016
- Full Text
- View/download PDF
25. Constructing Knowledge-Based Feedback in the Context of an Interactive Spoken CALL Application.
- Author
-
Tsourakis, Nikos, Baur, Claudia, and Rayner, Manny
- Published
- 2017
- Full Text
- View/download PDF
26. Representational and architectural issues in a limited-domain medical speech translator
- Author
-
Rayner, Manny, Bouillon, Pierrette, Santaholma, Marianne Elina, and Nakao, Yukie
- Subjects
ddc:410.2 ,Speech understanding ,Computer-aided diagnosis ,Speech translation - Abstract
We present an overview of MedSLT, a medium vocabulary medical speech translation system, focussing on the representational issues that arise when translating temporal and causal concepts. Although flat key/value structures are strongly preferred as semantic representations in speech understanding systems, we argue that it is infeasible to handle the necessary range of concepts using only flat structures. By exploiting the specific nature of the task, we show that it is possible to implement a solution which only slightly extends the representational complexity of the semantic representation language, by permitting an optional single nested level representing a subordinate clause construct. We sketch our solutions to the key problems of producing minimally nested representations using phrase-spotting methods, and writing cleanly structured rule-sets that map temporal and phrasal representations into a canonical interlingual form.
- Published
- 2005
27. A Generic Multi-Lingual Open Source Platform for Limited-Domain Medical Speech Translation
- Author
-
Bouillon, Pierrette, Rayner, Manny, Chatzichrisafis, Nikos, Hockey, Beth Ann, Santaholma, Marianne Elina, Starlander, Marianne, Nakao, Yukie, Kanzaki, Kyoko, and Isahara, Hitoshi
- Subjects
ddc:410.2 - Abstract
We present an overview of MedSLT, an Open Source platform for developing limited-domain medical speech translation systems. We focus in particular on the speech understanding architecture, which uses grammar-based language models derived using corpus-based specialisation methods from a single linguistically motivated grammar, and summarise the results of two evaluations which investigate the appropriateness of these design choices. Other sections describe the interlingua and its relationship with the recognition architecture, and the current demo system.
- Published
- 2005
28. Comparing Rule-Based and Statistical Approaches to Speech Understanding in a Limited Domain Speech Translation System
- Author
-
Rayner, Manny, Bouillon, Pierrette, Hockey, Beth Ann, Chatzichrisafis, Nikos, and Starlander, Marianne
- Subjects
ddc:410.2 - Abstract
The paper directly compares two versions of a medical speech translation system, one with a grammar based language model (GLM) recognizer and the other with a statistical language model (SLM) recognizer. We construct the GLM using a corpus-based method, so that both the GLM and the SLM can be derived from the same corpus; evaluation is carried out with respect to performance on the speech translation task. Despite using a very small training set for both the GLM and the SLM, the SLM delivers much better word error rates on unseen test material. Nonetheless, evaluating both systems on translation performance rather than word error rates, the GLM-based version of the system outperforms the SLM on the actual translation task.
- Published
- 2004
29. Turning Speech Into Scripts
- Author
-
Rayner, Manny, Hockey, Beth Ann, and James, Frankie
- Subjects
FOS: Computer and information sciences ,H.5.2 ,Computer Science - Computation and Language ,I.2.7 ,Computation and Language (cs.CL) - Abstract
We describe an architecture for implementing spoken natural language dialogue interfaces to semi-autonomous systems, in which the central idea is to transform the input speech signal through successive levels of representation corresponding roughly to linguistic knowledge, dialogue knowledge, and domain knowledge. The final representation is an executable program in a simple scripting language equivalent to a subset of Cshell. At each stage of the translation process, an input is transformed into an output, producing as a byproduct a "meta-output" which describes the nature of the transformation performed. We show how consistent use of the output/meta-output distinction permits a simple and perspicuous treatment of apparently diverse topics including resolution of pronouns, correction of user misconceptions, and optimization of scripts. The methods described have been concretely realized in a prototype speech interface to a simulation of the Personal Satellite Assistant., Working notes from AAAI Spring Symposium
- Published
- 2000
30. Accuracy, Coverage, and Speed: What Do They Mean to Users?
- Author
-
James, Frankie, Rayner, Manny, and Hockey, Beth Ann
- Subjects
FOS: Computer and information sciences ,H.5.2 ,Computer Science - Computation and Language ,I.2.7 ,Computer Science - Human-Computer Interaction ,Computation and Language (cs.CL) ,Human-Computer Interaction (cs.HC) - Abstract
Speech is becoming increasingly popular as an interface modality, especially in hands- and eyes-busy situations where the use of a keyboard or mouse is difficult. However, despite the fact that many have hailed speech as being inherently usable (since everyone already knows how to talk), most users of speech input are left feeling disappointed by the quality of the interaction. Clearly, there is much work to be done on the design of usable spoken interfaces. We believe that there are two major problems in the design of speech interfaces, namely, (a) the people who are currently working on the design of speech interfaces are, for the most part, not interface designers and therefore do not have as much experience with usability issues as we in the CHI community do, and (b) speech, as an interface modality, has vastly different properties than other modalities, and therefore requires different usability measures., Position paper for CHI 2000 Workshop on Natural-Language Interaction
- Published
- 2000
31. Language-Processing Strategies and Mixed-Initiative Dialogues
- Author
-
Boye, Johan, Wirén, Mats, Rayner, Manny, Lewin, Ian, Carter, David, and Becket, Ralph
- Subjects
Språkteknologi (språkvetenskaplig databehandling) ,Language Technology (Computational Linguistics) - Abstract
We describe an implemented spoken-language dialogue system for a travel-planning domain, which accesses a commercially available travel-information web-server and supports a flexible mixed-initiative dialogue strategy. We argue, based on data from initial Wizard-of-Oz experiments, that mixed-initiative strategies are appropriate for many types of user, but require more sophisticated architectures for processing of language and dialogue; we then use these observations to motivate an architecture which combines parallel deep and shallow natural language analysis engines and an agenda-driven dialogue manager. We outline the top-level processing strategy used by the dialogue manager, and also a novel formalism, which we call Flat Utterance Description, that allows us to reduce the output of the deep and shallow language-processing engines to a common representation.
- Published
- 1999
32. Recycling Lingware in a Multilingual MT System
- Author
-
Rayner, Manny, Carter, David, Bretan, Ivan, Eklund, Robert, Wiren, Mats, Hansen, Steffen Leo, Kirchmeier-Andersen, Sabine, Philp, Christina, Sorensen, Finn, and Thomsen, Hanne Erdman
- Subjects
FOS: Computer and information sciences ,General Language Studies and Linguistics ,Computer Science - Computation and Language ,Jämförande språkvetenskap och allmän lingvistik ,Computation and Language (cs.CL) - Abstract
We describe two methods relevant to multi-lingual machine translation systems, which can be used to port linguistic data (grammars, lexicons and transfer rules) between systems used for processing related languages. The methods are fully implemented within the Spoken Language Translator system, and were used to create versions of the system for two new language pairs using only a month of expert effort., 6 pages, needs aclap.sty. To appear in "From Research to Commercial Applications" workshop at ACL-97, see also http://www.cam.sri.com
- Published
- 1997
33. Abductive Equivalential Translation and its application to Natural Language Database Interfacing
- Author
-
Rayner, Manny
- Subjects
FOS: Computer and information sciences ,Computer Science - Computation and Language ,Computation and Language (cs.CL) - Abstract
The thesis describes a logical formalization of natural-language database interfacing. We assume the existence of a ``natural language engine'' capable of mediating between surface linguistic string and their representations as ``literal'' logical forms: the focus of interest will be the question of relating ``literal'' logical forms to representations in terms of primitives meaningful to the underlying database engine. We begin by describing the nature of the problem, and show how a variety of interface functionalities can be considered as instances of a type of formal inference task which we call ``Abductive Equivalential Translation'' (AET); functionalities which can be reduced to this form include answering questions, responding to commands, reasoning about the completeness of answers, answering meta-questions of type ``Do you know...'', and generating assertions and questions. In each case, a ``linguistic domain theory'' (LDT) $\Gamma$ and an input formula $F$ are given, and the goal is to construct a formula with certain properties which is equivalent to $F$, given $\Gamma$ and a set of permitted assumptions. If the LDT is of a certain specified type, whose formulas are either conditional equivalences or Horn-clauses, we show that the AET problem can be reduced to a goal-directed inference method. We present an abstract description of this method, and sketch its realization in Prolog. The relationship between AET and several problems previously discussed in the literature is discussed. In particular, we show how AET can provide a simple and elegant solution to the so-called ``Doctor on Board'' problem, and in effect allows a ``relativization'' of the Closed World Assumption. The ideas in the thesis have all been implemented concretely within the SRI CLARE project, using a real projects and payments database. The LDT for the example database is described in detail, and examples of the types of functionality that can be achieved within the example domain are presented., Comment: 162 pages, Latex source, PhD thesis (U Stockholm, 1993). Uses style-file ustockholm_thesis.sty
- Published
- 1994
34. The Swedish Core Language Engine
- Author
-
Gambäck, Björn and Rayner, Manny
- Subjects
Computer and Information Sciences ,Data- och informationsvetenskap - Abstract
The paper describes a Swedish-language customization (S-CLE) of the SRI Core Language Engine, which has been developed at SICS from the original English-language version by replacing English-specific modules with corresponding Swedish-language versions. The S-CLE is intended to be used as a building block in a broad range of applications, such as data-base query system, machine translation systems, NL front-ends, speech-to-text/text-to-speech systems, and so on. Examples of the first two types of application already exist. The main part of the S-CLE is an extensive Swedish grammar that is compiled into parsing and generation modules. The grammar formalism is a type of unification grammar loosely based on Generalized Phrase Structure Grammar (GPSG). Generation is performed using the Semantic-Head-Driven algorithm. Analysis turns sentences into ``Quasi-Logical Form'' (QLF), a logical-form representation, while generation works in the opposite direction. Intermediate stages include processing of morphology, syntax and semantics. For knowledge-base applications, a separate module can convert QLFs into conventional scoped logical forms. After two-and-a-half years of work (approximately 45 person months), the first prototype system has a vocabulary of about 1900 words and covers a fairly broad range of possible grammatical constructions. Based on our experience in this project, we present in this paper detailed arguments to support the claim that customization of an English-language NLP system is a highly cost-effective way of constructing Swedish language systems with corresponding functionality. A shorter version of this paper appears in L. Ahrenberg (ed.): Papers from the Third Nordic Conference on Text Comprehension in Man and Machine , Link Sweden, 1992.
- Published
- 1992
35. Developing an EBL bypass for a large-scale natural language query interface to relational data bases
- Author
-
Samuelsson, Christer and Rayner, Manny
- Subjects
Computer and Information Sciences ,Data- och informationsvetenskap - Abstract
The syntactic analysis component of a large-scale natural language query interface to relational data bases was greatly sped up by applying explanation-based learning(EBL), a machine learning technique. The idea is that one for most input queries can bypass the ordinary parser, instead using a set of learned rules. When no rule applies, one must pay the price of a small overhead. The set of learned rules is extracted automatically from sample queries posed by a user, thus tuning the system for that particular user. Several non-trivial problems, arising from the characteristics of the target system, were solved during the project. Measurements on a small test corpus indicated that the speed-ups when a learned rule could be used are on average a factor 30 and that the overhead, when no rule applied, is less than 3 percent. Extended version of a paper titled "Using Explanation-Based Learning to Increase Performance in a Large-Scale NL Query System" by the same authors, that was presented at the third DARPA Workshop on Speech and Natural Language, Hidden Valley, 1991. Original report number R91001.
- Published
- 1991
36. Bilingual conversation interpreter : a prototype interactive message translator. Final report
- Author
-
Alshawi, Hiyan, Brown, Carl, Carter, David, Gambäck, Björn, Pulman, Steve, and Rayner, Manny
- Subjects
Computer and Information Sciences ,Data- och informationsvetenskap - Abstract
This document is the final report for a research project aimed at producing a prototype system for on-line translation of typed dialogues between speakers of different natural languages. The work was carried out jointly by SICS and SRI Cambridge. The resulting prototype system (called Billingual Conversation Interpreter, or BCI) translates between english and Swedish in both directions.The Major components of the BCI are two copies of the SRI Core Language Engine, equipped with English and Swedish grammars respectively. These are linked by the transfer and disambiguation components. Translation takes place by analyzing the source-language sentence into Quasi Logical Form ( QLF), a linguistically motivated logical representation, transferring this into a target-language QLF, and generating a target-language sentence. When ambiguities occur that cannot be resolved automatically, they are clarified by Querying the appropriate user. The clarification dialogue presupposes no knowledge of either linguistics or the other language. The prototype system has a broad grammatical coverage, a initial vocabulary of about 1000 words together with vocabulary expansion tools, and a set of English-Swedish transfer rules. The formalism developed for coding this linguistic information make it relatively easy to extend the system. We believe that the project was successful in demonstrating the feasibility of using these techniques for interactive translation applications, and provides a sound basis for development of a large scale message translator system with potential for commercial exploitation.The main sections of this report are the following: * A non-technical introduction, summarizing the BCI's design , and containing a sample session. * An overview of the Swedish version of the CLE. * A detailed discussion of the theory and practice of QLF transfer. * A description of the interactive disambiguation method. * Suggestions for possible follow-on projects aimed in the direction of practically usable commercial systems. This is a joint report by SICS and SRI, which is publ. by both institutes. SRI Cambridge Report CCSRC-018.
- Published
- 1991
37. Transfer through quasi logical form - A new approach to machine translation
- Author
-
Alshawi, Hiyan, Carter, David, Gambäck, Björn, Pulman, Steve, and Rayner, Manny
- Subjects
Computer and Information Sciences ,Data- och informationsvetenskap - Abstract
This Document is an introduction to a research project aimed at producing a prototype system for on-line translation of typed dialogues between speakers of different natural languages. The work was carried out jointly by SICS and SRI Cambridge. The resulting prototype system (called Billingual Conversation Interpreter, or BCI) translates between English and Swedish in both Directions. The major components of the BCI are two copies of the SRI Core Language Engine, equipped with English and Swedish grammars respectively. These are linked by the transfer and disambiguation components. Translation takes place by analyzing the source-language sentence into Quasi Logical Form (QLF), a linguistically motivated logical representation, transferring this onto a target-language QLF, and generating a target-language sentence. We believe that the project was successful in demonstrating the feasibility of using these techniques for interactive translation applications, and provides a sound basis for development of a large scale message translator system. The final section of the paper points to several possible follow-on projects aimed in the direction of practically usable commercial systems.
- Published
- 1991
38. Contract bridge as a micro-world for reasoning about communication agents
- Author
-
Gambäck, Björn and Rayner, Manny
- Subjects
Computer and Information Sciences ,Data- och informationsvetenskap - Abstract
We argue that bidding in the game of Contract Bridge can profitably be regarded as a micro-world suitable for experimenting with pragmatics. We sketch an analysis in which a "bidding system" is treated as the semantics of an artificial language, and show how this "language", despite its apparent simplicity, is capable of supporting a wide variety of common speech acts parallel to those in natural languages; we also argue that the reason for the relatively unsuccessful nature of previous attempts to write strong Bridge playing programs has been their failure to address the need to reason explicitly about knowledge, pragmatics, probabilities and plans. We give an overview of Pragma, a system currently under development at SICS, which embodies these ideas in concrete form, using a combination of rule-based inference, stochastic simulation, and "neural-net" learning. Examples are given illustrating the functionality of the system in its current form. Original report number R90011.
- Published
- 1990
39. Spoken Language Understanding via Supervised Learning and Linguistically Motivated Features.
- Author
-
Georgescul, Maria, Rayner, Manny, and Bouillon, Pierrette
- Abstract
In this paper, we reduce the rescoring problem in a spoken dialogue understanding task to a classification problem, by using the semantic error rate as the reranking target value. The classifiers we consider here are trained with linguistically motivated features. We present comparative experimental evaluation results of four supervised machine learning methods: Support Vector Machines, Weighted K-Nearest Neighbors, Naïve Bayes and Conditional Inference Trees. We provide a quantitative evaluation of learning and generalization during the classification supervised training, using cross validation and ROC analysis procedures. The reranking is derived using the posterior knowledge given by the classification algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
40. Training Statistical Language Models from Grammar-Generated Data: A Comparative Case-Study.
- Author
-
Hockey, Beth Ann, Rayner, Manny, and Christian, Gwen
- Abstract
Statistical language models (SLMs) for speech recognition have the advantage of robustness, and grammar-based models (GLMs) the advantage that they can be built even when little corpus data is available. A known way to attempt to combine these two methodologies is first to create a GLM, and then use that GLM to generate training data for an SLM. It has however been difficult to evaluate the true utility of the idea, since the corpus data used to create the GLM has not in general been explicitly available. We exploit the Open Source Regulus platform, which supports corpus-based construction of linguistically motivated GLMs, to perform a methodologically sound comparison: the same data is used both to create an SLM directly, and also to create a GLM, which is then used to generate data to train an SLM. An evaluation on a medium-vocabulary task showed that the indirect method of constructing the SLM is in fact only marginally better than the direct one. The method used to create the training data is critical, with PCFG generation heavily outscoring CFG generation. [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
41. On the applicability of nonmonotonic logic to formal reasoning in continuous time.
- Author
-
Rayner, Manny
- Published
- 1992
42. Breaking the Language Barrier: Machine Assisted Diagnosis Using the Medical Speech Translator.
- Author
-
Starlander, Marianne, Bouillon, Pierrette, Rayner, Manny, Chatzichrisafis, Nikos, Hockey, Beth Ann, Isahara, Hitoshi, Kanzaki, Kyoko, Nakao, Yukie, and Santaholma, Marianne
- Abstract
In this paper, we describe and evaluate an Open Source medical speech translation system (MedSLT) intended for safety-critical applications. The aim of this system is to eliminate the language barriers in emergency situation. It translates spoken questions from English into French, Japanese and Finnish in three medical subdomains (headache, chest pain and abdominal pain), using a vocabulary of about 250-400 words per sub-domain. The architecture is a compromise between fixed-phrase translation on one hand and complex linguistically-based systems on the other. Recognition is guided by a Context Free Grammar Language Model compiled from a general unification grammar, automatically specialised for the domain. We present an evaluation of this initial prototype that shows the advantages of this grammar-based approach for this particular translation task in term of both reliability and use. [ABSTRACT FROM AUTHOR]
- Published
- 2005
43. On the applicability of default logic: two short papers
- Author
-
Rayner, Manny
- Subjects
Computer and Information Sciences ,Data- och informationsvetenskap - Abstract
Work by Yoav Shoham advocating the use of default logic is critically examined, and several serious inconsistencies are pointed out. Shoham's analysis of the frame problem is shown to depend on a misinterpretation of the formal basis for Newtonian mechanics, and strong reasons are given to doubt the correctness of his arguments for reducing auto-epistemic reasoning to default logic. Detailed counter-examples are provided for both cases. Original report number R88009. Related paper "Did Newton Solve the Extended Prediction problem?" by M. Rayner published in Proceedings 1st Intl. Conf. on the Foundations of Knowledge Representation, R. Brachman & H. Levesque, eds., Morgan Kaufman, 1989.
- Published
- 1988
44. Finding out = Achieving Decidability
- Author
-
Janson, Sverker and Rayner, Manny
- Subjects
Computer and Information Sciences ,Data- och informationsvetenskap - Abstract
We present a framework for reasoning about the concepts of "knowing what" and "finding out", in which the key concept is to identify "finding out the answer to question Q" with "achieving a situation in which Q is decidable" . We give examples of how the framework can be used to formulate non-trivial problems involving the construction of plans to acquire and use information, and go on to demonstrate that these problems can often be solved by systematic application of a small set of goal-directed backward-chaining rules. In conclusion, it is suggested that systems of this kind are potentially implementable in l-Prolog, a logic programming language based on higher-order logic. Original report number R89016.
- Published
- 1989
45. Reasoning about procedural programs in a chess ending
- Author
-
Rayner, Manny and Hugosson, Åsa
- Subjects
Computer and Information Sciences ,Data- och informationsvetenskap - Abstract
We demonstrate how to reason about plans by treating them as programs expressed in a simple imperative language, and illustrate this using an example originally suggested by McCarthy and later discussed at length by McDermott. A detailed correctness proof is provided, which it is argued is greatly superior to McDermott's. Original report number R88003. Related paper, "Reasoning about Plans Represented as Procedural Programs" located in Artificial Intelligence III: Methodology, Systems, Applications; Proceedings of the Third International Conference on Artificial Intelligence: Methodology, Systems, Applications (AIMSA '88), Varna, Bulgaria, 1988, pp. 93-102. T. O'Shea and V. Sgurev, editors, North-Holland Press.
- Published
- 1988
46. An implementable semantics for comparative constructions
- Author
-
Rayner, Manny and Banks, Amelie
- Subjects
Computer and Information Sciences ,Data- och informationsvetenskap - Abstract
We describe a fairly comprehensive handling of the syntax and semantics of comparative constructions. The analysis is largely based on the theory developed by Pinkham, but we advance arguments to support a different handling of phrasal comparatives - in particular, we replace the use of C-ellipsis with a method of interpretation we call contrastive comparison. We explain the reasons for dividing comparative sentences into different categories, and for each category we give an example of the corresponding Montague semantics. The ideas have all been implemented within a large-scale grammar for Swedish, a "toy" version of which is presented, along with examples of the output. This report is a revised and extended version of two earlier papers: (1) "Comparatives in Logic Grammars-Two Viewpoints" located in Natural Language Understanding and Logic Programming II: Proceedings of the 2nd Int'l Workshop on Natural Language Understanding and Logic Programming, Vancouver, Canada, 1987, pp. 153-168, V. Dahl and P. Saint-Dizier, editors, North-Holland Press, 1988; and (2) "Parsing and Interpreting Comparatives" located in Proceedings of the 26th Annual Conference of the Ass. for Computational Linguistics (ACL), Buffalo, 1988, published by the ACL, 1988. Original report number R88018.
- Published
- 1988
47. On the applicability of non-monotonic logic to formal reasoning in continuous time
- Author
-
Rayner, Manny
- Subjects
Computer and Information Sciences ,Data- och informationsvetenskap - Abstract
The paper criticizes arguments recently advanced by Shoham, McDermott and Sandewall, which purport to demonstrate the relevance of non-monotonic logic to the formalization of reasoning about the evolution of mechanical systems in continuous time. The first half of the paper examines the "Extended Prediction Problem" of Shoham and McDermott; reasons are given to support the claim that the "problem" is the product of a mistaken understanding of the the formal basis of Newtonian mechanics, and has no real existence. An example is given showing how, contrary to Shoham and McDermott's arguments, it is possible to formalise reasoning about the evolution of physical systems in continuous time using only classical logic and differential calculus. The second half then reviews Sandewall's non-monotonic logic for almost-continuous systems. Here it is argued that the proposed framework offers only very marginal advantages in compactness of notation, and generally tends to collapse back into classical logic. In summary, I conclude that there is as yet no good reason to believe that non-monotonic logic will be a useful tool in this area. Original report number R98013.
- Published
- 1989
48. Applying explanation-based learning to natural-language processing (part 1)
- Author
-
Rayner, Manny
- Subjects
Computer and Information Sciences ,TheoryofComputation_MATHEMATICALLOGICANDFORMALLANGUAGES ,Data- och informationsvetenskap - Abstract
It is shown how ideas adapted from recent work on explanation-based generalization can be used to allow a logic grammar to "learn" useful derived grammar rules by generalizing them from example sentences. The method is presented in the form of a small Prolog meta-interpreter, and its soundness is formally proved. Examples are given showing the application of the generalizer, first to a toy grammar with 40 rules and then to a largish independantly developed system which involves non-trivial syntactic and semantic analysis. Original report number R89014.
- Published
- 1989
49. Applying explanation-based learning to natural language processing (part 2)
- Author
-
Samuelsson, Christer and Rayner, Manny
- Subjects
Computer and Information Sciences ,Data- och informationsvetenskap - Abstract
Explanation-based learning is a technique which attempts to optimize performance of a rule-based system by adding new rules constructed from generalizations of successfully-solved examples. The paper summarizes previous work showing how this idea can be used in natural language processing, and describes experiments in which the EBL method was applied to the CHAT-80 system of Pereira and Warren. In particular, we address the problem of assuring the utility of learning a rule, since the benefit of a learned rule may not outweigh the increased search time incurred in checking its applicability. We show that this problem can be overcome in the NL domain by indexing acquired rules by their lexical constraints, which in general vastly reduces the number of potentially applicable rules. Such an indexing method was implemented and timing studies were made comparing its access speed to that of normal linear search. The indexing scheme required an average access time of 35 - 40 ms independent of the number of learned rules. The results suggest that the overhead of the indexing scheme is small.
- Published
- 1989
50. On the applicability of nonmonotonic logic to formal reasoning in continuous time
- Author
-
Rayner, Manny
- Published
- 1991
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.