Author: "Rayner, Manny" / Search Limiters: Full Text - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Rayner, Manny"' showing total 79 results

Start Over Author "Rayner, Manny" Search Limiters Full Text

79 results on '"Rayner, Manny"'

1. Using LARA to Create Annotated Manuscripts and Inscriptions for Museums: An Initial Feasibility Study

Author: Bédi, Branislav, Chiera, Belinda, Chua, Cathy, Eyjólfsson, Brynjarr, Rayner, Manny, Orian Weiss, Catherine, and Zviel-Girshin, Rina
Abstract: We argue that museums presenting exhibits of ancient texts may be able to benefit from accompanying them with annotated electronic versions. We present a short study using two sample annotated texts built using the Learning And Reading Assistant (LARA) platform for a fragment of an Old Norse manuscript and an inscription in Ancient Egyptian hieroglyphs respectively. The results of an online questionnaire suggested that people already interested in ancient languages would view the idea very positively. [For the complete volume, "Intelligent CALL, Granular Systems and Learner Data: Short Papers from EUROCALL 2022 (30th, Reykjavik, Iceland, August 17-19, 2022)," see ED624779.]
Published: 2022

2. Easy as ABC: Using LARA to Build Multimedia Alphabet Books

Author: Akhlaghi-Baghoojari, Elham, Bédi, Branislav, Chua, Cathy, Horváthová, Ivana, Ivanova, Nedelina, Maizonniaux, Christèle, Mykhats, Marta, Chiaráin, Neasa Ní, Rayner, Manny, Orian Weiss, Catherine, and Zviel-Girshin, Rina
Abstract: We present a study in which multimedia alphabet books were constructed for ten languages using the Learning And Reading Assistant (LARA) platform. We describe the alphabet books we built, the different design features they instantiate, and an initial evaluation using an anonymous online questionnaire. Links are provided to the books themselves, which are freely available on the web. [For the complete volume, "Intelligent CALL, Granular Systems and Learner Data: Short Papers from EUROCALL 2022 (30th, Reykjavik, Iceland, August 17-19, 2022)," see ED624779.]
Published: 2022

3. Assessing the Quality of TTS Audio in the LARA Learning-by-Reading Platform

Author: Akhlaghi, Elham, Baczkowska, Anna, Berthelsen, Harald, Bédi, Branislav, Chua, Cathy, Cucchiarini, Catia, Habibi, Hanieh, Horváthová, Ivana, Hvalsøe, Pernille, Lotz, Roy, Maizonniaux, Christèle, Chiaráin, Neasa Ní, Rayner, Manny, Tsourakis, Nikos, and Yao, Chunlin
Abstract: A popular idea in Computer Assisted Language Learning (CALL) is to use multimodal annotated texts, with annotations typically including embedded audio and translations, to support L2 learning through reading. An important question is how to create the audio, which can be done either through human recording or by a Text-To-Speech (TTS) synthesis engine. We may reasonably expect TTS to be quicker and easier, but humans to be of higher quality. Here, we report a study using the open-source LARA platform and ten languages. Samples of LARA audio totaling about three and a half minutes were provided for each language in both human and TTS form; subjects used a web form to compare different versions of the same item and rate the voices as a whole. Although human voice was more often preferred, TTS achieved higher ratings in some languages and was close in others. [For the complete volume, "CALL and Professionalisation: Short Papers from EUROCALL 2021 (29th, Online, August 26-27, 2021)," see ED616972.]
Published: 2021

4. LARA: An Extensible Open Source Platform for Learning Languages by Reading

Author: Bédi, Branislav, Butterweck, Matt, Chua, Cathy, Gerlach, Johanna, Guðmarsdóttir, Birgitta Björg, Habibi, Hanieh, Jónsson, Bjartur Örn, Rayner, Manny, and Vigfússon, Sigurður
Abstract: Learning and Reading Assistant (LARA) is an open source platform that enables conversion of plain texts into an interactive multimedia form designed to support second- and foreign-language (L2) learners. In this workshop, we illustrate the open source aspects using collaborative work carried out during a six-week summer project at the Árni Magnússon Institute for Icelandic Studies. Three undergraduate level students extended the platform in different directions in cooperation with other members of the international LARA team. The three subprojects were respectively concerned with adding automatically generated flashcards, adding multimedia versions of poetic texts in the archaic language Old Norse, and extending LARA to allow the inclusion of sign language content in Icelandic sign language -- Íslenskt TáknMál (ÍTM). All three reached successful conclusions. [For the complete volume, "CALL for Widening Participation: Short Papers from EUROCALL 2020 (28th, Online, August 20-21, 2020)," see ED610330.]
Published: 2020

5. Constructing an Interactive Old Norse Text with LARA

Author: Bédi, Branislav, Bernharðsson, Haraldur, Chua, Cathy, Guðmarsdóttir, Birgitta Björg, Habibi, Hanieh, and Rayner, Manny
Abstract: We describe how the open-source Learning and Reading Assistant (LARA) platform was used to convert a classic Old Norse text, the "Völuspá," into an interactive online form. The LARA version includes high-quality recorded audio, translations, notes on key words and phrases, an automatically generated concordance, and links to other online resources. The interactive text was created in two different editions, one with Modern Icelandic translations designed to support Icelandic school students who read the poem as a set text, and one with English translations designed for English readers with basic Old Norse who wish to able to appreciate the poem in the original as a piece of literature. Initial feedback from groups has been positive. [For the complete volume, "CALL for Widening Participation: Short Papers from EUROCALL 2020 (28th, Online, August 20-21, 2020)," see ED610330.]
Published: 2020

6. Using LARA for Language Learning: A Pilot Study for Icelandic

Author: Bédi, Branislav, Chua, Cathy, Habibi, Hanieh, Martinez-Lopez, Ruth, and Rayner, Manny
Abstract: This paper presents a brief overview of LARA (Learning And Reading Assistant), an open source online tool that has been under development since summer 2018. LARA currently contains a corpus of about 25 texts in ten languages and a crowdsourcing model used to expand the corpus. The central goal is to provide support for improving second language (L2) reading comprehension. The focus here is on the development of Icelandic content and its use during pilot testing amongst adult L2 learners of Icelandic. Preliminary feedback from users, while mostly positive, contained suggestions on how the tool might be improved. [For the complete proceedings, see ED600837.]
Published: 2019

7. Alexa as a CALL platform for children: Where do we start?

Author: Tsourakis, Nikos, Rayner, Manny, Habibi, Hanieh, Gallais, Pierre-Emmanuel, Chua, Cathy, and Butterweck, Matt
Subjects: Computer Science - Human-Computer Interaction
Abstract: Amazon's Alexa is now widely available and shows interesting potential as a platform for hosting CALL games aimed at children. In this paper, we describe an initial informal experiment where we created some simple CALL games and made them available to a few child testers. We report the children's and parents' reactions. Our overall conclusion is that, although Alexa has many positive features, there are still fundamental platform issues in the current version that make it very difficult to build compelling CALL games for children. The games used will soon be freely available for download on the Alexa store., Comment: 4 pages. Based on talk given at enetCollect WG3 & WG5 Meeting, Leiden, Holland, 2018
Published: 2019

8. What do the founders of online communities owe to their users?

Author: Chua, Cathy and Rayner, Manny
Subjects: Computer Science - Computers and Society
Abstract: We discuss the organisation of internet communities, focusing on what we call the principle of "bait and switch": founders of internet communities often find it advantageous to recruit members by promising inducements which are later not honoured. We look at some of the dilemmas and ways of attempting to resolve them through two paradigmatic examples, Wikispaces and Wordpress. Our analysis is to a large extent motivated by the demands of CALLector, a university-centred social network we are in the process of establishing. We consider the question of what ethical standards are imposed on universities engaged in this type of activity., Comment: 6 pages. Paper based on talk at enetCollect WG3 & WG5 Meeting, Leiden 2018
Published: 2019

9. Decentralising power: how we are trying to keep CALLector ethical

Author: Chua, Cathy, Habibi, Hanieh, Rayner, Manny, and Tsourakis, Nikos
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Computers and Society
Abstract: We present a brief overview of the CALLector project, and consider ethical questions arising from its overall goal of creating a social network to support creation and use of online CALL resources. We argue that these questions are best addressed in a decentralised, pluralistic open source architecture., Comment: 6 pages; based on talk presented at enetCollect WG3 & WG5 Meeting, Leiden, Holland, 2018
Published: 2019

10. Helping Domain Experts Build Speech Translation Systems

Author: Rayner, Manny, Armando, Alejandro, Bouillon, Pierrette, Ebling, Sarah, Gerlach, Johanna, Halimi, Sonia, Strasly, Irene, and Tsourakis, Nikos
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Computation and Language
Abstract: We present a new platform, "Regulus Lite", which supports rapid development and web deployment of several types of phrasal speech translation systems using a minimal formalism. A distinguishing feature is that most development work can be performed directly by domain experts. We motivate the need for platforms of this type and discuss three specific cases: medical speech translation, speech-to-sign-language translation and voice questionnaires. We briefly describe initial experiences in developing practical systems., Comment: 12 pages, 1 figure, to appear in Proc. Future and Emerging Trends in Language Technology 2015, Seville, Spain
Published: 2015

11. A Compact Architecture for Dialogue Management Based on Scripts and Meta-Outputs

Author: Rayner, Manny, Hockey, Beth Ann, and James, Frankie
Subjects: Computer Science - Computation and Language, I.2.7, H.5.2
Abstract: We describe an architecture for spoken dialogue interfaces to semi-autonomous systems that transforms speech signals through successive representations of linguistic, dialogue, and domain knowledge. Each step produces an output, and a meta-output describing the transformation, with an executable program in a simple scripting language as the final result. The output/meta-output distinction permits perspicuous treatment of diverse tasks such as resolving pronouns, correcting user misconceptions, and optimizing scripts.
Published: 2000

12. A Comparison of the XTAG and CLE Grammars for English

Author: Hockey, Beth Ann, Rayner, Manny, and James, Frankie
Subjects: Computer Science - Computation and Language, I.2.7
Abstract: When people develop something intended as a large broad-coverage grammar, they usually have a more specific goal in mind. Sometimes this goal is covering a corpus; sometimes the developers have theoretical ideas they wish to investigate; most often, work is driven by a combination of these two main types of goal. What tends to happen after a while is that the community of people working with the grammar starts thinking of some phenomena as ``central'', and makes serious efforts to deal with them; other phenomena are labelled ``marginal'', and ignored. Before long, the distinction between ``central'' and ``marginal'' becomes so ingrained that it is automatic, and people virtually stop thinking about the ``marginal'' phenomena. In practice, the only way to bring the marginal things back into focus is to look at what other people are doing and compare it with one's own work. In this paper, we will take two large grammars, XTAG and the CLE, and examine each of them from the other's point of view. We will find in both cases not only that important things are missing, but that the perspective offered by the other grammar suggests simple and practical ways of filling in the holes. It turns out that there is a pleasing symmetry to the picture. XTAG has a very good treatment of complement structure, which the CLE to some extent lacks; conversely, the CLE offers a powerful and general account of adjuncts, which the XTAG grammar does not fully duplicate. If we examine the way in which each grammar does the thing it is good at, we find that the relevant methods are quite easy to port to the other framework, and in fact only involve generalization and systematization of existing mechanisms., Comment: 5th International Workshop on Tree Adjoining Grammars and Related Formalisms. 25-27 May 2000, Paris, France
Published: 2000

13. Accuracy, Coverage, and Speed: What Do They Mean to Users?

Author: James, Frankie, Rayner, Manny, and Hockey, Beth Ann
Subjects: Computer Science - Computation and Language, Computer Science - Human-Computer Interaction, H.5.2, I.2.7
Abstract: Speech is becoming increasingly popular as an interface modality, especially in hands- and eyes-busy situations where the use of a keyboard or mouse is difficult. However, despite the fact that many have hailed speech as being inherently usable (since everyone already knows how to talk), most users of speech input are left feeling disappointed by the quality of the interaction. Clearly, there is much work to be done on the design of usable spoken interfaces. We believe that there are two major problems in the design of speech interfaces, namely, (a) the people who are currently working on the design of speech interfaces are, for the most part, not interface designers and therefore do not have as much experience with usability issues as we in the CHI community do, and (b) speech, as an interface modality, has vastly different properties than other modalities, and therefore requires different usability measures., Comment: Position paper for CHI 2000 Workshop on Natural-Language Interaction
Published: 2000

14. Compiling Language Models from a Linguistically Motivated Unification Grammar

Author: Rayner, Manny, Hockey, Beth Ann, James, Frankie, Bratt, Elizabeth O., Goldwater, Sharon, and Gawron, Mark
Subjects: Computer Science - Computation and Language, I.2.7
Abstract: Systems now exist which are able to compile unification grammars into language models that can be included in a speech recognizer, but it is so far unclear whether non-trivial linguistically principled grammars can be used for this purpose. We describe a series of experiments which investigate the question empirically, by incrementally constructing a grammar and discovering what problems emerge when successively larger versions are compiled into finite state graph representations and used as language models for a medium-vocabulary recognition task., Comment: To be published in COLING 2000
Published: 2000

15. Turning Speech Into Scripts

Author: Rayner, Manny, Hockey, Beth Ann, and James, Frankie
Subjects: Computer Science - Computation and Language, H.5.2, I.2.7
Abstract: We describe an architecture for implementing spoken natural language dialogue interfaces to semi-autonomous systems, in which the central idea is to transform the input speech signal through successive levels of representation corresponding roughly to linguistic knowledge, dialogue knowledge, and domain knowledge. The final representation is an executable program in a simple scripting language equivalent to a subset of Cshell. At each stage of the translation process, an input is transformed into an output, producing as a byproduct a "meta-output" which describes the nature of the transformation performed. We show how consistent use of the output/meta-output distinction permits a simple and perspicuous treatment of apparently diverse topics including resolution of pronouns, correction of user misconceptions, and optimization of scripts. The methods described have been concretely realized in a prototype speech interface to a simulation of the Personal Satellite Assistant., Comment: Working notes from AAAI Spring Symposium
Published: 2000

16. Translation Methodology in the Spoken Language Translator: An Evaluation

Author: Carter, David, Becket, Ralph, Rayner, Manny, Eklund, ; Robert, MacDermid, Catriona, Wiren, Mats, Kirchmeier-Andersen, ; Sabine, and Philp, Christina
Subjects: Computer Science - Computation and Language
Abstract: In this paper we describe how the translation methodology adopted for the Spoken Language Translator (SLT) addresses the characteristics of the speech translation task in a context where it is essential to achieve easy customization to new languages and new domains. We then discuss the issues that arise in any attempt to evaluate a speech translator, and present the results of such an evaluation carried out on SLT for several language pairs., Comment: 10 pages, needs aclap.sty. To appear in Spoken Language Translation workshop at (E)ACL-97
Published: 1997

17. Recycling Lingware in a Multilingual MT System

Author: Rayner, Manny, Carter, David, Bretan, Ivan, Eklund, Robert, Wiren, Mats, Hansen, Steffen Leo, Kirchmeier-Andersen, Sabine, Philp, Christina, Sorensen, Finn, and Thomsen, Hanne Erdman
Subjects: Computer Science - Computation and Language
Abstract: We describe two methods relevant to multi-lingual machine translation systems, which can be used to port linguistic data (grammars, lexicons and transfer rules) between systems used for processing related languages. The methods are fully implemented within the Spoken Language Translator system, and were used to create versions of the system for two new language pairs using only a month of expert effort., Comment: 6 pages, needs aclap.sty. To appear in "From Research to Commercial Applications" workshop at ACL-97, see also http://www.cam.sri.com
Published: 1997

18. Hybrid language processing in the Spoken Language Translator

Author: Rayner, Manny and Carter, David
Subjects: Computer Science - Computation and Language
Abstract: The paper presents an overview of the Spoken Language Translator (SLT) system's hybrid language-processing architecture, focussing on the way in which rule-based and statistical methods are combined to achieve robust and efficient performance within a linguistically motivated framework. In general, we argue that rules are desirable in order to encode domain-independent linguistic constraints and achieve high-quality grammatical output, while corpus-derived statistics are needed if systems are to be efficient and robust; further, that hybrid architectures are superior from the point of view of portability to architectures which only make use of one type of information. We address the topics of ``multi-engine'' strategies for robust translation; robust bottom-up parsing using pruning and grammar specialization; rational development of linguistic rule-sets using balanced domain corpora; and efficient supervised training by interactive disambiguation. All work described is fully implemented in the current version of the SLT-2 system., Comment: 4 pages, uses icassp97.sty; to appear in ICASSP-97; see http://www.cam.sri.com for related material
Published: 1997

19. Adapting the Core Language Engine to French and Spanish

Author: Rayner, Manny, Carter, David, and Bouillon, Pierrette
Subjects: Computer Science - Computation and Language
Abstract: We describe how substantial domain-independent language-processing systems for French and Spanish were quickly developed by manually adapting an existing English-language system, the SRI Core Language Engine. We explain the adaptation process in detail, and argue that it provides a fairly general recipe for converting a grammar-based system for English into a corresponding one for a Romance language., Comment: 9 pages, aclap.sty; to appear in NLP+IA 96; see also http://www.cam.sri.com/
Published: 1996

20. Fast Parsing using Pruning and Grammar Specialization

Author: Rayner, Manny and Carter, David
Subjects: Computer Science - Computation and Language
Abstract: We show how a general grammar may be automatically adapted for fast parsing of utterances from a specific domain by means of constituent pruning and grammar specialization based on explanation-based learning. These methods together give an order of magnitude increase in speed, and the coverage loss entailed by grammar specialization is reduced to approximately half that reported in previous work. Experiments described here suggest that the loss of coverage has been reduced to the point where it no longer causes significant performance degradation in the context of a real application., Comment: 8 pages LaTeX, ACL-96, needs aclap.sty; see also http://www.cam.sri.com/
Published: 1996

21. Hybrid Transfer in an English-French Spoken Language Translator

Author: Rayner, Manny and Bouillon, Pierrette
Subjects: Computer Science - Computation and Language
Abstract: The paper argues the importance of high-quality translation for spoken language translation systems. It describes an architecture suitable for rapid development of high-quality limited-domain translation systems, which has been implemented within an advanced prototype English to French spoken language translator. The focus of the paper is the hybrid transfer model which combines unification-based rules and a set of trainable statistical preferences; roughly, rules encode domain-independent grammatical information and preferences encode domain-dependent distributional information. The preferences are trained from sets of examples produced by the system, which have been annotated by human judges as correct or incorrect. An experiment is described in which the model was tested on a 2000 utterance sample of previously unseen data., Comment: 7 pages, LaTeX (2.09 preferred); eaclap.sty; Procs of IA '95 (Montpellier, France)
Published: 1995

22. The Speech-Language Interface in the Spoken Language Translator

Author: Carter, David and Rayner, Manny
Subjects: Computer Science - Computation and Language
Abstract: The Spoken Language Translator is a prototype for practically useful systems capable of translating continuous spoken language within restricted domains. The prototype system translates air travel (ATIS) queries from spoken English to spoken Swedish and to French. It is constructed, with as few modifications as possible, from existing pieces of speech and language processing software. The speech recognizer and language understander are connected by a fairly conventional pipelined N-best interface. This paper focuses on the ways in which the language processor makes intelligent use of the sentence hypotheses delivered by the recognizer. These ways include (1) producing modified hypotheses to reflect the possible presence of repairs in the uttered word sequence; (2) fast parsing with a version of the grammar automatically specialized to the more frequent constructions in the training corpus; and (3) allowing syntactic and semantic factors to interact with acoustic ones in the choice of a meaning structure for translation, so that the acoustically preferred hypothesis is not always selected even if it is within linguistic coverage., Comment: 9 pages, LaTeX. Published: Proceedings of TWLT-8, December 1994
Published: 1994

23. CLARE: A Contextual Reasoning and Cooperative Response Framework for the Core Language Engine

Author: Alshawi, Hiyan, Carter, David, Crouch, Richard, Pulman, Steve, Rayner, Manny, and Smith, Arnold
Subjects: Computer Science - Computation and Language
Abstract: This report describes the research, design and implementation work carried out in building the CLARE system at SRI International, Cambridge, England. CLARE was designed as a natural language processing system with facilities for reasoning and understanding in context and for generating cooperative responses. The project involved both further development of SRI's Core Language Engine (Alshawi, 1992, MIT Press) natural language processor and the design and implementation of new components for reasoning and response generation. The CLARE system has advanced the state of the art in a wide variety of areas, both through the use of novel techniques developed on the project, and by extending the coverage or scale of known techniques. The language components are application-independent and provide interfaces for the development of new types of application., Comment: 250 pages, uuencoded compressed tar-ed LaTeX. Written 1992
Published: 1994

24. Estimating Performance of Pipelined Spoken Language Translation Systems

Author: Rayner, Manny, Carter, David, Price, Patti, and Lyberg, Bertil
Subjects: Computer Science - Computation and Language
Abstract: Most spoken language translation systems developed to date rely on a pipelined architecture, in which the main stages are speech recognition, linguistic analysis, transfer, generation and speech synthesis. When making projections of error rates for systems of this kind, it is natural to assume that the error rates for the individual components are independent, making the system accuracy the product of the component accuracies. The paper reports experiments carried out using the SRI-SICS-Telia Research Spoken Language Translator and a 1000-utterance sample of unseen data. The results suggest that the naive performance model leads to serious overestimates of system error rates, since there are in fact strong dependencies between the components. Predicting the system error rate on the independence assumption by simple multiplication resulted in a 16\% proportional overestimate for all utterances, and a 19\% overestimate when only utterances of length 1-10 words were considered., Comment: 10 pages, Latex source. To appear in Proc. ICSLP '94
Published: 1994

25. Combining Knowledge Sources to Reorder N-Best Speech Hypothesis Lists

Author: Rayner, Manny, Carter, David, Digalakis, Vassilios, and Price, Patti
Subjects: Computer Science - Computation and Language
Abstract: A simple and general method is described that can combine different knowledge sources to reorder N-best lists of hypotheses produced by a speech recognizer. The method is automatically trainable, acquiring information from both positive and negative examples. Experiments are described in which it was tested on a 1000-utterance sample of unseen ATIS data., Comment: 13 pages, Latex source. To appear in Proc. HLT '94
Published: 1994

26. Abductive Equivalential Translation and its application to Natural Language Database Interfacing

Author: Rayner, Manny
Subjects: Computer Science - Computation and Language
Abstract: The thesis describes a logical formalization of natural-language database interfacing. We assume the existence of a ``natural language engine'' capable of mediating between surface linguistic string and their representations as ``literal'' logical forms: the focus of interest will be the question of relating ``literal'' logical forms to representations in terms of primitives meaningful to the underlying database engine. We begin by describing the nature of the problem, and show how a variety of interface functionalities can be considered as instances of a type of formal inference task which we call ``Abductive Equivalential Translation'' (AET); functionalities which can be reduced to this form include answering questions, responding to commands, reasoning about the completeness of answers, answering meta-questions of type ``Do you know...'', and generating assertions and questions. In each case, a ``linguistic domain theory'' (LDT) $\Gamma$ and an input formula $F$ are given, and the goal is to construct a formula with certain properties which is equivalent to $F$, given $\Gamma$ and a set of permitted assumptions. If the LDT is of a certain specified type, whose formulas are either conditional equivalences or Horn-clauses, we show that the AET problem can be reduced to a goal-directed inference method. We present an abstract description of this method, and sketch its realization in Prolog. The relationship between AET and several problems previously discussed in the literature is discussed. In particular, we show how AET can provide a simple and elegant solution to the so-called ``Doctor on Board'' problem, and in effect allows a ``relativization'' of the Closed World Assumption. The ideas in the thesis have all been implemented concretely within the SRI CLARE project, using a real projects and payments database. The LDT for the example database is described in detail, and examples of the types of functionality that can be achieved within the example domain are presented., Comment: 162 pages, Latex source, PhD thesis (U Stockholm, 1993). Uses style-file ustockholm_thesis.sty
Published: 1994

27. Build Your Own Speech-Enabled Online CALL Course, No Experience Required

Author: Rayner Manny, Baur Claudia, Bouillon Pierrette, Chua Cathy, and Tsourakis Nikos
Subjects: CALL, ddc:410.2, Speech recognition, Open architectures, Web
Abstract: We describe Open CALL-SLT, a framework that allows people who may only have modest computer skills to create interactive speech-enabled CALL courses on the web. User content is uploaded to a server and remotely compiled and deployed. Courses can be created at six levels of increasing sophistication, where the specification of the content ranges from simple prompt/response pairs at one end to multimedia-enabled gamified dialogues with multiple paths at the other. We briefly describe the overall framework and the different levels.
Published: 2015

28. Supervised Learning of Response Grammars in a Spoken CALL System

Author: Rayner Manny, Baur Claudia, Chua Cathy, and Tsourakis Nikos
Abstract: We summarise experiments carried out using a system initiative spoken CALL system in which permitted responses to prompts are defined using a minimal formalism based on templates and regular expressions and describe a simple structural learning algorithm that uses annotated data to update response definitions. Using 1 927 utterances of training data we obtained a relative improvement of 20 in the system’s ability to react differentially to correct and incorrect input measured on a previously unseen test set. The results are significant at p < 0:005.
Published: 2015

29. Formative Feedback in an Interactive Spoken CALL System

Author: Tsourakis Nikos, Rayner Manny, and Baur Claudia
Abstract: By definition spoken dialogue CALL systems should be easy to use and understand. However interaction in this context is often far from unhindered. In this paper we introduce a formative feedback mechanism in our CALL system which can monitor interaction report errors and provide advice and suggestions to users. The distinctive feature of this mechanism is the ability to combine information from different sources and decide on the most pertinent feedback which can also be adapted in terms of phrasing style and language. We conducted experiments at three secondary schools in German speaking Switzerland and the obtained results suggest that our feedback mechanism helps students during interaction and contributes as a motivating factor.
Published: 2014

30. Crafting Interesting Dialogues in an Interactive Spoken CALL System

Author: Baur Claudia, Rayner Manny, and Tsourakis Nikos
Subjects: ComputingMilieux_COMPUTERSANDEDUCATION
Abstract: We describe work carried out in the context of a project whose goal is to develop a web enabled serious game designed to help German speaking beginner students of English improve their generative and auditory competence. The game which is currently undergoing a first round of formal user testing is intended as a self study adjunct to normal classroom instruction. It was developed in collaboration with a secondary school teacher with the content taken from a textbook commonly used in German speaking Switzerland and offers a short course of 8 interactive lessons using a combined vocabulary of about 450 words. A lesson is structured as a short prompted dialogue between the student and the machine where the student is encouraged to use simple language in practical contexts like booking a hotel room buying clothes or ordering a meal in a restaurant. We present an overview of the architecture which is designed to balance three main considerations: it should support coherent grammatical responses and reasonably interesting dialogues but also allow course content to be constructed by personnel who do not necessarily have a computer science background. We use as an example a new strategy we have recently developed where the central idea is to add further dialogue paths to the scripts graded by their degree of “cooperativeness”. The system starts with a simple cooperative interaction and opens up increasingly "uncooperative" dialogue paths as the student gains familiarity with the game.
Published: 2014

31. CALL-SLT Lite: A Minimal Framework for Building Interactive Speech-Enabled CALL Applications

Author: Rayner Manny, Baur Claudia, and Tsourakis Nikos
Abstract: We present a framework CALL SLT Lite which can be used by people with only very basic software skills to create interactive multimodal speech enabled CALL games suitable for beginner/low intermediate child language learners. The games are deployed over the web and can be accessed through a normal browser and the framework is completely language independent. As the name suggests the framework grew out of an earlier platform CALL SLT which enables construction of similar games but uses a more sophisticated architecture. We review the history of the project describing the type of game we are aiming to build and our reasons for believing that they are useful and then present CALL SLT Lite and an initial evaluation comparing the performance of the two versions of the framework. The results suggest that the Lite framework although much simpler offers performance at least as good as that of the original system.
Published: 2014

32. Using a Serious Game to Collect a Child Learner Speech Corpus

Author: Baur Claudia, Rayner Manny, and Tsourakis Nikos
Abstract: We present an English L2 child learner speech corpus produced by Swiss German L1 students in their third year of learning English which is currently in the process of being collected. The collection method uses a web enabled multimodal language game implemented using the CALL SLT platform in which subjects hold prompted conversations with an animated agent. Prompts consist of a short animated English language video clip together with a German language piece of text indicating the semantic content of the requested response. Grammar based speech understanding is used to decide whether responses are accepted or rejected and dialogue flow is controlled using a simple XML based scripting language; the scripts are written to allow multiple dialogue paths the choice being made randomly. The system is gamified using a score and badge framework with four levels of badges. We describe the application the data collection and annotation procedures and the initial tranche of data. The full corpus when complete should contain at least 5 000 annotated utterances.
Published: 2014

33. A web-deployed Swedish spoken CALL systembased on a large shared English/Swedish feature grammar

Author: Rayner, Manny, Gerlach, Johanna, Starlander, Marianne, Tsourakis, Nikos, Kruckenberg, Anita, Eklund, Robert, Jönsson, Arne, McAllister, Anita, and Chua, Cathy
Subjects: Teknik och teknologier, Engineering and Technology
Abstract: We describe a Swedish version of CALL-SLT,a web-deployed CALL system that allows beginner/intermediate students to practise generativespoken language skills. Speech recognitionis grammar-based, with language modelsderived, using the Regulus platform, fromsubstantial domain-independent feature grammars.The paper focusses on the Swedishgrammar resources, which were developedby generalising the existing English featuregrammar into a shared grammar for Englishand Swedish. It turns out that this can be donevery economically: all but a handful of rulesand features are shared, and English grammaressentially ends up being treated as a reducedform of Swedish. We conclude by presentinga simple evaluation which compares theSwedish and French versions of CALL-SLT.
Published: 2012

34. A Scalable Architecture For Web Deployment of Spoken Dialogue Systems

Author: Fuchs Matthew, Tsourakis Nikos, and Rayner Manny
Abstract: We describe a scalable architecture particularly well suited to cloud based computing which can be used for Web deployment of spoken dialogue systems. In common with similar platforms like WAMI and the Nuance Mobile Developer Platform we use a client/server approach in which speech recognition is carried out on the server side; our architecture however differs from these systems in offering considerably more elaborate server side functionality based on large scale grammar based language processing and generic dialogue management. We describe two substantial applications built using our framework which we argue would have been hard to construct in WAMI or NMDP. Finally we present a series of evaluations carried out using CALL SLT a speech translation game where we contrast performance in Web and desktop versions. Task Error Rate in the Web version is only slightly inferior that in the desktop one and the average additional latency is under half a second. The software is generally available for research purposes.
Published: 2012

35. Evaluating A Web-Based Spoken Language Translation Game For Learning Domain Language

Author: Bouillon Pierrette, Halimi Sonia, Rayner Manny, and Tsourakis Nikos
Abstract: We present an evaluation of CALL SLT a web based CALL application based on the “translation game” idea that can be used for practicing fluency in a limited domain. The version tested was configured to teach basic restaurant French to students whose native language is Arabic. Students spent about an hour and a half each working with the system and explored between five and eight lessons. The focus was on investigating how they interacted with the system and what they learned most effectively.
Published: 2011

36. Evaluation of a Mobile Language Learning System Using Language-Neutral Prompts

Author: Tsourakis Nikos, Rayner Manny, and Bouillon Pierrette
Subjects: ComputingMilieux_COMPUTERSANDEDUCATION
Abstract: We describe two versions of a prototype system for computer assisted language learning hosted on a mobile device. In both versions the student is given prompts by the machine which they then have to render in the L2. The two versions differ with respect to the modality of the prompt: one presents it as an L1 based text string while the other one presents it in pictorial form. The two versions were tested on a group of 32 high school students as part of the Geneva University Student Week. Our findings suggest that male students found the pictorial version significantly easier to use while female students preferred the text version.
Published: 2011

37. A Student-Centered Evaluation of a Web-Based Spoken Translation Game

Author: Bouillon Pierrette, Rayner Manny, Tsourakis Nikos, and Zhang Q
Abstract: We present an evaluation of CALL SLT a web deployed speech enabled platform for improving fluency in a limited domain based on the “translation game” idea ofWang and Seneff. The evaluation used 10 Chinese speaking students of French who spent an average of about three hours each practising on a set of five lessons covering elementary and intermediate grammar topics in a restaurant domain. We found significant improvements in student performance measured both in terms of their ability to be recognised correctly by the application and according to written vocabulary and grammar tests.
Published: 2011

38. For A Fistful Of Dollars: Using Crowd-Sourcing To Evaluate A Spoken Language CALL Application

Author: Rayner Manny, Frank Ian, Chua Cathy, Tsourakis Nikos, and Bouillon Pierrette
Subjects: CALL, Japanese, ddc:410.2, Crowd-sourcing, Speech recognition, Evaluation
Abstract: We present an evaluation of a Web-deployed spoken language CALL system, carried out using crowd-sourcing methods. The system, “Survival Japanese”, is a crash course in tourist Japanese implemented within the platform CALL-SLT. The evaluation was carried out over one week using the Amazon Mechanical Turk. Although we found a high proportion of attempted scammers, there was a core of 23 subjects who used the system in a responsible manner. The evidence that these subjects learned from their 111 sessions and 9092 spoken interactions was significant at P=0.001. Our conclusion is that crowd-sourcing is a potentially valid method for evaluating spoken CALL systems.
Published: 2011

39. Bootstrapping a statistical speech translator from a rule-based one

Author: Rayner, Manny, Estrella, Paula, Bouillon, Pierrette, and International Workshop on Free/Open-Source Rule-Based Machine Translation (2nd : 2011 : Barcelona)
Subjects: Programari lliure, Traducció automàtica, Software libre, Traducción automática, bootstrapping, Computational linguistics, Lingüística computacional, Traducció automàtica estadística, Statistical, Open source software, Traducción automática estadística, Machine translating
Abstract: We describe a series of experiments in which we start with English to French and English to Japanese versions of an Open Source rule-based speech translation system for a medical domain, and bootstrap correspondign statistical systems. Comparative evaluation reveals that the rule-based systems are still significantly better than the statistical ones, despite the fact that considerable effort has been invested in tuning both the recognition and translation components; also, a hybrid system only marginally improved recall at the cost of a los in precision. The result suggests that rule-based architectures may still be preferable to statistical ones for safety-critical speech translation tasks. Describimos una serie de experimentos en los que comenzamos con las versiones de inglés a francés y de inglés a japonés de un sistema de traducción oral basado en reglas de código abierto para un dominio médico, e iniciamos los sistemas estadísticos correspondientes. La evaluación comparativa revela que los sistemas basados en reglas son significativamente mejores que los estadísticos, a pesar del hecho de que se ha dedicado un esfuerzo considerable en mejorar los componentes de reconocimiento y traducción; además, un sistema híbrido sólo mejoró de modo marginal la memoria a costa de una pérdida de precisión. Los resultados indican que las arquitecturas basadas en reglas podrían ser mejores para tareas de traducción oral críticas. Descrivim una sèrie d'experiments en què comencem amb les versions d'anglès a francès i d'anglès a japonès d'un sistema de traducció oral basat en regles en codi obert per a un domini mèdic, i iniciem els sistemes estadístics corresponents. L'avaluació comparativa revela que els sistemes basats en regles encara són significativament millors que els estadístics, malgrat el fet que s'ha dedicat un esforç important a millorar els components tant de reconeixement com de traducció; a més a més, un sistema híbrid només va millorar marginalment la memòria a costa d'una pèrdua de precisió. Els resultats indiquen que les arquitectures basades en regles podrien ser millors que les estadístiques per a tasques de traducció oral crítiques.
Published: 2010

40. A Multilingual Platform for Building Speech-Enabled Language Courses

Author: Rayner Manny, Bouillon Pierrette, Tsourakis Nikos, Gerlach Johanna, Baur Claudia, Georgecul Maria, and Nakao Yukie
Published: 2010

41. Design Issues for a Bidirectional Mobile Medical Speech Translator

Author: Tsourakis, Nikolaos, Bouillon, Pierrette, and Rayner, Manny
Subjects: MedSLT, ddc:410.2, Regulus, Medical Speech Translation
Abstract: We argue that there is an urgent need for spoken dialogue translation systems in the medical domain. In this work we describe how an existing system of this kind, originally intended for a desktop PC, was adapted to run in a mobile environment. The different characteristics of the target device necessitate new interaction approaches and interface design. The new configuration uses two client applications running on two different devices, one for the doctor and one for the patient, together with a server; the components are connected over a wireless network. The system supports context-dependent translation in both directions. We give a general overview of the system, and discuss some of the relevant design issues pertaining to deployment of speech-enabled systems on mobile devices.
Published: 2009

42. Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System

Author: Bouillon, Pierrette, Halimi, Sonia, Nakao, Yukie, Kanzaki, Kyoko, Isahara, Hitoshi, Tsourakis, Nikos, Starlander, Marianne, Hockey, Beth Ann, Rayner, Manny, and Nakao, Yukie
Subjects: [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL], ddc:410.2, ComputingMilieux_MISCELLANEOUS, [INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD]
Abstract: We describe recent work on MedSLT, a medium-vocabulary interlingua-based medical speech translation system, focussing on issues that arise when handling languages of which the grammar engineer has little or no knowledge. We describe how we can systematically create and maintain multiple forms of grammars, lexica and interlingual representations, with some versions being used by language informants, and some by grammar engineers. In particular, we describe the advantages of structuring the interlingua definition as a simple semantic grammar, which includes a human-readable surface form. We show how this allows us to rationalise the process of evaluating translations between languages lacking common speakers. The grammar-based interlingua definition can also be used in other ways. We describe two applications: a simple generic tool for debugging to-interlingua translation rules, and a method for improving speech understanding performance by rescoring N-best speech hypothesis lists. Examples presented focus on the concrete case of translation between Japanese and Arabic in both directions.
Published: 2008

43. Many-to-Many Multilingual Medical Speech Translation on a PDA

Author: Bouillon, Pierrette, Flores, Glenn, Georgescul, Maria, Halimi, Sonia, Hockey, Beth Ann, Isahara, Hitoshi, Kanzaki, Kyoko, Nakao, Yukie, Rayner, Manny, Santaholma, Marianne, Starlander, Marianne, Tsourakis, Nikos, and Nakao, Yukie
Subjects: ddc:410.2, ComputingMilieux_MISCELLANEOUS, [INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD], [INFO.INFO-PL] Computer Science [cs]/Programming Languages [cs.PL]
Abstract: Particularly considering the requirement of high reliability, we argue that the most appropriate architecture for a medical speech translator that can be realised using today's technology combines unidirectional (doctor to patient) translation, medium-vocabulary controlled language coverage, interlingua-based translation, an embedded help component, and deployability on a hand-held hardware platform. We present an overview of the Open Source MedSLT prototype, which has been developed in accordance with these design principles. The system is implemented on top of the Regulus and Nuance 8.5 platforms, translates patient examination questions for all language pairs in the set {English, French, Japanese, Arabic, Catalan}, using vocabularies of about 400 to 1 100 words, and can be run in a distributed client/server environment, where the client application is hosted on a Nokia Internet Tablet device.
Published: 2008

44. Representational and architectural issues in a limited-domain medical speech translator

Author: Rayner, Manny, Bouillon, Pierrette, Santaholma, Marianne Elina, and Nakao, Yukie
Subjects: ddc:410.2, Speech understanding, Computer-aided diagnosis, Speech translation
Abstract: We present an overview of MedSLT, a medium vocabulary medical speech translation system, focussing on the representational issues that arise when translating temporal and causal concepts. Although flat key/value structures are strongly preferred as semantic representations in speech understanding systems, we argue that it is infeasible to handle the necessary range of concepts using only flat structures. By exploiting the specific nature of the task, we show that it is possible to implement a solution which only slightly extends the representational complexity of the semantic representation language, by permitting an optional single nested level representing a subordinate clause construct. We sketch our solutions to the key problems of producing minimally nested representations using phrase-spotting methods, and writing cleanly structured rule-sets that map temporal and phrasal representations into a canonical interlingual form.
Published: 2005

45. A Generic Multi-Lingual Open Source Platform for Limited-Domain Medical Speech Translation

Author: Bouillon, Pierrette, Rayner, Manny, Chatzichrisafis, Nikos, Hockey, Beth Ann, Santaholma, Marianne Elina, Starlander, Marianne, Nakao, Yukie, Kanzaki, Kyoko, and Isahara, Hitoshi
Subjects: ddc:410.2
Abstract: We present an overview of MedSLT, an Open Source platform for developing limited-domain medical speech translation systems. We focus in particular on the speech understanding architecture, which uses grammar-based language models derived using corpus-based specialisation methods from a single linguistically motivated grammar, and summarise the results of two evaluations which investigate the appropriateness of these design choices. Other sections describe the interlingua and its relationship with the recognition architecture, and the current demo system.
Published: 2005

46. Comparing Rule-Based and Statistical Approaches to Speech Understanding in a Limited Domain Speech Translation System

Author: Rayner, Manny, Bouillon, Pierrette, Hockey, Beth Ann, Chatzichrisafis, Nikos, and Starlander, Marianne
Subjects: ddc:410.2
Abstract: The paper directly compares two versions of a medical speech translation system, one with a grammar based language model (GLM) recognizer and the other with a statistical language model (SLM) recognizer. We construct the GLM using a corpus-based method, so that both the GLM and the SLM can be derived from the same corpus; evaluation is carried out with respect to performance on the speech translation task. Despite using a very small training set for both the GLM and the SLM, the SLM delivers much better word error rates on unseen test material. Nonetheless, evaluating both systems on translation performance rather than word error rates, the GLM-based version of the system outperforms the SLM on the actual translation task.
Published: 2004

47. CALL-SLT: A Spoken CALL System

Author: Rayner, Manny, primary, Tsourakis, Nikos, additional, Baur, Claudia, additional, Bouillon, Pierrette, additional, and Gerlach, Johanna, additional
Published: 2014
Full Text: View/download PDF

48. Language-Processing Strategies and Mixed-Initiative Dialogues

Author: Boye, Johan, Wirén, Mats, Rayner, Manny, Lewin, Ian, Carter, David, and Becket, Ralph
Subjects: Språkteknologi (språkvetenskaplig databehandling), Language Technology (Computational Linguistics)
Abstract: We describe an implemented spoken-language dialogue system for a travel-planning domain, which accesses a commercially available travel-information web-server and supports a flexible mixed-initiative dialogue strategy. We argue, based on data from initial Wizard-of-Oz experiments, that mixed-initiative strategies are appropriate for many types of user, but require more sophisticated architectures for processing of language and dialogue; we then use these observations to motivate an architecture which combines parallel deep and shallow natural language analysis engines and an agenda-driven dialogue manager. We outline the top-level processing strategy used by the dialogue manager, and also a novel formalism, which we call Flat Utterance Description, that allows us to reduce the output of the deep and shallow language-processing engines to a common representation.
Published: 1999

49. Bootstrapping a Statistical Speech Translator From a Rule-Based One

Author: Rayner, Manny, primary, Bouillon, Pierrette, additional, Estrella, Paula, additional, Nakao, Yukie, additional, and Christian, Gwen, additional
Published: 2012
Full Text: View/download PDF

50. The Swedish Core Language Engine

Author: Gambäck, Björn and Rayner, Manny
Subjects: Computer and Information Sciences, Data- och informationsvetenskap
Abstract: The paper describes a Swedish-language customization (S-CLE) of the SRI Core Language Engine, which has been developed at SICS from the original English-language version by replacing English-specific modules with corresponding Swedish-language versions. The S-CLE is intended to be used as a building block in a broad range of applications, such as data-base query system, machine translation systems, NL front-ends, speech-to-text/text-to-speech systems, and so on. Examples of the first two types of application already exist. The main part of the S-CLE is an extensive Swedish grammar that is compiled into parsing and generation modules. The grammar formalism is a type of unification grammar loosely based on Generalized Phrase Structure Grammar (GPSG). Generation is performed using the Semantic-Head-Driven algorithm. Analysis turns sentences into ``Quasi-Logical Form'' (QLF), a logical-form representation, while generation works in the opposite direction. Intermediate stages include processing of morphology, syntax and semantics. For knowledge-base applications, a separate module can convert QLFs into conventional scoped logical forms. After two-and-a-half years of work (approximately 45 person months), the first prototype system has a vocabulary of about 1900 words and covers a fairly broad range of possible grammatical constructions. Based on our experience in this project, we present in this paper detailed arguments to support the claim that customization of an English-language NLP system is a highly cost-effective way of constructing Swedish language systems with corresponding functionality. A shorter version of this paper appears in L. Ahrenberg (ed.): Papers from the Third Nordic Conference on Text Comprehension in Man and Machine , Link Sweden, 1992.
Published: 1992

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

79 results on '"Rayner, Manny"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources