79 results on '"Rayner, Manny"'
Search Results
2. Easy as ABC: Using LARA to Build Multimedia Alphabet Books
- Author
-
Akhlaghi-Baghoojari, Elham, Bédi, Branislav, Chua, Cathy, Horváthová, Ivana, Ivanova, Nedelina, Maizonniaux, Christèle, Mykhats, Marta, Chiaráin, Neasa Ní, Rayner, Manny, Orian Weiss, Catherine, and Zviel-Girshin, Rina
- Abstract
We present a study in which multimedia alphabet books were constructed for ten languages using the Learning And Reading Assistant (LARA) platform. We describe the alphabet books we built, the different design features they instantiate, and an initial evaluation using an anonymous online questionnaire. Links are provided to the books themselves, which are freely available on the web. [For the complete volume, "Intelligent CALL, Granular Systems and Learner Data: Short Papers from EUROCALL 2022 (30th, Reykjavik, Iceland, August 17-19, 2022)," see ED624779.]
- Published
- 2022
3. Assessing the Quality of TTS Audio in the LARA Learning-by-Reading Platform
- Author
-
Akhlaghi, Elham, Baczkowska, Anna, Berthelsen, Harald, Bédi, Branislav, Chua, Cathy, Cucchiarini, Catia, Habibi, Hanieh, Horváthová, Ivana, Hvalsøe, Pernille, Lotz, Roy, Maizonniaux, Christèle, Chiaráin, Neasa Ní, Rayner, Manny, Tsourakis, Nikos, and Yao, Chunlin
- Abstract
A popular idea in Computer Assisted Language Learning (CALL) is to use multimodal annotated texts, with annotations typically including embedded audio and translations, to support L2 learning through reading. An important question is how to create the audio, which can be done either through human recording or by a Text-To-Speech (TTS) synthesis engine. We may reasonably expect TTS to be quicker and easier, but humans to be of higher quality. Here, we report a study using the open-source LARA platform and ten languages. Samples of LARA audio totaling about three and a half minutes were provided for each language in both human and TTS form; subjects used a web form to compare different versions of the same item and rate the voices as a whole. Although human voice was more often preferred, TTS achieved higher ratings in some languages and was close in others. [For the complete volume, "CALL and Professionalisation: Short Papers from EUROCALL 2021 (29th, Online, August 26-27, 2021)," see ED616972.]
- Published
- 2021
4. LARA: An Extensible Open Source Platform for Learning Languages by Reading
- Author
-
Bédi, Branislav, Butterweck, Matt, Chua, Cathy, Gerlach, Johanna, Guðmarsdóttir, Birgitta Björg, Habibi, Hanieh, Jónsson, Bjartur Örn, Rayner, Manny, and Vigfússon, Sigurður
- Abstract
Learning and Reading Assistant (LARA) is an open source platform that enables conversion of plain texts into an interactive multimedia form designed to support second- and foreign-language (L2) learners. In this workshop, we illustrate the open source aspects using collaborative work carried out during a six-week summer project at the Árni Magnússon Institute for Icelandic Studies. Three undergraduate level students extended the platform in different directions in cooperation with other members of the international LARA team. The three subprojects were respectively concerned with adding automatically generated flashcards, adding multimedia versions of poetic texts in the archaic language Old Norse, and extending LARA to allow the inclusion of sign language content in Icelandic sign language -- Íslenskt TáknMál (ÍTM). All three reached successful conclusions. [For the complete volume, "CALL for Widening Participation: Short Papers from EUROCALL 2020 (28th, Online, August 20-21, 2020)," see ED610330.]
- Published
- 2020
5. Constructing an Interactive Old Norse Text with LARA
- Author
-
Bédi, Branislav, Bernharðsson, Haraldur, Chua, Cathy, Guðmarsdóttir, Birgitta Björg, Habibi, Hanieh, and Rayner, Manny
- Abstract
We describe how the open-source Learning and Reading Assistant (LARA) platform was used to convert a classic Old Norse text, the "Völuspá," into an interactive online form. The LARA version includes high-quality recorded audio, translations, notes on key words and phrases, an automatically generated concordance, and links to other online resources. The interactive text was created in two different editions, one with Modern Icelandic translations designed to support Icelandic school students who read the poem as a set text, and one with English translations designed for English readers with basic Old Norse who wish to able to appreciate the poem in the original as a piece of literature. Initial feedback from groups has been positive. [For the complete volume, "CALL for Widening Participation: Short Papers from EUROCALL 2020 (28th, Online, August 20-21, 2020)," see ED610330.]
- Published
- 2020
6. Using LARA for Language Learning: A Pilot Study for Icelandic
- Author
-
Bédi, Branislav, Chua, Cathy, Habibi, Hanieh, Martinez-Lopez, Ruth, and Rayner, Manny
- Abstract
This paper presents a brief overview of LARA (Learning And Reading Assistant), an open source online tool that has been under development since summer 2018. LARA currently contains a corpus of about 25 texts in ten languages and a crowdsourcing model used to expand the corpus. The central goal is to provide support for improving second language (L2) reading comprehension. The focus here is on the development of Icelandic content and its use during pilot testing amongst adult L2 learners of Icelandic. Preliminary feedback from users, while mostly positive, contained suggestions on how the tool might be improved. [For the complete proceedings, see ED600837.]
- Published
- 2019
7. Alexa as a CALL platform for children: Where do we start?
- Author
-
Tsourakis, Nikos, Rayner, Manny, Habibi, Hanieh, Gallais, Pierre-Emmanuel, Chua, Cathy, and Butterweck, Matt
- Subjects
Computer Science - Human-Computer Interaction - Abstract
Amazon's Alexa is now widely available and shows interesting potential as a platform for hosting CALL games aimed at children. In this paper, we describe an initial informal experiment where we created some simple CALL games and made them available to a few child testers. We report the children's and parents' reactions. Our overall conclusion is that, although Alexa has many positive features, there are still fundamental platform issues in the current version that make it very difficult to build compelling CALL games for children. The games used will soon be freely available for download on the Alexa store., Comment: 4 pages. Based on talk given at enetCollect WG3 & WG5 Meeting, Leiden, Holland, 2018
- Published
- 2019
8. What do the founders of online communities owe to their users?
- Author
-
Chua, Cathy and Rayner, Manny
- Subjects
Computer Science - Computers and Society - Abstract
We discuss the organisation of internet communities, focusing on what we call the principle of "bait and switch": founders of internet communities often find it advantageous to recruit members by promising inducements which are later not honoured. We look at some of the dilemmas and ways of attempting to resolve them through two paradigmatic examples, Wikispaces and Wordpress. Our analysis is to a large extent motivated by the demands of CALLector, a university-centred social network we are in the process of establishing. We consider the question of what ethical standards are imposed on universities engaged in this type of activity., Comment: 6 pages. Paper based on talk at enetCollect WG3 & WG5 Meeting, Leiden 2018
- Published
- 2019
9. Decentralising power: how we are trying to keep CALLector ethical
- Author
-
Chua, Cathy, Habibi, Hanieh, Rayner, Manny, and Tsourakis, Nikos
- Subjects
Computer Science - Human-Computer Interaction ,Computer Science - Computers and Society - Abstract
We present a brief overview of the CALLector project, and consider ethical questions arising from its overall goal of creating a social network to support creation and use of online CALL resources. We argue that these questions are best addressed in a decentralised, pluralistic open source architecture., Comment: 6 pages; based on talk presented at enetCollect WG3 & WG5 Meeting, Leiden, Holland, 2018
- Published
- 2019
10. Helping Domain Experts Build Speech Translation Systems
- Author
-
Rayner, Manny, Armando, Alejandro, Bouillon, Pierrette, Ebling, Sarah, Gerlach, Johanna, Halimi, Sonia, Strasly, Irene, and Tsourakis, Nikos
- Subjects
Computer Science - Human-Computer Interaction ,Computer Science - Computation and Language - Abstract
We present a new platform, "Regulus Lite", which supports rapid development and web deployment of several types of phrasal speech translation systems using a minimal formalism. A distinguishing feature is that most development work can be performed directly by domain experts. We motivate the need for platforms of this type and discuss three specific cases: medical speech translation, speech-to-sign-language translation and voice questionnaires. We briefly describe initial experiences in developing practical systems., Comment: 12 pages, 1 figure, to appear in Proc. Future and Emerging Trends in Language Technology 2015, Seville, Spain
- Published
- 2015
11. A Compact Architecture for Dialogue Management Based on Scripts and Meta-Outputs
- Author
-
Rayner, Manny, Hockey, Beth Ann, and James, Frankie
- Subjects
Computer Science - Computation and Language ,I.2.7 ,H.5.2 - Abstract
We describe an architecture for spoken dialogue interfaces to semi-autonomous systems that transforms speech signals through successive representations of linguistic, dialogue, and domain knowledge. Each step produces an output, and a meta-output describing the transformation, with an executable program in a simple scripting language as the final result. The output/meta-output distinction permits perspicuous treatment of diverse tasks such as resolving pronouns, correcting user misconceptions, and optimizing scripts.
- Published
- 2000
12. A Comparison of the XTAG and CLE Grammars for English
- Author
-
Hockey, Beth Ann, Rayner, Manny, and James, Frankie
- Subjects
Computer Science - Computation and Language ,I.2.7 - Abstract
When people develop something intended as a large broad-coverage grammar, they usually have a more specific goal in mind. Sometimes this goal is covering a corpus; sometimes the developers have theoretical ideas they wish to investigate; most often, work is driven by a combination of these two main types of goal. What tends to happen after a while is that the community of people working with the grammar starts thinking of some phenomena as ``central'', and makes serious efforts to deal with them; other phenomena are labelled ``marginal'', and ignored. Before long, the distinction between ``central'' and ``marginal'' becomes so ingrained that it is automatic, and people virtually stop thinking about the ``marginal'' phenomena. In practice, the only way to bring the marginal things back into focus is to look at what other people are doing and compare it with one's own work. In this paper, we will take two large grammars, XTAG and the CLE, and examine each of them from the other's point of view. We will find in both cases not only that important things are missing, but that the perspective offered by the other grammar suggests simple and practical ways of filling in the holes. It turns out that there is a pleasing symmetry to the picture. XTAG has a very good treatment of complement structure, which the CLE to some extent lacks; conversely, the CLE offers a powerful and general account of adjuncts, which the XTAG grammar does not fully duplicate. If we examine the way in which each grammar does the thing it is good at, we find that the relevant methods are quite easy to port to the other framework, and in fact only involve generalization and systematization of existing mechanisms., Comment: 5th International Workshop on Tree Adjoining Grammars and Related Formalisms. 25-27 May 2000, Paris, France
- Published
- 2000
13. Accuracy, Coverage, and Speed: What Do They Mean to Users?
- Author
-
James, Frankie, Rayner, Manny, and Hockey, Beth Ann
- Subjects
Computer Science - Computation and Language ,Computer Science - Human-Computer Interaction ,H.5.2 ,I.2.7 - Abstract
Speech is becoming increasingly popular as an interface modality, especially in hands- and eyes-busy situations where the use of a keyboard or mouse is difficult. However, despite the fact that many have hailed speech as being inherently usable (since everyone already knows how to talk), most users of speech input are left feeling disappointed by the quality of the interaction. Clearly, there is much work to be done on the design of usable spoken interfaces. We believe that there are two major problems in the design of speech interfaces, namely, (a) the people who are currently working on the design of speech interfaces are, for the most part, not interface designers and therefore do not have as much experience with usability issues as we in the CHI community do, and (b) speech, as an interface modality, has vastly different properties than other modalities, and therefore requires different usability measures., Comment: Position paper for CHI 2000 Workshop on Natural-Language Interaction
- Published
- 2000
14. Compiling Language Models from a Linguistically Motivated Unification Grammar
- Author
-
Rayner, Manny, Hockey, Beth Ann, James, Frankie, Bratt, Elizabeth O., Goldwater, Sharon, and Gawron, Mark
- Subjects
Computer Science - Computation and Language ,I.2.7 - Abstract
Systems now exist which are able to compile unification grammars into language models that can be included in a speech recognizer, but it is so far unclear whether non-trivial linguistically principled grammars can be used for this purpose. We describe a series of experiments which investigate the question empirically, by incrementally constructing a grammar and discovering what problems emerge when successively larger versions are compiled into finite state graph representations and used as language models for a medium-vocabulary recognition task., Comment: To be published in COLING 2000
- Published
- 2000
15. Turning Speech Into Scripts
- Author
-
Rayner, Manny, Hockey, Beth Ann, and James, Frankie
- Subjects
Computer Science - Computation and Language ,H.5.2 ,I.2.7 - Abstract
We describe an architecture for implementing spoken natural language dialogue interfaces to semi-autonomous systems, in which the central idea is to transform the input speech signal through successive levels of representation corresponding roughly to linguistic knowledge, dialogue knowledge, and domain knowledge. The final representation is an executable program in a simple scripting language equivalent to a subset of Cshell. At each stage of the translation process, an input is transformed into an output, producing as a byproduct a "meta-output" which describes the nature of the transformation performed. We show how consistent use of the output/meta-output distinction permits a simple and perspicuous treatment of apparently diverse topics including resolution of pronouns, correction of user misconceptions, and optimization of scripts. The methods described have been concretely realized in a prototype speech interface to a simulation of the Personal Satellite Assistant., Comment: Working notes from AAAI Spring Symposium
- Published
- 2000
16. Translation Methodology in the Spoken Language Translator: An Evaluation
- Author
-
Carter, David, Becket, Ralph, Rayner, Manny, Eklund, ; Robert, MacDermid, Catriona, Wiren, Mats, Kirchmeier-Andersen, ; Sabine, and Philp, Christina
- Subjects
Computer Science - Computation and Language - Abstract
In this paper we describe how the translation methodology adopted for the Spoken Language Translator (SLT) addresses the characteristics of the speech translation task in a context where it is essential to achieve easy customization to new languages and new domains. We then discuss the issues that arise in any attempt to evaluate a speech translator, and present the results of such an evaluation carried out on SLT for several language pairs., Comment: 10 pages, needs aclap.sty. To appear in Spoken Language Translation workshop at (E)ACL-97
- Published
- 1997
17. Recycling Lingware in a Multilingual MT System
- Author
-
Rayner, Manny, Carter, David, Bretan, Ivan, Eklund, Robert, Wiren, Mats, Hansen, Steffen Leo, Kirchmeier-Andersen, Sabine, Philp, Christina, Sorensen, Finn, and Thomsen, Hanne Erdman
- Subjects
Computer Science - Computation and Language - Abstract
We describe two methods relevant to multi-lingual machine translation systems, which can be used to port linguistic data (grammars, lexicons and transfer rules) between systems used for processing related languages. The methods are fully implemented within the Spoken Language Translator system, and were used to create versions of the system for two new language pairs using only a month of expert effort., Comment: 6 pages, needs aclap.sty. To appear in "From Research to Commercial Applications" workshop at ACL-97, see also http://www.cam.sri.com
- Published
- 1997
18. Hybrid language processing in the Spoken Language Translator
- Author
-
Rayner, Manny and Carter, David
- Subjects
Computer Science - Computation and Language - Abstract
The paper presents an overview of the Spoken Language Translator (SLT) system's hybrid language-processing architecture, focussing on the way in which rule-based and statistical methods are combined to achieve robust and efficient performance within a linguistically motivated framework. In general, we argue that rules are desirable in order to encode domain-independent linguistic constraints and achieve high-quality grammatical output, while corpus-derived statistics are needed if systems are to be efficient and robust; further, that hybrid architectures are superior from the point of view of portability to architectures which only make use of one type of information. We address the topics of ``multi-engine'' strategies for robust translation; robust bottom-up parsing using pruning and grammar specialization; rational development of linguistic rule-sets using balanced domain corpora; and efficient supervised training by interactive disambiguation. All work described is fully implemented in the current version of the SLT-2 system., Comment: 4 pages, uses icassp97.sty; to appear in ICASSP-97; see http://www.cam.sri.com for related material
- Published
- 1997
19. Adapting the Core Language Engine to French and Spanish
- Author
-
Rayner, Manny, Carter, David, and Bouillon, Pierrette
- Subjects
Computer Science - Computation and Language - Abstract
We describe how substantial domain-independent language-processing systems for French and Spanish were quickly developed by manually adapting an existing English-language system, the SRI Core Language Engine. We explain the adaptation process in detail, and argue that it provides a fairly general recipe for converting a grammar-based system for English into a corresponding one for a Romance language., Comment: 9 pages, aclap.sty; to appear in NLP+IA 96; see also http://www.cam.sri.com/
- Published
- 1996
20. Fast Parsing using Pruning and Grammar Specialization
- Author
-
Rayner, Manny and Carter, David
- Subjects
Computer Science - Computation and Language - Abstract
We show how a general grammar may be automatically adapted for fast parsing of utterances from a specific domain by means of constituent pruning and grammar specialization based on explanation-based learning. These methods together give an order of magnitude increase in speed, and the coverage loss entailed by grammar specialization is reduced to approximately half that reported in previous work. Experiments described here suggest that the loss of coverage has been reduced to the point where it no longer causes significant performance degradation in the context of a real application., Comment: 8 pages LaTeX, ACL-96, needs aclap.sty; see also http://www.cam.sri.com/
- Published
- 1996
21. Hybrid Transfer in an English-French Spoken Language Translator
- Author
-
Rayner, Manny and Bouillon, Pierrette
- Subjects
Computer Science - Computation and Language - Abstract
The paper argues the importance of high-quality translation for spoken language translation systems. It describes an architecture suitable for rapid development of high-quality limited-domain translation systems, which has been implemented within an advanced prototype English to French spoken language translator. The focus of the paper is the hybrid transfer model which combines unification-based rules and a set of trainable statistical preferences; roughly, rules encode domain-independent grammatical information and preferences encode domain-dependent distributional information. The preferences are trained from sets of examples produced by the system, which have been annotated by human judges as correct or incorrect. An experiment is described in which the model was tested on a 2000 utterance sample of previously unseen data., Comment: 7 pages, LaTeX (2.09 preferred); eaclap.sty; Procs of IA '95 (Montpellier, France)
- Published
- 1995
22. The Speech-Language Interface in the Spoken Language Translator
- Author
-
Carter, David and Rayner, Manny
- Subjects
Computer Science - Computation and Language - Abstract
The Spoken Language Translator is a prototype for practically useful systems capable of translating continuous spoken language within restricted domains. The prototype system translates air travel (ATIS) queries from spoken English to spoken Swedish and to French. It is constructed, with as few modifications as possible, from existing pieces of speech and language processing software. The speech recognizer and language understander are connected by a fairly conventional pipelined N-best interface. This paper focuses on the ways in which the language processor makes intelligent use of the sentence hypotheses delivered by the recognizer. These ways include (1) producing modified hypotheses to reflect the possible presence of repairs in the uttered word sequence; (2) fast parsing with a version of the grammar automatically specialized to the more frequent constructions in the training corpus; and (3) allowing syntactic and semantic factors to interact with acoustic ones in the choice of a meaning structure for translation, so that the acoustically preferred hypothesis is not always selected even if it is within linguistic coverage., Comment: 9 pages, LaTeX. Published: Proceedings of TWLT-8, December 1994
- Published
- 1994
23. CLARE: A Contextual Reasoning and Cooperative Response Framework for the Core Language Engine
- Author
-
Alshawi, Hiyan, Carter, David, Crouch, Richard, Pulman, Steve, Rayner, Manny, and Smith, Arnold
- Subjects
Computer Science - Computation and Language - Abstract
This report describes the research, design and implementation work carried out in building the CLARE system at SRI International, Cambridge, England. CLARE was designed as a natural language processing system with facilities for reasoning and understanding in context and for generating cooperative responses. The project involved both further development of SRI's Core Language Engine (Alshawi, 1992, MIT Press) natural language processor and the design and implementation of new components for reasoning and response generation. The CLARE system has advanced the state of the art in a wide variety of areas, both through the use of novel techniques developed on the project, and by extending the coverage or scale of known techniques. The language components are application-independent and provide interfaces for the development of new types of application., Comment: 250 pages, uuencoded compressed tar-ed LaTeX. Written 1992
- Published
- 1994
24. Estimating Performance of Pipelined Spoken Language Translation Systems
- Author
-
Rayner, Manny, Carter, David, Price, Patti, and Lyberg, Bertil
- Subjects
Computer Science - Computation and Language - Abstract
Most spoken language translation systems developed to date rely on a pipelined architecture, in which the main stages are speech recognition, linguistic analysis, transfer, generation and speech synthesis. When making projections of error rates for systems of this kind, it is natural to assume that the error rates for the individual components are independent, making the system accuracy the product of the component accuracies. The paper reports experiments carried out using the SRI-SICS-Telia Research Spoken Language Translator and a 1000-utterance sample of unseen data. The results suggest that the naive performance model leads to serious overestimates of system error rates, since there are in fact strong dependencies between the components. Predicting the system error rate on the independence assumption by simple multiplication resulted in a 16\% proportional overestimate for all utterances, and a 19\% overestimate when only utterances of length 1-10 words were considered., Comment: 10 pages, Latex source. To appear in Proc. ICSLP '94
- Published
- 1994
25. Combining Knowledge Sources to Reorder N-Best Speech Hypothesis Lists
- Author
-
Rayner, Manny, Carter, David, Digalakis, Vassilios, and Price, Patti
- Subjects
Computer Science - Computation and Language - Abstract
A simple and general method is described that can combine different knowledge sources to reorder N-best lists of hypotheses produced by a speech recognizer. The method is automatically trainable, acquiring information from both positive and negative examples. Experiments are described in which it was tested on a 1000-utterance sample of unseen ATIS data., Comment: 13 pages, Latex source. To appear in Proc. HLT '94
- Published
- 1994
26. Abductive Equivalential Translation and its application to Natural Language Database Interfacing
- Author
-
Rayner, Manny
- Subjects
Computer Science - Computation and Language - Abstract
The thesis describes a logical formalization of natural-language database interfacing. We assume the existence of a ``natural language engine'' capable of mediating between surface linguistic string and their representations as ``literal'' logical forms: the focus of interest will be the question of relating ``literal'' logical forms to representations in terms of primitives meaningful to the underlying database engine. We begin by describing the nature of the problem, and show how a variety of interface functionalities can be considered as instances of a type of formal inference task which we call ``Abductive Equivalential Translation'' (AET); functionalities which can be reduced to this form include answering questions, responding to commands, reasoning about the completeness of answers, answering meta-questions of type ``Do you know...'', and generating assertions and questions. In each case, a ``linguistic domain theory'' (LDT) $\Gamma$ and an input formula $F$ are given, and the goal is to construct a formula with certain properties which is equivalent to $F$, given $\Gamma$ and a set of permitted assumptions. If the LDT is of a certain specified type, whose formulas are either conditional equivalences or Horn-clauses, we show that the AET problem can be reduced to a goal-directed inference method. We present an abstract description of this method, and sketch its realization in Prolog. The relationship between AET and several problems previously discussed in the literature is discussed. In particular, we show how AET can provide a simple and elegant solution to the so-called ``Doctor on Board'' problem, and in effect allows a ``relativization'' of the Closed World Assumption. The ideas in the thesis have all been implemented concretely within the SRI CLARE project, using a real projects and payments database. The LDT for the example database is described in detail, and examples of the types of functionality that can be achieved within the example domain are presented., Comment: 162 pages, Latex source, PhD thesis (U Stockholm, 1993). Uses style-file ustockholm_thesis.sty
- Published
- 1994
27. Build Your Own Speech-Enabled Online CALL Course, No Experience Required
- Author
-
Rayner Manny, Baur Claudia, Bouillon Pierrette, Chua Cathy, and Tsourakis Nikos
- Subjects
CALL ,ddc:410.2 ,Speech recognition ,Open architectures ,Web - Abstract
We describe Open CALL-SLT, a framework that allows people who may only have modest computer skills to create interactive speech-enabled CALL courses on the web. User content is uploaded to a server and remotely compiled and deployed. Courses can be created at six levels of increasing sophistication, where the specification of the content ranges from simple prompt/response pairs at one end to multimedia-enabled gamified dialogues with multiple paths at the other. We briefly describe the overall framework and the different levels.
- Published
- 2015
28. Supervised Learning of Response Grammars in a Spoken CALL System
- Author
-
Rayner Manny, Baur Claudia, Chua Cathy, and Tsourakis Nikos
- Abstract
We summarise experiments carried out using a system initiative spoken CALL system in which permitted responses to prompts are defined using a minimal formalism based on templates and regular expressions and describe a simple structural learning algorithm that uses annotated data to update response definitions. Using 1 927 utterances of training data we obtained a relative improvement of 20 in the system’s ability to react differentially to correct and incorrect input measured on a previously unseen test set. The results are significant at p < 0:005.
- Published
- 2015
29. Formative Feedback in an Interactive Spoken CALL System
- Author
-
Tsourakis Nikos, Rayner Manny, and Baur Claudia
- Abstract
By definition spoken dialogue CALL systems should be easy to use and understand. However interaction in this context is often far from unhindered. In this paper we introduce a formative feedback mechanism in our CALL system which can monitor interaction report errors and provide advice and suggestions to users. The distinctive feature of this mechanism is the ability to combine information from different sources and decide on the most pertinent feedback which can also be adapted in terms of phrasing style and language. We conducted experiments at three secondary schools in German speaking Switzerland and the obtained results suggest that our feedback mechanism helps students during interaction and contributes as a motivating factor.
- Published
- 2014
30. Crafting Interesting Dialogues in an Interactive Spoken CALL System
- Author
-
Baur Claudia, Rayner Manny, and Tsourakis Nikos
- Subjects
ComputingMilieux_COMPUTERSANDEDUCATION - Abstract
We describe work carried out in the context of a project whose goal is to develop a web enabled serious game designed to help German speaking beginner students of English improve their generative and auditory competence. The game which is currently undergoing a first round of formal user testing is intended as a self study adjunct to normal classroom instruction. It was developed in collaboration with a secondary school teacher with the content taken from a textbook commonly used in German speaking Switzerland and offers a short course of 8 interactive lessons using a combined vocabulary of about 450 words. A lesson is structured as a short prompted dialogue between the student and the machine where the student is encouraged to use simple language in practical contexts like booking a hotel room buying clothes or ordering a meal in a restaurant. We present an overview of the architecture which is designed to balance three main considerations: it should support coherent grammatical responses and reasonably interesting dialogues but also allow course content to be constructed by personnel who do not necessarily have a computer science background. We use as an example a new strategy we have recently developed where the central idea is to add further dialogue paths to the scripts graded by their degree of “cooperativeness”. The system starts with a simple cooperative interaction and opens up increasingly "uncooperative" dialogue paths as the student gains familiarity with the game.
- Published
- 2014
31. CALL-SLT Lite: A Minimal Framework for Building Interactive Speech-Enabled CALL Applications
- Author
-
Rayner Manny, Baur Claudia, and Tsourakis Nikos
- Abstract
We present a framework CALL SLT Lite which can be used by people with only very basic software skills to create interactive multimodal speech enabled CALL games suitable for beginner/low intermediate child language learners. The games are deployed over the web and can be accessed through a normal browser and the framework is completely language independent. As the name suggests the framework grew out of an earlier platform CALL SLT which enables construction of similar games but uses a more sophisticated architecture. We review the history of the project describing the type of game we are aiming to build and our reasons for believing that they are useful and then present CALL SLT Lite and an initial evaluation comparing the performance of the two versions of the framework. The results suggest that the Lite framework although much simpler offers performance at least as good as that of the original system.
- Published
- 2014
32. Using a Serious Game to Collect a Child Learner Speech Corpus
- Author
-
Baur Claudia, Rayner Manny, and Tsourakis Nikos
- Abstract
We present an English L2 child learner speech corpus produced by Swiss German L1 students in their third year of learning English which is currently in the process of being collected. The collection method uses a web enabled multimodal language game implemented using the CALL SLT platform in which subjects hold prompted conversations with an animated agent. Prompts consist of a short animated English language video clip together with a German language piece of text indicating the semantic content of the requested response. Grammar based speech understanding is used to decide whether responses are accepted or rejected and dialogue flow is controlled using a simple XML based scripting language; the scripts are written to allow multiple dialogue paths the choice being made randomly. The system is gamified using a score and badge framework with four levels of badges. We describe the application the data collection and annotation procedures and the initial tranche of data. The full corpus when complete should contain at least 5 000 annotated utterances.
- Published
- 2014
33. A web-deployed Swedish spoken CALL systembased on a large shared English/Swedish feature grammar
- Author
-
Rayner, Manny, Gerlach, Johanna, Starlander, Marianne, Tsourakis, Nikos, Kruckenberg, Anita, Eklund, Robert, Jönsson, Arne, McAllister, Anita, and Chua, Cathy
- Subjects
Teknik och teknologier ,Engineering and Technology - Abstract
We describe a Swedish version of CALL-SLT,a web-deployed CALL system that allows beginner/intermediate students to practise generativespoken language skills. Speech recognitionis grammar-based, with language modelsderived, using the Regulus platform, fromsubstantial domain-independent feature grammars.The paper focusses on the Swedishgrammar resources, which were developedby generalising the existing English featuregrammar into a shared grammar for Englishand Swedish. It turns out that this can be donevery economically: all but a handful of rulesand features are shared, and English grammaressentially ends up being treated as a reducedform of Swedish. We conclude by presentinga simple evaluation which compares theSwedish and French versions of CALL-SLT.
- Published
- 2012
34. A Scalable Architecture For Web Deployment of Spoken Dialogue Systems
- Author
-
Fuchs Matthew, Tsourakis Nikos, and Rayner Manny
- Abstract
We describe a scalable architecture particularly well suited to cloud based computing which can be used for Web deployment of spoken dialogue systems. In common with similar platforms like WAMI and the Nuance Mobile Developer Platform we use a client/server approach in which speech recognition is carried out on the server side; our architecture however differs from these systems in offering considerably more elaborate server side functionality based on large scale grammar based language processing and generic dialogue management. We describe two substantial applications built using our framework which we argue would have been hard to construct in WAMI or NMDP. Finally we present a series of evaluations carried out using CALL SLT a speech translation game where we contrast performance in Web and desktop versions. Task Error Rate in the Web version is only slightly inferior that in the desktop one and the average additional latency is under half a second. The software is generally available for research purposes.
- Published
- 2012
35. Evaluating A Web-Based Spoken Language Translation Game For Learning Domain Language
- Author
-
Bouillon Pierrette, Halimi Sonia, Rayner Manny, and Tsourakis Nikos
- Abstract
We present an evaluation of CALL SLT a web based CALL application based on the “translation game” idea that can be used for practicing fluency in a limited domain. The version tested was configured to teach basic restaurant French to students whose native language is Arabic. Students spent about an hour and a half each working with the system and explored between five and eight lessons. The focus was on investigating how they interacted with the system and what they learned most effectively.
- Published
- 2011
36. Evaluation of a Mobile Language Learning System Using Language-Neutral Prompts
- Author
-
Tsourakis Nikos, Rayner Manny, and Bouillon Pierrette
- Subjects
ComputingMilieux_COMPUTERSANDEDUCATION - Abstract
We describe two versions of a prototype system for computer assisted language learning hosted on a mobile device. In both versions the student is given prompts by the machine which they then have to render in the L2. The two versions differ with respect to the modality of the prompt: one presents it as an L1 based text string while the other one presents it in pictorial form. The two versions were tested on a group of 32 high school students as part of the Geneva University Student Week. Our findings suggest that male students found the pictorial version significantly easier to use while female students preferred the text version.
- Published
- 2011
37. A Student-Centered Evaluation of a Web-Based Spoken Translation Game
- Author
-
Bouillon Pierrette, Rayner Manny, Tsourakis Nikos, and Zhang Q
- Abstract
We present an evaluation of CALL SLT a web deployed speech enabled platform for improving fluency in a limited domain based on the “translation game” idea ofWang and Seneff. The evaluation used 10 Chinese speaking students of French who spent an average of about three hours each practising on a set of five lessons covering elementary and intermediate grammar topics in a restaurant domain. We found significant improvements in student performance measured both in terms of their ability to be recognised correctly by the application and according to written vocabulary and grammar tests.
- Published
- 2011
38. For A Fistful Of Dollars: Using Crowd-Sourcing To Evaluate A Spoken Language CALL Application
- Author
-
Rayner Manny, Frank Ian, Chua Cathy, Tsourakis Nikos, and Bouillon Pierrette
- Subjects
CALL ,Japanese ,ddc:410.2 ,Crowd-sourcing ,Speech recognition ,Evaluation - Abstract
We present an evaluation of a Web-deployed spoken language CALL system, carried out using crowd-sourcing methods. The system, “Survival Japanese”, is a crash course in tourist Japanese implemented within the platform CALL-SLT. The evaluation was carried out over one week using the Amazon Mechanical Turk. Although we found a high proportion of attempted scammers, there was a core of 23 subjects who used the system in a responsible manner. The evidence that these subjects learned from their 111 sessions and 9092 spoken interactions was significant at P=0.001. Our conclusion is that crowd-sourcing is a potentially valid method for evaluating spoken CALL systems.
- Published
- 2011
39. Bootstrapping a statistical speech translator from a rule-based one
- Author
-
Rayner, Manny, Estrella, Paula, Bouillon, Pierrette, and International Workshop on Free/Open-Source Rule-Based Machine Translation (2nd : 2011 : Barcelona)
- Subjects
Programari lliure ,Traducció automàtica ,Software libre ,Traducción automática ,bootstrapping ,Computational linguistics ,Lingüística computacional ,Traducció automàtica estadística ,Statistical ,Open source software ,Traducción automática estadística ,Machine translating - Abstract
We describe a series of experiments in which we start with English to French and English to Japanese versions of an Open Source rule-based speech translation system for a medical domain, and bootstrap correspondign statistical systems. Comparative evaluation reveals that the rule-based systems are still significantly better than the statistical ones, despite the fact that considerable effort has been invested in tuning both the recognition and translation components; also, a hybrid system only marginally improved recall at the cost of a los in precision. The result suggests that rule-based architectures may still be preferable to statistical ones for safety-critical speech translation tasks. Describimos una serie de experimentos en los que comenzamos con las versiones de inglés a francés y de inglés a japonés de un sistema de traducción oral basado en reglas de código abierto para un dominio médico, e iniciamos los sistemas estadísticos correspondientes. La evaluación comparativa revela que los sistemas basados en reglas son significativamente mejores que los estadísticos, a pesar del hecho de que se ha dedicado un esfuerzo considerable en mejorar los componentes de reconocimiento y traducción; además, un sistema híbrido sólo mejoró de modo marginal la memoria a costa de una pérdida de precisión. Los resultados indican que las arquitecturas basadas en reglas podrían ser mejores para tareas de traducción oral críticas. Descrivim una sèrie d'experiments en què comencem amb les versions d'anglès a francès i d'anglès a japonès d'un sistema de traducció oral basat en regles en codi obert per a un domini mèdic, i iniciem els sistemes estadístics corresponents. L'avaluació comparativa revela que els sistemes basats en regles encara són significativament millors que els estadístics, malgrat el fet que s'ha dedicat un esforç important a millorar els components tant de reconeixement com de traducció; a més a més, un sistema híbrid només va millorar marginalment la memòria a costa d'una pèrdua de precisió. Els resultats indiquen que les arquitectures basades en regles podrien ser millors que les estadístiques per a tasques de traducció oral crítiques.
- Published
- 2010
40. A Multilingual Platform for Building Speech-Enabled Language Courses
- Author
-
Rayner Manny, Bouillon Pierrette, Tsourakis Nikos, Gerlach Johanna, Baur Claudia, Georgecul Maria, and Nakao Yukie
- Published
- 2010
41. Design Issues for a Bidirectional Mobile Medical Speech Translator
- Author
-
Tsourakis, Nikolaos, Bouillon, Pierrette, and Rayner, Manny
- Subjects
MedSLT ,ddc:410.2 ,Regulus ,Medical Speech Translation - Abstract
We argue that there is an urgent need for spoken dialogue translation systems in the medical domain. In this work we describe how an existing system of this kind, originally intended for a desktop PC, was adapted to run in a mobile environment. The different characteristics of the target device necessitate new interaction approaches and interface design. The new configuration uses two client applications running on two different devices, one for the doctor and one for the patient, together with a server; the components are connected over a wireless network. The system supports context-dependent translation in both directions. We give a general overview of the system, and discuss some of the relevant design issues pertaining to deployment of speech-enabled systems on mobile devices.
- Published
- 2009
42. Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System
- Author
-
Bouillon, Pierrette, Halimi, Sonia, Nakao, Yukie, Kanzaki, Kyoko, Isahara, Hitoshi, Tsourakis, Nikos, Starlander, Marianne, Hockey, Beth Ann, Rayner, Manny, and Nakao, Yukie
- Subjects
[INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL] ,ddc:410.2 ,ComputingMilieux_MISCELLANEOUS ,[INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD] - Abstract
We describe recent work on MedSLT, a medium-vocabulary interlingua-based medical speech translation system, focussing on issues that arise when handling languages of which the grammar engineer has little or no knowledge. We describe how we can systematically create and maintain multiple forms of grammars, lexica and interlingual representations, with some versions being used by language informants, and some by grammar engineers. In particular, we describe the advantages of structuring the interlingua definition as a simple semantic grammar, which includes a human-readable surface form. We show how this allows us to rationalise the process of evaluating translations between languages lacking common speakers. The grammar-based interlingua definition can also be used in other ways. We describe two applications: a simple generic tool for debugging to-interlingua translation rules, and a method for improving speech understanding performance by rescoring N-best speech hypothesis lists. Examples presented focus on the concrete case of translation between Japanese and Arabic in both directions.
- Published
- 2008
43. Many-to-Many Multilingual Medical Speech Translation on a PDA
- Author
-
Bouillon, Pierrette, Flores, Glenn, Georgescul, Maria, Halimi, Sonia, Hockey, Beth Ann, Isahara, Hitoshi, Kanzaki, Kyoko, Nakao, Yukie, Rayner, Manny, Santaholma, Marianne, Starlander, Marianne, Tsourakis, Nikos, and Nakao, Yukie
- Subjects
ddc:410.2 ,ComputingMilieux_MISCELLANEOUS ,[INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD] ,[INFO.INFO-PL] Computer Science [cs]/Programming Languages [cs.PL] - Abstract
Particularly considering the requirement of high reliability, we argue that the most appropriate architecture for a medical speech translator that can be realised using today's technology combines unidirectional (doctor to patient) translation, medium-vocabulary controlled language coverage, interlingua-based translation, an embedded help component, and deployability on a hand-held hardware platform. We present an overview of the Open Source MedSLT prototype, which has been developed in accordance with these design principles. The system is implemented on top of the Regulus and Nuance 8.5 platforms, translates patient examination questions for all language pairs in the set {English, French, Japanese, Arabic, Catalan}, using vocabularies of about 400 to 1 100 words, and can be run in a distributed client/server environment, where the client application is hosted on a Nokia Internet Tablet device.
- Published
- 2008
44. Representational and architectural issues in a limited-domain medical speech translator
- Author
-
Rayner, Manny, Bouillon, Pierrette, Santaholma, Marianne Elina, and Nakao, Yukie
- Subjects
ddc:410.2 ,Speech understanding ,Computer-aided diagnosis ,Speech translation - Abstract
We present an overview of MedSLT, a medium vocabulary medical speech translation system, focussing on the representational issues that arise when translating temporal and causal concepts. Although flat key/value structures are strongly preferred as semantic representations in speech understanding systems, we argue that it is infeasible to handle the necessary range of concepts using only flat structures. By exploiting the specific nature of the task, we show that it is possible to implement a solution which only slightly extends the representational complexity of the semantic representation language, by permitting an optional single nested level representing a subordinate clause construct. We sketch our solutions to the key problems of producing minimally nested representations using phrase-spotting methods, and writing cleanly structured rule-sets that map temporal and phrasal representations into a canonical interlingual form.
- Published
- 2005
45. A Generic Multi-Lingual Open Source Platform for Limited-Domain Medical Speech Translation
- Author
-
Bouillon, Pierrette, Rayner, Manny, Chatzichrisafis, Nikos, Hockey, Beth Ann, Santaholma, Marianne Elina, Starlander, Marianne, Nakao, Yukie, Kanzaki, Kyoko, and Isahara, Hitoshi
- Subjects
ddc:410.2 - Abstract
We present an overview of MedSLT, an Open Source platform for developing limited-domain medical speech translation systems. We focus in particular on the speech understanding architecture, which uses grammar-based language models derived using corpus-based specialisation methods from a single linguistically motivated grammar, and summarise the results of two evaluations which investigate the appropriateness of these design choices. Other sections describe the interlingua and its relationship with the recognition architecture, and the current demo system.
- Published
- 2005
46. Comparing Rule-Based and Statistical Approaches to Speech Understanding in a Limited Domain Speech Translation System
- Author
-
Rayner, Manny, Bouillon, Pierrette, Hockey, Beth Ann, Chatzichrisafis, Nikos, and Starlander, Marianne
- Subjects
ddc:410.2 - Abstract
The paper directly compares two versions of a medical speech translation system, one with a grammar based language model (GLM) recognizer and the other with a statistical language model (SLM) recognizer. We construct the GLM using a corpus-based method, so that both the GLM and the SLM can be derived from the same corpus; evaluation is carried out with respect to performance on the speech translation task. Despite using a very small training set for both the GLM and the SLM, the SLM delivers much better word error rates on unseen test material. Nonetheless, evaluating both systems on translation performance rather than word error rates, the GLM-based version of the system outperforms the SLM on the actual translation task.
- Published
- 2004
47. CALL-SLT: A Spoken CALL System
- Author
-
Rayner, Manny, primary, Tsourakis, Nikos, additional, Baur, Claudia, additional, Bouillon, Pierrette, additional, and Gerlach, Johanna, additional
- Published
- 2014
- Full Text
- View/download PDF
48. Language-Processing Strategies and Mixed-Initiative Dialogues
- Author
-
Boye, Johan, Wirén, Mats, Rayner, Manny, Lewin, Ian, Carter, David, and Becket, Ralph
- Subjects
Språkteknologi (språkvetenskaplig databehandling) ,Language Technology (Computational Linguistics) - Abstract
We describe an implemented spoken-language dialogue system for a travel-planning domain, which accesses a commercially available travel-information web-server and supports a flexible mixed-initiative dialogue strategy. We argue, based on data from initial Wizard-of-Oz experiments, that mixed-initiative strategies are appropriate for many types of user, but require more sophisticated architectures for processing of language and dialogue; we then use these observations to motivate an architecture which combines parallel deep and shallow natural language analysis engines and an agenda-driven dialogue manager. We outline the top-level processing strategy used by the dialogue manager, and also a novel formalism, which we call Flat Utterance Description, that allows us to reduce the output of the deep and shallow language-processing engines to a common representation.
- Published
- 1999
49. Bootstrapping a Statistical Speech Translator From a Rule-Based One
- Author
-
Rayner, Manny, primary, Bouillon, Pierrette, additional, Estrella, Paula, additional, Nakao, Yukie, additional, and Christian, Gwen, additional
- Published
- 2012
- Full Text
- View/download PDF
50. The Swedish Core Language Engine
- Author
-
Gambäck, Björn and Rayner, Manny
- Subjects
Computer and Information Sciences ,Data- och informationsvetenskap - Abstract
The paper describes a Swedish-language customization (S-CLE) of the SRI Core Language Engine, which has been developed at SICS from the original English-language version by replacing English-specific modules with corresponding Swedish-language versions. The S-CLE is intended to be used as a building block in a broad range of applications, such as data-base query system, machine translation systems, NL front-ends, speech-to-text/text-to-speech systems, and so on. Examples of the first two types of application already exist. The main part of the S-CLE is an extensive Swedish grammar that is compiled into parsing and generation modules. The grammar formalism is a type of unification grammar loosely based on Generalized Phrase Structure Grammar (GPSG). Generation is performed using the Semantic-Head-Driven algorithm. Analysis turns sentences into ``Quasi-Logical Form'' (QLF), a logical-form representation, while generation works in the opposite direction. Intermediate stages include processing of morphology, syntax and semantics. For knowledge-base applications, a separate module can convert QLFs into conventional scoped logical forms. After two-and-a-half years of work (approximately 45 person months), the first prototype system has a vocabulary of about 1900 words and covers a fairly broad range of possible grammatical constructions. Based on our experience in this project, we present in this paper detailed arguments to support the claim that customization of an English-language NLP system is a highly cost-effective way of constructing Swedish language systems with corresponding functionality. A shorter version of this paper appears in L. Ahrenberg (ed.): Papers from the Third Nordic Conference on Text Comprehension in Man and Machine , Link Sweden, 1992.
- Published
- 1992
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.