Descriptor: "Textual statistics" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Textual statistics"' showing total 43 results

Start Over Descriptor "Textual statistics"

43 results on '"Textual statistics"'

1. A keyword extraction model study in the movie domain with synopsis and reviews: A keyword extraction model study in the movie domain...

Author: González-Santos, Carlos, Vega-Rodríguez, Miguel A., Pérez, Carlos J., Martínez-Sarriegui, Iñaki, and López-Muñoz, Joaquín M.
Published: 2025
Full Text: View/download PDF

2. EXPLORER LES MOTS DU POLITIQUE DANS LA TRANSFORMATION NUMERIQUE Analyser le lexique politique dans des contextes et selon des ressources en évolution.

Author: LONGHI, JULIEN
Subjects: DIGITAL transformation, PUBLIC opinion, ARTIFICIAL intelligence, POLITICAL organizations, DISCOURSE analysis, RUMOR
Abstract: The evolution of digital technologies has profoundly transformed the way we exchange political ideas. The rise of social networks, blogs, and online discussion forums has provided an increasingly accessible platform for political organizations to communicate and interact with voters. In order to explore words in digital transformation, and the use of tools and/or approaches for studying lexicon and phraseology in evolving discursive domains, we chose to focus our study on words in the political domain, in the context of the 2017 and 2022 French political elections. Thus, in the interval of two presidential campaigns, the political context has changed enormously (evolution of the French political landscape, recomposition of parties and the electorate), and corpus analysis technologies have also undergone a great evolution. The analysis of digital political discourse has become increasingly important due to the growing importance of the Internet and social media in public debate and opinion formation. New challenges in digital political discourse analysis include the sheer volume of data (which is often noisy and contains redundant or irrelevant information), polarization, misinformation, and the difficulty of distinguishing between facts, opinions, and rumors, especially in short messages. This paper will therefore address both the methodological and technological transformations, as well as the discursive and argumentative transformations, of the analysis of political words in an electoral context, through the presentation of different projects and initiatives that have marked the scientific landscape during the campaigns. Finally, to deepen this inventory, and to address the issue of lexicon and political discourse, we will focus on the theme of the analysis of the candidates’ style, by highlighting the way in which the use of deep learning and textual statistics can help to better understand the evolution of political discourse, and to measure the contribution of recent technologies and tools mobilizing Artificial Intelligence. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

3. Trends in Environmental Management Systems Research. A Content Analysis

Author: Fagioli Filippo Fiume, Paolotti Luisa, and Boggia Antonio
Subjects: emas, environmental management systems (ems), iso 14000, text mining, textual statistics, Renewable energy sources, TJ807-830
Abstract: Environmental Management Systems (EMS) represent a solid opportunity for companies that voluntarily undertake a path towards the adoption of environmental protection policies. Many studies investigated the link between EMS and improvement of environmental performance, demonstrating how the adoption of an international standard as ISO 14000 or EMAS guarantees the company an improvement in its performance and an increase in transparency towards stakeholders. However, scientific research in this field reflects the strong variability due to the breadth and diversity of sectors in which the two international standards can be adopted. The aim of this work is to analyse, through textual statistics and text mining methods, the trends that characterize scientific production, highlighting most debated topics useful for developing new research perspectives in this research field.
Published: 2022
Full Text: View/download PDF

4. Rationalization Processes Between Social Representations and Semantic Block Theory

Author: Bordarie, Jimmy, Sarti, Alessandro, Series editor, and Compagno, Dario, editor
Published: 2018
Full Text: View/download PDF

5. Ciudadanías ciudadanas: un estudio exploratorio sobre representaciones sociales de la ciudadanía en Francia y Colombia.

Author: BETANCOURT, VALERIA AYOLA
Subjects: *CITIZENSHIP, *COLOMBIANS, *CITIZENS, *STATISTICS
Abstract: How do Colombians and how do the French understand the notion of citizenship? This article seeks to answer this broad question, studying citizenship through the representations it arises in citizens themselves. Using data from an exploratory and non-representative survey, I compare the representations of a group of French and Colombian citizens of citzenship, through word association and an exercise in textual statistics. The results highlight citizenship's consensus zone, while describing the particularities found among the respondents in relation to their declared nationality. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

6. Uso de la familia léxica de sovereignty en los discursos de la Presidencia del Banco Central Europeo (2003-2016)

Author: Albert Morales
Subjects: institutional discourse, textual statistics, lexicometrics, discourse analysis, European Central Bank (ECB), Political science, Political theory, JC11-607
Abstract: The financial crisis stemming from the US subprime mortgage meltdown represented a setback for the European economy that still affects many European countries. The European Central Bank (ECB), a key monetary policy institution, is one of the major actors shaping European neoliberal financial discourse. An earlier lexicometric analysis of a corpus of speeches delivered by ECB presidents between 2003 and 2016 revealed a high specificity index for the term sovereignty in certain years. This article aims to analyze how the ECB justifies transferring, inter alia, financial and national sovereignty to the European Union and its institutions and what the ECB refers to when utilizing this highly symbolic concept from public discourse. Ultimately, this work aims to determine how the ECB, “Europe’s illicit emperor” (Bouchard 2013), frames its discourse of economic domination. Consequently, this article quantitatively (using Lexico 3.6) and qualitatively analyses the various forms in this lexical family (sovereignty, sovereignties, sovereign, sovereigns) to assess how they have evolved over time in the speeches given by ECB presidents. It adopts a methodology steeped in the theory and practice of discourse analysis, studying the contexts in which the selected terms appear in order to shed light on the implications of these changing backdrops
Published: 2017

7. Quousque tandem abutere, Catilina, patientia nostra : quand Cicéron disait nous…

Author: Longrée, Dominique
Subjects: motif textuel, Ciceron, nostra, textual statistics, statistique textuelle, text patterns, Cicero
Abstract: Dans ses dialogues philosophiques, Cicéron utilise régulièrement un nous ambigu. Dans les Tusculanes, le pronom nostra renvoie à l’ensemble des Romains par opposition aux Grecs et dans d’autres cas, principalement dans sa correspondance, nous peut renvoyer à sa seule personne et ce, dans des contextes particulièrement affectifs - par exemple quand il s’adresse à son épouse depuis son lieu d’exil. Nombreux sont les exemples où Cicéron se désigne par nos à la place de ego sans pour autant associer qui que ce soit à sa propre personne. Dans ses discours, qui ont tous d’une manière ou d’une autre une dimension politique, même lorsqu’il s’agit de discours judiciaires, l’Arpinate utilise parfois le nous de manière volontairement ambiguë. Des recherches antérieures ont montré que l’emploi du singulier ou du pluriel de la première personne pouvait être un des paramètres d’identification des variantes d’un même motif textuel, l’emploi du singulier est propre à Cicéron tandis que le pluriel est caractéristique de César et de Salluste. La présente recherche a pour visée de déterminer dans quelle mesure une analyse textométrique, fondée sur une approche topologique, et faisant appel à la notion de « motif textuel » permet d’isoler, au fil des discours et de la correspondance de Cicéron, des critères d’identification des divers types d’emplois de nous, qui viennent d’être évoqués. In his philosophical dialogues, Cicero regularly uses an ambiguous nous. In the Tusculana, the pronoun nostra refers to the Romans as a whole as opposed to the Greeks, and in other cases, mainly in his correspondence, nous may refer to his person alone in particularly affective contexts - for example, when he addresses his wife from his place of exile. There are many examples where Cicero refers to himself as nos instead of ego without associating anyone else with his own person. In his speeches, which all have a political dimension in one way or another, even when they are judicial speeches, the Arpinate sometimes uses nous in a deliberately ambiguous way. Previous research has shown that the use of the singular or plural of the first person can be one of the parameters for identifying variants of the same textual motif, the use of the singular is peculiar to Cicero while the plural is characteristic of Caesar and Sallustus. The aim of this research is to determine to what extent a textometric analysis, based on a topological approach and using the notion of "textual motif", can isolate, through Cicero's speeches and correspondence, criteria for identifying the various types of use of nous, which have just been mentioned.
Published: 2022

8. A estatística textual computadorizada e a literatura brasileira: uma análise do romance Miragem, de Coelho Neto.

Author: Pires de Assis, Emanoel Cesar and Lopes, Daniel
Abstract: In this article, we reveal, through a stylometric reading, characteristics of Coelho Neto's style in Miragem (1895). In addition, we compare the style of the writer with the style of three contemporary writers: Aluísio Azevedo, O Cortiço (1890); Adolfo Caminha, Bom-Crioulo (1895); and Machado de Assis, Dom Casmurro (1900). The research methodology adopted in this study is supported by stylometry (Brandão, 2006, Cúrcio, 2006, 2007, 2013, Freitas, 2007, Paiva, 2013, Assis, 2013 and Silva, 2014) and dialogues, whenever possible, with the literary criticism directed to Coelho Neto and his work, mainly in relation to his style. Among our findings, we have found that the uses of adjectives, verbs, and adverbs in Coelho Neto's criticism of the composition of his style are apparently normal when compared to the style of other writers. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

9. "My Inordinate Reluctance to Repeat a Word." A Lexicometric Report on Peirce's Collected Papers.

Author: Gaspard, Jeoffrey
Subjects: *PHILOSOPHY, *TRUTH, *PROPOSITION (Logic), *ANTHOLOGIES
Abstract: In this report, we present the results of two lexicometric analyses performed on a digitalized version of Charles S. Peirce's Collected Papers. We calculated the most frequently cited terms in the corpus and its most overrepresented terms, as compared with a sample of the Corpus of Historical American. Combining both analyses allowed us to produce a shorter list of the "most important terms" of the Collected Papers , deemed to be distinctive of Peirce's philosophical interests. One of the most significant findings is the quantitative evidence that Peirce's work is primarily concerned with truth and propositions. We suggest that such statistical analyses offer an interesting overview of Peirce's philosophical interests in an unprecedented way. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

10. Paradoxical territorial management: the case of peri-urban agricultural areas1.

Author: Hernandez, Solange
Subjects: URBAN land use, AGRICULTURE, PUBLIC administration
Abstract: The purpose of this research is to understand the paradoxical practices of territorial management and to look into ways of supporting the local actors of peri-urban agriculture. To do so, we conducted a case study into the policy implemented for the past 25 years by the urban community of the Pays d’Aubagne et de l’Etoile (Bouches-du-Rhône). Drawing on quantitative analyses of qualitative data, we demonstrate the relevance of these management practices in action. We go on to propose managerial tools that can be applied in the framework of public action in peri-urban agriculture. Points for practitioners The practical uses of the article are: first, to identify and understand the paradoxes, a recurring source of tension and deadlock in territorial public administration; and, second, to contribute concrete avenues of reflection and solutions to the territorial managers faced with these situations, especially in the management of peri-urban agricultural areas. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

11. Acquiring an Ontology from the Text : A Legal Case Study

Author: Casellas, Núria, Jakulin, Aleks, Vallbé, Joan-Josep, Casanovas, Pompeu, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Carbonell, Jaime G., editor, Siekmann, Jörg, editor, Ali, Moonis, editor, and Dapoigny, Richard, editor
Published: 2006
Full Text: View/download PDF

12. Les personnages de théâtre ont-ils un style ? Recherche outillée sur un corpus théâtral trilingue

Author: Vandersmissen, Marc
Subjects: stylistique outillée, digital stylistics, études du discours, textual statistics, General Earth and Planetary Sciences, statistique textuelle, discourse studies, theatre characters, personnages de théâtre, General Environmental Science
Abstract: Dans le cadre du développement récent de la stylistique outillée, cet article propose une réflexion sur l’application de ce concept et de ses méthodes aux personnages de théâtre sur la base d’un corpus trilingue de tragédies : Euripide, Sénèque et Corneille. Pour mener la recherche, nous aborderons d’abord la question de la nature des rôles de théâtre entre unités textuelles recomposées et discours de personnages dans le cadre d’une performance sur scène. Ensuite, nous chercherons à définir si les caractéristiques de ces unités peuvent être rapprochées de la notion de style. Nous en étudierons les apports pour la compréhension d’une œuvre de théâtre. Pour atteindre cet objectif, une attention particulière sera accordée au corpus de recherche (quelles œuvres choisir ?), à sa constitution (comment diviser ce corpus et avec quel niveau de granularité ?) et aux outils d’exploration (qualitative ou quantitative ?). En stylistique outillée, le corpus devient lui-même un outil de recherche en tant qu’opérateur de contraste. Enfin, poursuivre cette enquête amène à questionner la frontière entre études du discours et stylistique outillée du point de vue de leurs objectifs et de leurs méthodes. In the context of the recent development of digital stylistics, this paper proposes a study on the application of this concept and its methods to theatrical characters on the basis of a trilingual corpus of tragedies: Euripides, Seneca and Corneille. To conduct the research, we will first address the question of the nature of theatrical roles between recomposed textual units and character discourse in the context of a stage performance. Then, we will try to define if the characteristics of these units can be related to the notion of style. We will study its contribution to the understanding of a work of theatre. To achieve this objective, particular attention will be paid to the research corpus (which works should be chosen?), to its creation (how should this corpus be divided and with what level of granularity?) and to the exploration tools (qualitative or quantitative?). In digital stylistic, the corpus itself becomes a research tool as a contrast operator. Finally, pursuing this investigation leads us to question the frontier between discourse studies and digital stylistics in terms of their objectives and methods.
Published: 2022
Full Text: View/download PDF

13. Traduire le polar: une étude textométrique comparée de la phraséologie du roman policier en français source et cible.

Author: Kraif, Olivier
Abstract: Copyright of Synergies Pologne is the property of GERFLINT (Groupe d'Etudes et de Recherches pour le Francais Langue Internationale) and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2017

14. Analyses of open-ended questions by renormalized associativities and textual networks: A study of perception of minerality in wine.

Author: Deneulin, Pascale and Bavaud, François
Subjects: *CONSUMER preferences, *WINES, *MINERAL analysis, *FOOD quality, *FOOD science
Abstract: Verbal tasks are increasingly used in food science, but still often suffer from time-consuming manual preprocessing procedures. Also, traditional visualization techniques are not always successful at clearly revealing the structure of term co-occurrences. The present study proposes a few statistical innovations in the analysis of textual data resulting from an open-ended survey on minerality perception, without tasting phase. First, we use dedicated, amenable software aimed at producing term lemmatization and construction of contingency table, enabling minimum manual verification and correction. Furthermore, co-occurrences are treated as a textual weighted network, which can be further iterated and renormalized in a flexible way, filtering out rare terms and their associations. In addition, visualization and clustering techniques, initially developed in social networks studies, reveal meaningful and well-defined terms communities, corresponding to distinct conceptions of minerality. Results are exclusively based upon statistical methods, without resorting to semantic nor linguistic considerations. Altogether, they demonstrate the polysemy and ill-definiteness of the concept of minerality among wine professionals. [ABSTRACT FROM AUTHOR]
Published: 2016
Full Text: View/download PDF

15. La textométrie, une approche quantitative et qualitative des données textuelles

Author: Pincemin, Bénédicte, Institut d’Histoire des Représentations et des Idées dans les Modernités (IHRIM), École normale supérieure - Lyon (ENS Lyon)-Université Lumière - Lyon 2 (UL2)-Université Jean Moulin - Lyon 3 (UJML), Université de Lyon-Université de Lyon-Université Blaise Pascal - Clermont-Ferrand 2 (UBP)-Université Jean Monnet [Saint-Étienne] (UJM)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])-Centre National de la Recherche Scientifique (CNRS), MSH Lyon Saint-Étienne, Data infrastructure Progedo, ANR-17-CE38-0010,ANTRACT,Analyse Transdisciplinaire des Actualités filmées (1945-1969)(2017), ANR: 10-EQPX-0021,MATRICE,Outils de recherche pour l'analyse de la mémoire par la coopération internationale et les expérimentations. Entre mémoire individuelle et mémoire sociale: les outils et nécessités de l'innovation(2010), Pincemin, Bénédicte, Analyse Transdisciplinaire des Actualités filmées (1945-1969) - - ANTRACT2017 - ANR-17-CE38-0010 - AAPG2017 - VALID, Equipements d'excellence - Outils de recherche pour l'analyse de la mémoire par la coopération internationale et les expérimentations. Entre mémoire individuelle et mémoire sociale: les outils et nécessités de l'innovation - - MATRICE2010 - ANR-10-EQPX-0021 - EQPX - VALID, ANR-10-EQPX-0021,MATRICE,Outils de recherche pour l'analyse de la mémoire par la coopération internationale et les expérimentations. Entre mémoire individuelle et mémoire sociale: les outils et nécessités de l'innovation(2010), École normale supérieure de Lyon (ENS de Lyon)-Université Lumière - Lyon 2 (UL2)-Université Jean Moulin - Lyon 3 (UJML), and Université de Lyon-Université de Lyon-Université Blaise Pascal - Clermont-Ferrand 2 (UBP)-Université Jean Monnet - Saint-Étienne (UJM)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])-Centre National de la Recherche Scientifique (CNRS)
Subjects: Analyse de Données Textuelles, [SHS.STAT]Humanities and Social Sciences/Methods and statistics, TXM software, Statistique Textuelle, Textometry, [SHS.LANGUE] Humanities and Social Sciences/Linguistics, Logiciels d'analyse textuelle, Digital Humanities, Logiciel DtmVic, Logiciel TXM, Text Analysis Software, Logiciel IRaMuTeQ, [SHS.STAT] Humanities and Social Sciences/Methods and statistics, Textual Data Analysis, IRaMuTeQ software, Textual Statistics, Textométrie, [SHS.LANGUE]Humanities and Social Sciences/Linguistics, Humanités numériques, open-source, DtmVic software
Abstract: National audience; Cet exposé entend d'abord questionner les réalités concrètes des "données textuelles", et reconnaître la diversité des points de vue sur ces données, qui peuvent être très différents dans un contexte de production ou de recherche, et au sein même de la recherche entre sciences sociales et sciences humaines. A cette diversité de points de vue répond une diversité de méthodes et d'outils -CAQDAS, concordanciers, text mining, etc.-, au sein desquels situer la textométrie.Le propos sera centré sur l'approche textométrique. Nous illustrerons cette approche en parcourant les principaux types de questions pour lesquels la textométrie a développé des outils : que peut-on faire avec un logiciel de textométrie ? Puis, prenant du recul, nous mettrons en évidence des principes sous-jacents et des éléments méthodologiques forts de l'approche textométrique, à discuter en regard des attentes de la recherche en SHS.Le logiciel open-source TXM sera le principal support de l'exposé.
Published: 2020

16. Rhetorical Strategy in Forensic Speeches: Multidimensional Statistics-Based Methodology.

Author: Bécue-Bertaut, Mónica, Kostov, Belchin, Morin, Annie, and Naro, Guilhem
Subjects: *RHETORICAL analysis, *MULTIDIMENSIONAL scaling, *FORENSIC orations, *AUTOMATION, *COMMUNICATION, *CORRESPONDENCE analysis (Communications)
Abstract: Rhetorical strategy is relevant in the law domain, where language is a vital instrument. Textual statistics have much to offer for uncovering such a strategy. We propose a methodology that starts from a non-structured text; first, the breakpoints are automatically detected and lexically homogeneous parts are identified; then, the shape of the text through the trajectory of these parts and their hierarchical structure are uncovered; finally, the argument flow is tracked along. Several methods are combined. Chronological clustering of multidimensional count series detects the breakpoints; the shape of the text is revealed by applying correspondence analysis to the parts×words table while the progression of the argument is described by labelled time-constrained hierarchical clustering. This methodology is illustrated on a rhetoric forensic application, concretely a closing speech delivered by a prosecutor at Barcelona Criminal Court. This approach could also be useful in politics, communication and professional writing. [ABSTRACT FROM AUTHOR]
Published: 2014
Full Text: View/download PDF

17. Tracking verbal-based methods beyond conventional descriptive analysis in food science bibliography. A statistical approach.

Author: Bécue-Bertaut, Mónica
Subjects: *FOOD science, *BIBLIOGRAPHY, *FACTOR analysis, *STATISTICAL correlation, *DISTRIBUTION (Probability theory), *PATH analysis (Statistics)
Abstract: Highlights: [•] A bibliographic study of a base of abstracts is performed. [•] Textual analysis methods are used to detect changes in the verbal-based methods. [•] Multiple factor analysis for contingency tables is a favoured tool. [•] Novelties and pioneer works are reported. [ABSTRACT FROM AUTHOR]
Published: 2014
Full Text: View/download PDF

18. Vers une cartographie géo-lexicale

Author: William Martinez
Subjects: lexical co-occurrences, associated word networks, textual statistics, GIS, Fine Arts
Abstract: The statistical analysis of vocabulary distribution in French tourist guides of the 19th and 20th century reveals the essential descriptive terminology used in these texts. A more structured interpretation of this data is made possible by way of co-occurrence methods that produce associated word maps describing the preferred usage of nouns, adjectives, adverbs etc. regarding a given town, region or route. Based on this original visualizing of word structures we will consider the convergence of lexical and cartographic data in a GIS-type database (Geographic Information System).
Published: 2011
Full Text: View/download PDF

19. Recuperación de información automática de ofertas de empleo: estudio de las competencias de los profesionales de recursos humanos en el mercado de trabajo espanol.

Author: VALENCIA GARCÍA, OLGA
Subjects: *CORE competencies, *JOB skills, *PERSONNEL management, *LABOR market, *JOB offers, *CASE studies
Abstract: Access to information on competencies demanded by companies may be performed by a prospective study of job offers posted by employers. But, these include a large amount of quantitative and qualitative but mainly textual data, so an appropriate approach should be chosen in order to obtain a detailed knowledge of competencies in certain areas and professional profiles. The aim of this paper is to provide empirical evidence of the competencies required to HR professionals both generally and in terms of different job profiles in this area. The identification of competencies is accomplished by the application of statistical methods to automatically processed texts, which dispenses with previous categorization of competences in order to preserve the texts in their native format. The research is based on a textual corpus on 'Soft Competencies', which has been built from hundreds of job offers for HR professionals, posted by either recruitment consultants or direct employers within the labor Spanish market. After a process of standardization and lemmatization of the textual corpus, the key terms on HR soft competencies has been drawn and thus the overall most requested competencies. Likewise, combining the textual information about competencies with qualitative data concerning job profiles, modal texts may be determined and different competency profiles can be outlined. Finally, the visualization of the associations between soft competencies and HR job profiles has been carried out by means of a Correspondence Analysis of an aggregated lexical table. [ABSTRACT FROM AUTHOR]
Published: 2013

20. Analyser un corpus illisible? Le logiciel Alceste confronté à des registres de condoléances.

Author: Truc, Gérôme
Subjects: CONTENT analysis, SOCIOLOGY methodology, CONDOLENCE notes, COMMUNICATIONS research, SOCIAL scientists, COMPUTER software
Abstract: Copyright of Langage & Societe is the property of Fondation Maison des Sciences de l'Homme and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2011
Full Text: View/download PDF

21. Postures énonciatives et mondes lexicaux stabilisés en analyse statistique de discours.

Author: Reinert, Max
Subjects: DISCOURSE analysis, DISCURSIVE practices, FRENCH language, LANGUAGE & languages, LINGUISTICS, SEMANTICS, LEXICOLOGY
Abstract: Copyright of Langage & Societe is the property of Fondation Maison des Sciences de l'Homme and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2007
Full Text: View/download PDF

22. Uso de la familia léxica de sovereignty en los discursos de la Presidencia del Banco Central Europeo (2003-2016)

Author: Morales, Albert
Subjects: Lexicometrics, lexicometrics, Textual statistics, institutional discourse, lcsh:Political science, lcsh:JC11-607, 3 - Ciencias sociales::33 - Economía [CDU], lcsh:Political theory, Estadística textual, Banco Central Europeo (BCE), Discurso institucional, textual statistics, Análisis del discurso, Institutional discourse, discourse analysis, lcsh:J, Lexicometría, Discourse analysis, European Central Bank (ECB)
Abstract: La crisis financiera resultado de la de las subprime norteamericana supone un revés para la economía europea que aún afecta a numerosos países europeos. El Banco Central Europeo (BCE), como institución clave de la política monetaria, es uno de los agentes principales que configura el discurso neoliberal financiero europeo. Un análisis lexicométrico previo de nuestro corpus, que incluye los discursos de los presidentes del BCE (periodo 2003-2016), nos permitió identificar el término sovereignty como forma específica de algunos años. Este trabajo se propone analizar de qué modo el BCE justifica la cesión de la soberanía (financiera, nacional...) de los Estados a favor de la UE y de sus instituciones, y a qué apela dicha institución cuando emplea un concepto clave del discurso público por su alto valor simbólico: la soberanía. En resumen, de qué manera el BCE, “emperador ilícito de Europa” (Bouchard 2013), marca sus pautas de dominación económica. Así pues, en este artículo nos proponemos analizar cuantitativa (con el programa Lexico 3.6) y cualitativamente las formas relativas a dicha familia (sovereignty, sovereignties, sovereign, sovereigns) para ver su evolución a lo largo de los discursos de los presidentes de dicha institución. Hemos tomado como punto de partida la práctica y teoría del análisis del discurso, y se ha estudiado el contexto de los términos seleccionados. The financial crisis stemming from the US subprime mortgage meltdown represented a setback for the European economy that still affects many European countries. The European Central Bank (ECB), a key monetary policy institution, is one of the major actors shaping European neoliberal financial discourse. An earlier lexicometric analysis of a corpus of speeches delivered by ECB presidents between 2003 and 2016 revealed a high specificity index for the term sovereignty in certain years. This article aims to analyze how the ECB justifies transferring, inter alia, financial and national sovereignty to the European Union and its institutions and what the ECB refers to when utilizing this highly symbolic concept from public discourse. Ultimately, this work aims to determine how the ECB, “Europe’s illicit emperor” (Bouchard 2013), frames its discourse of economic domination. Consequently, this article quantitatively (using Lexico 3.6) and qualitatively analyses the various forms in this lexical family (sovereignty, sovereignties, sovereign, sovereigns) to assess how they have evolved over time in the speeches given by ECB presidents. It adopts a methodology steeped in the theory and practice of discourse analysis, studying the contexts in which the selected terms appear in order to shed light on the implications of these changing backdrops.
Published: 2017

23. Socioenvironmental changes and dynamics of agropastoral practices in sahelian Africa : comparative study between Ferlo (Senegal), Gourma (Mali) and Fakara (Niger)

Author: MAREGA, OUMAR, Territoires, Villes, Environnement & Société - ULR 4477 (TVES), Université du Littoral Côte d'Opale (ULCO)-Université de Lille, Pôle de recherche pour l'organisation et la diffusion de l'information géographique (PRODIG), Université Paris 1 Panthéon-Sorbonne (UP1)-Institut de Recherche pour le Développement (IRD)-École pratique des hautes études (EPHE), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Université Paris-Sorbonne (UP4)-AgroParisTech-Université Paris Diderot - Paris 7 (UPD7)-Centre National de la Recherche Scientifique (CNRS), Université paris Diderot-Paris 7, UMR 8586 PRODIG-CNRS, Catherine MERING et Frédéric ALEXANDRE, and Université Paris 1 Panthéon-Sorbonne (UP1)-Institut de Recherche pour le Développement (IRD)-École Pratique des Hautes Études (EPHE)
Subjects: [SHS.STAT]Humanities and Social Sciences/Methods and statistics, Natural resources management and use, Vulnerability and adaptability, [SDE.IE]Environmental Sciences/Environmental Engineering, Surveys and perception, Environnement et dynamique des paysages, Enquêtes et perception, [SDE.MCG]Environmental Sciences/Global Changes, Textual statistics, Remote Sensing and Geomatics, [SHS.GEO]Humanities and Social Sciences/Geography, Télédétection et Géomatique, Statistiques textuelles, [SDE.ES]Environmental Sciences/Environmental and Society, Agropastoralism and rural development, [SHS.ENVIR]Humanities and Social Sciences/Environmental studies, Agropastoralisme et développement rural, Gestion et usages des ressources naturelles, Vulnérabilité et adaptabilité, Environment and landscape dynamics, [SDE.BE]Environmental Sciences/Biodiversity and Ecology, [SHS.LANGUE]Humanities and Social Sciences/Linguistics
Abstract: In a biophysical context characterized by important socioenvironmental changes, the main objective of this PhD is to understand how much agropastoral practices modulatein space and time the vulnerability and adaptability of agroecosystems and sahelians agropastoral societies. To reach this target, this question has been applied at macro-regional scale, relying on three micro-regional study areas: Ferlo (Senegal); Gourma (Mali) and Fakara (Niger) in which perceptions and representations of societies have been studied in a sample of villages and camps. The method developed in this PhD has two components: 1) remote sensing mapping based on satellite image processing, and 2) geo-textual analyses based on lexicometry from semi-structured interviews. The remote sensing mapping allowed to diagnose micro-regional and local scales on land cover change since the 70s until 2010. The diagnosis, based on a multiscale approach, shows intra and inter regional contrasts in the evolution of soil cover by vegetation that invite to nuance observations indicating a greening of Sahel. The articulation of remote sensing with field investigations helped to understand, first the evolution of plant resources, then to consider the variety of responses from the Sahel rural societies to the 70’s and 80’s climate crisis, and to the recent socio-economic changes experienced by sahelian countries. Through lexicometry, geo-textual analysis based on semi-structured interviews allowed to approach the livestock farmer’s perception about the vulnerability and resilience of natural resources. Thanks to the expressed perceptions, the adaptive practices were identified in their variety. It enabled to show the sociospatial reconfigurations in the trajectories of agricultural and pastoral activities that partly explain the contrasts within and between regions observed in Ferlo, Gourma and Fakara.; Dans un contexte biophysique caractérisé par d’importants changements socio-environnementaux,l’objectif principal de la thèse vise à comprendre dans quelle mesure les pratiques agropastorales modulent dans l’espace et dans le temps la vulnérabilité et l’adaptabilité des agro-écosystèmes et des sociétés agropastorales du Sahel. Afin d’atteindre le but visé, cette problématique a été envisagée à l’échelle macro-régionale, celle du Sahel,en s’appuyant sur trois sites d’étude micro-régionaux à savoir le Ferlo (Sénégal) ; le Gourma (Mali) et le Fakara (Niger) au sein desquels les perceptions et les représentations des sociétés ont été étudiées dans un échantillon de villages et de campements.La méthode d’analyse élaborée comporte deux volets :1) un travail de cartographie par télédétection basé sur le traitement d’images satellites, et 2) des analyses géo-textuelles basées sur la lexicométrie à partir d’entretiens semi-directifs. La cartographie par télédétection a permis d’établir un diagnostic aux échelles micro-régionales et locales sur l’évolution du couvert végétal depuis les années 70 jusqu’en 2010. Du diagnostic tiré d’une approche multiscalaire, il ressort des contrastes intra et inter régionaux dans l’évolution de la couverture du sol par la végétation qui invitent à nuancer les observations faisant état d’un reverdissement du Sahel. L’articulation de la télédétection avec les enquêtes de terrain a permis d’appréhender l’évolution des ressources végétales d’une part, et d’autre part d’examiner la diversité des réponses données par les sociétés rurales du Sahel à la crise climatique des années 70 et 80, et aux évolutions socio-économiques récentes qu’ont connu leur pays et leur région. A travers la lexicométrie, des analyses géo-textuelles basées sur les entretiens semi-directifs ont permis d’approcher les perceptions des agropasteurs au sujet de la vulnérabilité et de la résilience des ressources naturelles. A la lumière des perceptions exprimées, les pratiques adaptatives ont été appréhendées dans leur variété. Il en ressort des reconfigurations socio-spatiales dans les trajectoires des activités agricoles et pastorales qui expliquent en partie les contrastes intra et inter régions observés entre le Ferlo, le Gourma et le Fakara.
Published: 2016

24. Les thèmes dans le discours électoral de candidature à la députation sous la Cinquième République. Perspective de genre (1958-2007)

Author: Magali Guaresi
Subjects: isotropes, thème politique, Energy Engineering and Power Technology, Management Science and Operations Research, cooccurrence, genre, textual statistics, gender, 0601 history and archaeology, statistique textuelle, cooccurrences, 060201 languages & linguistics, 060103 classics, temas políticos, estadística textual, Mechanical Engineering, coocurrencias, 06 humanities and the arts, 16. Peace & justice, discurso electoral, isotropías, political issues, 0602 languages and literature, isotropie, discours électoral, electoral discourse, género
Abstract: À partir de l’étude d’un corpus de professions de foi de femmes et d’hommes élus à la députation sous la Cinquième République, cette contribution vise à observer la structure thématique du discours électoral à l’aune d’une perspective de genre (féminin/masculin). La féminisation de la sphère politique introduit-elle des évolutions ou des modulations de l’organisation thématique du discours de candidature ? En parallèle, cette analyse se fixe pour objectif d’expliciter les méthodologies de recherche cooccurrentielles mises en œuvre dans les repérages thématiques en logométrie. Drawing on a corpus composed of manifestoes of men and women candidates to parliamentary elections under the Fifth Republic, this paper aims at examining the thematic structure of the electoral discourse in the prism of a gender perspective (men/women). It sets out to find out whether the feminisation of the political spectrum brings about any evolutions or modulations in the thematic organisation of the electoral discourse. Likewise the analysis aims to shed light on the methodologies of cooccurencial researches carried out in terms of thematic classification in logometry. Basándose sobre el estudio de un corpus de actos de candidatura de mujeres y de hombres elegidos a la diputación bajo la Quinta República francesa, esta contribución tiene como objetivo observar la estructura temática del discurso electoral dentro de una perspectiva de género (femenino/masculino). ¿Se operará la feminización de la esfera política introduciendo evoluciones o modulaciones en la organización temática del discurso de candidatura? De manera conjunta, este análisis tiende a explicitar las metodologías de investigación coocurrenciales desarrolladas a favor de las determinaciones temáticas en logometría.
Published: 2015

25. L'UE et ses voisinages : acteurs et espaces d'un discours

Author: Beauguitte, Laurent, Guérin-Pace, France, Richard, Yann, Collège international des sciences territoriales (FR2007 CIST), Université Paris 1 Panthéon-Sorbonne (UP1)-Université Paris Diderot - Paris 7 (UPD7)-Centre National de la Recherche Scientifique (CNRS), Institut national d'études démographiques (INED), Pôle de recherche pour l'organisation et la diffusion de l'information géographique (PRODIG), Université Paris 1 Panthéon-Sorbonne (UP1)-Institut de Recherche pour le Développement (IRD)-École pratique des hautes études (EPHE), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Université Paris-Sorbonne (UP4)-AgroParisTech-Université Paris Diderot - Paris 7 (UPD7)-Centre National de la Recherche Scientifique (CNRS), Collège international des sciences du territoire (CIST), Université Paris Diderot - Paris 7 (UPD7)-Université Panthéon-Sorbonne (UP1)-Centre National de la Recherche Scientifique (CNRS), Centre National de la Recherche Scientifique (CNRS)-Université Paris Diderot - Paris 7 (UPD7)-AgroParisTech-Université Paris-Sorbonne (UP4)-École pratique des hautes études (EPHE)-Institut de Recherche pour le Développement (IRD)-Université Panthéon-Sorbonne (UP1), Beauguitte, Laurent, Université Paris 1 Panthéon-Sorbonne (UP1)-Institut de Recherche pour le Développement (IRD)-École Pratique des Hautes Études (EPHE), Université Paris Diderot - Paris 7 (UPD7)-Université Paris 1 Panthéon-Sorbonne (UP1)-Centre National de la Recherche Scientifique (CNRS), and Sciencesconf.org, CCSD
Subjects: [SHS.GEO] Humanities and Social Sciences/Geography, international relations, voisinage, [SHS.GEO]Humanities and Social Sciences/Geography, political geography, géographie politique, [SHS]Humanities and Social Sciences, textual statistics, [SHS] Humanities and Social Sciences, European Union, statistique textuelle, Union européenne, neighbourhood, relations internationales
Abstract: In this communication, we provide an analysis of the European Neighbourhood Policy (ENP), based on 7 texts published by the European Commission and then by the European External Action Service between 2003 and 2013. The analysis of this corpus of texts by the methods of textual statistics brings a complementary perspective to the research works in the field of critical discourse analysis. Without formulating any hypothesis, we let emerge the contents of texts by a quantitative analysis of the vocabulary and the lexical fields which take shape during the studied decade. We paid a particular attention on the naming of the geographical entities in the manner of writing the policies of neighbourhood. This lexical analysis of the “official” speech of the politics (policy) of neighbourhood gives a significant contribution to the debates on regional integration at the macrogeographical level., Cette communication propose une analyse de la politique européenne de voisinage à partir des sept communications-cadres publiées par la Commission européenne et le Service européen d’Action extérieure entre 2003 et 2013. L’analyse de ce corpus de textes par les méthodes de la statistique textuelle apporte un éclairage complémentaire aux travaux qui s’inscrivent dans le champ de la critical discourse analysis. Le contenu des textes a été exploré par une analyse quantitative du vocabulaire et des champs lexicaux qui se dessinent au cours de la décennie étudiée. Nous avons porté une attention particulière à la désignation des entités géographiques dans la manière d’écrire la politique de voisinage. Cette analyse lexicale du discours « officiel » de la politique européenne de voisinage permet d’apporter une contribution significative aux débats sur l’intégration régionale à l’échelle macroscopique en décryptant le regard porté par les institutions européennes.
Published: 2014

26. Lexiques descriptifs et modélisation de la description en géographie. Du plan répétitif au découpage par des problématiques spatialisées

Author: Jean-Pierre Chevalier
Subjects: Géographie universelle, Modélisation, Histoire de la Géographie, Analyse lexicale, Ville, Geography, Planning and Development, Town, History of geography, Models, Textual statistics
Abstract: Descriptive lexicon and modelling description in geography. From repetitive schemes to division by spatialized problematics. Textuals statistics are applied to four french Géographie Universelle directed or written by Conrad Malte-Brun (1842), élisée Reclus (1876), Emmanuel de Martonne (1930) and Roger Brunet (1996). They contribute to model geographical description changings. The analysis contributes to build an interpretative model for each Géographie Universelle description and a general model. By this way it shows the changes in geografical descriptions of Central Europe towns while 19th and 20th centuries. Yesterday the description of each town was shared into several textual contexts distributed all over these descriptive texts. Today, there are less and less lexical contexts. Each description is now structured by a specific spatial problematic., Les procédés d'analyse lexicale automatisée sont utilisés ici pour construire un modèle interprétatif de l'évolution du genre descriptif en Géographie. Le corpus est composé des descriptions de 20 villes provenant de 4 Géographies universelles, de l'édition de 1842 de la G. U. de Conrad Malte-Brun à celle de Roger Brunet en 1996. Cette technique d'émergence de contextes lexicaux permet de caractériser les descriptions des différents auteurs. La confrontation de ces résultats permet de mettre en valeur les permanences et les ruptures dans ce genre de texte. En effet, depuis le début du 19ème siècle les discours sont devenus moins analytiques, aussi les contextes lexicaux se réduisent en nombre. Ils ne se construisent plus au travers d'un ordre d'exposition dicté par un itinéraire descriptif type, mais se structurent autour de problématiques propres à chaque espace étudié., Chevalier Jean-Pierre. Lexiques descriptifs et modélisation de la description en géographie. Du plan répétitif au découpage par des problématiques spatialisées. In: Hommes et Terres du Nord, 2000/3. Description et modélisation en géographie. pp. 127-137.
Published: 2000
Full Text: View/download PDF

27. La répétition dans la phrase. Étude statistique

Author: Etienne Brunet, BCL, équipe Logométrie : corpus, traitements, modèles, Bases, Corpus, Langage (UMR 7320 - UCA / CNRS) (BCL), Université Côte d'Azur (UCA)-Centre National de la Recherche Scientifique (CNRS)-Université Nice Sophia Antipolis (... - 2019) (UNS), COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Université Côte d'Azur (UCA)-Centre National de la Recherche Scientifique (CNRS)-Université Nice Sophia Antipolis (... - 2019) (UNS), COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA), Presses universitaires de Franche-Compté, and A. Rabatel
Subjects: [SHS.LITT]Humanities and Social Sciences/Literature, 050801 communication & media studies, corpus, Récurrence, Lexicométrie, Frantesxt, [SHS]Humanities and Social Sciences, 03 medical and health sciences, 0302 clinical medicine, 0508 media and communications, [SHS.LANGUE]Humanities and Social Sciences/Linguistics, Googlebooks, Giraudoux, Lexicometry, Repetition (rhetorical device), Philosophy, 05 social sciences, Répétition, Textual statistics, Google Books, 16. Peace & justice, stylistique, répétition, lexicométrie, Textométrie, Humanities, 030217 neurology & neurosurgery, linguistique
Abstract: Tout l’univers lexicométrique est fondé sur la répétition. La fréquence d’un mot dans un texte, c’est le nombre qui mesure sa répétition. Or la fréquence des mots et des segments a été considérée généralement à l’échelle large du texte et du corpus, à un niveau où la répétition n’est sensible ni à l’oreille ni à la mémoire. Il convient de se situer aussi à l’échelle du micro-texte et de la séquence courte qui s’arrête à la fin de la phrase ou du paragraphe et où se limite la portée de la rémanence mémorielle. On se propose de mener une enquête statistique à ce niveau du micro-texte, à travers un roman de Giraudoux, puis de son œuvre entière, avant de s’attaquer à l’ensemble des écrivains français et enfin au corpus français de Google Books. The entire lexicometric universe is based on repetition. Indeed, the frequency of a word in a text is the measure of its repetition. However, the frequency of words and segments has been generally considered at the broad scale of the text and the corpus, at a level where repetition is neither detectable to the ear nor to memory. It is therefore useful to also examine frequency at the level of the microtext, of the short sequences that terminate at the end of sentences and paragraphs and that fall within the range of the perdurance of memory. We propose performing a statistical investigation at the microtext level, first of a novel by Giraudoux and then of his entire body of work, before undertaking an analysis of all French writers and ultimately the entire French corpus within Google Books.
Published: 2013

28. A cognitive process of responding to a spatial survey

Author: Grasland, Claude, Guérin-Pace, France, De Ruffray, Sophie, Giraud, Timothée, Géographie-cités (GC (UMR_8504)), Université Paris 1 Panthéon-Sorbonne (UP1)-Centre National de la Recherche Scientifique (CNRS)-Université Paris Diderot - Paris 7 (UPD7), Institut national d'études démographiques (INED), Identité et Différenciation de l’Espace, de l’Environnement et des Sociétés (IDEES), Centre National de la Recherche Scientifique (CNRS)-Institut de Recherche Interdisciplinaire Homme et Société (IRIHS), Université de Rouen Normandie (UNIROUEN), Normandie Université (NU)-Normandie Université (NU)-Université de Rouen Normandie (UNIROUEN), Normandie Université (NU)-Normandie Université (NU)-Université Le Havre Normandie (ULH), Normandie Université (NU)-Université de Caen Normandie (UNICAEN), Normandie Université (NU)-Université de Rouen Normandie (UNIROUEN), Normandie Université (NU), Réseau interdisciplinaire pour l'aménagement et la cohésion des territoires de l'Europe et de ses voisinages (RIATE), Université Paris Diderot - Paris 7 (UPD7)-DATAR-Centre National de la Recherche Scientifique (CNRS), and European Project: 225260,EC:FP7:SSH,FP7-SSH-2007-1,EUROBROADMAP(2009)
Subjects: [SHS.STAT]Humanities and Social Sciences/Methods and statistics, spatial analysis, co-citations, textual statistics, [SHS.GEO]Humanities and Social Sciences/Geography, network analysis
Abstract: International audience; As demonstrated by the analysis of the general results of the EuroBroadMap survey on 18 countries and 42 places of survey (Didelon & al., 2011 ; Grasland & Beauguitte, 2012; Brennetot & al., 2013 ), the declaration by students of the countries and cities where they would like to live or not like to live is governed at aggregated level by some general gravity laws. Our contribution proposes three alternative approaches for the analysis of datasets containing explicit or implicit list of places where the order of the list is eventually important. Many researches are based on such type of data containing individuals characterised by list of places with an explicit or implicit ranking.The first hypothesis is the fact that the places given by the respondents are not randomly distributed at the individual level. It means that, even if we do not consider the answers ranking, an important part of the initial information is lost when we aggregate the list proposed by individual to upper levels of social or spatial aggregation. If we ask for example 6 students of a given country to quote 3 countries among seven proposed where they would like to live, and we obtain the set of answers (A,B,C), (A,B,C), (A,B,C), (C,D,E), C,D,E), (C,D,E), we will conclude at an aggregated level that C is the most appreciated country (100% of answers contain the country C) and that A,B,D,E are equally appreciated (50% of answers). But we will lose crucial information about the fact that, out of C, answers are strongly organised by the association of answers A-B on one hand, and D-E on the other hand. If we ask now to the same students to give 3 countries where they would not like to live, with answers (D,E,F), (D,EF), (D,E,F), (A,B,F), (A,B,F), (A,B,F), it will produce the same loss of information, but we will also miss the existence of interaction between answers to both questions. To demonstrate the existence of such effects, we propose to apply a set of methods from textual analysis where each place is considered as a word, each list as a sentence and each set of lists as a paragraph. These methods are able to take into account the contextual effects at different levels and can therefore help to validate or invalidate the existence of interactions between answers, not only at the level of individual lists but also between the different lists. The second hypothesis is the existence of a non-random distribution of ranks within answers. To illustrate this approach, let’s consider the following distribution of answers to the declaration of 3 countries among 8 countries where students would like to live : (A,B,C), (A,B,E), (A,B,F), (G,H,I),(H,I,G), (I,G,H). The aggregated approach will obviously miss a lot of information by concluding to the existence of only two levels of countries (A,B,G,H,I) and (C,E,F). Textual analysis will reveal the existence of two separated cluster of associated answers (A,B,C,D,E,F) against (G,H,I).. We propose here to apply firstly network analysis methods based on the analysis of successive couples of countries, based on ranks, in order to build an oriented network (A is followed by B which is followed by C, which defines only two edges A-B and B-C, optionally completed by A-C1). The network of followers in the list of countries reveals differences between countries that were not initially visible : the cluster A,B,C,D,E is based on a structure of “tree” when cluster (G,H,I) is rather like a “clique”. The third hypothesis is the fact that distance between places of a given list is not randomly distributed. We apply here spatial analysis method in order to verify if the choice of places rank is governed or not by rules of distance or other forms of proximity (common language, contiguity, colonial relations). For this, we compare the average distance between countries of the same list according to their ranksApplying this set of methods to the results of EuroBroadMap survey demonstrate first their greet interest but also their complementarity. We suggest in conclusion extending this kind of application to different data corpuses like co-citations of countries in press articles.
Published: 2013

29. A cognitive process of responding to a spatial survey

Author: Claude GRASLAND, France Guérin-Pace, Sophie de Ruffray, Timothée Giraud, Géographie-cités (GC (UMR_8504)), Université Panthéon-Sorbonne (UP1)-Université Paris Diderot - Paris 7 (UPD7)-Centre National de la Recherche Scientifique (CNRS), Institut national d'études démographiques (INED), Identités et Différenciation de l'Environnement des Espaces et des Sociétés (IDEES), Université de Rouen Normandie (UNIROUEN), Normandie Université (NU)-Normandie Université (NU)-Université de Caen Normandie (UNICAEN), Normandie Université (NU)-Université Le Havre Normandie (ULH), Normandie Université (NU)-Institut de Recherche Interdisciplinaire Homme et Société (IRIHS), Normandie Université (NU)-Normandie Université (NU)-Université de Rouen Normandie (UNIROUEN), Normandie Université (NU)-Centre National de la Recherche Scientifique (CNRS), Réseau interdisciplinaire pour l'aménagement et la cohésion des territoires de l'Europe et de ses voisinages (RIATE), Université Paris Diderot - Paris 7 (UPD7)-DATAR-Centre National de la Recherche Scientifique (CNRS), European Project: 225260,EC:FP7:SSH,FP7-SSH-2007-1,EUROBROADMAP(2009), Université Paris 1 Panthéon-Sorbonne (UP1)-Centre National de la Recherche Scientifique (CNRS)-Université Paris Diderot - Paris 7 (UPD7), Université de Caen Normandie (UNICAEN), Normandie Université (NU)-Normandie Université (NU)-Université Le Havre Normandie (ULH), Normandie Université (NU)-Université de Rouen Normandie (UNIROUEN), Normandie Université (NU)-Centre National de la Recherche Scientifique (CNRS)-Institut de Recherche Interdisciplinaire Homme et Société (IRIHS), Normandie Université (NU), Université Paris 1 Panthéon-Sorbonne (UP1)-Université Paris Diderot - Paris 7 (UPD7)-Centre National de la Recherche Scientifique (CNRS), and Identité et Différenciation de l’Espace, de l’Environnement et des Sociétés (IDEES)
Subjects: [SHS.STAT]Humanities and Social Sciences/Methods and statistics, spatial analysis, co-citations, textual statistics, [SHS.GEO]Humanities and Social Sciences/Geography, network analysis
Abstract: International audience; As demonstrated by the analysis of the general results of the EuroBroadMap survey on 18 countries and 42 places of survey (Didelon & al., 2011 ; Grasland & Beauguitte, 2012; Brennetot & al., 2013 ), the declaration by students of the countries and cities where they would like to live or not like to live is governed at aggregated level by some general gravity laws. Our contribution proposes three alternative approaches for the analysis of datasets containing explicit or implicit list of places where the order of the list is eventually important. Many researches are based on such type of data containing individuals characterised by list of places with an explicit or implicit ranking.The first hypothesis is the fact that the places given by the respondents are not randomly distributed at the individual level. It means that, even if we do not consider the answers ranking, an important part of the initial information is lost when we aggregate the list proposed by individual to upper levels of social or spatial aggregation. If we ask for example 6 students of a given country to quote 3 countries among seven proposed where they would like to live, and we obtain the set of answers (A,B,C), (A,B,C), (A,B,C), (C,D,E), C,D,E), (C,D,E), we will conclude at an aggregated level that C is the most appreciated country (100% of answers contain the country C) and that A,B,D,E are equally appreciated (50% of answers). But we will lose crucial information about the fact that, out of C, answers are strongly organised by the association of answers A-B on one hand, and D-E on the other hand. If we ask now to the same students to give 3 countries where they would not like to live, with answers (D,E,F), (D,EF), (D,E,F), (A,B,F), (A,B,F), (A,B,F), it will produce the same loss of information, but we will also miss the existence of interaction between answers to both questions. To demonstrate the existence of such effects, we propose to apply a set of methods from textual analysis where each place is considered as a word, each list as a sentence and each set of lists as a paragraph. These methods are able to take into account the contextual effects at different levels and can therefore help to validate or invalidate the existence of interactions between answers, not only at the level of individual lists but also between the different lists. The second hypothesis is the existence of a non-random distribution of ranks within answers. To illustrate this approach, let’s consider the following distribution of answers to the declaration of 3 countries among 8 countries where students would like to live : (A,B,C), (A,B,E), (A,B,F), (G,H,I),(H,I,G), (I,G,H). The aggregated approach will obviously miss a lot of information by concluding to the existence of only two levels of countries (A,B,G,H,I) and (C,E,F). Textual analysis will reveal the existence of two separated cluster of associated answers (A,B,C,D,E,F) against (G,H,I).. We propose here to apply firstly network analysis methods based on the analysis of successive couples of countries, based on ranks, in order to build an oriented network (A is followed by B which is followed by C, which defines only two edges A-B and B-C, optionally completed by A-C1). The network of followers in the list of countries reveals differences between countries that were not initially visible : the cluster A,B,C,D,E is based on a structure of “tree” when cluster (G,H,I) is rather like a “clique”. The third hypothesis is the fact that distance between places of a given list is not randomly distributed. We apply here spatial analysis method in order to verify if the choice of places rank is governed or not by rules of distance or other forms of proximity (common language, contiguity, colonial relations). For this, we compare the average distance between countries of the same list according to their ranksApplying this set of methods to the results of EuroBroadMap survey demonstrate first their greet interest but also their complementarity. We suggest in conclusion extending this kind of application to different data corpuses like co-citations of countries in press articles.
Published: 2013

30. Analyse d'un discours politique présidentiel : étude lexicométrique (Paul Biya, Cameroun, 1982 à 2002)

Author: Ambomo , Claudine, Edition, Littératures, Langages, Informatique, Arts, Didactique, Discours - UFC ( ELLIADD ), Université Bourgogne Franche-Comté ( UBFC ) -Université de Franche-Comté ( UFC ), Université de Franche-Comté, Andrée Chauvin-Vileno, Alpha Ousmane Barry, Edition, Littératures, Langages, Informatique, Arts, Didactique, Discours - UFC (EA 4661) (ELLIADD), Université de Franche-Comté (UFC), Université Bourgogne Franche-Comté [COMUE] (UBFC)-Université Bourgogne Franche-Comté [COMUE] (UBFC), and Edition, Littératures, Langages, Informatique, Arts, Didactique, Discours - UFC (UR 4661) (ELLIADD)
Subjects: Lexicometrics, Analyse du discours, Textual statistics, Lexicométrie, Presidential speech, Paul Biya, Cameroun, Discours présidentiel, Statistique textuelle, Political discourse, [ SHS.LANGUE ] Humanities and Social Sciences/Linguistics, Cameroon, Discours politique, [SHS.LANGUE]Humanities and Social Sciences/Linguistics, Discourse analysis
Abstract: The analysis of a corpus of 297 speeches made by the President of the Republic of Cameroon between 1982 and 2002, by means of the methods and the software of the textual statistics and the linguistic concepts of the discourse analysis, has brought to light lexical, rhetorical and structural characteristics of Paul Biya’s speeches. After the identification of some of the themes of the corpus, lexical analysis and lexicometric study of the evolution of vocabulary have been made. Statistical methods have helped to clarify the enunciation through the study of lexical time and the adaptation to the public targeted.Finally, the analysis of two types of speeches: speeches made by the President of the Republic every year on the eve of Youth Day’s celebration on February 10, and speeches made every end of year on December 31 to the Nation and to foreign diplomats has shown a diachronic change of vocabulary, showing a clear adaptation to the audience.; Mobilisant les concepts de l’analyse du discours combinés aux méthodes de la lexicométrie, cette thèse se propose de mettre en lumière, à travers l’analyse d’un corpus de 297 discours, les principales caractéristiques tant lexicales que rhétoriques et structurelles du discours du président de la République du Cameroun, Paul Biya, de 1982 et 2002. Après avoir dégagé quelques-uns des axes thématiques du corpus, on poursuit l’analyse lexicale par l’étude lexicométrique de l’évolution du vocabulaire. Les méthodes statistiques permettent ensuite d’appréhender certains phénomènes énonciatifs en fonction du temps lexical et de l’adaptation à l’auditoire. Enfin, dans un dernier temps, ce sont deux types de discours « rituels » qui sont analysés : les discours à la Jeunesse, prononcés par le président de la République le 10 février de chaque année pour célébrer la fête de la Jeunesse, et les discours de voeux de fin d’année. La prise en compte de ces deux types de discours permet de poursuivre l’étude du phénomène d’adaptation à l’auditoire : d’abord en confrontant les discours à la Jeunesse au reste du corpus présidentiel ensuite en faisant contraster les discours de voeux à la Nation avec les discours de voeux au diplomates.
Published: 2013

31. Sémantique interprétative et textométrie– Version abrégée1

Author: Bénédicte Pincemin
Subjects: media_common.quotation_subject, corpus linguistics, Art history, text analysis software, contexte textuel, lemmatization, logiciels d’analyse de textes, unités linguistiques, interprétation, textual statistics, textuality, linguistic units, sémantique, semantics, statistiques textuelles, General Environmental Science, media_common, textométrie, textual context, lemmatisation, textualité, Art, logométrie, linguistique de corpus, textometry, General Earth and Planetary Sciences, lexicométrie, Humanities
Abstract: La textométrie propose une approche et des outils pour analyser les corpus numériques, que les chercheurs en sémantique interprétative mettent à profit depuis une quinzaine d’années. Pour éclairer ces réussites, on entreprend ici de repérer des adéquations essentielles entre la théorie linguistique de la sémantique interprétative, et les principes fondateurs de l’approche textométrique. Les connivences sont nombreuses : la place centrale des textes à toutes les étapes de l’analyse, le souci de rester au plus proche du texte et d’éviter toute préconception réductrice, le rôle déterminant du contexte global construit par le corpus de référence, le fonctionnement différentiel des calculs contrastifs comme des tris… Dans le contexte de renouveau actuel des logiciels textométriques, la sémantique interprétative est appelée à nourrir la réflexion théorique sous-jacente. Interpretative Semantics and Textometry. Abridged version. Textometry offers a methodology and software resources for corpus analysis. Researchers in interpretative semantics have made use of these resources for the last fifteen years. This paper aims to explain this success by identifying the main links between the linguistic theory underpinning interpretative semantics and the founding principles of textometry. The approaches have much in common: a strong focus on the texts at every stage of the analysis; faithfulness to the text and avoidance of abstraction; use of the overall context of the corpus as a reference for determining the identity and meaning of every sub-part and sub-unit; leaning on difference as a meaning key in processes like contrastive statistics or sorting. As a new generation of textometric software is developed, interpretative semantics must guide its underlying theoretical design.
Published: 2012

32. Propositions pour la détection automatique de la néologie sémantique

Author: Reutenauer, Coralie
Subjects: Sémantique textuelle, Corpus linguistics, Domaines, Textual statistics, Semantic tagging, Semantic neology, Sémème, Text-based semantics, meaning representation, Traitement automatique, Annotation sémantique, Domains, Linguistique de corpus, Textométrie, Néologie
Abstract: Alors que la néologie formelle fait aujourd’hui l’objet de traitements informatiques, une approche automatisée de la néologie sémantique reste délicate. Des travaux en sémantique textuelle (Rastier et Valette 2009) proposent des représentations des mécanismes sémiques en jeu dans la néologie sémantique de façon à la fois détaillée et ancrée dans la pratique. Cet article présente un travail prospectif sur l’identification de la néologie sémantique à partir de méthodes statistiques sur corpus et dans la lignée de la sémantique textuelle, notamment à travers le rôle clé accordé à la notion de sème. Dans ce cadre a été élaborée une procédure informatisée, fondée sur une annotation de corpus en traits sémantiques et destinée à mettre en évidence des néologismes sémantiques. Cette procédure est appliquée à un cas d’étude, l’adjectif toxique dans le contexte de la crise économique et financière de fin 2008 à début 2009., Various techniques have been developed and implemented to process neology of form automatically, but the processing of semantic neology is still a tricky question. Mechanisms underlying semantic neology are described using semantic features or semes in works on text semantics (Rastier and Valette 2009). In this paper, we propose a prospective approach to detect semantic neology using statistical methods and corpus analysis. These methods are inspired from text semantics, since they use descriptors considered as semes. We propose a process relying on semantic tagging to detect semantic neologisms. This process is applied to study the French adjective toxique in the context of the economic crisis in 2008 and 2009.
Published: 2012
Full Text: View/download PDF

33. Discours d'entreprise et organisation de l'information - Apports de la textométrie dans la construction de référentiels terminologiques adaptables au contexte

Author: Erlos, Frédéric, SYLED - Systèmes Linguistiques, Énonciation et Discursivité - EA 2290 (SYLED), Université Sorbonne Nouvelle - Paris 3, Université de la Sorbonne nouvelle - Paris III, André Salem(Andre.Salem@univ-paris3.fr), and Erlos, Frédéric
Subjects: linguistique de corpus, textual statistics, nom propre, textométrie, corpus linguistics, terminology, organisation de l'information, terminologie, information organization, intranet, [SHS.LANGUE]Humanities and Social Sciences/Linguistics, [SHS.LANGUE] Humanities and Social Sciences/Linguistics, proper name
Abstract: Information organization on an intranet (internal network of an organization, using technologies of Internet) needs new approaches handling the question of the adequacy between the structures of intranet sites and the linguistic usages of their visitors. A way to take into account these usages is to explore textual data which are representative of a specific situation of communication. Such an exploration is carried out with textual statistics tools, like hierarchical index, concordance, repeated segment, textual map, co-occurrence and cluster analysis. This corpus-based approach allows us to extract linguistic units belonging, for example, to texts of corporate communication (annual reports). Recognition and storage of such lexical data aim at the construction of a terminological thesaurus of a peculiar type. We suggest taking into account the context of communication by using three sorts of marks : - the particular ontology of an organization such as it is evoked in discourses, - the pragmatic properties of the proper nouns, - a selection of proper nouns allows gathering a part of the characteristic vocabulary of the corpus used as source for the terminological thesaurus. This collection does not thus limit itself to the only terminological units, but also contains words of the common language and proper nouns. Elements belonging to the vocabulary of the corpus are selected according to the type of semantic relations established with the proper nouns in the texts. Finally, the results are assessed in terms of productivity, reliability and representativeness., L'organisation de l'information sur un intranet (réseau informatique interne d'une organisation fonctionnant avec les technologies d'Internet) nécessite de nouvelles approches pour traiter la question de l'adéquation entre l'arborescence des sites et les usages linguistiques de leurs publics. Une façon de prendre en compte ces usages consiste à explorer les données textuelles représentatives d'une situation de communication spécifique. Une telle exploration est effectuée à l'aide de techniques textométriques, comme l'index hiérarchique des formes, les concordances, les segments répétés, la carte des sections d'un texte, le calcul des co-occurrences et l'analyse factorielle des correspondances. On extrait alors d'un corpus de textes de communication d'entreprise (rapports d'activité) les unités lexicales destinées à la construction d'un référentiel terminologique d'un type particulier. Afin de prendre en compte le contexte de communication on propose d'utiliser trois sortes de repères : - le référentiel d'objets propre à une organisation, - les propriétés pragmatiques des noms propres, - la collecte d'une partie du vocabulaire caractéristique du corpus utilisé comme source du référentiel terminologique, réalisée à partir d'une sélection de noms propres. Ainsi, cette collecte ne se limite pas aux seules unités terminologiques : elle comprend également des mots relevant de la langue commune et des noms propres. Les unités appartenant au vocabulaire du corpus sont choisies en fonction du type de relations sémantiques établies avec les noms propres dans les discours. Enfin, les résultats obtenus sont évalués en termes de productivité, de fiabilité et de représentativité.
Published: 2009

34. Approcci statistici e qualitativi all'analisi del contenuto. Similarità, specificità, e una proposta di triangolazione

Author: G. Alessandri, TOMASETTO, CARLO, G. Alessandri, and C. Tomasetto
Subjects: QUALITATIVE CONTENT ANALYSIS, ATLAS.TI, TALTAC, TEXTUAL STATISTICS, GROUNDED THEORY
Abstract: Questo contributo propone un percorso di integrazione tra due approcci metodologici diversi all’esame di dati testuali: l’analisi lessicale su base statistica e l’analisi qualitativa computer-assistita basata sull’approccio della Grounded Theory. Vengono delineati i presupposti teorici e metodologici dell’uno e dell’altro con particolare attenzione alle loro rispettive potenzialità euristiche. Grazie all’utilizzo di un’ampia base dati costituita da 450 questionari a risposta aperti compilati da docenti e personale ausiliario appartenenti a scuole di primo e secondo grado, viene presentato un possibile percorso di triangolazione tra i due approcci basato sull’utilizzo di due software specifici, quali TALTAC e 1 ATLAS.ti. Nelle conclusioni vengono inoltre discussi i limiti dell’uno e dell’altro approccio, i rispettivi punti di forza e l’utilità del ricorso a tali procedure nella ricerca psicologica.
Published: 2008

35. L'analyse de données textuelles aujourd'hui : du corpus comme une urne au corpus comme un plan

Author: Mayaffre, Damon, BCL, équipe Logométrie : corpus, traitements, modèles, Bases, Corpus, Langage (UMR 7320 - UCA / CNRS) (BCL), Université Côte d'Azur (UCA)-Centre National de la Recherche Scientifique (CNRS)-Université Nice Sophia Antipolis (... - 2019) (UNS), COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Université Côte d'Azur (UCA)-Centre National de la Recherche Scientifique (CNRS)-Université Nice Sophia Antipolis (... - 2019) (UNS), and COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)
Subjects: textométrie, Text Topology, Text Topography, ADT, Topologie textuelle, Lexico, logométrie, topographie textuelle, Textual Data Analysis, Textual Statistics, statistique textuelle, lexicométrie, [SHS.LANGUE]Humanities and Social Sciences/Linguistics, Hyperbase
Abstract: International audience; Textual Data Analysis et Lexical Statistics try to consider, from now on, the text as an orderly structure and organized space. Softwares (Lexico and Hyperbase) can testify of the linear organization and the progress of texts and of corpora. So, the traditional Textual Linguistics and Textual Data Analysis move closer to their point of view.; L'Analyse de données textuelles se propose désormais de considérer le texte comme une structure ordonnée ou comme un espace organisé. Les logiciels d'ADT tels Lexico et Hyperbase peuvent en effet témoigner aujourd'hui de l'ordonnancement linéaire et de la progression du texte et du corpus. Ainsi, l'Analyse de données textuelles rejoint la Linguistique textuelle traditionnelle dans ses préoccupations fondamentales.
Published: 2007

36. Professions de foi et élection présidentielle de 2002. Une approche verbale de la distance entre candidats

Author: Jean-Michel Rampon, Equipe de recherche de Lyon en sciences de l'information et de la communication (ELICO), Sciences Po Lyon - Institut d'études politiques de Lyon (IEP Lyon), Université de Lyon-Université de Lyon-École nationale supérieure des sciences de l'information et des bibliothèques (ENSSIB), Université de Lyon-Université Claude Bernard Lyon 1 (UCBL), Université de Lyon-Université Jean Moulin - Lyon 3 (UJML), Université de Lyon-Université Lumière - Lyon 2 (UL2), Université Lumière - Lyon 2 (UL2)-École nationale supérieure des sciences de l'information et des bibliothèques (ENSSIB), Université de Lyon-Université de Lyon-Sciences Po Lyon - Institut d'études politiques de Lyon (IEP Lyon), and Université de Lyon
Subjects: morphologie lexicale, verbe, verb, élection présidentielle, estadística textual, Mechanical Engineering, [SHS.INFO]Humanities and Social Sciences/Library and information sciences, Energy Engineering and Power Technology, Management Science and Operations Research, morfología y léxico, Lexicométrie, Profession de foi, presidential election, textual statistics, lexical morphology, profession de foi, statistique textuelle, proclamación electoral, Election présidentielle française, electoral leaflet, elecciones presidenciales, verbo, ComputingMilieux_MISCELLANEOUS
Abstract: Sur la base des professions de foi du premier tour de l’élection présidentielle de 2002 en France, cet article teste une hypothèse forte : les proximités lexicales sont corrélées avec celles des candidats sur l’échiquier politique. Pour ce faire, cette analyse prend appui sur les verbes à l’infinitif, caractéristiques des professions de foi, et sur les verbes « ordinaires » auxquels ont été associés des termes apparentés. On the basis of the first ballot of the 2002 France presidential election, this paper puts forward a strong hypothesis : the lexical similitudes are connected with candidates’ proximities on the political scene. For this, this analysis uses verbs in the infinitive, characteristic of electoral leaflets, and « usual » verbs which have been linked with related terms. Sobre la base de las proclamaciones escritas de los candidatos del primer turno de las elecciones presidenciales francesas de 2002, este artículo afirma una hipótesis : las similitudes lexicales estan corelacionadas con las de los candidatos en función de su posición sobre el ecenario político. Se trata pues de analizar el empleo de los verbos al infinitivo, característico de las proclamaciones electorales, y los verbos « corrientes » asociados con términos emparentados.
Published: 2006
Full Text: View/download PDF

37. Métodos de análisis de las verbalizaciones: ontologías y procesode abducción

Author: Escalante Gómez, Eduardo and Escalante Gómez, Eduardo
Abstract: This study looks into the need to develop what is known as ontologies as a way to process natural language and as a tool for the exploration and analysis of the verbalizations obtained through interviews in depth and focus groups. Each ontology operates in the context of an abductive process, which allows the expansion of the initial ontology. The main idea isn't the construction of ontology for its computerized formalization but rather the compliance with the concept of ontology according to which its use is linked to its pertinence in the representation of a body of knowledge selected from a specific domain of reality which is of interest. The hypothesis of the text is: the analysis of the verbalizations in a research process should follow the abductive method that begins with the construction of an E-ontology and immediately develops into a recursive process of refinement and reconfigurations of what Samaja (2003) calls modeling preconceptions., En este trabajo se plantea la necesidad de desarrollar lo que se denominan ontologías como forma de procesar el lenguaje natural y como herramientas para la exploración y análisis de las verbalizaciones obtenidas mediante entrevistas en profundidad y grupos focales. Cada ontología operará en el contexto de un proceso abductivo, lo cual permitiría el enriquecimiento de la ontología inicial. Es decir, no se trata de la construcción de una ontología para su formalización informática, sino más bien porque se considera que el uso del concepto de "ontología" está dado por su pertinencia en la representación de un cuerpo de conocimientos seleccionado de un determinado dominio de la realidad que resulta de interés. La hipótesis del texto es: el análisis de las verbalizaciones de los sujetos en un proceso de investigación debería seguir el método abductivo que se inicia con la construcción de una ontología-E y en seguida se desarrolla un proceso recursivo de refinamiento y reconfiguraciones de lo que Samaja (2003) denomina pre-concepciones modelizantes.
Published: 2010

38. Statistica testuale e text mining: alcuni paradigmi applicativi

Author: Bolasco, Sergio
Subjects: Textual statistics, text mining, automatic textual data analysis, lexical analysis, information extraction
Published: 2005

39. Approches quantitatives de l'extraction de ressources traductionnelles à partir de corpus parallèles

Author: Zimina-Poirot, Maria, SYLED - Systèmes Linguistiques, Énonciation et Discursivité - EA 2290 (SYLED), Université Sorbonne Nouvelle - Paris 3, Université de la Sorbonne nouvelle - Paris III, Salem André(salem@univ-paris3.fr), and Zimina-Poirot, Maria
Subjects: correspondances traductionnelles, translation correspondences, topographie textuelle, textométrie, alignement, parallel corpora, alignment, textual topography, [SHS.LANGUE] Humanities and Social Sciences/Linguistics, [SHS]Humanities and Social Sciences, textual statistics, bi-texte, bi-text, [SHS] Humanities and Social Sciences, statistique textuelle, [SHS.LANGUE]Humanities and Social Sciences/Linguistics, textometrics, corpus parallèles
Abstract: This research work presents the results of a series of experiments devoted to the development of new tools for intertextual textometric exploration of translation corpora. Various methods of textual statistics have been adapted for use in a multilingual context and put into practice for parallel text processing, such as: repeated segments extraction, characteristic elements computation, bi-textual topography, multiple co-occurrences, factorial analysis, automatic classification, etc. Examples of concrete applications illustrate the use of each of these methods in a multilingual context. These examples are accompanied by sample translation resources obtained on quantitative bases from the parallel French/English corpus of the Convention for the Protection of Human Rights. The suggested approach opens up new horizons for automatic exploration of lexical equivalences of translation corpora by a variety of users: translators, foreign language teachers, terminologists, lexicographers, etc., Ce travail présente les résultats d'une série de recherches consacrées au développement d'une nouvelle famille d'outils d'exploration textométrique intertextuelle. De nombreuses méthodes de statistique textuelle ont été articulées et adaptées au cadre multilingue : la méthode des segments répétés, les spécificités, la topographie bi-textuelle, les cooccurrences multiples, l'analyse factorielle des correspondances, la classification automatique, etc. L'utilisation de chaque méthode dans le contexte multilingue est illustrée par des exemples d'applications concrètes, accompagnés d'échantillons de ressources traductionnelles obtenues à partir du corpus parallèle français/anglais de la Convention de sauvegarde des Droits de l'Homme. Les perspectives ouvertes par cette approche offrent aux traducteurs, enseignants en langues étrangères, terminologues, lexicographes, etc., des moyens automatisés pour explorer la structure des équivalences lexicales dans les corpus de traduction.
Published: 2004

40. Stylistique et statistique textuelle : à partir de l'article de C. Muller sur les pronoms de dialogue

Author: Malrieu, Denise, Modèles, Dynamiques, Corpus (MoDyCo), Université Paris Nanterre (UPN)-Centre National de la Recherche Scientifique (CNRS), IRISA, INRIA, and Malrieu, Denise
Subjects: pronoms et adjectifs de personne, textual genres, théâtre classique, textual statistics, interpretative semantics, classic theater, sémantique interprétative, genres textuels, personal pronouns and adjectives, statistique textuelle, [SHS.LANGUE]Humanities and Social Sciences/Linguistics, [SHS.LANGUE] Humanities and Social Sciences/Linguistics
Abstract: International audience; Partant de la démarche adoptée par C. Muller pour construire des indices stylistiques à partir de fréquences d'index, nous discuterons les problèmes méthodologiques liés à la conception de ces indices : nature des corpus de travail et de référence; nature des variables prises en compte; problème de l'ambiguïté sémantique des grammèmes selon les genres textuels et les séquences textuelles, en particulier le statut dialogique des pronoms de personnes. L'enjeu étant de passer d'une statistique lexicale à une statistique textuelle, nous développerons cette discussion à travers l'exemple concret des indices proposés par C. Muller en proposant d'autres indices calculables à partir des sorties de CODIAL Analyseur et d'Hyperbase et en proposant des orientations de développements prenant en compte les contraintes de localités et portées variables.
Published: 2002

41. Information Extraction from Free Text Medical Data

Author: Hercigonja-Szekeres, Mira, Jureša, Vesna, Kern, Josipa, Ivanković, Davor, and Vuletić, Silvije
Subjects: medical record, free text medical data, textual statistics
Abstract: A glance at any medical record shows that free text is the basic type in a GPs office and it is full of words, phrases, abbreviations, very often chaotic, depending of GPs behaviour pattern. The physician, school medicine specialist, made a large database consisting records of children taking to pre-school examination. There were data as free text and coded data, too. Data were analysed by the software package SPAD.T for statistical analysis of textual data. The first results statistics of vocabulary showed us two things: relatively big total number of words, and small number of distinct words. The correspondence analysis for tables crossing vocabulary and individual was applied, too. The most remarkable feature was that the result persisted in the same success / failure group. The next interesting feature were the clouds of words around the success / failure category. Value of this analysis was that it showed us some results not seen easily from numerical data or coded textual data (hidden information).
Published: 2000

42. Analyser un corpus illisible ? Le logiciel Alceste confronté à des registres de condoléances

Author: Truc, Gérôme and Truc, Gérôme
Abstract: Les registres de condoléances sont réputés constituer un matériau d’enquête difficile à appréhender pour le chercheur. Le formalisme et la répétitivité des messages qu’ils contiennent d’une part et leur volume pléthorique d’autre part semblent rendre impossible toute lecture exhaustive et vaine toute tentative d’analyse de contenu. Dans cet article, nous passons tout d’abord en revue les différentes stratégies mises en œuvre pour contourner cette difficulté, puis nous examinons dans quelle mesure le recours au logiciel de statistique textuelle Alceste, développé par Max Reinert, peut s’avérer particulièrement pertinent pour ce type spécifique de corpus. Nous appuyons cette réflexion sur une analyse menée avec Alceste d’un ensemble de presque 60000 messages de condoléances électroniques rédigés à la suite des attentats du 11 mars 2004 à Madrid, dont nous présentons pour finir les principaux résultats., Can one analyze an unreadable corpus ? Using Alceste to analyze a condolence book Condolence books are thought to represent a difficult primary source for the researcher. The formalism and repetitiveness of the messages they contain, along with their sheer number, seem to make any exhaustive reading impossible, and any attempt at content analysis vain. This article first goes through the various strategies used to get round this difficulty, before examining to what degree Alceste, a software designed for textual analysis by Max Reinert, can be relevant in this particular type of corpus. Our considerations are tested in an analysis using Alceste of a collection of almost 60,000 electronic condolence messages written in the aftermath of the Madrid terrorist attacks of 11 March 2004. To conclude, the main results of this analysis are given.

43. Words for Ills: Diagnostic Theories and Health Problems

Author: Béliard, Aude, Eideliman, Jean-Sébastien, and Jacobs, Amy
Published: 2014

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

43 results on '"Textual statistics"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources