186 results on '"Michal Ptaszynski"'
Search Results
102. Deep Learning for Information Triage on Twitter
- Author
-
Fumito Masui, Kiyoshi Takahashi, Hiroshi Hayakawa, Yuuto Fukushima, Shunzo Kawajiri, Yuuto Oikawa, Michal Ptaszynski, and Yasunori Miyamori
- Subjects
Technology ,Computer science ,QH301-705.5 ,QC1-999 ,050801 communication & media studies ,Convolutional neural network ,0508 media and communications ,Credibility ,General Materials Science ,natural language processing ,Biology (General) ,Natural disaster ,Instrumentation ,QD1-999 ,disaster detection ,Fluid Flow and Transfer Processes ,business.industry ,Event (computing) ,Process Chemistry and Technology ,Deep learning ,Physics ,05 social sciences ,Sentiment analysis ,General Engineering ,050301 education ,deep learning ,Engineering (General). Civil engineering (General) ,Data science ,Triage ,Computer Science Applications ,Classified information ,Chemistry ,Artificial intelligence ,information triage ,TA1-2040 ,business ,0503 education - Abstract
In this paper, we present a Deep Learning-based system for the support of information triaging on Twitter during emergency situations, such as disasters, or other influential events, such as political elections. The system is based on the assumption that a different type of information is required right after the event and some time after the event occurs. In a preliminary study, we analyze the language behavior of Twitter users during two kinds of influential events, namely, natural disasters and political elections. In the study, we analyze the credibility of information included by users in tweets in the above-mentioned situations, by classifying the information into two kinds: Primary Information (first-hand reports) and Secondary Information (second-hand reports, retweets, etc.). We also perform sentiment analysis of the data to check user attitudes toward the occurring events. Next, we present the structure of the system and compare a number of classifiers, including the proposed one based on Convolutional Neural Networks. Finally, we validate the system by performing an in-depth analysis of information obtained after a number of additional events, including an eruption of a Japanese volcano Ontake on 27 September 2014, as well as heavy rains and typhoons that occurred in 2020. We confirm that the methods works sufficiently well even when trained on data from nearly 10 years ago, which strongly suggests that the model is well-generalized and sufficiently grasps important aspects of each type of classified information.
- Published
- 2021
103. Quality Improvement of a Gear Transmission by Means of Genetic Algorithm
- Author
-
Edward Lisowski, Fumito Masui, Michal Ptaszynski, M. Domagala, Grzegorz Filo, Paweł Lempa, and Joanna Fabiś-Domagała
- Subjects
Quality management ,05 social sciences ,0211 other engineering and technologies ,02 engineering and technology ,Gear transmission ,gear transmision ,Industrial engineering ,0502 economics and business ,Genetic algorithm ,genetic algorithm ,021108 energy ,Business management ,optimization ,050203 business & management - Abstract
The article deals with the issue of quality improvement of a gear transmission by optimizing its geometry with the use of genetic algorithms. The optimization method is focused on increasing productivity and efficiency of the pump and reducing its pulsation. The best results are tested on mathematical model and automatically modelled in 3D be means of PTC Creo Software. The developed solution proved to be an effective tool in the search for better results, which greatly improved parameters of pump especially reduced flow pulsation.
- Published
- 2019
104. Brute-Force Sentence Pattern Extortion from Harmful Messages for Cyberbullying Detection
- Author
-
Fumito Masui, Gniewosz Leliwa, Yasutomo Kimura, Rafal Rzepka, Paweł Lempa, Michal Ptaszynski, Michal Wroczynski, and Kenji Araki
- Subjects
Extortion ,Automatic Cyberbullying Detection ,Brute force ,Computer science ,Language Combinatorics ,Computer security ,computer.software_genre ,computer ,Sentence ,Computer Science Applications ,Information Systems ,Natural Language Processing - Abstract
Cyberbullying, or humiliating people using the Internet, has existed almost since the beginning of Internet communication.The relatively recent introduction of smartphones and tablet computers has caused cyberbullying to evolve into a serious social problem. In Japan, members of a parent-teacher association (PTA)attempted to address the problem by scanning the Internet for cyber bullying entries. To help these PTA members and other interested parties confront this difficult task we propose a novel method for automatic detection of malicious Internet content. This method is based on a combinatorial approach resembling brute-force search algorithms, but applied in language classification. The method extracts sophisticated patterns from sentences and uses them in classification. The experiments performed on actual cyberbullying data reveal an advantage of our method vis-à-visprevious methods. Next, we implemented the method into an application for Android smartphones to automatically detect possible harmful content in messages. The method performed well in the Android environment, but still needs to be optimized for time efficiency in order to be used in practice
- Published
- 2019
105. Towards Socialized Machines: Emotions and Sense of Humour in Conversational Agents
- Author
-
Kenji Araki, Pawel Dybala, Rafal Rzepka, Shinsuke Higuhi, Michal Ptaszynski, and Wenhan Shi
- Subjects
Communication ,business.industry ,Psychology ,business - Abstract
5.1 The socialized conversational agent algorithm With already started attempts of combining the systems described above, we decided to combine them all and make the next step on the path to a naturally chatting agent. As the first step we decided to implement the following algorithm (see Figure 7): first, the emotive recogniser tries to categorize the emotion included inside the user's utterance. Here
- Published
- 2021
106. Predicting Increase in Demand for Public Buses in University Students Daily Life Needs: Case Study Based on a City in Japan
- Author
-
Fumito Masui, Michal Ptaszynski, and Ali Bakdur
- Subjects
Status quo ,media_common.quotation_subject ,Geography, Planning and Development ,Population ,TJ807-830 ,Management, Monitoring, Policy and Law ,TD194-195 ,Adaptability ,Renewable energy sources ,0502 economics and business ,predictive analysis ,student decision ,qualitative analysis ,statistical test ,GE1-350 ,Marketing ,Everyday life ,education ,media_common ,050210 logistics & transportation ,education.field_of_study ,Environmental effects of industries and plants ,Renewable Energy, Sustainability and the Environment ,business.industry ,05 social sciences ,Flexibility (personality) ,Usability ,Environmental sciences ,Business ,Rural area ,Prejudice ,050212 sport, leisure & tourism ,public bus services - Abstract
Accessibility and economic sustainability of public bus services (PBS) have been in a continuous decline in Japan’s countryside. Rural cities also suffer from population transformation toward industrial centers experiencing rapid economic growth. In the present study, we reviewed the current demand status of PBS in Kitami, a rural city in Japan that hosts a national university. The investigation was performed by examining students’ daily lives using a survey to collect data representing a portion of the population. The objective was to predict the change in demand rate for PBS concerning the necessities of everyday life from the perspective of university students as potential users of PBS. Intuitively, decision-makers at every level display a distinct prejudice toward alternatives that intend to change the long-lasting status quo, hence in the question sequence, a two-step verification probe was used to reveal a person’s actual perceived opinion. Accordingly, the respondents’ initial demand rate for PBS was around 60%, however, this score increased to 71% in the secondary confirmation. Afterward, using machine learning-based prediction methods, we could predict this demand at over 90% of F-measure, with the most reliable and stable prediction method reaching 80% by other daily life indicators’ weight. Finally, we supplied thorough evidence for our approach’s usability by collecting and processing the data’s right set regarding this study’s objective. This method’s highlighted outcomes would help to reduce the local governments’ and relevant initiatives’ adaptability time to demands and improve decision-making flexibility.
- Published
- 2021
- Full Text
- View/download PDF
107. Predicting increase of demand for public buses in university students > daily life needs: Case Study Based on a City in Japan
- Author
-
Ali Bakdur, Fumito Masui, Michal Ptaszynski, Ali Bakdur, Fumito Masui, and Michal Ptaszynski
- Abstract
Accessibility and economic sustainability of public bus services (PBS) have been in a continuous decline in Japan’s countryside. Rural cities also suffer from population transformation toward industrial centers experiencing rapid economic growth. In the present study, we reviewed the current demand status of PBS in Kitami, a rural city in Japan that hosts a national university. The investigation was performed by examining students’ daily lives using a survey to collect data representing a portion of the population. The objective was to predict the change in demand rate for PBS concerning the necessities of everyday life from the perspective of university students as potential users of PBS. Intuitively, decision-makers at every level display a distinct prejudice toward alternatives that intend to change the long-lasting status quo, hence in the question sequence, a two-step verification probe was used to reveal a person’s actual perceived opinion. Accordingly, the respondents’ initial demand rate for PBS was around 60%; however, this score increased to 71% in the secondary confirmation. Afterward, using machine learning-based prediction methods, we could predict this demand at over 90% of F-measure, with the most reliable and stable prediction method reaching 80% by other daily life indicators’ weight. Finally, we supplied thorough evidence for our approach’s usability by collecting and processing the data’s right set regarding this study’s objective. This method’s highlighted outcomes would help to reduce the local governments’ and relevant initiatives’ adaptability time to demands and improve decision-making flexibility.
- Published
- 2021
108. Multi-humoroid: joking system that reacts with humor to humans' bad moods.
- Author
-
Pawel Dybala, Michal Ptaszynski, Rafal Rzepka, and Kenji Araki
- Published
- 2010
109. Humoroids: conversational agents that induce positive emotions with humor.
- Author
-
Pawel Dybala, Michal Ptaszynski, Rafal Rzepka, and Kenji Araki
- Published
- 2009
110. Serious processing for frivolous purpose: a chatbot using web-mining supported affect analysis and pun generation.
- Author
-
Rafal Rzepka, Wenhan Shi, Michal Ptaszynski, Pawel Dybala, Shinsuke Higuchi, and Kenji Araki
- Published
- 2009
- Full Text
- View/download PDF
111. Towards Better Text Processing Tools for the Ainu Language
- Author
-
Fumito Masui, Michal Ptaszynski, and Karol Nowakowski
- Subjects
0209 industrial biotechnology ,Training set ,Part-of-speech tagging ,business.industry ,Computer science ,Text segmentation ,02 engineering and technology ,computer.software_genre ,020901 industrial engineering & automation ,Text processing ,Transcription (linguistics) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Language model ,Artificial intelligence ,Overall performance ,business ,computer ,Natural language processing - Abstract
In this paper we present our research devoted to the development of Natural Language Processing technologies for the Ainu language, a critically endangered language isolate spoken by the Ainu people, the native inhabitants of northern parts of the Japanese archipelago. In particular, we focused on improving the existing tools for transcription normalization, word segmentation (tokenization) and part-of-speech tagging. In the experiments we applied two Ainu language dictionaries from different domains (literary and colloquial) and created a new data set by combining them. The experiments confirmed the positive effect of these modifications on the overall performance of the tools, especially with objective samples unrelated to the training data. We also discuss further improvements obtained by applying corpus-driven language models to the problem of word segmentation and using a state-of-the-art tool for training part-of-speech taggers.
- Published
- 2020
- Full Text
- View/download PDF
112. Epistolary Education in 21st Century: A System to Support Composition of E-mails by Students to Superiors in Japanese
- Author
-
Michal Ptaszynski and Kenji Ryu
- Subjects
Literacy education ,business.industry ,Computer science ,Mathematics education ,The Internet ,business ,Set (psychology) ,Composition (language) - Abstract
E-mail is a communication tool widely used by people of all ages on the Internet today, often in business and formal situations, especially in Japan. Moreover, Japanese E-mail communication has a set of specific rules taught using specialized guidebooks. E-mail literacy education for many Japanese students is typically provided in a traditional, yet inefficient lecture-based way. We propose a system to support Japanese students in writing E-mails to superiors (teachers, job hunting representatives, etc.). We firstly make an investigation into the importance of formal E-mails in Japan, and what is needed to successfully write a formal E-mail. Next, we develop the system with accordance to those rules. Finally, we evaluated the system twofold. The results, although performed on a small number of samples, were generally positive, and clearly indicated additional ways to improve the system.
- Published
- 2020
- Full Text
- View/download PDF
113. Brute Force Search Method for Cyberbullying Detection
- Author
-
Fumito Masui and Michal Ptaszynski
- Subjects
Computer science ,05 social sciences ,050301 education ,Brute-force search ,0501 psychology and cognitive sciences ,0503 education ,Algorithm ,050107 human factors - Abstract
In this chapter, the authors present a method for automatic detection of malicious internet contents, based on a combinatorial approach resembling brute force search algorithms, with application to language classification. The method automatically extracts sophisticated patterns from sentences and applies them in classification. The experiments performed on actual cyberbullying data showed advantage of this method to previous methods, including the one described in Chapter 4. Pros and cons of this method when compared to previous ones are also discussed in this chapter.
- Published
- 2020
- Full Text
- View/download PDF
114. Development of a dialogue-based guidance system for narrow area navigation
- Author
-
Fumito Masui, Michal Ptaszynski, and Yasuhiro Yoshida
- Subjects
0209 industrial biotechnology ,Multimedia ,Computer science ,Navigation system ,Satellite system ,02 engineering and technology ,Library and Information Sciences ,Management Science and Operations Research ,computer.software_genre ,Viewpoints ,Computer Science Applications ,Task (project management) ,020901 industrial engineering & automation ,GNSS applications ,ComputingMilieux_COMPUTERSANDEDUCATION ,Media Technology ,Area navigation ,Guidance system ,computer ,Natural language ,Information Systems - Abstract
In this paper, we propose a dialogue-based navigation system for narrow areas (e.g., buildings, etc.). In particular, we develop Campus Navigation Partner System (CNPS), intended for the use by freshmen (first-year students) who are likely to get lost in the new environment on the university campus (with the campus of Kitami Institute of Technology set as the experimental environment). The CNPS operates on a smartphone and has the functions of facility navigation and indoor information presentation. Indoor environment navigation is typically a problematic task because GNSS (Global Navigation Satellite System) signals cannot be received with sufficient accuracy within concrete walls. Therefore, we utilized natural language dialogue as a navigation method that guides the users within the facility using navigation messages and images. To investigate the effectiveness of the constructed system, we conducted practical experiments with freshmen and senior students of the university and evaluated the performance of the navigation functionality. Moreover, we investigated the general impressions of the system which we evaluated separately with both freshmen and senior students to clarify the differences in evaluation viewpoints between freshmen and users experienced in the campus environment (graduate students, etc.).
- Published
- 2021
- Full Text
- View/download PDF
115. Machine Learning and feature engineering-based study into sarcasm and irony classification with application to cyberbullying detection
- Author
-
Zheng Lin Chia, Michal Wroczynski, Fumito Masui, Michal Ptaszynski, and Gniewosz Leliwa
- Subjects
Feature engineering ,Computer science ,media_common.quotation_subject ,02 engineering and technology ,Library and Information Sciences ,Management Science and Operations Research ,Machine learning ,computer.software_genre ,Data type ,Task (project management) ,Set (abstract data type) ,Similarity (psychology) ,0202 electrical engineering, electronic engineering, information engineering ,Media Technology ,Preprocessor ,media_common ,Sarcasm ,business.industry ,05 social sciences ,Computer Science Applications ,Irony ,020201 artificial intelligence & image processing ,Artificial intelligence ,0509 other social sciences ,050904 information & library sciences ,business ,computer ,Information Systems - Abstract
Irony and sarcasm detection is considered a complex task in Natural Language Processing. This paper set out to explore the sarcasm and irony on Twitter, using Machine Learning and Feature Engineering techniques. First we review and clarify the definition of irony and sarcasm by discussing various studies focusing on the terms. Next the first experiment is conducted comparing between various types of classification methods including some popular classifiers for text classification task. For the second experiment, different types of data preprocessing methods were compared and analyzed. Finally, the relationship between irony, sarcasm, and cyberbullying are discussed. The results are interesting as we observed high similarity between them.
- Published
- 2021
- Full Text
- View/download PDF
116. A Method for Detecting Harmful Entries on Informal School Websites Using Morphosemantic Patterns
- Author
-
Yoko Nakajima, Kenji Araki, Michal Ptaszynski, Rafal Rzepka, Fumito Masui, and Yasutomo Kimura
- Subjects
Multimedia ,Computer science ,05 social sciences ,02 engineering and technology ,computer.software_genre ,Human-Computer Interaction ,World Wide Web ,Semantic role labeling ,Artificial Intelligence ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,0501 psychology and cognitive sciences ,Computer Vision and Pattern Recognition ,computer ,050104 developmental & child psychology - Abstract
This paper presents a novel method of analyzing morphosemantic patterns in language to the detect cyberbullying, or frequently appearing harmful messages and entries that aim to humiliate other users. The morphosemantic patterns represent a novel concept, with the assumption that analyzed elements can be perceived as a combination of morphological information, such as parts of speech, and semantic information, such as semantic roles, categories, etc. The patterns are further automatically extracted from the data containing harmful entries (cyberbullying) and non-harmful entries found on the informal websites of Japanese high schools. These website data were prepared and standardized by the Human Rights Center in Mie Prefecture, Japan. The patterns extracted in this way are further applied to a document classification task using the provided data in 10-fold cross-validation. The results indicate that morphosemantic sentence representation can be considered useful in the task of detecting the deceptive and provocative language used in cyberbullying.
- Published
- 2017
- Full Text
- View/download PDF
117. Cohousing – idea współzamieszkania
- Author
-
Michal Ptaszynski and Magdalena Jagiełło-Kowalczyk
- Subjects
Environmental ethics ,Sociology ,Cohousing - Abstract
BIM w planowaniu przestrzennym – omowienie mozliwości wykorzystania makiet BIM w procesie optymalizacji zarządzania przestrzenią w skali urbanistycznej
- Published
- 2017
- Full Text
- View/download PDF
118. Subjective? Emotional? Emotive?: Language Combinatorics based Automatic Detection of Emotionally Loaded Sentences
- Author
-
Rzepka Rafal, Masui Fumito, Michal Ptaszynski, and Araki Kenji
- Subjects
Point (typography) ,Recall ,Computer science ,Language Combinatorics ,Sentiment analysis ,Context (language use) ,Language Modeling ,Combinatorics ,Emotive ,Emotive Expressions ,Pattern Extraction ,Fully automatic ,Sentiment Analysis ,State (computer science) ,Language model ,Affect Analysis - Abstract
In this paper presents our research in automatic detection of emotionally loaded, or emotive sentences. We define the problem from a linguistic point of view assuming that emotive sentences stand out both lexically and grammatically. To verify this assumption we prepare a text classification experiment. In the experiment we apply language combinatorics approach to automatically extract emotive patterns from training sentences. The applied approach allows automatic extraction of not only widely used unigrams (tokens), or n-grams, but also more sophisticated patterns with disjointed elements. The results of experiments are explained with the use of means such as standard Precision, Recall and balanced F-score. The algorithm also provides a refined list of most frequent sophisticated patterns typical for both emotive and non-emotive context. The method reached results comparable to the state of the art, while the fact that it is fully automatic makes it more efficient and language independent.
- Published
- 2017
119. Emotional and Moral Impressions Associated with Buddhist Religious Terms in Japanese Blogs-Preliminary Analysis
- Author
-
Michal Ptaszynski, Rafal Rzepka, Kenji Araki, Jagna Nieuważny, Fumito Masui, and Karol Nowakowski
- Subjects
Vocabulary ,Social space ,Point (typography) ,media_common.quotation_subject ,Scale (social sciences) ,Buddhism ,Machine ethics ,Consciousness ,Psychology ,Social psychology ,media_common ,Terminology - Abstract
This paper is an attempt at analyzing how much religious vocabulary (in this case Buddhist vocabulary taken from a large scale dictionary of Buddhist terms available online) is present in everyday Japanese social space (in this case in a repository of blog entries form the Ameba blog service) and thus in the consciousness of people. We also investigate and what associations (positive or negative) it generates, thus indicating the connotations associated with several Buddhist terms – whether expressions containing Buddhist vocabulary are considered proper or not from a moral point of view – as well as the emotional response of Internet users to Buddhist terminology.
- Published
- 2019
- Full Text
- View/download PDF
120. Applying Support Vector Machines to POS tagging of the Ainu Language
- Author
-
Yoshio Momouchi, Karol Nowakowski, Michal Ptaszynski, and Fumito Masui
- Subjects
Support vector machine ,Annotation ,Computer science ,business.industry ,Artificial intelligence ,computer.software_genre ,business ,computer ,Natural language processing ,Task (project management) - Abstract
We describe our attempt to apply a state-of-the-art sequential tagger – SVMTool – in the task of automatic part-of-speech annotation of the Ainu language, a critically endangered language isolate spoken by the native inhabitants of northern Japan. Our experiments indicated that it performs better than the custom system proposed in previous research (POST-AL), especially when applied to out-of-domain data. The biggest advantage of the model trained using SVMTool over the POST-AL tagger is its ability to guess part-of-speech tags for OoV words, with the accuracy of up to 63%.
- Published
- 2019
- Full Text
- View/download PDF
121. PolEval 2019 : the next chapter in evaluating Natural Language Processing tools for Polish
- Author
-
Aleksander Smywiński-Pohl, Łukasz Kobyliński, Maciej Ogrodniczuk, Jan Kocoń, Michał Marcińczuk, Krzysztof Wołk, Danijel Koržinek, Michal Ptaszynski, Agata Pieciukiewicz, and Paweł Dybała
- Abstract
PolEval is a SemEval-inspired evaluation campaign for natural language processing tools for Polish. Submitted tools compete againstone another within certain tasks selected by organizers, using available data and are evaluated according to pre-established procedures.It is organized since 2017 and each year the winning systems become the state-of-the-art in Polish language processing in the respectivetasks. In 2019 we have organized six different tasks, creating an even greater opportunity for NLP researchers to evaluate their systemsin an objective manner.
- Published
- 2019
122. Study on Game Information Analysis for Support to Tactics and Strategies in Curling
- Author
-
Hiromu Otani, Michal Ptaszynski, Fumito Masui, and Hitoshi Yanagi
- Subjects
Computer science ,Shot (filmmaking) ,Factor (programming language) ,Informatics ,Applied psychology ,ComputingMilieux_PERSONALCOMPUTING ,Information analysis ,Subject (documents) ,Physical factor ,computer ,computer.programming_language ,Curling - Abstract
Curling is a winter sport often referred to as “chess on ice”. There are three factors influencing game performance in this game: physical factor, human factor, and strategic/tactical factor. The strategic/tactical factor is considered as the most important at the top level. In this paper, we report on new knowledge we obtained regarding the relationship between shot accuracy and difference in game score, and difference in correlation for each level in 378 games, covering around 60,000 shots as a research subject of Curling Informatics project.
- Published
- 2019
- Full Text
- View/download PDF
123. Statistical Analysis of Automatic Seed Word Acquisition to Improve Harmful Expression Extraction in Cyberbullying Detection
- Author
-
Suzuha Hatakeyama, Fumito Masui, Michal Ptaszynski, and Kazuhide Yamamoto
- Subjects
lcsh:T ,lcsh:TA1-2040 ,SO-PMI-IR ,information extraction ,text mining ,lcsh:Engineering (General). Civil engineering (General) ,seed word ,lcsh:Technology ,cyberbullying - Abstract
We study the social problem of cyberbullying, defined as a new form of bullying that takes place in the Internet space. This paper proposes a method for automatic acquisition of seed words to improve performance of the original method for the cyberbullying detection by Nitta et al. [1]. We conduct an experiment exactly in the same settings to find out that the method based on a Web mining technique, lost over 30% points of its performance since being proposed in 2013. Thus, we hypothesize on the reasons for the decrease in the performance and propose a number of improvements, from which we experimentally choose the best one. Furthermore, we collect several seed word sets using different approaches, evaluate and their precision. We found out that the influential factor in extraction of harmful expressions is not the number of seed words, but the way the seed words were collected and filtered.
- Published
- 2016
124. Sustainable cyberbullying detection with category-maximized relevance of harmful phrases and double-filtered automatic optimization
- Author
-
Fumito Masui, Taisei Nitta, Kenji Araki, Suzuha Hatakeyama, Rafal Rzepka, Yasutomo Kimura, and Michal Ptaszynski
- Subjects
Multimedia ,business.industry ,media_common.quotation_subject ,05 social sciences ,050301 education ,02 engineering and technology ,computer.software_genre ,Machine learning ,Education ,Task (project management) ,Human-Computer Interaction ,020204 information systems ,Reading (process) ,0202 electrical engineering, electronic engineering, information engineering ,Relevance (information retrieval) ,The Internet ,Artificial intelligence ,business ,Baseline (configuration management) ,Psychology ,Association (psychology) ,0503 education ,computer ,media_common - Abstract
We developed a supporting solution for “cyberbullying” prevention based on recent discoveries in Artificial Intelligence and Natural Language Processing. Cyberbullying, defined as using the Internet to humiliate and slander other people has become a serious problem. In Japan members of the Parent–Teacher Association manually perform Web monitoring to stop cyberbullying activities. Unfortunately, reading through the whole Web manually is an impossible task. Although the complexity of cyberbullying makes it a problem unsolvable solely with the help of technology, we found that technology could make cyberbullying prevention more efficient. We developed a novel method of automatic detection of cyberbullying entries on the Internet. In the method we use seed words from three categories to calculate a semantic orientation score and then maximize the relevance of categories. The proposed method outperformed baseline settings in both laboratory and real world conditions. The developed system was deployed and tested in practice. After a year of testing we noticed a greater than 30 percent-point-drop in its performance. We hypothesize on the reasons for the drop. To regain the lost performance and retain it in the future we propose additional improvements including automatic acquisition and filtering of seed words. Experimentally selected optimal improvements regained much of the lost performance.
- Published
- 2016
- Full Text
- View/download PDF
125. Informatics to Support Tactics and Strategies in Curling
- Author
-
Hiromu Otani, Fumito Masui, Hitoshi Yanagi, Michal Ptaszynski, and Kohsuke Hirata
- Subjects
Engineering ,business.industry ,Mechanical Engineering ,ComputingMilieux_PERSONALCOMPUTING ,030229 sport sciences ,01 natural sciences ,Industrial and Manufacturing Engineering ,Curling ,03 medical and health sciences ,Engineering management ,0302 clinical medicine ,Informatics ,0103 physical sciences ,Artificial intelligence ,010306 general physics ,business - Abstract
This paper presents game information analysis by utilizing a digital scorebook system, as the first tool for curling informatics, which supports coaches and players in realizing smart tactics and strategies in the sport of curling. Our research project, called “Curling Informatics,” aims to develop an environment to support curling strategies and tactics by realizing methods to record game information, perform analysis, and provide visualization and sharing of the information.We found a significant correlation between the differences in shot accuracies and scores from game information collected by our digital scorebook system for more than 200 games played by the Japanese national class. The results suggest that the difference in shot accuracies is related to the difference in game scores. This is valuable new knowledge to support strategic/tactical planning in curling games. However, the correlation for games involving world-class teams becomes weaker than for the Japanese national class because there is scarcely any difference in shot accuracies.
- Published
- 2016
- Full Text
- View/download PDF
126. HEMOS: A novel deep learning-based fine-grained humor detecting method for sentiment analysis of social media
- Author
-
Rafal Rzepka, Da Li, Michal Ptaszynski, and Kenji Araki
- Subjects
Computer science ,Emoji ,media_common.quotation_subject ,02 engineering and technology ,Library and Information Sciences ,Management Science and Operations Research ,Lexicon ,computer.software_genre ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Media Technology ,media_common ,Social network ,business.industry ,Deep learning ,Sentiment analysis ,Computer Science Applications ,Recurrent neural network ,Slang ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Internet slang ,computer ,Natural language processing ,Information Systems - Abstract
In this paper we introduce HEMOS (Humor-EMOji-Slang-based) system for fine-grained sentiment classification for the Chinese language using deep learning approach. We investigate the importance of recognizing the influence of humor, pictograms and slang on the task of affective processing of the social media. In the first step, we collected 576 frequent Internet slang expressions as a slang lexicon; then, we converted 109 Weibo emojis into textual features creating a Chinese emoji lexicon. In the next step, by performing two polarity annotations with new “optimistic humorous type” and “pessimistic humorous type” added to standard “positive” and “negative” sentiment categories, we applied both lexicons to attention-based bi-directional long short-term memory recurrent neural network (AttBiLSTM) and tested its performance on undersized labeled data. Our experimental results show that the proposed method can significantly improve the state-of-the-art methods in predicting sentiment polarity on Weibo, the largest Chinese social network.
- Published
- 2020
- Full Text
- View/download PDF
127. Emoticon-Aware Recurrent Neural Network Model for Chinese Sentiment Analysis
- Author
-
Michal Ptaszynski, Rafal Rzepka, Kenji Araki, and Da Li
- Subjects
business.industry ,Computer science ,Polarity (physics) ,Sentiment analysis ,02 engineering and technology ,computer.software_genre ,Expression (mathematics) ,Textual information ,Recurrent neural network ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Emoticon ,Social media ,Artificial intelligence ,business ,Recurrent neural network model ,computer ,Natural language processing - Abstract
Pictograms (emoticons/emojis) have been widely used in social media as a mean for graphical expression of emotions. People can express delicate nuances through textual information when supported with emoticons, and the effectiveness of computer-mediated communication (CMC) is also improved. Therefore it is important to fully understand the influence of emoticons on CMC. In this paper, we propose an emoticon polarity-aware recurrent neural network method for sentiment analysis of Weibo, a Chinese social media platform. In the first step, we analyzed the usage of 67 emoticons with racial expression used on Weibo. By performing a polarity annotation with a new “humorous type” added, we have confirmed that 23 emoticons can be considered more as humorous than positive or negative. On this basis, we applied the emoticons polarity in a Long Short-Term Memory recurrent neural network (LSTM) for sentiment analysis of undersized labelled data. Our experimental results show that the proposed method can significantly improve the precision for predicting sentiment polarity on Weibo.
- Published
- 2018
- Full Text
- View/download PDF
128. The use of genetic algorithm to optimize quantitative learner's motivation model
- Author
-
Paweł Lempa, Michal Ptaszynski, and Fumito Masui
- Subjects
Quantitative Learner’s Motivation Model ,business.industry ,Computer science ,Genetic algorithm ,genetic algorithm ,Artificial intelligence ,Machine learning ,computer.software_genre ,business ,optimization ,computer - Abstract
The paper presents a method of optimizing Quantitative Learner’s Motivation Model with the use of genetic algorithm. It is focused on optimizing the formula for prediction of learning motivation by means of different weights for three values: interest, usefulness in the future and satisfaction. For the purpose of this optimization, we developed a C++ library that implements a genetic algorithm and an application in C# which uses that library with data acquired from questionnaires enquiring about those three elements. The results of the experiment showed improvement in the estimation of student’s learning motivation.
- Published
- 2018
- Full Text
- View/download PDF
129. Automatic Extraction of Harmful Sentence Patterns with Application in Cyberbullying Detection
- Author
-
Fumito Masui, Yasutomo Kimura, Rafal Rzepka, Michal Ptaszynski, and Kenji Araki
- Subjects
Information retrieval ,business.industry ,Computer science ,Association (object-oriented programming) ,Reading (process) ,media_common.quotation_subject ,Brute-force search ,The Internet ,Language model ,Internet users ,business ,Sentence ,media_common - Abstract
The problem of humiliating and slandering people through Internet, generally defined as cyberbullying (later: CB), has been recently noticed as a serious social problem disturbing mental health of Internet users. In Japan, to deal with the problem, members of Parent-Teacher Association (PTA) perform Internet Patrol – a voluntary work by reading through the whole Web contents to spot cyberbullying entries. To help PTA members we propose a novel method for automatic detection of malicious contents on the Internet. The method is based on a brute force search algorithm-inspired combinatorial approach to language modeling. The method automatically extracts sophisticated sentence patterns and uses them in classification. We tested the method on actual data containing cyberbullying provided by Human Rights Center. The results show our method outperformed previous methods. It is also more efficient as it requires minimal human effort.
- Published
- 2018
- Full Text
- View/download PDF
130. Automatic extraction of future references from news using morphosemantic patterns with application to future trend prediction
- Author
-
Fumito Masui, Hirotoshi Honma, Michal Ptaszynski, and Yoko Nakajima
- Subjects
Computer science ,business.industry ,Future trend ,02 engineering and technology ,General Medicine ,Data science ,Newspaper ,Resource (project management) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,The Internet ,business ,Everyday life - Abstract
In everyday life people use past events and their own knowledge to predict future events. In such everyday predictions people use widely available resources (newspapers, Internet). This study focused on sentences referring to the future, such as the one below, as one of such resource.
- Published
- 2016
- Full Text
- View/download PDF
131. Advances in Curling Game Information Analysis by Considering Starting Position
- Author
-
Hitoshi Yanagi, Fumito Masui, Hiromu Otani, and Michal Ptaszynski
- Subjects
Engineering drawing ,Position (vector) ,business.industry ,Computer science ,Information analysis ,Artificial intelligence ,business ,Curling - Published
- 2017
- Full Text
- View/download PDF
132. The influence of changing the road pavement and the method of using a wheelchair on the vibration perception in accordance with ISO 2631
- Author
-
Fumito Masui and Michal Ptaszynski
- Subjects
Combinatorics ,Computer science ,TheoryofComputation_GENERAL ,MathematicsofComputing_DISCRETEMATHEMATICS - Abstract
Experiments with language combinatorics in text classification: lessons learned and future implications
- Published
- 2017
- Full Text
- View/download PDF
133. Automatically annotating a five-billion-word corpus of Japanese blogs for sentiment and affect analysis
- Author
-
Yoshio Momouchi, Michal Ptaszynski, Rafal Rzepka, and Kenji Araki
- Subjects
Information retrieval ,Computer science ,business.industry ,media_common.quotation_subject ,InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL ,Sentiment analysis ,computer.software_genre ,Affect (psychology) ,Theoretical Computer Science ,Human-Computer Interaction ,Sadness ,Annotation ,Emotive ,Test set ,Artificial intelligence ,Valence (psychology) ,business ,GeneralLiterature_REFERENCE(e.g.,dictionaries,encyclopedias,glossaries) ,computer ,Software ,Sentence ,Natural language processing ,media_common - Abstract
This paper presents our research on automatic annotation of a five-billion-word corpus of Japanese blogs with information on affect and sentiment. We first perform a study in emotion blog corpora to discover that there has been no large scale emotion corpus available for the Japanese language. We choose the largest blog corpus for the language and annotate it with the use of two systems for affect analysis: ML-Ask for word- and sentence-level affect analysis and CAO for detailed analysis of emoticons. The annotated information includes affective features like sentence subjectivity (emotive/non-emotive) or emotion classes (joy, sadness, etc.), useful in affect analysis. The annotations are also generalized on a two-dimensional model of affect to obtain information on sentence valence (positive/negative), useful in sentiment analysis. The annotations are evaluated in several ways. Firstly, on a test set of a thousand sentences extracted randomly and evaluated by over forty respondents. Secondly, the statistics of annotations are compared to other existing emotion blog corpora. Finally, the corpus is applied in several tasks, such as generation of emotion object ontology or retrieval of emotional and moral consequences of actions.
- Published
- 2014
- Full Text
- View/download PDF
134. Detecting Emotive Sentences with Pattern-based Language Modelling
- Author
-
Fumito Masui, Rafal Rzepka, Michal Ptaszynski, and Kenji Araki
- Subjects
Emotive expressions ,Recall ,Computer science ,business.industry ,Language Combinatorics ,Language Modelling ,Context (language use) ,Brute Force Search Algorithms ,computer.software_genre ,Task (project management) ,Emotive ,Pattern Extraction ,General Earth and Planetary Sciences ,Language modelling ,Artificial intelligence ,business ,computer ,Natural language processing ,General Environmental Science - Abstract
This paper presents our research in detection of emotive (emotionally loaded) sentences. The task is defined as a text classification problem with an assumption that emotive sentences stand out both lexically and grammatically. The assumption is verified exper- imentally. The experiment is based on n-grams as well as more sophisticated patterns with disjointed elements. To deal with the sophisticated patterns a novel language modelling algorithm based on the idea of language combinatorics is applied. The results of experiments are explained with the standard means of Precision, Recall and balanced F-score. The algorithm also provides a refined list of most frequent sophisticated patterns typical for both emotive and non-emotive context.
- Published
- 2014
- Full Text
- View/download PDF
135. Straight thinking straight from the net - on the web-based intelligent talking toy development
- Author
-
Rafal Rzepka, Kenji Araki, Michal Ptaszynski, and Shinsuke Higuchi
- Subjects
Multimedia ,business.industry ,Computer science ,Joke ,media_common.quotation_subject ,Intelligent decision support system ,Machine ethics ,Common sense ,computer.software_genre ,Knowledge-based systems ,Robot ,The Internet ,business ,computer ,Generator (mathematics) ,media_common - Abstract
This paper introduces an early stage of a smart toy development project which combines several techniques to achieve a level of conversational skills and knowledge higher than currently available robots for children. We describe our ideas and achievements for three modules which we treat as the most important - topic unlimited talking engine, emotions recognizer and the moral behavior analyzer. We will also mention our novel evaluation method for freely speaking agents and possibilities of adding another module - an automatic joke generator. Index Terms—Intelligent systems, common sense, affect analysis, machine ethics
- Published
- 2013
136. Affect analysis in context of characters in narratives
- Author
-
Satoshi Oyama, Rafal Rzepka, Michal Ptaszynski, Masahito Kurihara, Hiroaki Dokoshi, Kenji Araki, and Yoshio Momouchi
- Subjects
business.industry ,General Engineering ,Realization (linguistics) ,Context (language use) ,computer.software_genre ,Linguistics ,Computer Science Applications ,Artificial Intelligence ,Semiotics ,Narrative ,Emotional expression ,Affect (linguistics) ,Artificial intelligence ,Psychology ,business ,computer ,Modality (semiotics) ,Natural language processing ,Sentence - Abstract
This paper presents our research in text-based affect analysis (AA) of narratives. AA represents a task of estimating or recognizing emotions elicited by a certain semiotic modality. In text-based AA the modality in focus is the textual representation of language. In this research we study particularly one type of language realization, namely narratives (e.g., stories, fairy tales, etc.). Affect analysis within the context of narratives is a challenging task because narratives are created of different kinds of sentences (descriptions, dialogs, etc.). Moreover, different characters become subjects of different emotional expressions in different parts of narratives. In this research we address the problem of person/character related affect recognition in narratives. We propose a method for emotion subject extraction from a sentence based on analysis of anaphoric expressions and compare two methods for affect analysis. We evaluate the system and discuss its possible future improvements.
- Published
- 2013
- Full Text
- View/download PDF
137. Subjective? Emotional? Emotive?: Language Combinatorics based Automatic Detection of Emotionally Loaded Sentences
- Author
-
Michal, Ptaszynski, Fumito, Masui, Rafal, Rzepka, Kenji, Araki, Michal, Ptaszynski, Fumito, Masui, Rafal, Rzepka, and Kenji, Araki
- Abstract
In this paper presents our research in automatic detection of emotionally loaded, or emotive sentences. We define the problem from a linguistic point of view assuming that emotive sentences stand out both lexically and grammatically. To verify this assumption we prepare a text classification experiment. In the experiment we apply language combinatorics approach to automatically extract emotive patterns from training sentences. the applied approach allows automatic extraction of not only widely used unigrams (tokens), or n-grams, but also more sophisticated patterns with disjointed elements. The results of experiments are explained with the use of means such as standard Precision, Recall and balanced F-score. The algorithm also provides a refined list of most frequent sophisticated patterns typical for both emotive and non-emotive context. The method reached results comparable to the state of the art, while the fact that it is fully automatic makes it more efficient and language independent.
- Published
- 2017
138. Towards Joking, Humor Sense Equipped and Emotion Aware Conversational Systems
- Author
-
Rafal Rzepka, Pawel Dybala, Michal Ptaszynski, Kenji Araki, and Motoki Yatsu
- Subjects
Emotive ,Human–computer interaction ,Computer science ,business.industry ,ComputingMilieux_PERSONALCOMPUTING ,Realization (linguistics) ,Sense of humor ,Conversational system ,Emotion recognition ,Modular design ,business ,GeneralLiterature_MISCELLANEOUS ,Generator (mathematics) - Abstract
In this paper, we present our progress so far in realization of project aimed to create a complex, modular humor-equipped conversational system. By complex, we mean that it should be able to: (1) detect users’ emotions, (2) detect users’ humorous behaviors and react to them properly, (3) generate humor according to users’ emotive states and (4) learn each user’s individual sense of humor. The research is conducted in Japanese. We chose puns as a relatively computable genre of humor. We describe a general outline of our system, as well as its four modules: humor detection module, emotion recognition module, response generator module and individualisation module. We present the algorithm of systems used in each module, along with some evaluation results.
- Published
- 2016
- Full Text
- View/download PDF
139. Analysis of Curling Team Strategy and Tactics using Curling Informatics
- Author
-
Fumito Masui, Hitoshi Yanagi, Kohsuke Hirata, Michal Ptaszynski, and Hiromu Otani
- Subjects
Engineering ,Engineering management ,Operations research ,business.industry ,Team strategy ,Informatics ,business ,Curling - Published
- 2016
- Full Text
- View/download PDF
140. Evaluating Subjective Aspects of HCI on an Example of a Non-Task Oriented Conversational System
- Author
-
Rafal Rzepka, Michal Ptaszynski, Pawel Dybala, and Kenji Araki
- Subjects
HCI ,business.industry ,Computer science ,Conversational system ,Usability ,Bridge (nautical) ,Field (computer science) ,Artificial Intelligence ,Human–computer interaction ,Task oriented ,conversational systems ,business ,Brain scanning ,evaluation methodology - Abstract
The evaluation of subjective aspects of HCI, such as human-likeness, likeability or users' emotions towards computers is still quite a neglected issue, especially in the field of non-task oriented conversational systems (chatterbots). In this paper we try to bridge this gap by proposing a new methodology of evaluation. The methods presented were tested in our research on humor-equipped chatterbots. We describe them in details, discuss their drawbacks and usability. In one of the presented methods we used an emotiveness analysis system, which itself can be considered an AI tool, as it was used to detect users' emotions towards conversational systems, and to perform their automatic evaluation. We also propose some methods that we have not used yet, which, however, seem applicable in this field, such as brain scanning techniques. Finally, we give some ideas that should be addressed in the future.
- Published
- 2010
141. Big data analytics - towards the enrichment of content tourism for revitalization of Japanese rural area
- Author
-
Rafal Rzepka, Kenji Araki, Pawel Dybala, Michal Ptaszynski, and Fumito Masui
- Subjects
lcsh:TA1-2040 ,0502 economics and business ,05 social sciences ,Business ,050207 economics ,lcsh:Engineering (General). Civil engineering (General) ,050212 sport, leisure & tourism - Abstract
Japan's domestic travel and tourism industry expenditure has been declining gradually since 1998 (from 33.5 in 1998 to 21.6 trillion JPY in 2016). Our research purpose is to construct a data analysis model to transform the collected data to a meaningful graphical format by using big data analytics techniques to discover anomalies and sustainable development possibilities for economy and tourism of Japan's rural areas, with a particular focus on the prefecture of Hokkaido, subprefecture of Okhotsk. To strengthen the reliability of this model we apply popular Monte Carlo simulation combined with Bayesian statistic and implement it on an Apache Spark platform to acquire results within the span of the study. Through this research, we focus on observing and analyzing interests, expectations and tendencies of Japanese people living in rural areas. From such collected information, we can obtain reasons for the decline of this sector’s impact on Japan’s economy. Measuring public awareness has become more efficient since the content generator role has been passed on to ordinary people. Therefore, the analysis of Big Data with the use of data science techniques has become important to comprehend human behavior from multiple points of view, including the scientific, economic, political, historical and sociological.
- Published
- 2018
142. CAO: A Fully Automatic Emoticon Analysis System Based on Theory of Kinesics
- Author
-
Rafal Rzepka, Jacek Maciejewski, Kenji Araki, Pawel Dybala, and Michal Ptaszynski
- Subjects
Structure (mathematical logic) ,Kinesics ,Computer science ,business.industry ,Speech recognition ,computer.software_genre ,Semantics ,Human-Computer Interaction ,Annotation ,Text processing ,Emoticon ,Affect (linguistics) ,Artificial intelligence ,Computer-mediated communication ,business ,computer ,Software ,Natural language processing - Abstract
This paper presents CAO, a system for affect analysis of emoticons in Japanese online communication. Emoticons are strings of symbols widely used in text-based online communication to convey user emotions. The presented system extracts emoticons from input and determines the specific emotion types they express with a three-step procedure. First, it matches the extracted emoticons to a predetermined raw emoticon database. The database contains over 10,000 emoticon samples extracted from the Web and annotated automatically. The emoticons for which emotion types could not be determined using only this database, are automatically divided into semantic areas representing “mouths” or “eyes,” based on the idea of kinemes from the theory of kinesics. The areas are automatically annotated according to their co-occurrence in the database. The annotation is first based on the eye-mouth-eye triplet, and if no such triplet is found, all semantic areas are estimated separately. This provides hints about potential groups of expressed emotions, giving the system coverage exceeding 3 million possibilities. The evaluation, performed on both training and test sets, confirmed the system's capability to sufficiently detect and extract any emoticon, analyze its semantic structure, and estimate the potential emotion types expressed. The system achieved nearly ideal scores, outperforming existing emoticon analysis systems.
- Published
- 2010
- Full Text
- View/download PDF
143. Multiagent system for joke generation: Humor and emotions combined in human-agent conversation
- Author
-
Mizuki Takahashi, Pawel Dybala, Jacek Maciejewski, Rafal Rzepka, Kenji Araki, and Michal Ptaszynski
- Subjects
Joke ,business.industry ,Computer science ,media_common.quotation_subject ,Humor processing ,computer.software_genre ,Outcome (game theory) ,emotiveness analysis ,Intelligent agent ,human-computer interaction ,Human–computer interaction ,chatterbots ,Conversation ,Artificial intelligence ,Dialog system ,User interface ,business ,Baseline (configuration management) ,computer ,conversational agents ,Software ,Generator (mathematics) ,media_common - Abstract
In this paper we present an innovative work on a multiagent joking conversational system. In our research so far we have shown that implementing humor into a chatterbot can visibly improve its performance. The results presented in this paper are the outcome of the next step of our work. They show that a multiagent system, combining a conversational agent, a pun generator and an emotiveness analysis engine, works reasonably well in interactions with users. In the setup used in this research, the emotiveness analysis agent analyses users' utterances and decides whether it is appropriate to tell a pun. Depending on the results of this analysis, the agent chooses either the pun generator, if the decision is that a joke should be told, or the non-humor-equipped agent when the decision is different. Two evaluation experiments were conducted: user (first person) focused and automatic (emotiveness-analysis-based). In both, we compared the performance of the multiagent joking system and a baseline (non-humorous) conversation agent. The results show that in both cases the humor-equipped engine was evaluated as better than the baseline agent. The results are discussed and some ideas for the future are given.
- Published
- 2010
- Full Text
- View/download PDF
144. Activating Humans with Humor : A Dialogue System That Users Want to Interact with
- Author
-
Rafal Rzepka, Pawel Dybala, Michal Ptaszynski, and Kenji Araki
- Subjects
dialogue ,Casual ,Computer science ,Field (Bourdieu) ,media_common.quotation_subject ,AI and social sciences ,Negativity effect ,Space (commercial competition) ,Psycholinguistics ,Human-Computer Interaction ,User engagement ,Emotive ,Artificial Intelligence ,Hardware and Architecture ,Human–computer interaction ,Conversation ,Computer Vision and Pattern Recognition ,Electrical and Electronic Engineering ,Social psychology ,Software ,psycholinguistics ,media_common - Abstract
The topic of Human Computer Interaction (HCI) has been gathering more and more scientific attention of late. A very important, but often undervalued area in this field is human engagement. That is, a person's commitment to take part in and continue the interaction. In this paper we describe work on a humor-equipped casual conversational system (chatterbot) and investigate the effect of humor on a user's engagement in the conversation. A group of users was made to converse with two systems: one with and one without humor. The chat logs were then analyzed using an emotive analysis system to check user reactions and attitudes towards each system. Results were projected on Russell's two-dimensional emotiveness space to evaluate the positivity/negativity and activation/deactivation of these emotions. This analysis indicated emotions elicited by the humor-equipped system were more positively active and less negatively active than by the system without humor. The implications of results and relation between them and user engagement in the conversation are discussed. We also propose a distinction between positive and negative engagement.
- Published
- 2009
145. A System for Affect Analysis of Utterances in Japanese Supported with Web Mining
- Author
-
Rafal Rzepka, Kenji Araki, Pawel Dybala, Wenhan Shi, and Michal Ptaszynski
- Subjects
Analysis of emotiveness ,Evaluation methods ,Web mining ,Computer science ,business.industry ,Speech recognition ,Context (language use) ,computer.software_genre ,Expression (mathematics) ,Emotive ,Emotiveness ,Emotional expression ,Affect (linguistics) ,Artificial intelligence ,Affect analysis ,business ,computer ,Sentence ,Natural language processing ,Utterance - Abstract
We propose a method for affect analysis of textual input in Japanese supported with Web mining. The method is based on a pragmatic reasoning that emotional states of a speaker are conveyed by emotional expressions used in emotive utterances. It means that if an emotive expression is used in a sentence in a context described as emotive, the emotion conveyed in the text is revealed by the used emotive expression. The system ML-Ask (Emotive Elements / Expressions Analysis System) is constructed on the basis of this idea. An evaluation of the system is performed in which two evaluation methods are compared. To choose the most objective evaluation method we compare the most popular method in the field and a method proposed by us. The proposed evaluation method was shown to be more objective and revealed the strong and weak points of the system in detail. In the evaluation experiment ML-Ask reached human level in recognizing the general emotiveness of an utterance (0.83 balanced F-score) and 63% of human level in recognizing the specific types of emotions. We support the system with a Web mining technique to improve the performance of emotional state types extraction. In the Web mining technique emotive associations are extracted from the Web using co-occurrences of emotive expressions with morphemes of causality. The Web mining technique improved the performance of the emotional states types extraction to 85% of human performance.
- Published
- 2009
146. Statistical Analysis of Automatic Seed Word Acquisition to Improve Harmful Expression Extraction in Cyberbullying Detection
- Author
-
Suzuha, Hatakeyama, Fumito, Masui, Michal, Ptaszynski, Kazuhide, Yamamoto, Suzuha, Hatakeyama, Fumito, Masui, Michal, Ptaszynski, and Kazuhide, Yamamoto
- Abstract
We study the social problem of cyberbullying, defined as a new form of bullying that takes place in the Internet space. This paper proposes a method for automatic acquisition of seed words to improve performance of the original method for the cyberbullying detection by Nitta et al. [1]. We conduct an experiment exactly in the same settings to find out that the method based on a Web mining technique, lost over 30% points of its performance since being proposed in 2013. Thus, we hypothesize on the reasons for the decrease in the performance and propose a number of improvements, from which we experimentally choose the best one. Furthermore, we collect several seed word sets using different approaches, evaluate and their precision. We found out that the influential factor in extraction of harmful expressions is not the number of seed words, but the way the seed words were collected and filtered.
- Published
- 2016
147. Annotating Japanese blogs with syntactic and affective information
- Author
-
Jacek Maciejewski, Yoshio Momouchi, Michal Ptaszynski, Kenji Araki, Pawel Dybala, and Rafal Rzepka
- Subjects
Computer science ,business.industry ,Artificial intelligence ,business ,computer.software_genre ,computer ,Natural language processing - Published
- 2014
148. Emotive or Non-emotive: That is The Question
- Author
-
Kenji Araki, Rafal Rzepka, Michal Ptaszynski, and Fumito Masui
- Subjects
Point (typography) ,Emotive ,business.industry ,Computer science ,Artificial intelligence ,computer.software_genre ,business ,computer ,Natural language processing ,Linguistics ,Focus (linguistics) - Abstract
In this research we focus on discriminating between emotive (emotionally loaded) and non-emotive sentences. We define the problem from a linguistic point of view assuming that emotive sentences stand out both lexically and grammatically. We verify this assumption experimentally by comparing two sets of such sentences in Japanese. The comparison is based on words, longer n-grams as well as more sophisticated patterns. In the classification weuse a novelunsupervised learning algorithm based on the idea of language combinatorics. The method reached results comparable to the state of the art, while the fact that it is fully automatic makes it more efficient and language independent.
- Published
- 2014
- Full Text
- View/download PDF
149. Towards computational fronesis : verifying contextual appropriateness of emotions
- Author
-
Pawel Dybala, Kenji Araki, Yoshio Momouchi, Michal Mazur, Michal Ptaszynski, and Rafal Rzepka
- Subjects
Multimedia ,Computer Networks and Communications ,Computer science ,media_common.quotation_subject ,Educational technology ,Context (language use) ,computer.software_genre ,Computer Science Applications ,Education ,Task (project management) ,Intelligent agent ,Human–computer interaction ,Conversation ,Affect (linguistics) ,Computational linguistics ,TUTOR ,computer ,computer.programming_language ,media_common - Abstract
This paper presents research in Contextual Affect Analysis (CAA) for the need of future application in intelligent agents, such as conversational agents or artificial tutors. The authors propose a new term, Computational Fronesis (CF), to embrace the tasks included in CAA applied to development of conversational agents such as artificial tutors. In tutor-student discourse it is crucial that the artificial tutor was able not only to detect user/student emotions, but also to verify toward whom they were directed and whether they were appropriate for the context of the conversation. Therefore, as the first task in CF the authors focus on verification of contextual appropriateness of emotions. They performed some of the first experiments in this task for the Japanese language and discuss future directions in development and implications of Computational Fronesis.
- Published
- 2013
150. Science of Emoticons
- Author
-
Kenji Araki, Rafal Rzepka, Jacek Maciejewski, Pawel Dybala, Michal Ptaszynski, and Yoshio Momouchi
- Subjects
Conceptual framework ,Computer science ,business.industry ,Artificial intelligence ,State (computer science) ,Type (model theory) ,computer.software_genre ,business ,computer ,Natural language processing - Abstract
Emoticons are string of symbols representing body language in text-based communication. For a long time they have been considered as unnatural language entities. This chapter argues that, in over 40-year-long history of text-based communication, emoticons have gained a status of an indispensable means of support for text-based messages. This makes them fully a part of Natural Language Processing. The fact the emoticons have been considered as unnatural language expressions has two causes. Firstly, emoticons represent body language, which by definition is nonverbal. Secondly, there has been a lack of sufficient methods for the analysis of emoticons. Emoticons represent a multimodal (bimodal in particular) type of information. Although they are embedded in lexical form, they convey non-linguistic information. To prove this argument the authors propose that the analysis of emoticons was based on a theory designed for the analysis of body language. In particular, the authors apply the theory of kinesics to develop a state of the art system for extraction and analysis of kaomoji, Japanese emoticons. The system performance is verified in comparison with other emoticon analysis systems. Experiments showed that the presented approach provides nearly ideal results in different aspects of emoticon analysis, thus proving that emoticons possess features of multimodal expressions.
- Published
- 2012
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.