190 results on '"Bernard J Jansen"'
Search Results
2. Optimal advertising for a generalized Vidale–Wolfe response model
- Author
-
Yanwu Yang, Joni Salminen, Baozhu Feng, and Bernard J. Jansen
- Subjects
Computer science ,05 social sciences ,Economics, Econometrics and Finance (miscellaneous) ,Stochastic game ,Advertising ,Monotonic function ,02 engineering and technology ,Optimal control ,Human-Computer Interaction ,Advertising campaign ,Elasticity (cloud computing) ,020204 information systems ,0502 economics and business ,0202 electrical engineering, electronic engineering, information engineering ,Key (cryptography) ,050211 marketing ,Market share ,Marginal utility - Abstract
In this research, we formulate budget allocation decisions as an optimal control problem using a generalized Vidale–Wolfe model (GVW) as its advertising dynamics under a finite time horizon. One key element of our modeling work is that the proposed optimal budget allocation model (called GVW-OB) takes into account the roles of two useful indexes of the GVW model representing the advertising elasticity and the word-of-mouth (WoM) effect, respectively, in determining optimal budget. Moreover, we discuss desirable properties and provide a feasible solution to our GVW-OB model. We conduct computational experiments to assess our model’s performance and its identified properties, based on real-world datasets obtained from advertising campaigns by three e-commerce companies on Google AdWords, Facebook Ads and Baidu Ads, respectively. Experimental results show that (1) our GVW-OB strategy outperforms four baselines in terms of both payoff and ROI in either concave or S-shaped settings; (2) linear budget allocation strategies favor concave advertising responses, while nonlinear strategies support S-shaped responses; (3) a larger ad elasticity empowers higher levels of optimal budget and corresponding market share and thus achieves higher payoff and ROI, so does a larger WoM effect; and (4) as the total budget increases, the resulting payoff by the GVW-OB strategy increases monotonically, but the ROI decreases, which is consistent with the law of diminishing marginal utility. From a methodological perspective, our GVW-OB strategy provides a feasible solution for advertisers to make optimal budget allocation over time, which can be easily applied to a variety of advertising media. The identified properties and experimental findings of this research illuminate critical managerial insights for advertisers and media providers.
- Published
- 2021
3. Making Meaningful User Segments from Datasets Using Product Dissemination and Product Impact
- Author
-
Soon-Gyo Jung, Joni Salminen, and Bernard J. Jansen
- Subjects
Information management ,Knowledge management ,business.industry ,Computer science ,05 social sciences ,Pharmaceutical Science ,Business informatics ,0502 economics and business ,050211 marketing ,Product (category theory) ,0509 other social sciences ,050904 information & library sciences ,business ,Business management - Published
- 2020
4. Data-Driven Personas for Enhanced User Understanding: Combining Empathy with Rationality for Better Insights to Analytics
- Author
-
Soon-Gyo Jung, Bernard J. Jansen, and Joni Salminen
- Subjects
Information management ,Computer science ,business.industry ,media_common.quotation_subject ,05 social sciences ,Pharmaceutical Science ,Rationality ,Empathy ,Persona ,Data science ,Data-driven ,Business informatics ,Analytics ,0502 economics and business ,050211 marketing ,0501 psychology and cognitive sciences ,business ,Business management ,050107 human factors ,media_common - Published
- 2020
5. Influence of Social Media Attitude in Cross Screen Conversation
- Author
-
Bernard J. Jansen and Partha Mukherjee
- Subjects
Computer science ,Event (computing) ,media_common.quotation_subject ,020206 networking & telecommunications ,Advertising ,02 engineering and technology ,Feeling ,0202 electrical engineering, electronic engineering, information engineering ,General Earth and Planetary Sciences ,Second screen ,020201 artificial intelligence & image processing ,Social media ,Conversation ,Association (psychology) ,General Environmental Science ,media_common - Abstract
We carry out our study on interactions via second screens by analyzing more than 3M, 800K and 50K Super Bowl related posts from Twitter, Instagram, and Tumblr respectively to measure the influence of second screen based interaction on social media attitude. We took commercials, musicals and game as the categories of interest into consideration for temporal analysis in Pre, During, and Post Super Bowl phases. Research results show that the change in attitude is significant among categories in phases. Posts containing URLs and posts that contain only texts with no URLs, retweets or response show a positive association with viewers’ second screen attitude in Pre and Post phases. This research is important in identifying the interplay that technology has on the use of social interactions for sharing feelings and information via cross screens across disparate social platforms at different phases of an in real life event.
- Published
- 2020
6. Keyword Optimization in Sponsored Search Advertising: A Multi-Level Computational Framework
- Author
-
Yanwu Yang, Yinghui Yang, Xunhua Guo, Daniel Dajun Zeng, and Bernard J. Jansen
- Subjects
FOS: Computer and information sciences ,Information retrieval ,Computer Science - Artificial Intelligence ,Computer Networks and Communications ,Keyword search ,Computer science ,I.2.6 ,Intelligent decision support system ,68Txx ,02 engineering and technology ,Bridge (interpersonal) ,Bridge (nautical) ,Computer Science - Information Retrieval ,Management information systems ,Search engine ,Artificial Intelligence (cs.AI) ,Artificial Intelligence ,Search advertising ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Baseline (configuration management) ,Information Retrieval (cs.IR) - Abstract
In sponsored search advertising, keywords serve as an essential bridge linking advertisers, search users and search engines. Advertisers have to deal with a series of keyword decisions throughout the entire lifecycle of search advertising campaigns. This paper proposes a multi-level and closed-form computational framework for keyword optimization (MKOF) to support various keyword decisions. Based on this framework, we develop corresponding optimization strategies for keyword targeting, keyword assignment and keyword grouping at different levels (e.g., market, campaign and adgroup). With two real-world datasets obtained from past search advertising campaigns, we conduct computational experiments to evaluate our keyword optimization framework and instantiated strategies. Experimental results show that our method can approach the optimal solution in a steady way, and it outperforms two baseline keyword strategies commonly used in practice. The proposed MKOF framework also provides a valid experimental environment to implement and assess various keyword strategies in sponsored search advertising., Comment: 21 pages, 3 figures,1 table
- Published
- 2022
- Full Text
- View/download PDF
7. Manual and Automatic Methods for User Needs Detection in Requirements Engineering: Key Concepts and Challenges
- Author
-
Soon-Gyo Jung, Bernard J. Jansen, and Joni Salminen
- Subjects
Work (electrical) ,Requirements engineering ,business.industry ,Computer science ,Key (cryptography) ,User needs ,Software engineering ,business ,Focus group ,User Research - Abstract
User needs inform designers and developers of essential functionalities for requirements engineering. In this work, we summarize key concepts and challenges relating to manual and automatic user needs detection methods. We discuss six challenges with manual and eight challenges with automated methods. Despite the promise of automated methods, the challenges imply that artificial intelligence and machine learning are not yet mature enough to replace manual methods, such as interviews and focus groups, for discovering user needs in requirements engineering.
- Published
- 2021
8. Taking Back Control of Social Media Feeds with Take Back Control
- Author
-
Soon-Gyo Jung, Joni Salminen, Juan Corporan, and Bernard J. Jansen
- Subjects
Web browser ,Computer science ,End user ,media_common.quotation_subject ,Control (management) ,JavaScript ,Filter (software) ,World Wide Web ,Selection (linguistics) ,Social media ,Quality (business) ,computer ,computer.programming_language ,media_common - Abstract
Controlling the quality of social media feeds poses an issue for many users. Platforms such as Twitter give users some options to influence their feeds. Still, the selection of content predominantly relies on implicit rather than explicit user actions, as manual options for "cleaning the feed" are often cumbersome and difficult to use for most users. Here, we present Take Back Control, a web browser extension that gives users control to hide undesirable content from their social media feeds. The extension combines JavaScript (for hiding the content) and machine learning (for deciding what content to hide). Our current demonstration includes three filter types: Toxic, Political, and Negative content, with a possibility to add more filters, all of this with the overarching aim of helping end users control the information visible in their social media feeds.
- Published
- 2021
9. Comparing Persona Analytics and Social Media Analytics for a User-Centric Task Using Eye-Tracking and Think-Aloud
- Author
-
Soon-Gyo Jung, Joni Salminen, Sercan Şengün, and Bernard J. Jansen
- Subjects
Identification (information) ,Human–computer interaction ,Analytics ,business.industry ,Computer science ,Persona ,Think aloud protocol ,business ,Design paradigm ,Social media analytics ,User-centered design ,Task (project management) - Abstract
We compare a data-driven persona system and an analytics system for efficiency and effectiveness for a user identification task. Findings from the 34-participant experiment show that the data-driven persona system affords faster task completion, is easier for users to engage with, and provides better user identification accuracy. Eye-tracking data indicates that the participants focus most of their attention on the persona content while focusing more on navigation features when using the analytics system. The combined results provide empirical support for the use of data-driven personas for a user identification task, which we surmise to be a result of the persona system following a user-centered design paradigm instead of an information-centered paradigm. That analytics system afforded capabilities and insights that the persona system did not suggest that the triangulation of features may lead to a better overall user understanding.
- Published
- 2021
10. Machine learning approach to auto-tagging online content for content marketing efficiency: A comparative analysis between methods and content type
- Author
-
Bernard J. Jansen, Juan Corporan, Vignesh Yoganathan, Soon-Gyo Jung, and Joni Salminen
- Subjects
Marketing ,Content marketing ,Artificial neural network ,business.industry ,Computer science ,05 social sciences ,Unstructured data ,Machine learning ,computer.software_genre ,Random forest ,User experience design ,0502 economics and business ,050211 marketing ,Content type ,Web content ,Artificial intelligence ,business ,F1 score ,computer ,050203 business & management - Abstract
As complex data becomes the norm, greater understanding of machine learning (ML) applications is needed for content marketers. Unstructured data, scattered across platforms in multiple forms, impedes performance and user experience. Automated classification offers a solution to this. We compare three state-of-the-art ML techniques for multilabel classification - Random Forest, K-Nearest Neighbor, and Neural Network - to automatically tag and classify online news articles. Neural Network performs the best, yielding an F1 Score of 70% and provides satisfactory cross-platform applicability on the same organisation's YouTube content. The developed model can automatically label 99.6% of the unlabelled website and 96.1% of the unlabelled YouTube content. Thus, we contribute to marketing literature via comparative evaluation of ML models for multilabel content classification, and cross-channel validation for a different type of content. Results suggest that organisations may optimise ML to auto-tag content across various platforms, opening avenues for aggregated analyses of content performance.
- Published
- 2019
11. Capturing the change in topical interests of personas over time
- Author
-
Soon-Gyo Jung, Bernard J. Jansen, and Joni Salminen
- Subjects
Web analytics ,World Wide Web ,General Computer Science ,business.industry ,Computer science ,Persona ,Library and Information Sciences ,business - Published
- 2019
12. Implementing Eye-Tracking for Persona Analytics
- Author
-
Soon-Gyo Jung, Bernard J. Jansen, and Joni Salminen
- Subjects
Scarcity ,User studies ,Human–computer interaction ,Analytics ,business.industry ,Computer science ,media_common.quotation_subject ,Yield (finance) ,Eye tracking ,Persona ,business ,media_common - Abstract
Investigating users’ engagement with interactive persona systems can yield crucial insights for the design of such systems. Using eye-tracking, researchers can address the scarcity of behavioral user studies, even during times when physical user studies are difficult or impossible to carry out. In this research, we implement a webcam-based eye-tracking module into an interactive persona system, facilitating remote user studies. Findings from the implementation can show what information users pay attention to in persona profiles.
- Published
- 2021
13. Persona Analytics: Implementing Mouse-Tracking for an Interactive Persona System
- Author
-
Soon-Gyo Jung, Bernard J. Jansen, and Joni Salminen
- Subjects
User studies ,InformationSystems_MODELSANDPRINCIPLES ,InformationSystems_INFORMATIONINTERFACESANDPRESENTATION(e.g.,HCI) ,Analytics ,business.industry ,Computer science ,Human–computer interaction ,ComputingMilieux_COMPUTERSANDSOCIETY ,Persona ,Mouse tracking ,business - Abstract
Observing user interactions with interactive persona systems offers important insights for the design and application of such systems. Using an interactive persona system, user behavior and interaction with personas can be tracked with high precision, addressing the scarcity of behavioral persona user studies. In this research, we introduce and evaluate an implementation of persona analytics based on mouse tracking, which offers researchers new possibilities for conducting persona user studies, especially during times when in-person user studies are challenging to carry out.
- Published
- 2021
14. Suggestions for Online User Studies
- Author
-
Soon-Gyo Jung, Bernard J. Jansen, and Joni Salminen
- Subjects
User studies ,World Wide Web ,Important research ,Computer science ,Physical access ,Online research methods - Abstract
During exceptional times when researchers do not have physical access to users of technology, the importance of remote user studies increases. We provide recommendations based on lessons learned from conducting online user studies utilizing four online research platforms (Appen, MTurk, Prolific, and Upwork). Our recommendations aim to help those inexperienced with online user studies. They are also beneficial for those interested in increasing their proficiency, employing this increasingly important research methodology for studying people’s interactions with technology and information.
- Published
- 2021
15. Automatically Mapping Ad Targeting Criteria between Online Ad Platforms
- Author
-
Bernard J. Jansen, Joni Salminen, and Soon-Gyo Jung
- Subjects
Computer science - Published
- 2021
16. Information Design for Personas in Four Professional Domains of User Experience Design, Healthcare, Market Research, and Social Media Strategy
- Author
-
Soon-Gyo Jung, Lene Rostgaard Nielsen, Kathleen W. Guan, Joni Salminen, and Bernard J. Jansen
- Subjects
Knowledge management ,InformationSystems_INFORMATIONINTERFACESANDPRESENTATION(e.g.,HCI) ,business.industry ,Computer science ,Persona ,Information design ,Market research ,InformationSystems_MODELSANDPRINCIPLES ,personas, professionals, use ,User experience design ,Health care ,ComputingMilieux_COMPUTERSANDSOCIETY ,Social media ,business - Abstract
Practitioners in user-centric industries have increasingly recognized the applicability of personas. However, the methods used to create personas in different domains remain inconsistent and unsystematic. We analyzed 51 studies focused on designing personas for professional purposes and found the practice most prevalent in the user experience design, healthcare, market research, and social media strategy domains. Within these domains, user experience design personas are characterized by their focus on user activity goals, health personas on medical patients’ physical symptoms, market research personas on customers’ lifestyles, and social media strategy personas on interactions within and between online communities. We identify and compare the elements in the personas. Based on these, we provide guidelines for professionals interested in developing personas for understanding barriers to positive user experience, recruiting users, and building online communities, including how to represent persona details related to lifestyle and health, contexts of product usage, and scaling of online data. Practitioners in user-centric industries have increasingly recognized the applicability of personas. However, the methods used to create personas in different domains remain inconsistent and unsystematic. We analyzed 51 studies focused on designing personas for professional purposes and found the practice most prevalent in the user experience design, healthcare, market research, and social media strategy domains. Within these domains, user experience design personas are characterized by their focus on user activity goals, health personas on medical patients’ physical symptoms, market research personas on customers’ lifestyles, and social media strategy personas on interactions within and between online communities. We identify and compare the elements in the personas. Based on these, we provide guidelines for professionals interested in developing personas for understanding barriers to positive user experience, recruiting users, and building online communities, including how to represent persona details related to lifestyle and health, contexts of product usage, and scaling of online data.
- Published
- 2021
17. From flat file to interface: Synthesis of personas and analytics for enhanced user understanding
- Author
-
Soon-Gyo Jung, Joni Salminen, and Bernard J. Jansen
- Subjects
Web analytics ,General Computer Science ,business.industry ,Interface (Java) ,Computer science ,Human–computer interaction ,Analytics ,Flat file database ,Persona ,Library and Information Sciences ,Design methods ,business - Published
- 2020
18. Designing Prototype Player Personas from a Game Preference Survey
- Author
-
Shammur Absar Chowdhury, Joni Salminen, Aki Koponen, Jukka Vahlo, Soon-Gyo Jung, and Bernard J. Jansen
- Subjects
Value (ethics) ,Computer science ,05 social sciences ,ComputingMilieux_PERSONALCOMPUTING ,020207 software engineering ,02 engineering and technology ,Variation (game tree) ,Persona ,Preference ,Market segmentation ,Human–computer interaction ,0202 electrical engineering, electronic engineering, information engineering ,Survey data collection ,0501 psychology and cognitive sciences ,Social media ,Video game ,050107 human factors - Abstract
The competitiveness of the video game market has increased the need for understanding players. We generate player personas from survey data of 15,402 players' 195,158 stated game preferences from 130,495 game titles using the methodology of automatic persona generation. Our purpose is to demonstrate the potential of data-driven personas for segmenting players by their game preferences. The resulting prototype personas provide potential value for game marketing purposes, e.g., targeting gamers with social media advertising, although they can also be used for understanding demographic variation among various game preference patterns.
- Published
- 2020
19. Analyzing Demographic Bias in Artificially Generated Facial Pictures
- Author
-
Bernard J. Jansen, Shammur Absar Chowdhury, Soon-Gyo Jung, and Joni Salminen
- Subjects
White (horse) ,Demographics ,business.industry ,Computer science ,05 social sciences ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,020207 software engineering ,Pattern recognition ,02 engineering and technology ,0202 electrical engineering, electronic engineering, information engineering ,0501 psychology and cognitive sciences ,Artificial intelligence ,business ,050107 human factors - Abstract
Artificial generation of facial images is increasingly popular, with machine learning achieving photo-realistic results. Yet, there is a concern that the generated images might not fairly represent all demographic groups. We use a state-of-the-art method to generate 10,000 facial images and find that the generated images are skewed towards young people, especially white women. We provide recommendations to reduce demographic bias in artificial image generation.
- Published
- 2020
20. The Ethics of Data-Driven Personas
- Author
-
Shammur Absar Chowdhury, Joni Salminen, Bernard J. Jansen, Soon-Gyo Jung, and Willemien Froneman
- Subjects
Computer science ,Human–computer interaction ,Persona ,Base (topology) ,Diversity (business) ,Data-driven - Abstract
Quantitative methods are becoming more common for persona creation, but it is not clear to which extent online data and opaque machine learning algorithms introduce bias at various steps of data-driven persona creation (DDPC) and if these methods violate user rights. In this conceptual analysis, we use Gillespie's framework of algorithmic ethics to analyze DDPC for ethical considerations. We propose five design questions for evaluating the ethics of DDPC. DDPC should demonstrate the diversity of the user base but represent the actual data, be accompanied by explanations of their creation, and mitigate the possibility of unfair decisions.
- Published
- 2020
21. Things Change: Comparing Results Using Historical Data and User Testing for Evaluating a Recommendation Task
- Author
-
Shammur Absar Chowdhury, Soon-Gyo Jung, Dianne Ramirez Robillos, Joni Salminen, and Bernard J. Jansen
- Subjects
User testing ,Information retrieval ,Computer science ,05 social sciences ,Control (management) ,020207 software engineering ,02 engineering and technology ,Recommender system ,Task (project management) ,Domain (software engineering) ,Test set ,0202 electrical engineering, electronic engineering, information engineering ,Collaborative filtering ,0501 psychology and cognitive sciences ,050107 human factors - Abstract
We address a recommendation task for next likely flight destination to customers of a major international airline company. We compare performance using historical flight data and an actual user evaluation. Using two years of historical flight data consisting of tens of millions of flights, an ensemble and a collaborative filtering approach obtained an accuracy of 47% and 20% using a test set of 100,000 customers, respectively, highlighting the challenge of the domain. We then evaluated our recommendations on 10,000 actual customers, with a 45-45-10 split among ensemble, collaborative filtering, and control group. The overall predictive power employed with real users was 23%, with the ensemble method having a predictive power of 19% and 30% for collaborative filtering. Results indicate that, in complex and shifting domains such as this one, one cannot rely solely on historical data for evaluating the impact of user recommendations. We discuss implications for recommendation systems and future research in this and related domains.
- Published
- 2020
22. Personas and Analytics: A Comparative User Study of Efficiency and Effectiveness for a User Identification Task
- Author
-
Soon-Gyo Jung, Bernard J. Jansen, Sercan Şengün, Shammur Absar Chowdhury, and Joni Salminen
- Subjects
InformationSystems_INFORMATIONINTERFACESANDPRESENTATION(e.g.,HCI) ,Computer science ,business.industry ,Learnability ,05 social sciences ,020207 software engineering ,02 engineering and technology ,Persona ,Task (project management) ,Consistency (database systems) ,Identification (information) ,InformationSystems_MODELSANDPRINCIPLES ,Empirical research ,Qualitative analysis ,Analytics ,Human–computer interaction ,0202 electrical engineering, electronic engineering, information engineering ,ComputingMilieux_COMPUTERSANDSOCIETY ,0501 psychology and cognitive sciences ,business ,050107 human factors - Abstract
Personas are a well-known technique in human computer interaction. However, there is a lack of rigorous empirical research evaluating personas relative to other methods. In this 34-participant experiment, we compare a persona system and an analytics system, both using identical user data, for efficiency and effectiveness for a user identification task. Results show that personas afford faster task completion than the analytics system, as well as outperforming analytics with significantly higher user identification accuracy. Qualitative analysis of think-aloud transcripts shows that personas have other benefits regarding learnability and consistency. However, the analytics system affords insights and capabilities that personas cannot due to inherent design differences. Findings support the use of personas to learn about users, empirically confirming some of the stated benefits in the literature, while also highlighting the limitations of personas that may necessitate the use of accompanying methods.
- Published
- 2020
23. A Literature Review of Quantitative Persona Creation
- Author
-
Kathleen W. Guan, Shammur Absar Chowdhury, Soon-Gyo Jung, Bernard J. Jansen, and Joni Salminen
- Subjects
business.industry ,Computer science ,Best practice ,05 social sciences ,020207 software engineering ,02 engineering and technology ,Persona ,Diversification (marketing strategy) ,Data science ,Digital media ,Leverage (negotiation) ,Analytics ,0202 electrical engineering, electronic engineering, information engineering ,0501 psychology and cognitive sciences ,business ,050107 human factors - Abstract
Quantitative persona creation (QPC) has tremendous potential, as HCI researchers and practitioners can leverage user data from online analytics and digital media platforms to better understand their users and customers. However, there is a lack of a systematic overview of the QPC methods and progress made, with no standard methodology or known best practices. To address this gap, we review 49 QPC research articles from 2005 to 2019. Results indicate three stages of QPC research: Emergence, Diversification, and Sophistication. Sharing resources, such as datasets, code, and algorithms, is crucial to achieving the next stage (Maturity). For practitioners, we provide guiding questions for assessing QPC readiness in organizations.
- Published
- 2020
24. Statistical Modeling of Harassment against Reddit Moderators
- Author
-
Hind Almerekhi, Bernard J. Jansen, and Haewoon Kwak
- Subjects
Variables ,Computer science ,media_common.quotation_subject ,05 social sciences ,020207 software engineering ,Statistical model ,02 engineering and technology ,Affect (psychology) ,Moderation ,Community norms ,0202 electrical engineering, electronic engineering, information engineering ,Harassment ,0501 psychology and cognitive sciences ,Social psychology ,050107 human factors ,media_common - Abstract
Despite the dedication that some volunteer moderators of online communities display when performing their moderation duties, they become targets of hate and harassment by other users. To understand what causes the change in moderator role from heroes to victims, we analyze the responses of 1,818 moderators on Reddit to an online survey about moderation practices and harassment. We built a statistical model and found 6 significant independent variables that affect harassment on moderators, such as the knowledge of community norms, which increases harassment on moderators the most. Our findings imply that vulnerable moderators in toxic communities need countermeasures against harassment.
- Published
- 2020
25. Are These Comments Triggering? Predicting Triggers of Toxicity in Online Discussions
- Author
-
Haewoon Kwak, Joni Salminen, Hind Almerekhi, and Bernard J. Jansen
- Subjects
Online discussion ,0508 media and communications ,Computer science ,020204 information systems ,05 social sciences ,Toxicity ,0202 electrical engineering, electronic engineering, information engineering ,050801 communication & media studies ,Context (language use) ,02 engineering and technology ,Data science - Abstract
Understanding the causes or triggers of toxicity adds a new dimension to the prevention of toxic behavior in online discussions. In this research, we define toxicity triggers in online discussions as a non-toxic comment that lead to toxic replies. Then, we build a neural network-based prediction model for toxicity trigger. The prediction model incorporates text-based features and derived features from previous studies that pertain to shifts in sentiment, topic flow, and discussion context. Our findings show that triggers of toxicity contain identifiable features and that incorporating shift features with the discussion context can be detected with a ROC-AUC score of 0.87. We discuss implications for online communities and also possible further analysis of online toxicity and its root causes.
- Published
- 2020
26. The effect of numerical and textual information on visual engagement and perceptions of AI-driven persona interfaces
- Author
-
João M. Santos, Ying-Hsang Liu, Sercan Şengün, Bernard J. Jansen, Joni Salminen, and Soon-Gyo Jung
- Subjects
Data collection ,Computer science ,End user ,Perception ,media_common.quotation_subject ,Completeness (order theory) ,Eye tracking ,Persona ,Information design ,Affect (psychology) ,Cognitive psychology ,media_common - Abstract
In an experiment, we present 38 marketing and data analysts professionals with two online AI-driven persona interfaces, one using numbers and the other using text. We employ eye tracking, think-aloud, and a post-engagement survey for data collection to measure perception and visual engagement with the personas along 7 constructs. Results show that the use of numbers has a mixed effect on the perceptions and visual engagement of the persona profile, with job role as a determining factor on whether numbers/text affect end users for 2 of the constructs. The use of numbers has a significant positive effect on user perceptions of usefulness by analysts but a significantly negative effect on user perceptions of completeness for both marketers and analysts. The use of numbers decreases the perceived completeness of the personas for both marketer and analysts. This research has both theoretical and practical consequences for AI-driven persona development and their interface design, suggesting that the inclusion of numbers can have a desirable effect for certain roles but with possible negative effects on user perceptions.
- Published
- 2020
27. Giving Faces to Data
- Author
-
Soon-Gyo Jung, Joni Salminen, and Bernard J. Jansen
- Subjects
World Wide Web ,Computer science ,business.industry ,Analytics ,Scalability ,Big data ,Social media ,Persona ,User interface ,Architecture ,business ,Data-driven - Abstract
Creating personas from large amounts of online data is useful but difficult with manual methods. To address this difficulty, we present Automatic Persona Generation (APG), which is an implementation of a methodology for quantitatively generating data-driven personas from online social media data. APG is functional, and it is deployed with several organizations in multiple industry verticals. APG employs a scalable web front-end user interface and robust back-end database framework processing tens of millions of user interactions with tens of thousands of online digital products across multiple online platforms, including Facebook, Google Analytics, and YouTube. APG identifies audience segments that are both distinct and impactful for an organization to create persona profiles. APG enhances numerical social media data with relevant human attributes, such as names, photos, topics, etc. Here, we discuss the architecture development and central system features. Overall, APG can benefit organizations distributing content via online platforms or with online content that relates to commercial products. APG is unique in its algorithmic approach to processing social media data for customer insights. APG can be found online at https://persona.qcri.org.
- Published
- 2020
28. A Template for Data-Driven Personas: Analyzing 31 Quantitatively Oriented Persona Profiles
- Author
-
Bernard J. Jansen, Soon-Gyo Jung, Kathleen W. Guan, Lene Nielsen, and Joni Salminen
- Subjects
Information retrieval ,Computer science ,business.industry ,05 social sciences ,Big data ,020207 software engineering ,Qualitative property ,Information needs ,02 engineering and technology ,Persona ,Information design ,Data-driven ,Variety (cybernetics) ,0202 electrical engineering, electronic engineering, information engineering ,0501 psychology and cognitive sciences ,business ,050107 human factors - Abstract
Following the proliferation of personified big data and data science algorithms, data-driven user personas (DDPs) are becoming more common in persona design. However, the DDP templates are seemingly diverse and fragmented, prompting a need for a synthesis of the information included in these personas. Analyzing 31 templates for DDPs, we find that DDPs vary greatly by their information richness, as the most informative layout has more than 300% more information categories than the least informative layout. We also find that graphical complexity and information richness do not necessarily correlate. Furthermore, the chosen persona development method may carry over to the information presentation, with quantitative data typically presented as scores, metrics, or tables and qualitative data as text-rich narratives. We did not find one “general template” for DDPs and defining this is difficult due to the variety of the outputs of different methods as well as different information needs of the persona users.
- Published
- 2020
29. Rethinking Personas for Fairness: Algorithmic Transparency and Accountability in Data-Driven Personas
- Author
-
Soon-Gyo Jung, Shammur Absar Chowdhury, Joni Salminen, and Bernard J. Jansen
- Subjects
InformationSystems_INFORMATIONINTERFACESANDPRESENTATION(e.g.,HCI) ,business.industry ,End user ,Computer science ,Internet privacy ,Accountability ,Ethical concerns ,ComputingMilieux_COMPUTERSANDSOCIETY ,Persona ,business ,Ethical standards ,Transparency (behavior) ,Data-driven - Abstract
Algorithmic fairness criteria for machine learning models are gathering widespread research interest. They are also relevant in the context of data-driven personas that rely on online user data and opaque algorithmic processes. Overall, while technology provides lucrative opportunities for the persona design practice, several ethical concerns need to be addressed to adhere to ethical standards and to achieve end user trust. In this research, we outline the key ethical concerns in data-driven persona generation and provide design implications to overcome these ethical concerns. Good practices of data-driven persona development include (a) creating personas also from outliers (not only majority groups), (b) using data to demonstrate diversity within a persona, (c) explaining the methods and their limitations as a form of transparency, and (d) triangulating the persona information to increase truthfulness.
- Published
- 2020
30. Creating and detecting fake reviews of online products
- Author
-
Chandrashekhar Kandpal, Joni Salminen, Bernard J. Jansen, Ahmed Kamel, and Soon-Gyo Jung
- Subjects
Marketing ,Computer science ,media_common.quotation_subject ,ComputingMilieux_LEGALASPECTSOFCOMPUTING ,Fake reviews ,Unfair competition ,Consumer protection ,Data science ,Task (project management) ,Classifier (linguistics) ,Quality (business) ,Language model ,Product (category theory) ,media_common - Abstract
Customers increasingly rely on reviews for product information. However, the usefulness of online reviews is impeded by fake reviews that give an untruthful picture of product quality. Therefore, detection of fake reviews is needed. Unfortunately, so far, automatic detection has only had partial success in this challenging task. In this research, we address the creation and detection of fake reviews. First, we experiment with two language models, ULMFiT and GPT-2, to generate fake product reviews based on an Amazon e-commerce dataset. Using the better model, GPT-2, we create a dataset for a classification task of fake review detection. We show that a machine classifier can accomplish this goal near-perfectly, whereas human raters exhibit significantly lower accuracy and agreement than the tested algorithms. The model was also effective on detected human generated fake reviews. The results imply that, while fake review detection is challenging for humans, “machines can fight machines” in the task of detecting fake reviews. Our findings have implications for consumer protection, defense of firms from unfair competition, and responsibility of review platforms.
- Published
- 2022
31. Analyzing Attitude of Second Screen Social Media Messages
- Author
-
Partha Mukherjee and Bernard J. Jansen
- Subjects
Data collection ,Computer Networks and Communications ,Computer science ,Event (computing) ,05 social sciences ,Intelligent decision support system ,Advertising ,Information needs ,02 engineering and technology ,Artificial Intelligence ,Phenomenon ,0502 economics and business ,0202 electrical engineering, electronic engineering, information engineering ,Second screen ,050211 marketing ,020201 artificial intelligence & image processing ,Social media ,Panel data - Abstract
We analyze more than 3,000,000 Twitter, 800,000 Instagram, and 50,000 Tumblr posts concerning a single major in-real-life event, Super Bowl XLIX, to determine attitude. We consider three event phases ( Pre , During , and Post ). Findings show link-based recommendations and undirected broadcast patterns positively correlate with attitude in the Pre and Post phases, respectively. The usage of these specific features highlights the differing information needs of viewers during these phases, specifically the sharing of information in the Pre phase and the sharing of opinions in the Post phase. The volume of postings indicates a negative attitude for all social media platforms, demonstrating that adverse information is more likely to be shared than positive information; this finding contradicts prior findings. This second screens phenomenon research is important in identifying the sharing information on multiple social media platforms during an in real life event.
- Published
- 2018
32. Are personas done? Evaluating their usefulness in the age of digital analytics
- Author
-
Soon-Gyo Jung, Joni Salminen, Jisun An, Bernard J. Jansen, and Haewoon Kwak
- Subjects
Computer science ,business.industry ,InformationSystems_INFORMATIONINTERFACESANDPRESENTATION(e.g.,HCI) ,lcsh:BF1-990 ,Context (language use) ,Persona ,Data-Driven Personas, Digital Analytics, Customer Insights, Digital Marketing, Marketing Automation ,Data science ,Customer base ,InformationSystems_MODELSANDPRINCIPLES ,lcsh:Psychology ,Customer analytics ,Market segmentation ,Analytics ,Criticism ,business ,Interpretability - Abstract
In this critique, we conceptually examine the use of personas in an age of availability of large-scale online analytics data. Based on the criticism and benefits outlined in prior work, we formulate the major arguments for and against the use of personas, analyze these arguments, and demonstrate areas for the productive employment of personas by leveraging digital analytics data in their creation. From our review of the prior literature and the given availability of online customer data, our key tenet is that personas are located between aggregated and individual statistics. At their best, personas capture the coverage of the customer base attributed to aggregated data representations while retaining the interpretability of individual-level analytics. Persona creation benefits from both novel computational techniques and data sources. To demonstrate this, we propose and implement automatically generated personas primarily based on quantitative data. We also review key persona validation issues and examine how these issues can be addressed with automated persona generation using real user data from online analytics platforms. Finally, we outline areas of future research in the persona domain within the field of digital marketing and advertising. Overall, to survive in the rapidly developing marketing industry and online customer analytics, personas must evolve by adopting new practices. There are implications for this evolution of personas in a variety of domains, including design, content creation, and digital marketing.
- Published
- 2018
33. Persona analytics: Analyzing the stability of online segments and content interests over time using non-negative matrix factorization
- Author
-
Soon-Gyo Jung, Shammur Absar Chowdhury, Bernard J. Jansen, and Joni Salminen
- Subjects
Data collection ,business.industry ,Computer science ,Big data ,General Engineering ,Persona ,Data science ,Computer Science Applications ,Consistency (database systems) ,Artificial Intelligence ,Analytics ,Publishing ,Leverage (statistics) ,Social media ,business - Abstract
Personified big data and rapidly developing data science techniques enable previously unforeseen methodological developments for longitudinal analysis of online audiences. Applying data-driven persona generation on online customer statistics from a real organizational social media channel, we demonstrate how personas can be deployed to understand online customer patterns over time. We conduct 32 monthly rounds of data collection of customer demographics and content consumption patterns on the YouTube channel of a major publishing organization posting thousands of items of content and then algorithmically generate 15 personas monthly. We analyze the data-driven persona for changes monthly, yearly, and lifetime (period). Results show an average 40% change in the personas, and 78% of the personas experience more change than consistency for topic interests. The implications are that organizations frequently publishing online content should employ automatic data collection and periodic persona creation to ensure their customer understanding is current. For this, algorithmic data-driven systems that leverage methods for persona creation are recommended.
- Published
- 2021
34. Too few, too many, just right: Creating the necessary number of segments for large online customer populations
- Author
-
Dianne Ramirez Robillos, Bernard J. Jansen, Soon-Gyo Jung, and Joni Salminen
- Subjects
Marketing ,education.field_of_study ,Computer Networks and Communications ,Computer science ,business.industry ,Population ,Persona ,Data science ,Automation ,Computer Science Applications ,Task (project management) ,Market segmentation ,Management of Technology and Innovation ,Segmentation ,Heuristics ,Construct (philosophy) ,education ,business - Abstract
We develop a framework to reduce the number of customer segments to the smallest quantity without losing essential information of the underlying population in the electronic marketplace. As a use case of this approach, we create personas for these segments to enhance customer understanding. We use (a) matrix factorization to identify customer behaviors and construct customer segments, (b) statistical heuristics to collapse into meaningful segments, and (c) automation to enrich by generating a persona profile for each segment. We evaluate our approach in a case study using more than 21 million online flight bookings of a major airline company resulting in a 57.5% decrease from 1194 to 507 segments, thereby reducing segment noise. Three customer mega-segments emerge: Behaviorally same – Demographically different, Behaviorally different - Demographically same, and Behaviorally and Demographically different. As one of the first efforts at the essential task of customer segmentation reduction for large customer populations, findings have implications for organizations desiring to employ segmentation and/or personas for enhanced customer understanding.
- Published
- 2021
35. Conversion potential: a metric for evaluating search engine advertising performance
- Author
-
Theresa B. Clarke and Bernard J. Jansen
- Subjects
Marketing ,Measure (data warehouse) ,Operations research ,business.industry ,Computer science ,media_common.quotation_subject ,05 social sciences ,Rank (computer programming) ,Online advertising ,Data set ,Originality ,0502 economics and business ,Premise ,050211 marketing ,Metric (unit) ,business ,Construct (philosophy) ,050203 business & management ,media_common - Abstract
Purpose This research is based on the premise that current metrics for search engine advertising (SEA) are misleading and do not sufficiently allow managers to evaluate traffic and conversions simultaneously. This study aimed to conceptually develop and assess conversion potential (CvP) as a unifying construct for both measuring and evaluating the performance of SEA campaigns. Design/methodology/approach A data set of nearly seven million records covering almost three years of a multi-million-dollar keyword marketing campaign from a major US retailer was used to validate the construct of CvP. Findings Results empirically validate how CvP measures both campaign traffic and sales in SEA, using the optimization factor of ad rank, which is one of many possible factors. Research limitations/implications Although the data set is large and covers a lengthy period of time, it is limited to one company in the retail sector. Practical implications The research instantiates CvP as a metric for overall SEA account performance while demonstrating that it is a practical tool for future campaign planning. The metric simultaneously incorporates a sales ratio and a traffic ratio. Originality/value This is the first study to formalize and provide a working definition of CvP in the academic literature. The contribution is a theoretical and practical managerial framework to mutually evaluate, measure and make decisions about SEA efforts.
- Published
- 2017
36. Computational Advertising: A Paradigm Shift for Advertising and Marketing?
- Author
-
Yanwu Yang, Mounia Lalmas, Yinghui Catherine Yang, and Bernard J. Jansen
- Subjects
Computer Networks and Communications ,business.industry ,Computer science ,05 social sciences ,Intelligent decision support system ,Umbrella term ,Advertising ,Compensation methods ,02 engineering and technology ,Online advertising ,Advertising research ,Advertising campaign ,Artificial Intelligence ,Paradigm shift ,0502 economics and business ,Search advertising ,0202 electrical engineering, electronic engineering, information engineering ,050211 marketing ,020201 artificial intelligence & image processing ,Marketing ,business - Abstract
The umbrella term "computational advertising" encompasses a spectrum of computational systems, technologies, and methods of advertising and promotional behaviors and decision-making activities. The guest editors of this special issue on computational advertising discuss the field in general and the highlighted articles in particular.
- Published
- 2017
37. Identifying and predicting the desire to help in social question and answering
- Author
-
Zhe Liu and Bernard J. Jansen
- Subjects
Service (business) ,Knowledge management ,Social connectedness ,Computer science ,Process (engineering) ,business.industry ,05 social sciences ,Internet privacy ,Social environment ,Context (language use) ,02 engineering and technology ,Library and Information Sciences ,Management Science and Operations Research ,Computer Science Applications ,Knowledge sharing ,Order (business) ,0202 electrical engineering, electronic engineering, information engineering ,Media Technology ,020201 artificial intelligence & image processing ,0509 other social sciences ,Routing (electronic design automation) ,050904 information & library sciences ,business ,Information Systems - Abstract
Evaluate the effectiveness of question routing systems in the social Q&A process.Find that individuals are more willing to share their knowledge under question routing context whereas less connected.Build an effective model to automatically identify active knowledge sharers from non-shares using non-Q&A features from four dimensions: profile, posting behavior, language style, and social activities. The increasing volume of questions posted on social question and answering sites has triggered the development of question routing services. Most of these routing algorithms are able to recognize effectively individuals with the required knowledge to answer a specific question. However, just because people have the capability to answer a question, does not mean that they have the desire to help. In this research, we evaluate the practical performance of the question routing services in social context by analyzing the knowledge sharing behavior of users in social Q&A process in terms of their participation, interests, and connectedness. We collect questions and answers over a ten-month period from Wenwo, a major Chinese question routing service. Using 340,658 questions and 1,754,280 replies, findings reveal separate roles for knowledge sharers and consumers. Based on this finding, we identify knowledge sharers from non-sharers a priori in order to increase the response probabilities. We evaluate our model based on an analysis of 3006 Wenwo knowledge sharers and non-sharers. Our experimental results demonstrate knowledge sharer prediction based solely on non-Q&A features achieves a 70% success rate in accurately identifying willing respondents.
- Published
- 2017
38. Information Sharing by Viewers Via Second Screens for In-Real-Life Events
- Author
-
Partha Mukherjee and Bernard J. Jansen
- Subjects
Social network ,Computer Networks and Communications ,business.industry ,Computer science ,Information sharing ,Perspective (graphical) ,02 engineering and technology ,World Wide Web ,Identification (information) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Second screen ,In real life ,020201 artificial intelligence & image processing ,Social media ,business ,Affordance - Abstract
The use of second screen devices with social media facilitates conversational interaction concerning broadcast media events, creating what we refer to as the social soundtrack. In this research, we evaluate the change of the Super Bowl XLIX social soundtrack across three social media platforms on the topical categories of commercials, music, and game at three game phases ( Pre , During , and Post ). We perform statistical analysis on more than 3M, 800K, and 50K posts from Twitter, Instagram, and Tumblr, respectively. Findings show that the volume of posts in the During phase is fewer compared to Pre and Post phases; however, the hourly mean in the During phase is considerably higher than it is in the other two phases. We identify the predominant phase and category of interaction across all three social media sites. We also determine the significance of change in absolute scale across the Super Bowl categories (commercials, music, game) and in both absolute and relative scales across Super Bowl phases ( Pre , During , Post ) for the three social network platforms (Twitter, Tumblr, Instagram). Results show that significant phase-category relationships exist for all three social networks. The results identify the During phase as the predominant one for all three categories on all social media sites with respect to the absolute volume of conversations in a continuous scale. From the relative volume perspective, the During phase is highest for the music category for most social networks. For the commercials and game categories, however, the Post phase is higher than the During phase for Twitter and Instagram, respectively. Regarding category identification, the game category is the highest for Twitter and Instagram but not for Tumblr, which has dominant peaks for music and/or commercials in all three phases. It is apparent that different social media platforms offer various phase and category affordances. These results are important in identifying the influence that second screen technology has on information sharing across different social media platforms and indicates that the viewer role is transitioning from passive to more active.
- Published
- 2017
39. Viewed by too many or viewed too little: Using information dissemination for audience segmentation
- Author
-
Haewoon Kwak, Joni Salminen, Jisun An, Bernard J. Jansen, and Soon-Gyo Jung
- Subjects
General Computer Science ,Market segmentation ,Computer science ,0502 economics and business ,05 social sciences ,Information Dissemination ,050211 marketing ,0509 other social sciences ,Library and Information Sciences ,Audience segmentation ,050904 information & library sciences ,Data science ,Social media analytics - Published
- 2017
40. Detecting Toxicity Triggers in Online Discussions
- Author
-
Haewoon Kwak, Bernard J. Jansen, Joni Salminen, and Hind Almerekhi
- Subjects
Artificial neural network ,Computer science ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Social media ,02 engineering and technology ,Data science ,Formal description - Abstract
Despite the considerable interest in the detection of toxic comments, there has been little research investigating the causes -- i.e., triggers -- of toxicity. In this work, we first propose a formal definition of triggers of toxicity in online communities. We proceed to build an LSTM neural network model using textual features of comments, and then, based on a comprehensive review of previous literature, we incorporate topical and sentiment shift in interactions as features. Our model achieves an average accuracy of 82.5% of detecting toxicity triggers from diverse Reddit communities.
- Published
- 2019
41. The Effect of Smiling Pictures on Perceptions of Personas
- Author
-
Soon-Gyo Jung, Bernard J. Jansen, Joni Salminen, and João M. Santos
- Subjects
Willingness to use ,Computer science ,Perception ,media_common.quotation_subject ,Credibility ,Similarity (psychology) ,Contrast (statistics) ,Persona ,Social psychology ,media_common - Abstract
We analyze the effect of a smile in personas pictures on persona perceptions, including credibility, likability, similarity, and willingness to use. We conduct an online experiment with 2,400 participants using a 16-item survey and multiple persona profile treatments of which half have a smiling photo and half do not. We find that persona profiles with a smiling photo result in an increase in perceived similarity with, likability of, and willingness to use the personas. In contrast, a smile does not increase the credibility of the personas. Our research has implications for the design of persona profiles and adds to previous findings of persona research that the picture choice influences individuals' persona perceptions in profound ways.
- Published
- 2019
42. Detecting Demographic Bias in Automatically Generated Personas
- Author
-
Joni Salminen, Bernard J. Jansen, and Soon-Gyo Jung
- Subjects
education.field_of_study ,Information retrieval ,Computer science ,business.industry ,05 social sciences ,Population ,020207 software engineering ,02 engineering and technology ,Persona ,Analytics ,0202 electrical engineering, electronic engineering, information engineering ,0501 psychology and cognitive sciences ,Objectivity (science) ,business ,education ,Raw data ,050107 human factors - Abstract
We investigate the existence of demographic bias in automatically generated personas by producing personas from YouTube Analytics data. Despite the intended objectivity of the methodology, we find elements of bias in the data-driven personas. The bias is highest when doing an exact match comparison, and the bias decreases when comparing at age or gender level. The bias also decreases when increasing the number of generated personas. For example, the smaller number of personas resulted in underrepresentation of female personas. This suggests that a higher number of personas gives a more balanced representation of the user population and a smaller number increases biases. Researchers and practitioners developing data-driven personas should consider the possibility of algorithmic bias, even unintentional, in their personas by comparing the personas against the underlying raw data.
- Published
- 2019
43. Creating Manageable Persona Sets from Large User Populations
- Author
-
Bernard J. Jansen, Soon-Gyo Jung, and Joni Salminen
- Subjects
User information ,Web analytics ,education.field_of_study ,InformationSystems_INFORMATIONINTERFACESANDPRESENTATION(e.g.,HCI) ,Computer science ,business.industry ,End user ,media_common.quotation_subject ,05 social sciences ,Population ,Big data ,020207 software engineering ,02 engineering and technology ,Persona ,World Wide Web ,InformationSystems_MODELSANDPRINCIPLES ,0202 electrical engineering, electronic engineering, information engineering ,Key (cryptography) ,ComputingMilieux_COMPUTERSANDSOCIETY ,0501 psychology and cognitive sciences ,Function (engineering) ,business ,education ,050107 human factors ,media_common - Abstract
Creating personas from actual online user information is an advantage of the data-driven persona approach. However, modern online systems often provide big data from millions of users that display vastly different behaviors, resulting in possibly thousands of personas representing the entire user population. We present a technique for reducing the number of personas to a smaller number that efficiently represents the complete user population, while being more manageable for end users of personas. We first isolate the key user behaviors and demographical attributes, creating thin personas, and we then apply an algorithmic cost function to collapse the set to the minimum needed to represent the whole population. We evaluate our approach on 26 million user records of a major international airline, isolating 1593 personas. Applying our approach, we collapse this number to 493, a 69% decrease in the number of personas. Our research findings have implications for organizations that have a large user population and desire to employ personas.
- Published
- 2019
44. Using Machine Learning to Predict Ranking of Webpages in the Gift Industry
- Author
-
Joni Salminen, Roope Marttila, Juan Corporan, Tommi Salenius, and Bernard J. Jansen
- Subjects
business.industry ,Computer science ,Rank (computer programming) ,Machine learning ,computer.software_genre ,Online advertising ,Ranking (information retrieval) ,Search engine ,Ranking ,Search engine optimization ,Web page ,Alternating decision tree ,Artificial intelligence ,Gradient boosting ,business ,computer - Abstract
We use machine learning to predict the search engine rank of webpages. We use a list of keywords for 30 content blogs of an e-commerce company in the gift industry to retrieve 733 content pages occupying the first-page Google rankings and predict their rank using 30 ranking factors. We test two models, Light Gradient Boosting Machine (LightGBM) and Extreme Gradient Boosted Decision Trees (XGBoost), finding that XGBoost performs better for predicting actual search rankings, with an average accuracy of 0.86. The feature analysis shows the most impactful features are (a) internal and external links, (b) security of the web domain, and (c) length of H3 headings, and the least impactful features are (a) keyword mentioned in domain address, (b) keyword mentioned in the H1 headings, and (c) overall number of keyword mentions in the text. The results highlight the persistent importance of links in search-engine optimization. We provide actionable insights for online marketers and content creators.
- Published
- 2019
45. Confusion Prediction from Eye-Tracking Data
- Author
-
Mridul Nagpal, Haewoon Kwak, Joni Salminen, Soon-Gyo Jung, Jisun An, and Bernard J. Jansen
- Subjects
business.industry ,Computer science ,05 social sciences ,Cognition ,02 engineering and technology ,Virtual reality ,Machine learning ,computer.software_genre ,Gaze ,050105 experimental psychology ,Random forest ,020204 information systems ,Fixation (visual) ,0202 electrical engineering, electronic engineering, information engineering ,medicine ,Eye tracking ,0501 psychology and cognitive sciences ,Artificial intelligence ,medicine.symptom ,business ,computer ,Confusion - Abstract
Predicting user confusion can help improve information presentation on websites, mobile apps, and virtual reality interfaces. One promising information source for such prediction is eye-tracking data about gaze movements on the screen. Coupled with think-aloud records, we explore if user's confusion is correlated with primarily fixation-level features. We find that random forest achieves an accuracy of more than 70% when prediction user confusion using only fixation features. In addition, adding user-level features (age and gender) improves the accuracy to more than 90%. We also find that balancing the classes before training improves performance. We test two balancing algorithms, Synthetic Minority Over Sampling Technique (SMOTE) and Adaptive Synthetic Sampling (ADASYN) finding that SMOTE provides a higher performance increase. Overall, this research contains implications for researchers interested in inferring users' cognitive states from eye-tracking data.
- Published
- 2019
46. Design Issues in Automatically Generated Persona Profiles
- Author
-
Soon-Gyo Jung, Joni Salminen, Sercan Şengün, and Bernard J. Jansen
- Subjects
InformationSystems_INFORMATIONINTERFACESANDPRESENTATION(e.g.,HCI) ,business.industry ,Process (engineering) ,Computer science ,05 social sciences ,Usability ,02 engineering and technology ,Persona ,Space (commercial competition) ,Information design ,InformationSystems_MODELSANDPRINCIPLES ,Data access ,Customer analytics ,Human–computer interaction ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,ComputingMilieux_COMPUTERSANDSOCIETY ,0501 psychology and cognitive sciences ,business ,Think aloud protocol ,050107 human factors - Abstract
Increased access to data and computational techniques enable innovations in the space of automated customer analytics, for example, automatic persona generation. Automatic persona generation is the process of creating data-driven representations from user or customer statistics. Even though automatic persona generation is technically possible and provides advantages compared to manual persona creation regarding the speed and freshness of the personas, it is not clear (a) what information to include in the persona profiles and (b) how to display that information. To query into these aspects relating information design of personas, we conducted a user study with 38 participants. In the findings, we report several challenges relating to the design of automatically generated persona profiles, including usability issues, perceptual issues, and issues relating to information content. Our research has implications for the information design of data-driven personas.
- Published
- 2019
47. Automatic Persona Generation for Online Content Creators: Conceptual Rationale and a Research Agenda
- Author
-
Soon-Gyo Jung, Joni Salminen, Haewoon Kwak, Jisun An, and Bernard J. Jansen
- Subjects
World Wide Web ,Open research ,Customer analytics ,Analytics ,business.industry ,End user ,Computer science ,Profiling (information science) ,Social media ,Persona ,business ,Data mapping - Abstract
As the quantity of social and online analytics data has drastically increased, a wide variety of methods are deployed to make sense of this data, typically via computational and algorithmic approaches. However, in many cases, these approaches trade one form of complexity for another by ignoring the principles of human cognitive processing. In this perspective manuscript, we propose an approach of employing Personas as an alternative form of making large volumes of online user analytics information useful to end users of the user and customer analytics, with results applicable in software development, business sectors, communication industry, and other domains where understanding online user behavior is deemed important. Toward this end, we have developed a system that automatically generates data-driven Personas from social media and online analytics data, capable of handling hundreds of millions of user interactions from tens of thousands of pieces of content on YouTube, Facebook and Google Analytics, while retaining the privacy of individual users of those channels. Our approach (1) identifies and prioritizes user segments by their online behavior, (2) associates the segments with demographic data, and (3) creates rich Persona profiles by dynamically adding characteristics, such as names, photos, and descriptive quotes. This chapter characterizes the currently open research problems in automatic Persona generation, such as de-aggregation of data, cross-platform data mapping, filtering of toxic comments, and choosing the right information content according to end-user needs. Addressing these problems requires the use of state-of-the-art techniques of computer and information science within one system and benefits greatly from inter-disciplinary collaboration. Overall, the research agenda set in this work aims at achieving the vision for automatic user profiling using diverse online and social media platforms and advanced data processing methods for the end goal of making complex analytics data more useful for human decision makers, especially those working with online content.
- Published
- 2019
48. The Future of Data-driven Personas: A Marriage of Online Analytics Numbers and Human Attributes
- Author
-
Soon-Gyo Jung, Bernard J. Jansen, and Joni Salminen
- Subjects
Computer science ,business.industry ,05 social sciences ,Big data ,020207 software engineering ,02 engineering and technology ,Persona ,Data science ,Automation ,Data-driven ,Open research ,Market segmentation ,Analytics ,0202 electrical engineering, electronic engineering, information engineering ,0501 psychology and cognitive sciences ,business ,050107 human factors - Abstract
The massive volume of online analytics data about customers has led to novel opportunities for user segmentation. However, getting real value from data remains challenging for many organizations. One of the recent innovations in online analytics is data-driven persona generation that can be used to create high-quality human representations from online analytics data. This manuscript (a) summarizes the potential of data-driven persona generation for online analytics, (b) characterizes nine open research questions for data-driven persona generation, and (c) outlines a research agenda for making persona analytics more useful for decision makers.
- Published
- 2019
49. Associating Searching on Search Engines to Subsequent Searching on Sites
- Author
-
Adan Ortiz-Cordova and Bernard J. Jansen
- Subjects
Information Systems and Management ,Information retrieval ,Web search query ,Computer science ,business.industry ,Strategy and Management ,Search analytics ,05 social sciences ,Semantic search ,020207 software engineering ,02 engineering and technology ,Management Science and Operations Research ,Phrase search ,Organic search ,Management Information Systems ,World Wide Web ,Online search ,0202 electrical engineering, electronic engineering, information engineering ,Web search engine ,0509 other social sciences ,050904 information & library sciences ,Metasearch engine ,business ,Information Systems - Abstract
In this research study, the authors investigate the association between external searching, which is searching on a web search engine, and internal searching, which is searching on a website. They classify 295,571 external – internal searches where each search is composed of a search engine query that is submitted to a web search engine and then one or more subsequent queries submitted to a commercial website by the same user. The authors examine 891,453 queries from all searches, of which 295,571 were external search queries and 595,882 were internal search queries. They algorithmically classify all queries into states, and then clustered the searching episodes into major searching configurations and identify the most commonly occurring search patterns for both external, internal, and external-to-internal searching episodes. The research implications of this study are that external sessions and internal sessions must be considered as part of a continuous search episode and that online businesses can leverage external search information to more effectively target potential consumers.
- Published
- 2016
50. ASK: A taxonomy of accuracy, social, and knowledge information seeking posts in social question and answering
- Author
-
Zhe Liu and Bernard J. Jansen
- Subjects
Information Systems and Management ,Information retrieval ,Computer Networks and Communications ,Computer science ,Information seeking ,05 social sciences ,Social question ,Information needs ,02 engineering and technology ,Library and Information Sciences ,Syntax ,Ask price ,0202 electrical engineering, electronic engineering, information engineering ,Word usage ,020201 artificial intelligence & image processing ,Syntactic structure ,0509 other social sciences ,050904 information & library sciences ,Classifier (UML) ,Information Systems - Abstract
Many people turn to their social networks to find information through the practice of question and answering. We believe it is necessary to use different answering strategies based on the type of questions to accommodate the different information needs. In this research, we propose the ASK taxonomy that categorizes questions posted on social networking sites into three types according to the nature of the questioner's inquiry of accuracy, social, or knowledge. To automatically decide which answering strategy to use, we develop a predictive model based on ASK question types using question features from the perspectives of lexical, topical, contextual, and syntactic as well as answer features. By applying the classifier on an annotated data set, we present a comprehensive analysis to compare questions in terms of their word usage, topical interests, temporal and spatial restrictions, syntactic structure, and response characteristics. Our research results show that the three types of questions exhibited different characteristics in the way they are asked. Our automatic classification algorithm achieves an 83% correct labeling result, showing the value of the ASK taxonomy for the design of social question and answering systems.
- Published
- 2016
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.