13 results
Search Results
2. Using Tensor Completion Method to Achieving Better Coverage of Traffic State Estimation from Sparse Floating Car Data.
- Author
-
Ran, Bin, Song, Li, Zhang, Jian, Cheng, Yang, and Tan, Huachun
- Subjects
TRAFFIC engineering ,ESTIMATION theory ,PROBLEM solving ,STATISTICAL correlation ,MISSING data (Statistics) - Abstract
Traffic state estimation from the floating car system is a challenging problem. The low penetration rate and random distribution make available floating car samples usually cover part space and time points of the road networks. To obtain a wide range of traffic state from the floating car system, many methods have been proposed to estimate the traffic state for the uncovered links. However, these methods cannot provide traffic state of the entire road networks. In this paper, the traffic state estimation is transformed to solve a missing data imputation problem, and the tensor completion framework is proposed to estimate missing traffic state. A tensor is constructed to model traffic state in which observed entries are directly derived from floating car system and unobserved traffic states are modeled as missing entries of constructed tensor. The constructed traffic state tensor can represent spatial and temporal correlations of traffic data and encode the multi-way properties of traffic state. The advantage of the proposed approach is that it can fully mine and utilize the multi-dimensional inherent correlations of traffic state. We tested the proposed approach on a well calibrated simulation network. Experimental results demonstrated that the proposed approach yield reliable traffic state estimation from very sparse floating car data, particularly when dealing with the floating car penetration rate is below 1%. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
3. Windowed persistent homology: A topological signal processing algorithm applied to clinical obesity data.
- Author
-
Biwer, Craig, Rothberg, Amy, IglayReger, Heidi, Derksen, Harm, Burant, Charles F., and Najarian, Kayvan
- Subjects
OBESITY ,HOMOLOGY theory ,SIGNAL processing ,ALGORITHMS ,WEIGHT loss ,WEIGHT gain - Abstract
Overweight and obesity are highly prevalent in the population of the United States, affecting roughly 2/3 of Americans. These diseases, along with their associated conditions, are a major burden on the healthcare industry in terms of both dollars spent and effort expended. Volitional weight loss is attempted by many, but weight regain is common. The ability to predict which patients will lose weight and successfully maintain the loss versus those prone to regain weight would help ease this burden by allowing clinicians the ability to skip treatments likely to be ineffective. In this paper we introduce a new windowed approach to the persistent homology signal processing algorithm that, when paired with a modified, semimetric version of the Hausdorff distance, can differentiate the two groups where other commonly used methods fail. The novel approach is tested on accelerometer data gathered from an ongoing study at the University of Michigan. While most standard approaches to signal processing show no difference between the two groups, windowed persistent homology and the modified Hausdorff semimetric show a clear separation. This has significant implications for clinical decision making and patient care. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
4. Estimation of the shared mobility demand based on the daily regularity of the urban mobility and the similarity of individual trips
- Author
-
Veve, Cyril, Chiabaut, Nicolas, Laboratoire d'Ingénierie Circulation Transport (LICIT UMR TE ), and École Nationale des Travaux Publics de l'État (ENTPE)-Université de Lyon-Université Gustave Eiffel
- Subjects
Economics ,Shared mobility ,NEW YORK ,INDIVIDUAL TRIPS ,Social Sciences ,Transportation ,Geographical locations ,Cognition ,Customer base ,11. Sustainability ,TRAFIC ROUTIER ,Psychology ,Economic impact analysis ,ECOMOBILITE ,0303 health sciences ,Multidisciplinary ,Geography ,Applied Mathematics ,Simulation and Modeling ,05 social sciences ,Transportation Infrastructure ,COVOITURAGE ,GESTION DU TRAFIC ,Physical Sciences ,Engineering and Technology ,Medicine ,Algorithms ,Research Article ,Science ,Decision Making ,DEPLACEMENT URBAIN ,Human Geography ,Research and Analysis Methods ,Civil Engineering ,03 medical and health sciences ,MOBILITY DEMAND ,0502 economics and business ,Similarity (psychology) ,Cities ,030304 developmental biology ,Estimation ,MOBILITE ,050210 logistics & transportation ,Models, Statistical ,TRAITEMENT DES DONNEES ,Arithmetic ,Cognitive Psychology ,Biology and Life Sciences ,Environmental economics ,URBAN MOBILITY ,[INFO.INFO-MO]Computer Science [cs]/Modeling and Simulation ,United States ,Economic Analysis ,Roads ,Economic Impact Analysis ,North America ,Earth Sciences ,Human Mobility ,Cognitive Science ,TRIPS architecture ,Business ,VOLUME DE TRAFIC ,People and places ,Mathematics ,Neuroscience - Abstract
Even if shared mobility services are encouraged by transportation policies, they remain underused and inefficient transportation modes because they struggle to find their customer base. This paper aims to estimate the potential demand for such services by focusing on individual trips and determining the number of passengers who perform similar trips. Contrary to existing papers, this study focuses on the demand without assuming any specific shared mobility system. The experiment performed on data coming from New York City conducts to cluster more than 85% of the trips. Consequently, shared mobility services such as ride-sharing can find their customer base and, at a long time, to a significantly reduce the number of cars flowing in the city. After a detailed analysis, commonalities in the clusters are identified: regular patterns from one day to the next exist in shared mobility demand. This regularity makes it possible to anticipate the potential shared mobility demand to help transportation suppliers to optimize their operations.
- Published
- 2020
5. A leader-follower model for discrete competitive facility location problem under the partially proportional rule with a threshold
- Author
-
Wuyang Yu
- Subjects
0209 industrial biotechnology ,Computer science ,0211 other engineering and technologies ,Social Sciences ,02 engineering and technology ,Geographical locations ,020901 industrial engineering & automation ,Cognition ,Mississippi ,Chain (algebraic topology) ,Psychology ,Market share ,Workplace ,021103 operations research ,Multidisciplinary ,Economic Competition ,Applied Mathematics ,Simulation and Modeling ,Commerce ,Facility location problem ,Models, Economic ,Physical Sciences ,Florida ,Medicine ,Leader follower ,Algorithms ,Research Article ,Mathematical optimization ,Current (mathematics) ,Science ,Decision Making ,New York ,Models, Psychological ,Research and Analysis Methods ,Ranking Algorithms ,Humans ,Cognitive Psychology ,Biology and Life Sciences ,Consumer Behavior ,Pennsylvania ,Louisiana ,United States ,Leadership ,Ranking ,North America ,Cognitive Science ,People and places ,Mathematics ,Neuroscience - Abstract
When consumers are faced with the choice of competitive chain facilities that offer exclusive services, current rules do not properly describe the behavior pattern of these consumers. To eliminate the gap between the current rules and this kind of customers behavior pattern, the partially proportional rule with a threshold is proposed in this paper. A leader-follower model for discrete competitive facility location problem is established under the partially proportional rule with a threshold. Combining with the greedy strategy and the 2-opt strategy, a heuristical algorithm (GFA) is designed to solve the follower's problem. By embedding the algorithm (GFA), an improved ranking-based algorithm (IRGA) is proposed to solve the leader-follower model. Numerical tests show that the algorithm proposed in this paper can solve the leader-follower model for discrete competitive facility location problem effectively. The effects of different parameters on the market share captured by the leader firm and the follower firm are analyzed in detail using a quasi-real example. An interesting finding is that in some cases the leader firm does not have a first-mover advantage.
- Published
- 2019
6. A personalized channel recommendation and scheduling system considering both section video clips and full video clips
- Author
-
SeungGwan Lee and Daeho Lee
- Subjects
Computer science ,Section (typography) ,Video Recording ,Social Sciences ,lcsh:Medicine ,02 engineering and technology ,computer.software_genre ,Geographical locations ,Machine Learning ,Database and Informatics Methods ,Learning and Memory ,0202 electrical engineering, electronic engineering, information engineering ,Data Mining ,Psychology ,Computer Networks ,CLIPS ,lcsh:Science ,Statistical Data ,computer.programming_language ,Multidisciplinary ,Multimedia ,Applied Mathematics ,Simulation and Modeling ,IPTV ,Scheduling system ,Physical Sciences ,Information Retrieval ,020201 artificial intelligence & image processing ,Information Technology ,Algorithms ,Statistics (Mathematics) ,Research Article ,Communication channel ,Computer and Information Sciences ,Schedule ,Minnesota ,Broadcasting ,Research and Analysis Methods ,Computer Communication Networks ,Artificial Intelligence ,Learning ,Humans ,Internet ,business.industry ,Communications Media ,lcsh:R ,Cognitive Psychology ,Biology and Life Sciences ,020207 software engineering ,Models, Theoretical ,United States ,North America ,Cognitive Science ,lcsh:Q ,People and places ,business ,computer ,Mathematics ,Neuroscience - Abstract
With the convergence of various broadcasting systems, the amount of content available in mobile terminals including IPTV has significantly increased. In this paper, we propose a system that enables users to schedule programs considering both section video clips and full video clips based on the user detection method with similar preference. And, since the system constituting the contents can be classified according to the program, the proposed method can store a program desired by the user, and thus create and schedule a kind of individual channel. Experimental results show that the proposed method has a higher prediction accuracy; this is accomplished by comparing existing channel recommendation methods with the program recommendation methods proposed in this paper.
- Published
- 2018
7. Global and country-specific mainstreaminess measures: Definitions, analysis, and usage for improving personalized music recommendation systems.
- Author
-
Bauer, Christine and Schedl, Markus
- Abstract
Relevance: Popularity-based approaches are widely adopted in music recommendation systems, both in industry and research. These approaches recommend to the target user what is currently popular among all users of the system. However, as the popularity distribution of music items typically is a long-tail distribution, popularity-based approaches to music recommendation fall short in satisfying listeners that have specialized music preferences far away from the global music mainstream. Addressing this gap, the contribution of this article is three-fold. Definition of mainstreaminess measures: First, we provide several quantitative measures describing the proximity of a user’s music preference to the music mainstream. Assuming that there is a difference between the global music mainstream and a country-specific one, we define the measures at two levels: relating a listener’s music preferences to the global music preferences of all users, or relating them to music preferences of the user’s country. To quantify such music preferences, we define a music item’s popularity in terms of artist playcounts (APC) and artist listener counts (ALC). Moreover, we adopt a distribution-based and a rank-based approach as means to decrease bias towards the head of the long-tail distribution. This eventually results in a framework of 6 measures to quantify music mainstream. Differences between countries with respect to music mainstream: Second, we perform in-depth quantitative and qualitative studies of music mainstream in that we (i) analyze differences between countries in terms of their level of mainstreaminess, (ii) uncover both positive and negative outliers (substantially higher and lower country-specific popularity, respectively, compared to the global mainstream), analyzing these with a mixed-methods approach, and (iii) investigate differences between countries in terms of listening preferences related to popular music artists. We conduct our studies and experiments using the standardized LFM-1b dataset, from which we analyze about 800,000,000 listening events shared by about 53,000 users (from 47 countries) of the music streaming platform Last.fm. We show that there are substantial country-specific differences in listeners’ music consumption behavior with respect to the most popular artists listened to. Rating prediction experiments: Third, we demonstrate the applicability of our study results to improve music recommendation systems. To this end, we conduct rating prediction experiments in which we tailor recommendations to a user’s level of preference for the music mainstream using the proposed 6 mainstreaminess measures: defined by a distribution-based or rank-based approach, defined on a global level or on a country level (for the user’s country), and for APC or ALC. Our approach roughly equals a hybrid recommendation approach in which a demographic filtering strategy is implemented before collaborative filtering is performed. Results suggest that, in terms of rating prediction accuracy, each of the presented mainstreaminess definitions has its merits. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
8. Fuel shortages during hurricanes: Epidemiological modeling and optimal control
- Author
-
Sirish Namilae, Dahai Liu, Sabique Islam, and Richard J. Prazenica
- Subjects
0301 basic medicine ,0209 industrial biotechnology ,Operations research ,Economics ,Social Sciences ,Economic shortage ,02 engineering and technology ,Shortages ,Systems Science ,Geographical locations ,020901 industrial engineering & automation ,Sociology ,Per capita ,Medicine and Health Sciences ,Resource Management ,Public and Occupational Health ,Materials ,Multidisciplinary ,Emergency management ,Covariance ,Cyclonic Storms ,Applied Mathematics ,Simulation and Modeling ,Resource constraints ,Social Communication ,Vaccination and Immunization ,Southeastern United States ,Dynamical Systems ,Social Networks ,Physical Sciences ,Florida ,Medicine ,Engineering and Technology ,Kalman Filter ,Gasoline ,Algorithms ,Network Analysis ,Research Article ,Computer and Information Sciences ,Science ,Materials Science ,Immunology ,Disaster Planning ,Fuels ,Research and Analysis Methods ,03 medical and health sciences ,Humans ,Landfall ,Estimation ,business.industry ,Biology and Life Sciences ,Random Variables ,Optimal control ,Probability Theory ,United States ,Communications ,Energy and Power ,030104 developmental biology ,North America ,Environmental science ,Preventive Medicine ,People and places ,business ,Epidemic model ,Social Media ,Mathematics - Abstract
Hurricanes are powerful agents of destruction with significant socioeconomic impacts. A persistent problem due to the large-scale evacuations during hurricanes in the southeastern United States is the fuel shortages during the evacuation. Computational models can aid in emergency preparedness and help mitigate the impacts of hurricanes. In this paper, we model the hurricane fuel shortages using the SIR epidemic model. We utilize the crowd-sourced data corresponding to Hurricane Irma and Florence to parametrize the model. An estimation technique based on Unscented Kalman filter (UKF) is employed to evaluate the SIR dynamic parameters. Finally, an optimal control approach for refueling based on a vaccination analogue is presented to effectively reduce the fuel shortages under a resource constraint. We find the basic reproduction number corresponding to fuel shortages in Miami during Hurricane Irma to be 3.98. Using the control model we estimated the level of intervention needed to mitigate the fuel-shortage epidemic. For example, our results indicate that for Naples- Fort Myers affected by Hurricane Irma, a per capita refueling rate of 0.1 for 2.2 days would have reduced the peak fuel shortage from 55% to 48% and a refueling rate of 0.75 for half a day before landfall would have reduced to 37%.
- Published
- 2019
9. Using meta-predictions to identify experts in the crowd when past performance is unknown
- Author
-
Marcellin Martinie, Piers D. L. Howe, and Tom Wilkening
- Subjects
Computer science ,Social Sciences ,Surveys ,computer.software_genre ,Geographical locations ,Mathematical and Statistical Techniques ,Sociology ,Psychology ,050207 economics ,Schools ,050208 finance ,Multidisciplinary ,Applied Mathematics ,Simulation and Modeling ,Experimental Design ,Physics ,Statistics ,05 social sciences ,Sports Science ,Research Design ,Physical Sciences ,Medicine ,Algorithms ,Research Article ,Sports ,Leverage (finance) ,Science ,Decision Making ,Research and Analysis Methods ,Machine learning ,Education ,0502 economics and business ,Humans ,Leverage (statistics) ,Statistical Methods ,Sound Waves ,Behavior ,Survey Research ,business.industry ,Probabilistic logic ,Biology and Life Sciences ,Acoustics ,United States ,North America ,Recreation ,Artificial intelligence ,People and places ,business ,computer ,Mathematics ,Forecasting - Abstract
A common approach to improving probabilistic forecasts is to identify and leverage the forecasts from experts in the crowd based on forecasters' performance on prior questions with known outcomes. However, such information is often unavailable to decision-makers on many forecasting problems, and thus it can be difficult to identify and leverage expertise. In the current paper, we propose a novel algorithm for aggregating probabilistic forecasts using forecasters' meta-predictions about what other forecasters will predict. We test the performance of an extremised version of our algorithm against current forecasting approaches in the literature and show that our algorithm significantly outperforms all other approaches on a large collection of 500 binary decision problems varying in five levels of difficulty. The success of our algorithm demonstrates the potential of using meta-predictions to leverage latent expertise in environments where forecasters' expertise cannot otherwise be easily identified.
- Published
- 2020
10. Global and country-specific mainstreaminess measures: Definitions, analysis, and usage for improving personalized music recommendation systems
- Author
-
Bauer, Christine and Schedl, Markus
- Subjects
music recommendation ,FOS: Computer and information sciences ,country-specific differences ,Databases, Factual ,country ,Culture ,Social Sciences ,Geographical Locations ,Japan ,Sociology ,Animal Cells ,Medicine and Health Sciences ,Psychology ,recommender system ,music preferences ,Applied Mathematics ,Simulation and Modeling ,Computer Science - Social and Information Networks ,Music Perception ,Dewey Decimal Classification -- Computer science, information & general works (0) ,Physical Sciences ,Medicine ,Sensory Perception ,mainstreaminess ,Cellular Types ,Information Retrieval (cs.IR) ,Brazil ,Algorithms ,Research Article ,Asia ,Science ,Immune Cells ,Immunology ,Antigen-Presenting Cells ,Research and Analysis Methods ,Computer Science - Information Retrieval ,Clustering Algorithms ,Humans ,personalization ,Social and Information Networks (cs.SI) ,Internet ,Behavior ,Music Cognition ,Dewey-Dezimalklassifikation -- Informatik, Informationswissenschaft, allgemeine Werke (0) ,Cognitive Psychology ,Biology and Life Sciences ,Cell Biology ,South America ,United States ,culture ,People and Places ,North America ,Cognitive Science ,Social Media ,Music ,Mathematics ,Neuroscience - Abstract
Popularity-based approaches are widely adopted in music recommendation systems, both in industry and research. However, as the popularity distribution of music items typically is a long-tail distribution, popularity-based approaches to music recommendation fall short in satisfying listeners that have specialized music. The contribution of this article is three-fold. We provide several quantitative measures describing the proximity of a user's music preference to the music mainstream. We define the measures at two levels: relating a listener's music preferences to the global music preferences of all users, or relating them to music preferences of the user's country. Moreover, we adopt a distribution-based and a rank-based approach as means to decrease bias towards the head of the long-tail distribution. We analyze differences between countries in terms of their level of mainstreaminess, uncover both positive and negative outliers (substantially higher and lower country-specific popularity, respectively, compared to the global mainstream), and investigate differences between countries in terms of listening preferences related to popular music artists. We use the standardized LFM-1b dataset, from which we analyze about 8 million listening events shared by about 53,000 users (from 47 countries) of the music streaming platform Last.fm. We show that there are substantial country-specific differences in listeners' music consumption behavior with respect to the most popular artists listened to. We conduct rating prediction experiments in which we tailor recommendations to a user's level of preference for the music mainstream using the proposed 6 mainstreaminess measures. Results suggest that, in terms of rating prediction accuracy, each of the presented mainstreaminess definitions has its merits., Comment: 36 pages, 4 figures, 10 tables, PLOS ONE 14(6), paper e0217389
- Published
- 2018
11. Windowed persistent homology: A topological signal processing algorithm applied to clinical obesity data
- Author
-
Kayvan Najarian, Amy E. Rothberg, Harm Derksen, Craig Biwer, Heidi B. IglayReger, and Charles F. Burant
- Subjects
Michigan ,Computer science ,Physiology ,Economics ,Entropy ,Social Sciences ,Health Care Sector ,lcsh:Medicine ,02 engineering and technology ,Overweight ,computer.software_genre ,01 natural sciences ,Geographical locations ,Weight loss ,0202 electrical engineering, electronic engineering, information engineering ,Medicine and Health Sciences ,Entropy (energy dispersal) ,lcsh:Science ,education.field_of_study ,Multidisciplinary ,Entropy (statistical thermodynamics) ,Applied Mathematics ,Simulation and Modeling ,Physics ,Hausdorff space ,Signal Processing, Computer-Assisted ,Physiological Parameters ,Physical Sciences ,Signal processing algorithms ,Engineering and Technology ,Thermodynamics ,medicine.symptom ,Algorithms ,Research Article ,Population ,Machine learning ,Research and Analysis Methods ,Entropy (classical thermodynamics) ,Health Economics ,Weight Loss ,medicine ,Entropy (information theory) ,Humans ,Obesity ,0101 mathematics ,education ,Entropy (arrow of time) ,Persistent homology ,business.industry ,010102 general mathematics ,Body Weight ,lcsh:R ,Biology and Life Sciences ,020206 networking & telecommunications ,medicine.disease ,United States ,Health Care ,Hausdorff distance ,Signal Processing ,North America ,lcsh:Q ,Artificial intelligence ,People and places ,Electronics ,Accelerometers ,business ,computer ,Mathematics ,Entropy (order and disorder) - Abstract
Overweight and obesity are highly prevalent in the population of the United States, affecting roughly 2/3 of Americans. These diseases, along with their associated conditions, are a major burden on the healthcare industry in terms of both dollars spent and effort expended. Volitional weight loss is attempted by many, but weight regain is common. The ability to predict which patients will lose weight and successfully maintain the loss versus those prone to regain weight would help ease this burden by allowing clinicians the ability to skip treatments likely to be ineffective. In this paper we introduce a new windowed approach to the persistent homology signal processing algorithm that, when paired with a modified, semimetric version of the Hausdorff distance, can differentiate the two groups where other commonly used methods fail. The novel approach is tested on accelerometer data gathered from an ongoing study at the University of Michigan. While most standard approaches to signal processing show no difference between the two groups, windowed persistent homology and the modified Hausdorff semimetric show a clear separation. This has significant implications for clinical decision making and patient care.
- Published
- 2017
12. Using Tensor Completion Method to Achieving Better Coverage of Traffic State Estimation from Sparse Floating Car Data
- Author
-
Yang Cheng, Jian Zhang, Huachun Tan, Li Song, and Bin Ran
- Subjects
Computer science ,Aviation ,Intelligence ,lcsh:Medicine ,Social Sciences ,Transportation ,02 engineering and technology ,Geographical locations ,Mathematical and Statistical Techniques ,0202 electrical engineering, electronic engineering, information engineering ,Range (statistics) ,Computer Science::Networking and Internet Architecture ,Psychology ,lcsh:Science ,Intelligent transportation system ,Principal Component Analysis ,Multidisciplinary ,geography.geographical_feature_category ,Applied Mathematics ,Simulation and Modeling ,05 social sciences ,Floating car data ,Transportation Infrastructure ,Physical Sciences ,Engineering and Technology ,020201 artificial intelligence & image processing ,Algorithm ,Algorithms ,Statistics (Mathematics) ,Network analysis ,Research Article ,Optimization ,Computer and Information Sciences ,Research and Analysis Methods ,Civil Engineering ,Wisconsin ,0502 economics and business ,Computer Simulation ,Tensor ,Statistical Methods ,Traffic generation model ,050210 logistics & transportation ,geography ,business.industry ,lcsh:R ,Cognitive Psychology ,Biology and Life Sciences ,Computing Methods ,United States ,Roads ,ComputerSystemsOrganization_MISCELLANEOUS ,Multivariate Analysis ,North America ,Cognitive Science ,lcsh:Q ,State (computer science) ,People and places ,business ,Automobiles ,Mathematics ,Water well ,Neuroscience - Abstract
Traffic state estimation from the floating car system is a challenging problem. The low penetration rate and random distribution make available floating car samples usually cover part space and time points of the road networks. To obtain a wide range of traffic state from the floating car system, many methods have been proposed to estimate the traffic state for the uncovered links. However, these methods cannot provide traffic state of the entire road networks. In this paper, the traffic state estimation is transformed to solve a missing data imputation problem, and the tensor completion framework is proposed to estimate missing traffic state. A tensor is constructed to model traffic state in which observed entries are directly derived from floating car system and unobserved traffic states are modeled as missing entries of constructed tensor. The constructed traffic state tensor can represent spatial and temporal correlations of traffic data and encode the multi-way properties of traffic state. The advantage of the proposed approach is that it can fully mine and utilize the multi-dimensional inherent correlations of traffic state. We tested the proposed approach on a well calibrated simulation network. Experimental results demonstrated that the proposed approach yield reliable traffic state estimation from very sparse floating car data, particularly when dealing with the floating car penetration rate is below 1%.
- Published
- 2016
13. Avian Influenza Risk Surveillance in North America with Online Media
- Author
-
Lauren Yee and Colin Robertson
- Subjects
Epidemiology ,lcsh:Medicine ,Social Sciences ,Disease Outbreaks ,Animal Diseases ,0302 clinical medicine ,Sociology ,Zoonoses ,Medicine and Health Sciences ,030212 general & internal medicine ,lcsh:Science ,Statistical Data ,Multidisciplinary ,Warning system ,Animal Behavior ,Social Communication ,Geography ,Infectious Diseases ,Data extraction ,Social Networks ,Veterinary Diseases ,Vertebrates ,Physical Sciences ,The Internet ,Algorithms ,Network Analysis ,Statistics (Mathematics) ,Research Article ,Avian Influenza ,Computer and Information Sciences ,Infectious Disease Control ,030231 tropical medicine ,Twitter ,Context (language use) ,Disease Surveillance ,Digital media ,Birds ,03 medical and health sciences ,Animal Influenza ,Animals ,Social media ,Internet ,Behavior ,Operationalization ,business.industry ,lcsh:R ,Organisms ,Biology and Life Sciences ,Data science ,Communications ,Risk perception ,Influenza in Birds ,Infectious Disease Surveillance ,North America ,Amniotes ,lcsh:Q ,Animal Migration ,Veterinary Science ,business ,Social Media ,Zoology ,Mathematics - Abstract
The use of Internet-based sources of information for health surveillance applications has increased in recent years, as a greater share of social and media activity happens through online channels. The potential surveillance value in online sources of information about emergent health events include early warning, situational awareness, risk perception and evaluation of health messaging among others. The challenge in harnessing these sources of data is the vast number of potential sources to monitor and developing the tools to translate dynamic unstructured content into actionable information. In this paper we investigated the use of one social media outlet, Twitter, for surveillance of avian influenza risk in North America. We collected AI-related messages over a five-month period and compared these to official surveillance records of AI outbreaks. A fully automated data extraction and analysis pipeline was developed to acquire, structure, and analyze social media messages in an online context. Two methods of outbreak detection; a static threshold and a cumulative-sum dynamic threshold; based on a time series model of normal activity were evaluated for their ability to discern important time periods of AI-related messaging and media activity. Our findings show that peaks in activity were related to real-world events, with outbreaks in Nigeria, France and the USA receiving the most attention while those in China were less evident in the social media data. Topic models found themes related to specific AI events for the dynamic threshold method, while many for the static method were ambiguous. Further analyses of these data might focus on quantifying the bias in coverage and relation between outbreak characteristics and detectability in social media data. Finally, while the analyses here focused on broad themes and trends, there is likely additional value in developing methods for identifying low-frequency messages, operationalizing this methodology into a comprehensive system for visualizing patterns extracted from the Internet, and integrating these data with other sources of information such as wildlife, environment, and agricultural data.
- Published
- 2016
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.