20 results on '"Peng, Wen-Chih"'
Search Results
2. Mining and clustering mobility evolution patterns from social media for urban informatics.
- Author
-
Chen, Chien-Cheng, Chiang, Meng-Fen, and Peng, Wen-Chih
- Subjects
DATA mining ,INFORMATION science ,SOCIAL media ,CITIES & towns ,ALGORITHMS - Abstract
In this paper, given a set of check-in data, we aim at discovering representative daily movement behavior of users in a city. For example, daily movement behavior on a weekday may show users moving from one to another spatial region associated with time information. Since check-in data contain both spatial and temporal information, we propose a mobility evolution pattern to capture the daily movement behavior of users in a city. Furthermore, given a set of daily mobility evolution patterns, we formulate their similarity distances and then discover representative mobility evolution patterns via the clustering process. Representative mobility evolution patterns are able to infer major movement behavior in a city, which could bring some valuable knowledge for urban planning. Specifically, mobility evolution patterns consist of segments with the spatial region distribution and the corresponding time interval. To measure good segmentation from a set of check-in data, we formulate the problem of mining evolution patterns as a compression problem. In particular, we compute the representation length of the patterns based on the Minimum Description Length principle. Since the number of daily mobility evolution patterns is huge, we further cluster the daily mobility evolution patterns into groups and discover representative patterns. Note that we use the concept of locality-sensitive hashing to accelerate the cluster performance. To evaluate our proposed algorithms, we conducted experiments on the Gowalla and Brightkite datasets, and the experimental results show the effectiveness and efficiency of our proposed algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
3. Associative Classification for Human Activity Inference on Smart Phones.
- Author
-
Peng, Yu-Hsiang, Njoo, Gunarto Sindoro, Li, Shou-Chun, and Peng, Wen-Chih
- Published
- 2014
- Full Text
- View/download PDF
4. Mining Correlation Patterns among Appliances in Smart Home Environment.
- Author
-
Chen, Yi-Cheng, Chen, Chien-Chih, Peng, Wen-Chih, and Lee, Wang-Chien
- Published
- 2014
- Full Text
- View/download PDF
5. Dynamic Circle Recommendation: A Probabilistic Model.
- Author
-
Chou, Fan-Kai, Chiang, Meng-Fen, Chen, Yi-Cheng, and Peng, Wen-Chih
- Published
- 2014
- Full Text
- View/download PDF
6. Discovering pattern-aware routes from trajectories.
- Author
-
Wei, Ling-Yin, Chang, Kai-Ping, and Peng, Wen-Chih
- Subjects
PATTERNS (Mathematics) ,GLOBAL Positioning System ,COMPUTER users ,ROAD maps ,DATA analysis - Abstract
With the prevalence of GPS-equipped devices and navigation services, users can record and share their driving movements via trajectories. These trajectories reveal users' driving behaviors for planning routes. In this paper, we propose a novel pattern-aware route discovery framework that considers users' preferred routes. The proposed framework is comprised of two components: pattern-aware road map generation and route planning. In the first component, we mine significant road segments from historical trajectories, and generate a pattern-aware road map. We design a route score function that strikes a balance between user preference degrees and the length of the route. For the second component, given a source, a destination, and a user pre-defined value k, we intend to derive the top- k routes that consist of road segments from the source to the destination in the pattern-aware road map. To support on-line route planning in most navigation services, we propose a constrained breadth-first-search (CBFS) algorithm. We evaluate the performance of our framework using real trajectory data, and compare our framework with an existing approach in terms of effectiveness and efficiency. The experimental results demonstrate the effectiveness and efficiency of our proposed framework. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
7. Clustering and aggregating clues of trajectories for mining trajectory patterns and routes.
- Author
-
Hung, Chih-Chieh, Peng, Wen-Chih, and Lee, Wang-Chien
- Abstract
In this paper, we propose a new trajectory pattern mining framework, namely Clustering and Aggregating Clues of Trajectories (CACT), for discovering trajectory routes that represent the frequent movement behaviors of a user. In addition to spatial and temporal biases, we observe that trajectories contain silent durations, i.e., the time durations when no data points are available to describe the movements of users, which bring many challenging issues to trajectory pattern mining. We claim that a movement behavior would leave some clues in its various sampled/observed trajectories. These clues may be extracted from spatially and temporally co-located data points from the observed trajectories. Based on this observation, we propose clue-aware trajectory similarity to measure the clues between two trajectories. Accordingly, we further propose the clue-aware trajectory clustering algorithm to cluster similar trajectories into groups to capture the movement behaviors of the user. Finally, we devise the clue-aware trajectory aggregation algorithm to aggregate trajectories in the same group to derive the corresponding trajectory pattern and route. We validate our ideas and evaluate the proposed CACT framework by experiments using both synthetic and real datasets. The experimental results show that CACT is more effective in discovering trajectory patterns than the state-of-the-art techniques for mining trajectory patterns. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
8. A Hybrid Prediction Algorithm for Traffic Speed Prediction.
- Author
-
Huang, Bo-Wei, Wang, Kun-Wei, Wei, Ling-Yin, and Peng, Wen-Chih
- Published
- 2013
- Full Text
- View/download PDF
9. Mining Usage Traces of Mobile Apps for Dynamic Preference Prediction.
- Author
-
Liao, Zhung-Xun, Peng, Wen-Chih, and Yu, Philip S.
- Published
- 2013
- Full Text
- View/download PDF
10. Mining Appliance Usage Patterns in Smart Home Environment.
- Author
-
Chen, Yi-Cheng, Ko, Yu-Lun, Peng, Wen-Chih, and Lee, Wang-Chien
- Published
- 2013
- Full Text
- View/download PDF
11. Clustering Clues of Trajectories for Discovering Frequent Movement Behaviors.
- Author
-
Hung, Chih-Chieh, Wei, Ling-Yin, and Peng, Wen-Chih
- Published
- 2012
- Full Text
- View/download PDF
12. Exploring Spatio-Temporal Features for Traffic Estimation on Road Networks.
- Author
-
Wei, Ling-Yin, Peng, Wen-Chih, Lin, Chun-Shuo, and Jung, Chen-Hen
- Abstract
In this paper, given a query that indicates a query road segment and a query time, we intend to accurately estimate the traffic status (i.e., the driving speed) on the query road segment at the query time from traffic databases. Note that a traffic behavior in the same time usually reflects similar patterns (referring to the temporal feature), and nearby road segments have the similar traffic behaviors (referring to the spatial feature). By exploring the temporal and spatial features, more GPS data points are retrieved. In light of these GPS data retrieved, we exploit the weighted moving average approach to estimate traffic status on road networks. Experimental results show the effectiveness of our proposed algorithm. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
13. Clustering Data Streams in Optimization and Geography Domains.
- Author
-
Wei, Ling-Yin and Peng, Wen-Chih
- Abstract
In this paper, we formulate a dual clustering problem in spatial data streams. A spatial data stream consists of data points with attributes in the optimization and geography domains. We aim at partitioning these objects into disjoint clusters such that at each time window (1) objects in the same cluster satisfy the transitively r-connected relation in the optimization and geography domains, and (2) the number of clusters is as minimal as possible. We propose a Hierarchical-Based Clustering algorithm (HBC). Specifically, objects are represented as a graph structure, called RGraph, where each node represents an object and edges indicate their similarity relationships. In light of RGraph, algorithm HBC iteratively merges clusters. Experimental results show the performance of the algorithm. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
14. Efficient Joint Clustering Algorithms in Optimization and Geography Domains.
- Author
-
Lo, Chia-Hao and Peng, Wen-Chih
- Abstract
Prior works have elaborated on the problem of joint clustering in the optimization and geography domains. However, prior works neither clearly specify the connected constraint in the geography domain nor propose efficient algorithms. In this paper, we formulate the joint clustering problem in which a connected constraint and the number of clusters should be specified. We propose an algorithm K-means with Local Search (abbreviated as KLS) to solve the joint clustering problem with the connected constraint. Experimental results show that KLS can find correct clusters efficiently. [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
15. An incremental algorithm for clustering spatial data streams: exploring temporal locality.
- Author
-
Wei, Ling-Yin and Peng, Wen-Chih
- Subjects
ALGORITHMS ,DOCUMENT clustering ,SENSOR networks ,DATA mining ,TRAFFIC monitoring ,GEOGRAPHIC information systems - Abstract
Clustering sensor data discovers useful information hidden in sensor networks. In sensor networks, a sensor has two types of attributes: a geographic attribute (i.e, its spatial location) and non-geographic attributes (e.g., sensed readings). Sensor data are periodically collected and viewed as spatial data streams, where a spatial data stream consists of a sequence of data points exhibiting attributes in both the geographic and non-geographic domains. Previous studies have developed a dual clustering problem for spatial data by considering similarity-connected relationships in both geographic and non-geographic domains. However, the clustering processes in stream environments are time-sensitive because of frequently updated sensor data. For sensor data, the readings from one sensor are similar for a period, and the readings refer to temporal locality features. Using the temporal locality features of the sensor data, this study proposes an incremental clustering (IC) algorithm to discover clusters efficiently. The IC algorithm comprises two phases: cluster prediction and cluster refinement. The first phase estimates the probability of two sensors belonging to a cluster from the previous clustering results. According to the estimation, a coarse clustering result is derived. The cluster refinement phase then refines the coarse result. This study evaluates the performance of the IC algorithm using synthetic and real datasets. Experimental results show that the IC algorithm outperforms exiting approaches confirming the scalability of the IC algorithm. In addition, the effect of temporal locality features on the IC algorithm is analyzed and thoroughly examined in the experiments. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
16. Exploring heterogeneous information networks and random walk with restart for academic search.
- Author
-
Chiang, Meng-Fen, Liou, Jiun-Jiue, Wang, Jen-Liang, Peng, Wen-Chih, and Shan, Man-Kwan
- Subjects
INFORMATION networks ,RANDOM walks ,QUERYING (Computer science) ,INFORMATION storage & retrieval systems ,GRAPH theory ,ELECTRONIC information resource searching ,ALGORITHMS - Abstract
In this paper, we explore heterogenous information networks in which each vertex represents one entity and the edges reflect linkage relationships. Heterogenous information networks contain vertices of several entity types, such as papers, authors and terms, and hence can fully reflect multiple linkage relationships among different entities. Such a heterogeneous information network is similar to a mixed media graph (MMG). By representing a bibliographic dataset as an MMG, the performance obtained when searching relevant entities (e.g., papers) can be improved. Furthermore, our academic search enables multiple-entity search, where a variety of entity search results are provided, such as relevant papers, authors and conferences, via a one-time query. Explicitly, given a bibliographic dataset, we propose a Global-MMG, in which a global heterogeneous information network is built. When a user submits a query keyword, we perform a random walk with restart (RWR) to retrieve papers or other types of entity objects. To reduce the query response time, algorithm Net-MMG (standing for NetClus-based MMG) is developed. Algorithm Net-MMG first divides a heterogeneous information network into a collection of sub-networks. Afterward, the Net-MMG performs a RWR on a set of selected relevant sub-networks. We implemented our academic search and conducted extensive experiments using the ACM Digital Library. The experimental results show that by exploring heterogeneous information networks and RWR, both the Global-MMG and Net-MMG achieve better search quality compared with existing academic search services. In addition, the Net-MMG has a shorter query response time while still guaranteeing good quality in search results. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
17. QS-STT: QuadSection clustering and spatial-temporal trajectory model for location prediction.
- Author
-
Lei, Po-Ruey, Li, Shou-Chung, and Peng, Wen-Chih
- Subjects
LOCATION-based services ,PREDICTION theory ,DATA mining ,FEATURE extraction ,ALGORITHMS ,PROBABILITY theory - Abstract
Location prediction is a crucial need for location-aware services and applications. Given an object's recent movement and a future time, the goal of location prediction is to predict the location of the object at the future time specified. Different from traditional location prediction using motion function, some research works have elaborated on mining movement behavior from historical trajectories for location prediction. Without loss of generality, given a set of trajectories of an object, prior works on mining movement behaviors will first extract regions of popularity, in which the object frequently appears, and then discover the sequential relationships among regions. However, the quality of the frequent regions extracted affects the accuracy of the location prediction. Furthermore, trajectory data has both spatial and temporal information. To further enhance the accuracy of location prediction, one could utilize not only spatial information but also temporal information to predict the locations of objects. In this paper, we propose a framework QS-STT (standing for QuadSection clustering and Spatial-Temporal Trajectory model) to capture the movement behaviors of objects for location prediction. Specifically, we have developed QuadSection clustering to extract a reasonable and near-optimal set of frequent regions. Then, based on the set of frequent regions, we propose a spatial-temporal trajectory model to explore the object's movement behavior as a probabilistic suffix tree with both spatial and temporal information of movements. Note that STT is not only able to discover sequential relationships among regions but also derives the corresponding probabilities of time, indicating when the object appears in each region. Based on STT, we further propose an algorithm to traverse STT for location prediction. By enhancing the quality of the frequent region extracted and exploring both the spatial and temporal information of STT, the accuracy of location prediction in QS-STT is improved. QS-STT is designed for individual location prediction. For verifying the effectiveness of QS-STT for location prediction under the different spatial density, we have conducted experiments on four types of real trajectory datasets with different speed. The experimental results show that our proposed QS-STT is able to capture both spatial and temporal patterns of movement behaviors and by exploring QS-STT, our proposed prediction algorithm outperforms existing works. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
18. Efficient algorithms for influence maximization in social networks.
- Author
-
Chen, Yi-Cheng, Peng, Wen-Chih, and Lee, Suh-Yin
- Subjects
ALGORITHMS ,SOCIAL networks ,WEBSITES ,SOCIAL network theory ,HEURISTIC programming ,SCALABILITY - Abstract
In recent years, due to the surge in popularity of social-networking web sites, considerable interest has arisen regarding influence maximization in social networks. Given a social network structure, the problem of influence maximization is to determine a minimum set of nodes that could maximize the spread of influences. With a large-scale social network, the efficiency and practicability of such algorithms are critical. Although many recent studies have focused on the problem of influence maximization, these works in general are time-consuming when a social network is large-scale. In this paper, we propose two novel algorithms, CDH-Kcut and Community and Degree Heuristic on Kcut/SHRINK, to solve the influence maximization problem based on a realistic model. The algorithms utilize the community structure, which significantly decreases the number of candidates of influential nodes, to avoid information overlap. The experimental results on both synthetic and real datasets indicate that our algorithms not only significantly outperform the state-of-the-art algorithms in efficiency but also possess graceful scalability. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
19. Exploring latent browsing graph for question answering recommendation.
- Author
-
Chiang, Meng-Fen, Peng, Wen-Chih, and Yu, Philip
- Subjects
- *
WEB browsing , *QUESTION answering systems , *GRAPHIC methods , *INTERNET users - Abstract
In this paper, we develop a framework of Question Answering Pages (referred to as QA pages) recommendation. Our proposed framework consists of the two modules: the off-line module to determine the importance of QA pages and the on-line module for on-line QA page recommendation. In the off-line module, we claim that the importance of QA pages could be discovered from user click streams. If the QA pages are of higher importance, many users will click and spend their time on these QA pages. Moreover, the relevant relationships among QA pages are captured by the browsing behavior on these QA pages. As such, we exploit user click streams to model the browsing behavior among QA pages as QA browsing graph structures. The importance of QA pages is derived from our proposed QA browsing graph structures. However, we observe that the QA browsing graph is sparse and that most of the QA pages do not link to other QA pages. This is referred to as a sparsity problem. To overcome this problem, we utilize the latent browsing relations among QA pages to build a QA Latent Browsing Graph. In light of QA latent browsing graph, the importance score of QA pages (referred to as Latent Browsing Rank) and the relevance score of QA pages (referred to as Latent Browsing Recommendation Rank) are proposed. These scores demonstrate the use of a QA latent browsing graph not only to determine the importance of QA pages but also to recommend QA pages. We conducted extensive empirical experiments on Yahoo! Asia Knowledge Plus to evaluate our proposed framework. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
20. Clustering spatial data with a geographic constraint: exploring local search.
- Author
-
Liao, Zhung-Xun and Peng, Wen-Chih
- Subjects
DOCUMENT clustering ,SPATIAL data structures ,MATHEMATICAL optimization ,DATA mining ,SUPPORT vector machines - Abstract
Spatial data objects that possess attributes in the optimization domain and the geographic domain are now widely available. For example, sensor data are one kind of spatial data objects. The location of a sensor is an attribute in the geographic domain, while its reading is an attribute in the optimization domain. Previous studies discuss dual clustering problems that attempt to partition spatial data objects into several groups, such that objects in the same group have similar values in their optimization attributes and form a compact region in the geographic domain. However, previous studies do not clearly define compact regions. Therefore, this paper formulates a connective dual clustering problem with an explicit connected constraint given. Objects with a geographic distance smaller than or equal to the connected constraint are connected. The goal of the connective dual clustering problem is to derive clusters that contain objects with similar values in the optimization domain and are connected in the geographic domain. This study further proposes an algorithm CLS (Clustering with Local Search) to efficiently derive clusters. This algorithm consists of two phases: the ConGraph (standing for Connective Graph) transformation phase and the clustering phase. In the ConGraph transformation phase, CLS first transforms the data objects into a ConGraph that captures geographic constraints among data objects and selects initial seeds for clustering. Then, the initial seeds selected nearby data objects and formed coarse clusters by exploring local search in the clustering phase. Moreover, coarse clusters are merged and finely turned. Experiments show that CLS algorithm is more efficient and scalable than existing methods. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.