3 results on '"Liu, Honglei"'
Search Results
2. Mining Patterns and Networks from Sequence Data
- Author
-
Liu, Honglei
- Subjects
- Computer science
- Abstract
Sequence data are ubiquitous in diverse domains such as bioinformatics, computational neuroscience, and user behavior analysis. As a result, many critical applications require extracting knowledge from sequences in multi-level. For example, mining frequent patterns is the central goal of motif discovery in biological sequences, while in computational neuronal science, one essential task is to infer causal networks from neural event sequences (spike trains). Given the wide application of pattern and network mining tools for sequence data, they are facing new challenges posted by modern instruments. That is, as large scale and high resolution sequence data become available, we need new methods with better efficiency and higher accuracy.In this dissertation, we propose several approaches to improve existing pattern and network mining tools to meet new challenges in terms of efficiency and accuracy. The first problem is how to scale existing motif discovery algorithms. Our work on motif discovery focuses on the challenge of discovering motifs from a large scale of short sequences that none of existing motif finding algorithms can handle. We propose an anchor based clustering algorithm that could significantly improve the scalability of all the existing motif finding algorithms without losing accuracy at all. In particular, our algorithm could reduce the running time of a very popular motif finding algorithm, MEME, from weeks to a few minutes with even better accuracy.In another work, we study the problem of how to accurately infer a functional network from neural recordings (spike trains), which is an essential task in many real world applications such as diagnosing neurodegenerative diseases. We introduce a statistical tool that could be used to accurately identify inhibitory causal relations from spike trains. While most of existing works devote their efforts on characterizing the statistics of neural spike trains, we show that it is crucial to make predictions about the response of neurons to changes. More importantly, our results are validated by real biological experiments with a novel instrument, which makes this work the first of its kind. Furthermore, while most existing methods focus on learning functional networks from purely observational data, we propose an active learning framework that could intelligently generate and utilize interventional data. We demonstrate that by intelligently adopting interventional data using the active learning models we propose, the accuracy of the inferred functional network could be substantially improved with the same amount of training data.
- Published
- 2017
3. The near real-time solar irradiance mapping in California based on satellite data and economic and emission benefits analysis
- Author
-
Liu, Honglei
- Subjects
- Environmental Studies., solar irradiance, California, satellite data
- Abstract
As the most abundant, sustainable, and green energy source on the earth, solar energy has the potential to resolve environmental problems such as climate change and air pollution caused by fossil energy. Real-time solar irradiance mapping, which gives the real-time data on local solar energy distribution, would provide valuable information and lead to more efficient use of solar energy. State of California (CA) is abundant in solar energy. However, the data of real-time direct normal irradiance (DNI) in CA is still currently not available. This thesis focuses on developing-models that would use satellite data to map near real-time solar irradiance in CA and analyzing the potential economic and emission benefits of solar installation based on the modeled DNI data. The transmittance data was derived from GOES-WEST for the development of ON I model. The study showed that the major atmospheric factors influencing the solar irradiance are cloud coverage and aerosol concentration in non-clear and clear sky condition, respectively. This suggests that the transmittances of cloud and aerosol should be as accurate as possible when calculating DNI using satellite data. Next, the modeled DNI was compared with actual DNI data, and the results indicated a good agreement between them, implying that the modeled DNI data is a good estimate of the actual solar irradiance. Finally, the economic and emission benefits of solar use were analyzed based upon process-modeling and empirical regression approach. The results indicated that 0.2% coverage of solar PV in CA could reduce the C02, NOx and S02 emissions on average by 9.08%, 4.51 % and 2.73% during 2007 summertime, respectively. The cost savings are amount to 48.7%, 32.0% and 16.9% in summer under three levels of PV installation rate. In summary, we developed a model to map near real-time DNI spatial and temporal distribution based on satellite data. The outcomes from solar mapping are used to predict potential economic and emission benefits using empirical regression and process-based approach.
- Published
- 2008
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.