Author: "Fu, Mingsheng" / Topic: reinforcement learning - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Fu, Mingsheng"' showing total 4 results

Start Over Author "Fu, Mingsheng" Topic reinforcement learning

4 results on '"Fu, Mingsheng"'

1. Reinforcement Learning for Mobile Robot Obstacle Avoidance Under Dynamic Environments

Author: Huang, Liwei, Qu, Hong, Fu, Mingsheng, Deng, Wu, Hutchison, David, Series Editor, Kanade, Takeo, Series Editor, Kittler, Josef, Series Editor, Kleinberg, Jon M., Series Editor, Mattern, Friedemann, Series Editor, Mitchell, John C., Series Editor, Naor, Moni, Series Editor, Pandu Rangan, C., Series Editor, Steffen, Bernhard, Series Editor, Terzopoulos, Demetri, Series Editor, Tygar, Doug, Series Editor, Weikum, Gerhard, Series Editor, Geng, Xin, editor, and Kang, Byeong-Ho, editor
Published: 2018
Full Text: View/download PDF

2. An Attention-Based Interactive Learning-to-Rank Model for Document Retrieval.

Author: Zhang, Fan, Chen, Wenyu, Fu, Mingsheng, Li, Fan, Qu, Hong, and Yi, Zhang
Subjects: INFORMATION retrieval, INFORMATION storage & retrieval systems
Abstract: The core issue of learning-to-rank (LTR) for document retrieval lies in finding an optimal ranking policy to meet the search intent of the user. The majority of proposed LTR approaches treat the ranking as a static process, employing a fixed ranking policy to immediately assign scores to documents. By contrast, ranking is not a static but an interactive process where the user continues interacting with the document retrieval system through information exchange such as search intent (e.g., rating or clicking for the retrieved items). We model the interactive ranking process (IRP), and propose an Attention-Based Interactive LTR model (AIRank) to constitute an intent-aware flexible ranking policy to gratify the user’s need. To enhance the ranking quality, the inherent relations among documents are procured by the self-attention method to contribute to an enriched user intent representation. Furthermore, we mend the policy gradient learning method to train the AIRank in the IRP. Experiments demonstrate the effectiveness of AIRank compared to the state-of-the-art methods in terms of normalized discounted cumulative gain and expected reciprocal rank. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

3. A deep reinforcement learning-based method applied for solving multi-agent defense and attack problems.

Author: Huang, Liwei, Fu, Mingsheng, Qu, Hong, Wang, Siying, and Hu, Shangqian
Subjects: *REINFORCEMENT learning, *REWARD (Psychology), *ARTIFICIAL intelligence, *MACHINE learning, *DEEP learning, *FUNCTION spaces
Abstract: • The multi-agent defense and attack environment is reconstructed. • Several algorithms are applied to solve the considered problem. • We redefine the state space, the action space and the reward functions accordingly. • Comparison experiments are conducted to show the performance of the employed models. Learning to cooperate among agents has always been an important research topic in artificial intelligence. Multi-agent defense and attack, one of the important issues in multi-agent cooperation, requires multiple agents in the environment to learn effective strategies to achieve their goals. Deep reinforcement learning (DRL) algorithms have natural advantages dealing with continuous control problems especially under situations with dynamic interactions, and have provided new solutions for those long-studied multi-agent cooperation problems. In this paper, we start from deep deterministic policy gradient (DDPG) algorithm and then introduce multi-agent DDPG (MADDPG) to solve the multi-agent defense and attack problem under different situations. We reconstruct the considered environment, redefine the continuous state space, continuous action space, reward functions accordingly, and then apply deep reinforcement learning algorithms to obtain effective decision strategies. Several experiments considering different confrontation scenarios are conducted to validate the feasibility and effectiveness of the DRL-based methods. Experimental results show that through learning the agents can make better decisions, and learning with MADDPG achieves superior performance than learning with other DRL-based models, which also explains the importance and necessity of mastering other agents' information. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

4. A deep reinforcement learning based long-term recommender system.

Author: Huang, Liwei, Fu, Mingsheng, Li, Fan, Qu, Hong, Liu, Yangjun, and Chen, Wenyu
Subjects: *RECOMMENDER systems, *DEEP learning, *REINFORCEMENT learning, *SUPERVISED learning, *MARKOV processes, *RECURRENT neural networks, *DECISION making
Abstract: Recommender systems aim to maximize the overall accuracy for long-term recommendations. However, most of the existing recommendation models adopt a static view, and ignore the fact that recommendation is a dynamic sequential decision-making process. As a result, they fail to adapt to new situations and suffer from the cold-start problem. Although sequential recommendation methods have been gaining attention recently, the objective of long-term recommendation still has not been explicitly addressed since these methods are developed for short-term prediction situations. To overcome these problems, we propose a novel top-N interactive recommender system based on deep reinforcement learning. In our model, the processes of recommendation are viewed as Markov decision processes (MDP), wherein the interactions between agent (recommender system) and environment (user) are simulated by the recurrent neural network (RNN). In addition, reinforcement learning is employed to optimize the proposed model for the purpose of maximizing long-term recommendation accuracy. Experimental results based on several benchmarks show that our model significantly outperforms previous top-N methods in terms of Hit-Rate and NDCG for the long-term recommendation, and can be applied to both cold-start and warm-start scenarios. • A novel top-N interactive recommender system based on deep reinforcement learning is proposed. • The interactions between recommender system and users are simulated by recurrent neural networks. • The proposed model can deal with both cold-start and warm-start scenarios. • Reinforcement learning and supervised learning are employed to optimize the proposed model for long-term recommendation accuracy. • Experiments and comparisons are conducted to show the merits of the proposed model. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

4 results on '"Fu, Mingsheng"'

1. Reinforcement Learning for Mobile Robot Obstacle Avoidance Under Dynamic Environments

2. An Attention-Based Interactive Learning-to-Rank Model for Document Retrieval.

3. A deep reinforcement learning-based method applied for solving multi-agent defense and attack problems.

4. A deep reinforcement learning based long-term recommender system.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

4 results on '"Fu, Mingsheng"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources