Author: "Fan, Changjie" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Fan, Changjie"' showing total 294 results

Start Over Author "Fan, Changjie"

294 results on '"Fan, Changjie"'

51. Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation

Author: Li, Lincheng, Wang, Suzhen, Zhang, Zhimeng, Ding, Yu, Zheng, Yixing, Yu, Xin, and Fan, Changjie
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we propose a novel text-based talking-head video generation framework that synthesizes high-fidelity facial expressions and head motions in accordance with contextual sentiments as well as speech rhythm and pauses. To be specific, our framework consists of a speaker-independent stage and a speaker-specific stage. In the speaker-independent stage, we design three parallel networks to generate animation parameters of the mouth, upper face, and head from texts, separately. In the speaker-specific stage, we present a 3D face model guided attention network to synthesize videos tailored for different individuals. It takes the animation parameters as input and exploits an attention mask to manipulate facial expression changes for the input individuals. Furthermore, to better establish authentic correspondences between visual motions (i.e., facial expression changes and head movements) and audios, we leverage a high-accuracy motion capture dataset instead of relying on long videos of specific individuals. After attaining the visual and audio correspondences, we can effectively train our network in an end-to-end fashion. Extensive experiments on qualitative and quantitative results demonstrate that our algorithm achieves high-quality photo-realistic talking-head videos including various facial expressions and head motions according to speech rhythms and outperforms the state-of-the-art.
Published: 2021

52. Personalized Bundle Recommendation in Online Games

Author: Deng, Qilin, Wang, Kai, Zhao, Minghao, Zou, Zhene, Wu, Runze, Tao, Jianrong, Fan, Changjie, and Chen, Liang
Subjects: Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: In business domains, \textit{bundling} is one of the most important marketing strategies to conduct product promotions, which is commonly used in online e-commerce and offline retailers. Existing recommender systems mostly focus on recommending individual items that users may be interested in. In this paper, we target at a practical but less explored recommendation problem named bundle recommendation, which aims to offer a combination of items to users. To tackle this specific recommendation problem in the context of the \emph{virtual mall} in online games, we formalize it as a link prediction problem on a user-item-bundle tripartite graph constructed from the historical interactions, and solve it with a neural network model that can learn directly on the graph-structure data. Extensive experiments on three public datasets and one industrial game dataset demonstrate the effectiveness of the proposed method. Further, the bundle recommendation model has been deployed in production for more than one year in a popular online game developed by Netease Games, and the launch of the model yields more than 60\% improvement on conversion rate of bundles, and a relative improvement of more than 15\% on gross merchandise volume (GMV)., Comment: 8 pages, 10 figures, accepted paper on CIKM 2020
Published: 2021

53. Reinforcement Learning with a Disentangled Universal Value Function for Item Recommendation

Author: Wang, Kai, Zou, Zhene, Deng, Qilin, Wu, Runze, Tao, Jianrong, Fan, Changjie, Chen, Liang, and Cui, Peng
Subjects: Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: In recent years, there are great interests as well as challenges in applying reinforcement learning (RL) to recommendation systems (RS). In this paper, we summarize three key practical challenges of large-scale RL-based recommender systems: massive state and action spaces, high-variance environment, and the unspecific reward setting in recommendation. All these problems remain largely unexplored in the existing literature and make the application of RL challenging. We develop a model-based reinforcement learning framework, called GoalRec. Inspired by the ideas of world model (model-based), value function estimation (model-free), and goal-based RL, a novel disentangled universal value function designed for item recommendation is proposed. It can generalize to various goals that the recommender may have, and disentangle the stochastic environmental dynamics and high-variance reward signals accordingly. As a part of the value function, free from the sparse and high-variance reward signals, a high-capacity reward-independent world model is trained to simulate complex environmental dynamics under a certain goal. Based on the predicted environmental dynamics, the disentangled universal value function is related to the user's future trajectory instead of a monolithic state and a scalar reward. We demonstrate the superiority of GoalRec over previous approaches in terms of the above three practical challenges in a series of simulations and a real application., Comment: 9 pages, 4 figures, to be published in Proceedings of the AAAI Conference on Artificial Intelligence 2021
Published: 2021

54. Fever Basketball: A Complex, Flexible, and Asynchronized Sports Game Environment for Multi-agent Reinforcement Learning

Author: Jia, Hangtian, Hu, Yujing, Chen, Yingfeng, Ren, Chunxu, Lv, Tangjie, Fan, Changjie, and Zhang, Chongjie
Subjects: Computer Science - Artificial Intelligence
Abstract: The development of deep reinforcement learning (DRL) has benefited from the emergency of a variety type of game environments where new challenging problems are proposed and new algorithms can be tested safely and quickly, such as Board games, RTS, FPS, and MOBA games. However, many existing environments lack complexity and flexibility and assume the actions are synchronously executed in multi-agent settings, which become less valuable. We introduce the Fever Basketball game, a novel reinforcement learning environment where agents are trained to play basketball game. It is a complex and challenging environment that supports multiple characters, multiple positions, and both the single-agent and multi-agent player control modes. In addition, to better simulate real-world basketball games, the execution time of actions differs among players, which makes Fever Basketball a novel asynchronized environment. We evaluate commonly used multi-agent algorithms of both independent learners and joint-action learners in three game scenarios with varying difficulties, and heuristically propose two baseline methods to diminish the extra non-stationarity brought by asynchronism in Fever Basketball Benchmarks. Besides, we propose an integrated curricula training (ICT) framework to better handle Fever Basketball problems, which includes several game-rule based cascading curricula learners and a coordination curricula switcher focusing on enhancing coordination within the team. The results show that the game remains challenging and can be used as a benchmark environment for studies like long-time horizon, sparse rewards, credit assignment, and non-stationarity, etc. in multi-agent settings., Comment: 7 pages,12 figures
Published: 2020

55. Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping

Author: Hu, Yujing, Wang, Weixun, Jia, Hangtian, Wang, Yixiang, Chen, Yingfeng, Hao, Jianye, Wu, Feng, and Fan, Changjie
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Reward shaping is an effective technique for incorporating domain knowledge into reinforcement learning (RL). Existing approaches such as potential-based reward shaping normally make full use of a given shaping reward function. However, since the transformation of human knowledge into numeric reward values is often imperfect due to reasons such as human cognitive bias, completely utilizing the shaping reward function may fail to improve the performance of RL algorithms. In this paper, we consider the problem of adaptively utilizing a given shaping reward function. We formulate the utilization of shaping rewards as a bi-level optimization problem, where the lower level is to optimize policy using the shaping rewards and the upper level is to optimize a parameterized shaping weight function for true reward maximization. We formally derive the gradient of the expected true reward with respect to the shaping weight function parameters and accordingly propose three learning algorithms based on different assumptions. Experiments in sparse-reward cartpole and MuJoCo environments show that our algorithms can fully exploit beneficial shaping rewards, and meanwhile ignore unbeneficial shaping rewards or even transform them into beneficial ones., Comment: Accepted by NeurIPS2020
Published: 2020

56. Semi-Supervised Learning for In-Game Expert-Level Music-to-Dance Translation

Author: Duan, Yinglin, Shi, Tianyang, Zou, Zhengxia, Qin, Jia, Zhao, Yifei, Yuan, Yi, Hou, Jie, Wen, Xiang, and Fan, Changjie
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multimedia
Abstract: Music-to-dance translation is a brand-new and powerful feature in recent role-playing games. Players can now let their characters dance along with specified music clips and even generate fan-made dance videos. Previous works of this topic consider music-to-dance as a supervised motion generation problem based on time-series data. However, these methods suffer from limited training data pairs and the degradation of movements. This paper provides a new perspective for this task where we re-formulate the translation problem as a piece-wise dance phrase retrieval problem based on the choreography theory. With such a design, players are allowed to further edit the dance movements on top of our generation while other regression based methods ignore such user interactivity. Considering that the dance motion capture is an expensive and time-consuming procedure which requires the assistance of professional dancers, we train our method under a semi-supervised learning framework with a large unlabeled dataset (20x than labeled data) collected. A co-ascent mechanism is introduced to improve the robustness of our network. Using this unlabeled dataset, we also introduce self-supervised pre-training so that the translator can understand the melody, rhythm, and other components of music phrases. We show that the pre-training significantly improves the translation accuracy than that of training from scratch. Experimental results suggest that our method not only generalizes well over various styles of music but also succeeds in expert-level choreography for game players., Comment: 14 pages, 8 figures
Published: 2020

57. GraphFederator: Federated Visual Analysis for Multi-party Graphs

Author: Han, Dongming, Chen, Wei, Pan, Rusheng, Liu, Yijing, Zhou, Jiehui, Xu, Ying, Zhang, Tianye, Fan, Changjie, Tao, Jianrong, Xiaolong, and Zhang
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Cryptography and Security, Computer Science - Graphics
Abstract: This paper presents GraphFederator, a novel approach to construct joint representations of multi-party graphs and supports privacy-preserving visual analysis of graphs. Inspired by the concept of federated learning, we reformulate the analysis of multi-party graphs into a decentralization process. The new federation framework consists of a shared module that is responsible for joint modeling and analysis, and a set of local modules that run on respective graph data. Specifically, we propose a federated graph representation model (FGRM) that is learned from encrypted characteristics of multi-party graphs in local modules. We also design multiple visualization views for joint visualization, exploration, and analysis of multi-party graphs. Experimental results with two datasets demonstrate the effectiveness of our approach., Comment: 12 pages,8 figures
Published: 2020

58. Unsupervised Learning Facial Parameter Regressor for Action Unit Intensity Estimation via Differentiable Renderer

Author: Song, Xinhui, Shi, Tianyang, Feng, Zunlei, Song, Mingli, Lin, Jackie, Lin, Chuanjie, Fan, Changjie, and Yuan, Yi
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Facial action unit (AU) intensity is an index to describe all visually discernible facial movements. Most existing methods learn intensity estimator with limited AU data, while they lack generalization ability out of the dataset. In this paper, we present a framework to predict the facial parameters (including identity parameters and AU parameters) based on a bone-driven face model (BDFM) under different views. The proposed framework consists of a feature extractor, a generator, and a facial parameter regressor. The regressor can fit the physical meaning parameters of the BDFM from a single face image with the help of the generator, which maps the facial parameters to the game-face images as a differentiable renderer. Besides, identity loss, loopback loss, and adversarial loss can improve the regressive results. Quantitative evaluations are performed on two public databases BP4D and DISFA, which demonstrates that the proposed method can achieve comparable or better performance than the state-of-the-art methods. What's more, the qualitative results also demonstrate the validity of our method in the wild.
Published: 2020
Full Text: View/download PDF

59. Neutral Face Game Character Auto-Creation via PokerFace-GAN

Author: Shi, Tianyang, Zou, Zhengxia, Song, Xinhui, Song, Zheng, Gu, Changjian, Fan, Changjie, and Yuan, Yi
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Game character customization is one of the core features of many recent Role-Playing Games (RPGs), where players can edit the appearance of their in-game characters with their preferences. This paper studies the problem of automatically creating in-game characters with a single photo. In recent literature on this topic, neural networks are introduced to make game engine differentiable and the self-supervised learning is used to predict facial customization parameters. However, in previous methods, the expression parameters and facial identity parameters are highly coupled with each other, making it difficult to model the intrinsic facial features of the character. Besides, the neural network based renderer used in previous methods is also difficult to be extended to multi-view rendering cases. In this paper, considering the above problems, we propose a novel method named "PokerFace-GAN" for neutral face game character auto-creation. We first build a differentiable character renderer which is more flexible than the previous methods in multi-view rendering cases. We then take advantage of the adversarial training to effectively disentangle the expression parameters from the identity parameters and thus generate player-preferred neutral face (expression-less) characters. Since all components of our method are differentiable, our method can be easily trained under a multi-task self-supervised learning paradigm. Experiment results show that our method can generate vivid neutral face game characters that are highly similar to the input photos. The effectiveness of our method is verified by comparison results and ablation studies., Comment: Accepted by ACMMM 2020
Published: 2020

60. Fast and Robust Face-to-Parameter Translation for Game Character Auto-Creation

Author: Shi, Tianyang, Zou, Zhengxia, Yuan, Yi, and Fan, Changjie
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: With the rapid development of Role-Playing Games (RPGs), players are now allowed to edit the facial appearance of their in-game characters with their preferences rather than using default templates. This paper proposes a game character auto-creation framework that generates in-game characters according to a player's input face photo. Different from the previous methods that are designed based on neural style transfer or monocular 3D face reconstruction, we re-formulate the character auto-creation process in a different point of view: by predicting a large set of physically meaningful facial parameters under a self-supervised learning paradigm. Instead of updating facial parameters iteratively at the input end of the renderer as suggested by previous methods, which are time-consuming, we introduce a facial parameter translator so that the creation can be done efficiently through a single forward propagation from the face embeddings to parameters, with a considerable 1000x computational speedup. Despite its high efficiency, the interactivity is preserved in our method where users are allowed to optionally fine-tune the facial parameters on our creation according to their needs. Our approach also shows better robustness than previous methods, especially for those photos with head-pose variance. Comparison results and ablation analysis on seven public face verification datasets suggest the effectiveness of our method., Comment: Accepted by AAAI 2020 with supplementary material
Published: 2020

61. MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Author: Zhang, Jin, Wang, Jianhao, Hu, Hao, Chen, Tong, Chen, Yingfeng, Fan, Changjie, and Zhang, Chongjie
Subjects: Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Meta reinforcement learning (meta-RL) extracts knowledge from previous tasks and achieves fast adaptation to new tasks. Despite recent progress, efficient exploration in meta-RL remains a key challenge in sparse-reward tasks, as it requires quickly finding informative task-relevant experiences in both meta-training and adaptation. To address this challenge, we explicitly model an exploration policy learning problem for meta-RL, which is separated from exploitation policy learning, and introduce a novel empowerment-driven exploration objective, which aims to maximize information gain for task identification. We derive a corresponding intrinsic reward and develop a new off-policy meta-RL framework, which efficiently learns separate context-aware exploration and exploitation policies by sharing the knowledge of task inference. Experimental evaluation shows that our meta-RL method significantly outperforms state-of-the-art baselines on various sparse-reward MuJoCo locomotion tasks and more complex sparse-reward Meta-World tasks.
Published: 2020

62. Unsupervised Facial Action Unit Intensity Estimation via Differentiable Optimization

Author: Song, Xinhui, Shi, Tianyang, Shao, Tianjia, Yuan, Yi, Feng, Zunlei, and Fan, Changjie
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The automatic intensity estimation of facial action units (AUs) from a single image plays a vital role in facial analysis systems. One big challenge for data-driven AU intensity estimation is the lack of sufficient AU label data. Due to the fact that AU annotation requires strong domain expertise, it is expensive to construct an extensive database to learn deep models. The limited number of labeled AUs as well as identity differences and pose variations further increases the estimation difficulties. Considering all these difficulties, we propose an unsupervised framework GE-Net for facial AU intensity estimation from a single image, without requiring any annotated AU data. Our framework performs differentiable optimization, which iteratively updates the facial parameters (i.e., head pose, AU parameters and identity parameters) to match the input image. GE-Net consists of two modules: a generator and a feature extractor. The generator learns to "render" a face image from a set of facial parameters in a differentiable way, and the feature extractor extracts deep features for measuring the similarity of the rendered image and input real image. After the two modules are trained and fixed, the framework searches optimal facial parameters by minimizing the differences of the extracted features between the rendered image and the input image. Experimental results demonstrate that our method can achieve state-of-the-art results compared with existing methods.
Published: 2020

63. Exploring Unknown States with Action Balance

Author: Song, Yan, Chen, Yingfeng, Hu, Yujing, and Fan, Changjie
Subjects: Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Exploration is a key problem in reinforcement learning. Recently bonus-based methods have achieved considerable successes in environments where exploration is difficult such as Montezuma's Revenge, which assign additional bonuses (e.g., intrinsic rewards) to guide the agent to rarely visited states. Since the bonus is calculated according to the novelty of the next state after performing an action, we call such methods as the next-state bonus methods. However, the next-state bonus methods force the agent to pay overmuch attention in exploring known states and ignore finding unknown states since the exploration is driven by the next state already visited, which may slow the pace of finding reward in some environments. In this paper, we focus on improving the effectiveness of finding unknown states and propose action balance exploration, which balances the frequency of selecting each action at a given state and can be treated as an extension of upper confidence bound (UCB) to deep reinforcement learning. Moreover, we propose action balance RND that combines the next-state bonus methods (e.g., random network distillation exploration, RND) and our action balance exploration to take advantage of both sides. The experiments on the grid world and Atari games demonstrate action balance exploration has a better capability in finding unknown states and can improve the performance of RND in some hard exploration environments respectively.
Published: 2020

64. Efficient Deep Reinforcement Learning via Adaptive Policy Transfer

Author: Yang, Tianpei, Hao, Jianye, Meng, Zhaopeng, Zhang, Zongzhang, Hu, Yujing, Cheng, Yingfeng, Fan, Changjie, Wang, Weixun, Liu, Wulong, Wang, Zhaodong, and Peng, Jiajie
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: Transfer Learning (TL) has shown great potential to accelerate Reinforcement Learning (RL) by leveraging prior knowledge from past learned policies of relevant tasks. Existing transfer approaches either explicitly computes the similarity between tasks or select appropriate source policies to provide guided explorations for the target task. However, how to directly optimize the target policy by alternatively utilizing knowledge from appropriate source policies without explicitly measuring the similarity is currently missing. In this paper, we propose a novel Policy Transfer Framework (PTF) to accelerate RL by taking advantage of this idea. Our framework learns when and which source policy is the best to reuse for the target policy and when to terminate it by modeling multi-policy transfer as the option learning problem. PTF can be easily combined with existing deep RL approaches. Experimental results show it significantly accelerates the learning process and surpasses state-of-the-art policy transfer methods in terms of learning efficiency and final performance in both discrete and continuous action spaces., Comment: Accepted by IJCAI'2020
Published: 2020

65. An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning

Author: Yang, Tianpei, Wang, Weixun, Tang, Hongyao, Hao, Jianye, Meng, Zhaopeng, Mao, Hangyu, Li, Dong, Liu, Wulong, Zhang, Chengwei, Hu, Yujing, Chen, Yingfeng, and Fan, Changjie
Subjects: Computer Science - Multiagent Systems
Abstract: Transfer Learning has shown great potential to enhance single-agent Reinforcement Learning (RL) efficiency. Similarly, Multiagent RL (MARL) can also be accelerated if agents can share knowledge with each other. However, it remains a problem of how an agent should learn from other agents. In this paper, we propose a novel Multiagent Policy Transfer Framework (MAPTF) to improve MARL efficiency. MAPTF learns which agent's policy is the best to reuse for each agent and when to terminate it by modeling multiagent policy transfer as the option learning problem. Furthermore, in practice, the option module can only collect all agent's local experiences for update due to the partial observability of the environment. While in this setting, each agent's experience may be inconsistent with each other, which may cause the inaccuracy and oscillation of the option-value's estimation. Therefore, we propose a novel option learning algorithm, the successor representation option learning to solve it by decoupling the environment dynamics from rewards and learning the option-value under each agent's preference. MAPTF can be easily combined with existing deep RL and MARL approaches, and experimental results show it significantly boosts the performance of existing methods in both discrete and continuous state spaces., Comment: Accepted by NeurIPS'2021
Published: 2020

66. Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Author: Yang, Yaodong, Hao, Jianye, Chen, Guangyong, Tang, Hongyao, Chen, Yingfeng, Hu, Yujing, Fan, Changjie, and Wei, Zhongyu
Subjects: Computer Science - Multiagent Systems
Abstract: Recently, deep multiagent reinforcement learning (MARL) has become a highly active research area as many real-world problems can be inherently viewed as multiagent systems. A particularly interesting and widely applicable class of problems is the partially observable cooperative multiagent setting, in which a team of agents learns to coordinate their behaviors conditioning on their private observations and commonly shared global reward signals. One natural solution is to resort to the centralized training and decentralized execution paradigm. During centralized training, one key challenge is the multiagent credit assignment: how to allocate the global rewards for individual agent policies for better coordination towards maximizing system-level's benefits. In this paper, we propose a new method called Q-value Path Decomposition (QPD) to decompose the system's global Q-values into individual agents' Q-values. Unlike previous works which restrict the representation relation of the individual Q-values and the global one, we leverage the integrated gradient attribution technique into deep MARL to directly decompose global Q-values along trajectory paths to assign credits for agents. We evaluate QPD on the challenging StarCraft II micromanagement tasks and show that QPD achieves the state-of-the-art performance in both homogeneous and heterogeneous multiagent scenarios compared with existing cooperative MARL algorithms.
Published: 2020

67. Multi-label Relation Modeling in Facial Action Units Detection

Author: Ji, Xianpeng, Ding, Yu, Li, Lincheng, Chen, Yu, and Fan, Changjie
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: This paper describes an approach to the facial action units detections. The involved action units (AU) include AU1 (Inner Brow Raiser), AU2 (Outer Brow Raiser), AU4 (Brow Lowerer), AU6 (Cheek Raise), AU12 (Lip Corner Puller), AU15 (Lip Corner Depressor), AU20 (Lip Stretcher), and AU25 (Lip Part). Our work relies on the dataset released by the FG-2020 Competition: Affective Behavior Analysis In-the-Wild (ABAW). The proposed method consists of the data preprocessing, the feature extraction and the AU classification. The data preprocessing includes the detection of face texture and landmarks. The texture static and landmark dynamic features are extracted through neural networks and then fused as the feature latent representation. Finally, the fused feature is taken as the initial hidden state of a recurrent neural network with a trainable lookup AU table. The output of the RNN is the results of AU classification. The detected accuracy is evaluated with 0.5$\times$accuracy + 0.5$\times$F1. Our method achieve 0.56 with the validation data that is specified by the organization committee.
Published: 2020

68. ASN: action semantics network for multiagent reinforcement learning

Author: Yang, Tianpei, Wang, Weixun, Hao, Jianye, Taylor, Matthew E., Liu, Yong, Hao, Xiaotian, Hu, Yujing, Chen, Yingfeng, Fan, Changjie, Ren, Chunxu, Huang, Ye, Zhu, Jiangcheng, and Gao, Yang
Published: 2023
Full Text: View/download PDF

69. Diverse Behavior Is What Game AI Needs: Generating Varied Human-Like Playing Styles Using Evolutionary Multi-Objective Deep Reinforcement Learning

Author: Shen, Ruimin, Zheng, Yan, Hao, Jianye, Chen, Yinfeng, and Fan, Changjie
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: this paper has been withdrawn, Comment: 1. there is some discrepancy between some contributors with respect to the order of the authors; 2. the paper is rather "raw" - significant effort and improvement in terms of the paper's language and structure are needed to make it ready for publication
Published: 2019

70. From Few to More: Large-scale Dynamic Multiagent Curriculum Learning

Author: Wang, Weixun, Yang, Tianpei, Liu, Yong, Hao, Jianye, Hao, Xiaotian, Hu, Yujing, Chen, Yingfeng, Fan, Changjie, and Gao, Yang
Subjects: Computer Science - Artificial Intelligence, Computer Science - Multiagent Systems
Abstract: A lot of efforts have been devoted to investigating how agents can learn effectively and achieve coordination in multiagent systems. However, it is still challenging in large-scale multiagent settings due to the complex dynamics between the environment and agents and the explosion of state-action space. In this paper, we design a novel Dynamic Multiagent Curriculum Learning (DyMA-CL) to solve large-scale problems by starting from learning on a multiagent scenario with a small size and progressively increasing the number of agents. We propose three transfer mechanisms across curricula to accelerate the learning process. Moreover, due to the fact that the state dimension varies across curricula,, and existing network structures cannot be applied in such a transfer setting since their network input sizes are fixed. Therefore, we design a novel network structure called Dynamic Agent-number Network (DyAN) to handle the dynamic size of the network input. Experimental results show that DyMA-CL using DyAN greatly improves the performance of large-scale multiagent learning compared with state-of-the-art deep reinforcement learning approaches. We also investigate the influence of three transfer mechanisms across curricula through extensive simulations., Comment: Accepted by AAAI2020
Published: 2019

71. Learning Action-Transferable Policy with Action Embedding

Author: Chen, Yu, Chen, Yingfeng, Hu, Zhipeng, Yang, Tianpei, Fan, Changjie, Yu, Yang, and Hao, Jianye
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Transfer learning (TL) is a promising way to improve the sample efficiency of reinforcement learning. However, how to efficiently transfer knowledge across tasks with different state-action spaces is investigated at an early stage. Most previous studies only addressed the inconsistency across different state spaces by learning a common feature space, without considering that similar actions in different action spaces of related tasks share similar semantics. In this paper, we propose a method to learning action embeddings by leveraging this idea, and a framework that learns both state embeddings and action embeddings to transfer policy across tasks with different state and action spaces. Our experimental results on various tasks show that the proposed method can not only learn informative action embeddings but accelerate policy learning.
Published: 2019

72. Face-to-Parameter Translation for Game Character Auto-Creation

Author: Shi, Tianyang, Yuan, Yi, Fan, Changjie, Zou, Zhengxia, Shi, Zhenwei, and Liu, Yong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Character customization system is an important component in Role-Playing Games (RPGs), where players are allowed to edit the facial appearance of their in-game characters with their own preferences rather than using default templates. This paper proposes a method for automatically creating in-game characters of players according to an input face photo. We formulate the above "artistic creation" process under a facial similarity measurement and parameter searching paradigm by solving an optimization problem over a large set of physically meaningful facial parameters. To effectively minimize the distance between the created face and the real one, two loss functions, i.e. a "discriminative loss" and a "facial content loss", are specifically designed. As the rendering process of a game engine is not differentiable, a generative network is further introduced as an "imitator" to imitate the physical behavior of the game engine so that the proposed method can be implemented under a neural style transfer framework and the parameters can be optimized by gradient descent. Experimental results demonstrate that our method achieves a high degree of generation similarity between the input face photo and the created in-game character in terms of both global appearance and local details. Our method has been deployed in a new game last year and has now been used by players over 1 million times., Comment: Accepted by ICCV 2019
Published: 2019

73. Action Semantics Network: Considering the Effects of Actions in Multiagent Systems

Author: Wang, Weixun, Yang, Tianpei, Liu, Yong, Hao, Jianye, Hao, Xiaotian, Hu, Yujing, Chen, Yingfeng, Fan, Changjie, and Gao, Yang
Subjects: Computer Science - Multiagent Systems, Computer Science - Artificial Intelligence
Abstract: In multiagent systems (MASs), each agent makes individual decisions but all of them contribute globally to the system evolution. Learning in MASs is difficult since each agent's selection of actions must take place in the presence of other co-learning agents. Moreover, the environmental stochasticity and uncertainties increase exponentially with the increase in the number of agents. Previous works borrow various multiagent coordination mechanisms into deep learning architecture to facilitate multiagent coordination. However, none of them explicitly consider action semantics between agents that different actions have different influences on other agents. In this paper, we propose a novel network architecture, named Action Semantics Network (ASN), that explicitly represents such action semantics between agents. ASN characterizes different actions' influence on other agents using neural networks based on the action semantics between them. ASN can be easily combined with existing deep reinforcement learning (DRL) algorithms to boost their performance. Experimental results on StarCraft II micromanagement and Neural MMO show ASN significantly improves the performance of state-of-the-art DRL approaches compared with several network architectures., Comment: accepted by ICLR2020
Published: 2019

74. Reinforcement Learning Experience Reuse with Policy Residual Representation

Author: Zhou, Wen-Ji, Yu, Yang, Chen, Yingfeng, Guan, Kai, Lv, Tangjie, Fan, Changjie, and Zhou, Zhi-Hua
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Experience reuse is key to sample-efficient reinforcement learning. One of the critical issues is how the experience is represented and stored. Previously, the experience can be stored in the forms of features, individual models, and the average model, each lying at a different granularity. However, new tasks may require experience across multiple granularities. In this paper, we propose the policy residual representation (PRR) network, which can extract and store multiple levels of experience. PRR network is trained on a set of tasks with a multi-level architecture, where a module in each level corresponds to a subset of the tasks. Therefore, the PRR network represents the experience in a spectrum-like way. When training on a new task, PRR can provide different levels of experience for accelerating the learning. We experiment with the PRR network on a set of grid world navigation tasks, locomotion tasks, and fighting tasks in a video game. The results show that the PRR network leads to better reuse of experience and thus outperforms some state-of-the-art approaches., Comment: Conference version appears in IJCAI 2019
Published: 2019

75. FReeNet: Multi-Identity Face Reenactment

Author: Zhang, Jiangning, Zeng, Xianfang, Wang, Mengmeng, Pan, Yusu, Liu, Liang, Liu, Yong, Ding, Yu, and Fan, Changjie
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper presents a novel multi-identity face reenactment framework, named FReeNet, to transfer facial expressions from an arbitrary source face to a target face with a shared model. The proposed FReeNet consists of two parts: Unified Landmark Converter (ULC) and Geometry-aware Generator (GAG). The ULC adopts an encode-decoder architecture to efficiently convert expression in a latent landmark space, which significantly narrows the gap of the face contour between source and target identities. The GAG leverages the converted landmark to reenact the photorealistic image with a reference image of the target person. Moreover, a new triplet perceptual loss is proposed to force the GAG module to learn appearance and geometry information simultaneously, which also enriches facial details of the reenacted images. Further experiments demonstrate the superiority of our approach for generating photorealistic and expression-alike faces, as well as the flexibility for transferring facial expressions between identities., Comment: Add more experiments; Revise the paper carefully
Published: 2019

76. Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces

Author: Fu, Haotian, Tang, Hongyao, Hao, Jianye, Lei, Zihan, Chen, Yingfeng, and Fan, Changjie
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Multiagent Systems, Statistics - Machine Learning
Abstract: Deep Reinforcement Learning (DRL) has been applied to address a variety of cooperative multi-agent problems with either discrete action spaces or continuous action spaces. However, to the best of our knowledge, no previous work has ever succeeded in applying DRL to multi-agent problems with discrete-continuous hybrid (or parameterized) action spaces which is very common in practice. Our work fills this gap by proposing two novel algorithms: Deep Multi-Agent Parameterized Q-Networks (Deep MAPQN) and Deep Multi-Agent Hierarchical Hybrid Q-Networks (Deep MAHHQN). We follow the centralized training but decentralized execution paradigm: different levels of communication between different agents are used to facilitate the training process, while each agent executes its policy independently based on local observations during execution. Our empirical results on several challenging tasks (simulated RoboCup Soccer and game Ghost Story) show that both Deep MAPQN and Deep MAHHQN are effective and significantly outperform existing independent deep parameterized Q-learning method.
Published: 2019

77. Hierarchical Deep Multiagent Reinforcement Learning with Temporal Abstraction

Author: Tang, Hongyao, Hao, Jianye, Lv, Tangjie, Chen, Yingfeng, Zhang, Zongzhang, Jia, Hangtian, Ren, Chunxu, Zheng, Yan, Meng, Zhaopeng, Fan, Changjie, and Wang, Li
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Multiagent Systems
Abstract: Multiagent reinforcement learning (MARL) is commonly considered to suffer from non-stationary environments and exponentially increasing policy space. It would be even more challenging when rewards are sparse and delayed over long trajectories. In this paper, we study hierarchical deep MARL in cooperative multiagent problems with sparse and delayed reward. With temporal abstraction, we decompose the problem into a hierarchy of different time scales and investigate how agents can learn high-level coordination based on the independent skills learned at the low level. Three hierarchical deep MARL architectures are proposed to learn hierarchical policies under different MARL paradigms. Besides, we propose a new experience replay mechanism to alleviate the issue of the sparse transitions at the high level of abstraction and the non-stationarity of multiagent learning. We empirically demonstrate the effectiveness of our approaches in two domains with extremely sparse feedback: (1) a variety of Multiagent Trash Collection tasks, and (2) a challenging online mobile game, i.e., Fever Basketball Defense.
Published: 2018

78. PPPNE: Personalized proximity preserved network embedding

Author: Fan, Ge, Geng, Biao, Tao, Jianrong, Wang, Kai, Fan, Changjie, and Zeng, Wei
Published: 2022
Full Text: View/download PDF

79. GBGallery : A benchmark and framework for game testing

Author: Li, Zhuo, Wu, Yuechen, Ma, Lei, Xie, Xiaofei, Chen, Yingfeng, and Fan, Changjie
Published: 2022
Full Text: View/download PDF

80. The MMO Economist: AI Empowers Robust, Healthy, and Sustainable P2W MMO Economies

Author: Zhao, Shiwei, primary, Yuan, Xi, additional, Wu, Runze, additional, Hu, Zhipeng, additional, Liu, Haoyu, additional, Wang, Kai, additional, Hu, Yujing, additional, Lv, Tangjie, additional, Fan, Changjie, additional, Tong, Xin, additional, Han, Jiangze, additional, Zheng, Yan, additional, and Hao, Jianye, additional
Published: 2024
Full Text: View/download PDF

81. Keep You from Leaving: Churn Prediction in Online Games

Author: Zheng, Angyu, Chen, Liang, Xie, Fenfang, Tao, Jianrong, Fan, Changjie, Zheng, Zibin, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Nah, Yunmook, editor, Cui, Bin, editor, Lee, Sang-Won, editor, Yu, Jeffrey Xu, editor, Moon, Yang-Sae, editor, and Whang, Steven Euijong, editor
Published: 2020
Full Text: View/download PDF

82. Asyncflow: A visual programming tool for game artificial intelligence

Author: Hu, Zhipeng, Fan, Changjie, Zheng, Qiwei, Wu, Wei, and Liu, Bai
Published: 2021
Full Text: View/download PDF

83. Facial Action Unit Detection and Intensity Estimation From Self-Supervised Representation.

Author: Ma, Bowen, An, Rudong, Zhang, Wei, Ding, Yu, Zhao, Zeng, Zhang, Rongsheng, Lv, Tangjie, Fan, Changjie, and Hu, Zhipeng
Abstract: As a fine-grained and local expression behavior measurement, facial action unit (FAU) analysis (e.g., detection and intensity estimation) has been documented for its time-consuming, labor-intensive, and error-prone annotation. Thus a long-standing challenge of FAU analysis arises from the data scarcity of manual annotations, limiting the generalization ability of trained models to a large extent. Amounts of previous works have made efforts to alleviate this issue via semi/weakly supervised methods and extra auxiliary information. However, these methods still require domain knowledge and have not yet avoided the high dependency on data annotation. This article introduces a robust facial representation model MAE-Face for AU analysis. Using masked autoencoding as the self-supervised pre-training approach, MAE-Face first learns a high-capacity model from a feasible collection of face images without additional data annotations. Then after being fine-tuned on AU datasets, MAE-Face exhibits convincing performance for both AU detection and AU intensity estimation, achieving a new state-of-the-art on nearly all the evaluation results. Further investigation shows that MAE-Face achieves decent performance even when fine-tuned on only 1% of the AU training set, strongly proving its robustness and generalization performance. The pre-trained model is available at our GitHub repository. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

84. StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads

Author: Wang, Suzhen, primary, Ma, Yifeng, additional, Ding, Yu, additional, Hu, Zhipeng, additional, Fan, Changjie, additional, Lv, Tangjie, additional, Deng, Zhidong, additional, and Yu, Xin, additional
Published: 2024
Full Text: View/download PDF

85. VESPA: A General System for Vision-Based Extrasensory Perception Anticheating in Online FPS Games

Author: Zhao, Shiwei, Qi, Jiaheng, Hu, Zhipeng, Yan, Han, Wu, Runze, Shen, Xudong, Lv, Tangjie, and Fan, Changjie
Abstract: Cheating is widespread in online games, particularly in competitive games, such as first-person shooter (FPS) games. One of the most common types of cheating is extrasensory perception (ESP), which involves illicitly obtaining visual information to gain an unfair advantage over normal players. To protect the gaming experience of legitimate players and the interests of game companies, there is an urgent need for anticheating applications. In this article, we propose a general system for ESP anticheating in online FPS games, considering the business characteristics and industrial applications. We present a vision-based anticheating framework that incorporates both supervised and unsupervised solutions for comprehensive cheating detection. Based on this framework, we design and deploy a dual-audit human-in-the-loop system for industrial gaming anticheating applications. We evaluate our proposed framework from multiple online and offline perspectives and demonstrate its practical significance with superior performance.
Published: 2024
Full Text: View/download PDF

86. Modeling and Control of General Hydraulic Excavator for Human-in-the-loop Automation

Author: Chen, Guangda, primary, Gan, Yinghao, additional, Chen, Jiayi, additional, Shi, Shuanwu, additional, Chen, Wei, additional, Chen, Yingfeng, additional, Xiong, Rong, additional, and Fan, Changjie, additional
Published: 2023
Full Text: View/download PDF

87. Keep You from Leaving: Churn Prediction in Online Games

Author: Zheng, Angyu, primary, Chen, Liang, additional, Xie, Fenfang, additional, Tao, Jianrong, additional, Fan, Changjie, additional, and Zheng, Zibin, additional
Published: 2020
Full Text: View/download PDF

88. Efficient policy detecting and reusing for non-stationarity in Markov games

Author: Zheng, Yan, Hao, Jianye, Zhang, Zongzhang, Meng, Zhaopeng, Yang, Tianpei, Li, Yanran, and Fan, Changjie
Published: 2021
Full Text: View/download PDF

89. A Music-Driven Deep Generative Adversarial Model for Guzheng Playing Animation

Author: Fan Changjie, Ding Yu, Zhimeng Zhang, Zhao Zeng, Gongzheng Li, Zhigang Deng, and Chen Jiali
Subjects: Computer science, business.industry, Generalization, Process (engineering), Deep learning, Musical instrument, Animation, Computer Graphics and Computer-Aided Design, Motion capture, Motion (physics), Human–computer interaction, Signal Processing, Computer Vision and Pattern Recognition, Artificial intelligence, business, Software, Generative grammar
Abstract: To date relatively few efforts have been made on the automatic generation of musical instrument playing animations. This problem is challenging due to the intrinsically complex, temporal relationship between music and human motion as well as the lacking of high quality music-playing motion datasets. In this paper, we propose a fully automatic, deep learning based framework to synthesize realistic upper body animations based on novel guzheng music input. Specifically, based on a recorded audiovisual motion capture dataset, we delicately design a generative adversarial network (GAN) based approach to capture the temporal relationship between the music and the human motion data. In this process, data augmentation is employed to improve the generalization of our approach to handle a variety of guzheng music inputs. Through extensive objective and subjective experiments, we show that our method can generate visually plausible guzheng-playing animations that are well synchronized with the input guzheng music, and it can significantly outperform \uline{the state-of-the-art} methods. In addition, through an ablation study, we validate the contributions of the carefully-designed modules in our framework.
Published: 2023
Full Text: View/download PDF

90. A Data-Driven Decision Support Framework for Player Churn Analysis in Online Games

Author: Xiong, Yu, primary, Wu, Runze, additional, Zhao, Shiwei, additional, Tao, Jianrong, additional, Shen, Xudong, additional, Lyu, Tangjie, additional, Fan, Changjie, additional, and Cui, Peng, additional
Published: 2023
Full Text: View/download PDF

91. RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System

Author: Wang, Kai, primary, Zou, Zhene, additional, Zhao, Minghao, additional, Deng, Qilin, additional, Shang, Yue, additional, Liang, Yile, additional, Wu, Runze, additional, Shen, Xudong, additional, Lyu, Tangjie, additional, and Fan, Changjie, additional
Published: 2023
Full Text: View/download PDF

92. FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping

Author: Zeng, Hao, primary, Zhang, Wei, additional, Fan, Changjie, additional, Lv, Tangjie, additional, Wang, Suzhen, additional, Zhang, Zhimeng, additional, Ma, Bowen, additional, Li, Lincheng, additional, Ding, Yu, additional, and Yu, Xin, additional
Published: 2023
Full Text: View/download PDF

93. DINet: Deformation Inpainting Network for Realistic Face Visually Dubbing on High Resolution Video

Author: Zhang, Zhimeng, primary, Hu, Zhipeng, additional, Deng, Wenjin, additional, Fan, Changjie, additional, Lv, Tangjie, additional, and Ding, Yu, additional
Published: 2023
Full Text: View/download PDF

94. StyleTalk: One-Shot Talking Head Generation with Controllable Speaking Styles

Author: Ma, Yifeng, primary, Wang, Suzhen, additional, Hu, Zhipeng, additional, Fan, Changjie, additional, Lv, Tangjie, additional, Ding, Yu, additional, Deng, Zhidong, additional, and Yu, Xin, additional
Published: 2023
Full Text: View/download PDF

95. Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement

Author: Qi, Xingqun, primary, Liu, Chen, additional, Sun, Muyi, additional, Li, Lincheng, additional, Fan, Changjie, additional, and Yu, Xin, additional
Published: 2023
Full Text: View/download PDF

96. Zero-Shot Text-to-Parameter Translation for Game Character Auto-Creation

Author: Zhao, Rui, primary, Li, Wei, additional, Hu, Zhipeng, additional, Li, Lincheng, additional, Zou, Zhengxia, additional, Shi, Zhenwei, additional, and Fan, Changjie, additional
Published: 2023
Full Text: View/download PDF

97. Towards Unbiased Volume Rendering of Neural Implicit Surfaces with Geometry Priors

Author: Zhang, Yongqiang, primary, Hu, Zhipeng, additional, Wu, Haoqian, additional, Zhao, Minda, additional, Li, Lincheng, additional, Zou, Zhengxia, additional, and Fan, Changjie, additional
Published: 2023
Full Text: View/download PDF

98. Promoting human-AI interaction makes a better adoption of deep reinforcement learning: a real-world application in game industry

Author: Hu, Zhipeng, primary, Liu, Haoyu, additional, Xiong, Yu, additional, Wang, Lizi, additional, Wu, Runze, additional, Guan, Kai, additional, Hu, Yujing, additional, Lyu, Tangjie, additional, and Fan, Changjie, additional
Published: 2023
Full Text: View/download PDF

99. Towards Long-term Annotators: A Supervised Label Aggregation Baseline

Author: Liu, Haoyu, Wang, Fei, Lin, Minmin, Wu, Runze, Zhu, Renyu, Zhao, Shiwei, Wang, Kai, Lv, Tangjie, Fan, Changjie, Liu, Haoyu, Wang, Fei, Lin, Minmin, Wu, Runze, Zhu, Renyu, Zhao, Shiwei, Wang, Kai, Lv, Tangjie, and Fan, Changjie
Abstract: Relying on crowdsourced workers, data crowdsourcing platforms are able to efficiently provide vast amounts of labeled data. Due to the variability in the annotation quality of crowd workers, modern techniques resort to redundant annotations and subsequent label aggregation to infer true labels. However, these methods require model updating during the inference, posing challenges in real-world implementation. Meanwhile, in recent years, many data labeling tasks have begun to require skilled and experienced annotators, leading to an increasing demand for long-term annotators. These annotators could leave substantial historical annotation records on the crowdsourcing platforms, which can benefit label aggregation, but are ignored by previous works. Hereby, in this paper, we propose a novel label aggregation technique, which does not need any model updating during inference and can extensively explore the historical annotation records. We call it SuperLA, a Supervised Label Aggregation method. Inside this model, we design three types of input features and a straightforward neural network structure to merge all the information together and subsequently produce aggregated labels. Based on comparison experiments conducted on 22 public datasets and 11 baseline methods, we find that SuperLA not only outperforms all those baselines in inference performance but also offers significant advantages in terms of efficiency.
Published: 2023

100. See Through the Inside and Outside: Human Body and Anatomical Skeleton Prediction Network

Author: Peng, Zhiheng, primary, Zhao, Kai, additional, Chen, Xiaoran, additional, Chen, Yingfeng, additional, Fan, Changjie, additional, Tang, Bowei, additional, Xia, Siyu, additional, and Shang, Weijian, additional
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

294 results on '"Fan, Changjie"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources