Author: "Zhang, Longfei" / Publication Type: Reports - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Zhang, Longfei"' showing total 4 results

Start Over Author "Zhang, Longfei" Publication Type Reports

4 results on '"Zhang, Longfei"'

1. Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?

Author: Dai, Yang, Ma, Oubo, Zhang, Longfei, Liang, Xingxing, Hu, Shengchao, Wang, Mengzhu, Ji, Shouling, Huang, Jincai, and Shen, Li
Subjects: Computer Science - Machine Learning
Abstract: Transformer-based trajectory optimization methods have demonstrated exceptional performance in offline Reinforcement Learning (offline RL), yet it poses challenges due to substantial parameter size and limited scalability, which is particularly critical in sequential decision-making scenarios where resources are constrained such as in robots and drones with limited computational power. Mamba, a promising new linear-time sequence model, offers performance on par with transformers while delivering substantially fewer parameters on long sequences. As it remains unclear whether Mamba is compatible with trajectory optimization, this work aims to conduct comprehensive experiments to explore the potential of Decision Mamba in offline RL (dubbed DeMa) from the aspect of data structures and network architectures with the following insights: (1) Long sequences impose a significant computational burden without contributing to performance improvements due to the fact that DeMa's focus on sequences diminishes approximately exponentially. Consequently, we introduce a Transformer-like DeMa as opposed to an RNN-like DeMa. (2) For the components of DeMa, we identify that the hidden attention mechanism is key to its success, which can also work well with other residual structures and does not require position embedding. Extensive evaluations from eight Atari games demonstrate that our specially designed DeMa is compatible with trajectory optimization and surpasses previous state-of-the-art methods, outdoing Decision Transformer (DT) by 80\% with 30\% fewer parameters, and exceeds DT in MuJoCo with only a quarter of the parameters., Comment: 20 pages, 8 figures
Published: 2024

2. Reconstructing editable prismatic CAD from rounded voxel models

Author: Lambourne, Joseph G., Willis, Karl D. D., Jayaraman, Pradeep Kumar, Zhang, Longfei, Sanghi, Aditya, and Malekshan, Kamal Rahimi
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, I.2.10
Abstract: Reverse Engineering a CAD shape from other representations is an important geometric processing step for many downstream applications. In this work, we introduce a novel neural network architecture to solve this challenging task and approximate a smoothed signed distance function with an editable, constrained, prismatic CAD model. During training, our method reconstructs the input geometry in the voxel space by decomposing the shape into a series of 2D profile images and 1D envelope functions. These can then be recombined in a differentiable way allowing a geometric loss function to be defined. During inference, we obtain the CAD data by first searching a database of 2D constrained sketches to find curves which approximate the profile images, then extrude them and use Boolean operations to build the final CAD model. Our method approximates the target shape more closely than other methods and outputs highly editable constrained parametric sketches which are compatible with existing CAD software., Comment: SIGGRAPH Asia 2022 Conference Paper
Published: 2022
Full Text: View/download PDF

3. TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations

Author: Huang, Shiyu, Chen, Wenze, Zhang, Longfei, Xu, Shizhen, Li, Ziyang, Zhu, Fengming, Ye, Deheng, Chen, Ting, and Zhu, Jun
Subjects: Computer Science - Artificial Intelligence
Abstract: Deep reinforcement learning (DRL) has achieved super-human performance on complex video games (e.g., StarCraft II and Dota II). However, current DRL systems still suffer from challenges of multi-agent coordination, sparse rewards, stochastic environments, etc. In seeking to address these challenges, we employ a football video game, e.g., Google Research Football (GRF), as our testbed and develop an end-to-end learning-based AI system (denoted as TiKick) to complete this challenging task. In this work, we first generated a large replay dataset from the self-playing of single-agent experts, which are obtained from league training. We then developed a distributed learning system and new offline algorithms to learn a powerful multi-agent AI from the fixed single-agent dataset. To the best of our knowledge, Tikick is the first learning-based AI system that can take over the multi-agent Google Research Football full game, while previous work could either control a single agent or experiment on toy academic scenarios. Extensive experiments further show that our pre-trained model can accelerate the training process of the modern multi-agent algorithm and our method achieves state-of-the-art performances on various academic scenarios.
Published: 2021

4. Inverse design of glass structure with deep graph neural networks

Author: Wang, Qi and Zhang, Longfei
Subjects: Condensed Matter - Materials Science
Abstract: Directly manipulating the atomic structure to achieve a specific property is a long pursuit in the field of materials. However, hindered by the disordered, non-prototypical glass structure and the complex interplay between structure and property, such inverse design is dauntingly hard for glasses. Here, combining two cutting-edge techniques, graph neural networks and swap Monte Carlo, we develop a data-driven, property-oriented inverse design route that managed to improve the plastic resistance of Cu-Zr metallic glasses in a controllable way. Swap Monte Carlo, as "sampler", effectively explores the glass landscape, and graph neural networks, with high regression accuracy in predicting the plastic resistance, serves as "decider" to guide the search in configuration space. Via an unconventional strengthening mechanism, a geometrically ultra-stable yet energetically meta-stable state is unraveled, contrary to the common belief that the higher the energy, the lower the plastic resistance. This demonstrates a vast configuration space that can be easily overlooked by conventional atomistic simulations. The data-driven techniques, structural search methods and optimization algorithms consolidate to form a toolbox, paving a new way to the design of glassy materials.
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

4 results on '"Zhang, Longfei"'

1. Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?

2. Reconstructing editable prismatic CAD from rounded voxel models

3. TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations

4. Inverse design of glass structure with deep graph neural networks

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Publication Type

Database

4 results on '"Zhang, Longfei"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources