Author: "Yang,Mengjiao" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Yang,Mengjiao"' showing total 204 results

Start Over Author "Yang,Mengjiao"

204 results on '"Yang,Mengjiao"'

1. Video Language Planning

Author: Du, Yilun, Yang, Mengjiao, Florence, Pete, Xia, Fei, Wahid, Ayzaan, Ichter, Brian, Sermanet, Pierre, Yu, Tianhe, Abbeel, Pieter, Tenenbaum, Joshua B., Kaelbling, Leslie, Zeng, Andy, and Tompson, Jonathan
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Robotics
Abstract: We are interested in enabling visual planning for complex long-horizon tasks in the space of generated videos and language, leveraging recent advances in large generative models pretrained on Internet-scale data. To this end, we present video language planning (VLP), an algorithm that consists of a tree search procedure, where we train (i) vision-language models to serve as both policies and value functions, and (ii) text-to-video models as dynamics models. VLP takes as input a long-horizon task instruction and current image observation, and outputs a long video plan that provides detailed multimodal (video and language) specifications that describe how to complete the final task. VLP scales with increasing computation budget where more computation time results in improved video plans, and is able to synthesize long-horizon video plans across different robotics domains: from multi-object rearrangement, to multi-camera bi-arm dexterous manipulation. Generated video plans can be translated into real robot actions via goal-conditioned policies, conditioned on each intermediate frame of the generated video. Experiments show that VLP substantially improves long-horizon task success rates compared to prior methods on both simulated and real robots (across 3 hardware platforms)., Comment: https://video-language-planning.github.io/
Published: 2023

2. Influence of Different Sowing Dates on Yield and Quality of Corn Xianyu 335

Author: Song Yanling, Zhou Guangsheng, Guo Jianping, Pan Yaru, Yang Mengjiao, Tian Jinfeng, Li Xiangxue, Meng Xiangyi, Lan Huiting, Jiang Weiguang, Sui Dan, Zhou Lingyu, Shi Junchen, Nie Chang, and Man Yi
Subjects: corn, sowing date, measures to adapt to climate change, Meteorology. Climatology, QC851-999
Abstract: Using data from Yushu Agricultural Meteorological Station of Jilin from 2018 to 2023, the impact of different sowing dates of corn is investigated focusing on its growth and yield composition as well as grain quality under global warming. It is also debated whether adjusting the sowing date of corn could be a measure for agriculture to adapt to climate change. Results show that the utilization efficiency of accumulated temperature during the growing season of corn varies with different sowing dates. The accumulated temperature is the highest in the first sowing date and lowest in the fourth sowing date, with an average decrease of 8.3% compared to the first sowing date. Different sowing dates of corn have an impact on the growing period. The duration of the first sowing date for corn is extended by an average of 7.5 days compared to the normal sowing date, while durations of the third and fourth sowing dates are shortened by 5.7 days and 13.8 days, respectively. Different sowing dates have an impact on the yield structure of corn. In the first sowing date, there is an increase in the weight of 100 grains of corn in 2 years during 6 years, while a decrease by 4.8% and 8.7% in the third and fourth sowing dates compared to the normal sowing date. The average number of grains per plant in the first sowing date increases by 0.2%, while decreases by 6.0% and 9.3% in the third and fourth sowing date. Overall, delaying the corn sowing date by 10 days and 20 days results in an average yield reduction of 10.9% and 17.1%. Sowing corn 10 days earlier could increase the yield in some years. The change in sowing dates has little effect on the quality of grains. Therefore, an early corn sowing date can be utilized as a strategy to adapt to climate change in certain regions of Northeast China.
Published: 2024
Full Text: View/download PDF

3. Probabilistic Adaptation of Text-to-Video Models

Author: Yang, Mengjiao, Du, Yilun, Dai, Bo, Schuurmans, Dale, Tenenbaum, Joshua B., and Abbeel, Pieter
Subjects: Computer Science - Artificial Intelligence
Abstract: Large text-to-video models trained on internet-scale data have demonstrated exceptional capabilities in generating high-fidelity videos from arbitrary textual descriptions. However, adapting these models to tasks with limited domain-specific data, such as animation or robotics videos, poses a significant computational challenge, since finetuning a pretrained large model can be prohibitively expensive. Inspired by how a small modifiable component (e.g., prompts, prefix-tuning) can adapt a large language model to perform new tasks without requiring access to the model weights, we investigate how to adapt a large pretrained text-to-video model to a variety of downstream domains and tasks without finetuning. In answering this question, we propose Video Adapter, which leverages the score function of a large pretrained video diffusion model as a probabilistic prior to guide the generation of a task-specific small video model. Our experiments show that Video Adapter is capable of incorporating the broad knowledge and preserving the high fidelity of a large pretrained video model in a task-specific small video model that is able to generate high-quality yet specialized videos on a variety of tasks such as animation, egocentric modeling, and modeling of simulated and real-world robotics data. More videos can be found on the website https://video-adapter.github.io/., Comment: Project website https://video-adapter.github.io/. First two authors contributed equally
Published: 2023

4. The associations between modifiable risk factors and constipation: a comprehensive mendelian randomization study

Author: Chang, Hong, Ouyang, Jing, Tian, Meng, Yang, Jin, Gao, Jie, Yang, Mengjiao, Zhang, Meng, Yuan, Hao, Zheng, Ya, Wang, Yuping, and Chen, Zhaofeng
Published: 2024
Full Text: View/download PDF

5. Social support mediates social frailty with anxiety and depression

Author: Liu, Yang, Yang, Mengjiao, Zhao, Yangyang, Wang, Ziwei, He, Jie, Wang, Yali, and Anme, Tokie
Published: 2024
Full Text: View/download PDF

6. Identification and prediction of frailty among community-dwelling older Japanese adults based on Bayesian network analysis: a cross-sectional and longitudinal study

Author: Yang, Mengjiao, Liu, Yang, Miura, Kumi Watanabe, Matsumoto, Munenori, Jiao, Dandan, Zhu, Zhu, Li, Xiang, Cui, Mingyu, Zhang, Jinrui, Qian, Meiling, Huang, Lujiao, and Anme, Tokie
Published: 2024
Full Text: View/download PDF

7. Learning Universal Policies via Text-Guided Video Generation

Author: Du, Yilun, Yang, Mengjiao, Dai, Bo, Dai, Hanjun, Nachum, Ofir, Tenenbaum, Joshua B., Schuurmans, Dale, and Abbeel, Pieter
Subjects: Computer Science - Artificial Intelligence
Abstract: A goal of artificial intelligence is to construct an agent that can solve a wide variety of tasks. Recent progress in text-guided image synthesis has yielded models with an impressive ability to generate complex novel images, exhibiting combinatorial generalization across domains. Motivated by this success, we investigate whether such tools can be used to construct more general-purpose agents. Specifically, we cast the sequential decision making problem as a text-conditioned video generation problem, where, given a text-encoded specification of a desired goal, a planner synthesizes a set of future frames depicting its planned actions in the future, after which control actions are extracted from the generated video. By leveraging text as the underlying goal specification, we are able to naturally and combinatorially generalize to novel goals. The proposed policy-as-video formulation can further represent environments with different state and action spaces in a unified space of images, which, for example, enables learning and generalization across a variety of robot manipulation tasks. Finally, by leveraging pretrained language embeddings and widely available videos from the internet, the approach enables knowledge transfer through predicting highly realistic video plans for real robots., Comment: NeurIPS 2023, Project Website: https://universal-policy.github.io/
Published: 2023

8. Dichotomy of Control: Separating What You Can Control from What You Cannot

Author: Yang, Mengjiao, Schuurmans, Dale, Abbeel, Pieter, and Nachum, Ofir
Subjects: Computer Science - Machine Learning
Abstract: Future- or return-conditioned supervised learning is an emerging paradigm for offline reinforcement learning (RL), where the future outcome (i.e., return) associated with an observed action sequence is used as input to a policy trained to imitate those same actions. While return-conditioning is at the heart of popular algorithms such as decision transformer (DT), these methods tend to perform poorly in highly stochastic environments, where an occasional high return can arise from randomness in the environment rather than the actions themselves. Such situations can lead to a learned policy that is inconsistent with its conditioning inputs; i.e., using the policy to act in the environment, when conditioning on a specific desired return, leads to a distribution of real returns that is wildly different than desired. In this work, we propose the dichotomy of control (DoC), a future-conditioned supervised learning framework that separates mechanisms within a policy's control (actions) from those beyond a policy's control (environment stochasticity). We achieve this separation by conditioning the policy on a latent variable representation of the future, and designing a mutual information constraint that removes any information from the latent variable associated with randomness in the environment. Theoretically, we show that DoC yields policies that are consistent with their conditioning inputs, ensuring that conditioning a learned policy on a desired high-return future outcome will correctly induce high-return behavior. Empirically, we show that DoC is able to achieve significantly better performance than DT on environments that have highly stochastic rewards and transition
Published: 2022

9. Research Progress of ADAM17 in Gastric Cancer

Author: YANG Mengjiao, YUAN Hao, ZHENG Ya, WANG Yuping, and GUO Qinghong
Subjects: gastric cancer, adam17, mechanism of action, targeted therapy, prognosis, Medicine
Abstract: Gastric cancer is one of the most common malignant tumors in the world. Patients with gastric cancer are often treated by surgery, radiotherapy, chemotherapy or immunotherapy, but the clinical efficacy and prognosis are poor. As an important member of ADAMs family, a disintegrin and metalloprotease 17 (ADAM17) is significantly more highly expressed in gastric cancer than in adjacent tissues. It participates in the occurrence and development of gastric cancer by mediating EGFR, TNF-α, TGF-β/Smad, Notch and Wnt, FoxM1-ADAM17 and EGFR/ERK/SP1. The high expression of ADAM17 is also closely related to the poor prognosis of gastric cancer, suggesting that ADAM17 can be used as a biological index to predict the development and prognosis of gastric cancer and has great potential to become a new therapeutic target for gastric cancer. In this paper, the mechanism, treatment and prognosis of ADAM17 in the development of gastric cancer are reviewed, in order to provide new ideas for clinical diagnosis and treatment of gastric cancer.
Published: 2024
Full Text: View/download PDF

10. Making Linear MDPs Practical via Contrastive Representation Learning

Author: Zhang, Tianjun, Ren, Tongzheng, Yang, Mengjiao, Gonzalez, Joseph E., Schuurmans, Dale, and Dai, Bo
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: It is common to address the curse of dimensionality in Markov decision processes (MDPs) by exploiting low-rank representations. This motivates much of the recent theoretical study on linear MDPs. However, most approaches require a given representation under unrealistic assumptions about the normalization of the decomposition or introduce unresolved computational challenges in practice. Instead, we consider an alternative definition of linear MDPs that automatically ensures normalization while allowing efficient representation learning via contrastive estimation. The framework also admits confidence-adjusted index algorithms, enabling an efficient and principled approach to incorporating optimism or pessimism in the face of uncertainty. To the best of our knowledge, this provides the first practical representation learning method for linear MDPs that achieves both strong theoretical guarantees and empirical performance. Theoretically, we prove that the proposed algorithm is sample efficient in both the online and offline settings. Empirically, we demonstrate superior performance over existing state-of-the-art model-based and model-free algorithms on several benchmarks., Comment: ICML 2022. The first two authors contribute equally
Published: 2022

11. Offline RL for Natural Language Generation with Implicit Language Q Learning

Author: Snell, Charlie, Kostrikov, Ilya, Su, Yi, Yang, Mengjiao, and Levine, Sergey
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Large language models distill broad knowledge from text corpora. However, they can be inconsistent when it comes to completing user specified tasks. This issue can be addressed by finetuning such models via supervised learning on curated datasets, or via reinforcement learning. In this work, we propose a novel offline RL method, implicit language Q-learning (ILQL), designed for use on language models, that combines both the flexible utility maximization framework of RL algorithms with the ability of supervised learning to leverage previously collected data, as well as its simplicity and stability. Our method employs a combination of value conservatism alongside an implicit dataset support constraint in learning value functions, which are then used to guide language model generations towards maximizing user-specified utility functions. In addition to empirically validating ILQL, we present a detailed empirical analysis of situations where offline RL can be useful in natural language generation settings, demonstrating how it can be a more effective utility optimizer than prior approaches for end-to-end dialogue, and how it can effectively optimize high variance reward functions based on subjective judgement, such as whether to label a comment as toxic or not.
Published: 2022

12. Multi-Game Decision Transformers

Author: Lee, Kuang-Huei, Nachum, Ofir, Yang, Mengjiao, Lee, Lisa, Freeman, Daniel, Xu, Winnie, Guadarrama, Sergio, Fischer, Ian, Jang, Eric, Michalewski, Henryk, and Mordatch, Igor
Subjects: Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: A longstanding goal of the field of AI is a method for learning a highly capable, generalist agent from diverse experience. In the subfields of vision and language, this was largely achieved by scaling up transformer-based models and training them on large, diverse datasets. Motivated by this progress, we investigate whether the same strategy can be used to produce generalist reinforcement learning agents. Specifically, we show that a single transformer-based model - with a single set of weights - trained purely offline can play a suite of up to 46 Atari games simultaneously at close-to-human performance. When trained and evaluated appropriately, we find that the same trends observed in language and vision hold, including scaling of performance with model size and rapid adaptation to new games via fine-tuning. We compare several approaches in this multi-game setting, such as online and offline RL methods and behavioral cloning, and find that our Multi-Game Decision Transformer models offer the best scalability and performance. We release the pre-trained models and code to encourage further research in this direction., Comment: NeurIPS 2022. 24 pages, 16 figures. Additional information, videos and code can be seen at https://sites.google.com/view/multi-game-transformers
Published: 2022

13. Chain of Thought Imitation with Procedure Cloning

Author: Yang, Mengjiao, Schuurmans, Dale, Abbeel, Pieter, and Nachum, Ofir
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Imitation learning aims to extract high-performance policies from logged demonstrations of expert behavior. It is common to frame imitation learning as a supervised learning problem in which one fits a function approximator to the input-output mapping exhibited by the logged demonstrations (input observations to output actions). While the framing of imitation learning as a supervised input-output learning problem allows for applicability in a wide variety of settings, it is also an overly simplistic view of the problem in situations where the expert demonstrations provide much richer insight into expert behavior. For example, applications such as path navigation, robot manipulation, and strategy games acquire expert demonstrations via planning, search, or some other multi-step algorithm, revealing not just the output action to be imitated but also the procedure for how to determine this action. While these intermediate computations may use tools not available to the agent during inference (e.g., environment simulators), they are nevertheless informative as a way to explain an expert's mapping of state to actions. To properly leverage expert procedure information without relying on the privileged tools the expert may have used to perform the procedure, we propose procedure cloning, which applies supervised sequence prediction to imitate the series of expert computations. This way, procedure cloning learns not only what to do (i.e., the output action), but how and why to do it (i.e., the procedure). Through empirical analysis on navigation, simulated robotic manipulation, and game-playing environments, we show that imitating the intermediate computations of an expert's behavior enables procedure cloning to learn policies exhibiting significant generalization to unseen environment configurations, including those configurations for which running the expert's procedure directly is infeasible.
Published: 2022

14. CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning

Author: Verma, Siddharth, Fu, Justin, Yang, Mengjiao, and Levine, Sergey
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Conventionally, generation of natural language for dialogue agents may be viewed as a statistical learning problem: determine the patterns in human-provided data and generate appropriate responses with similar statistical properties. However, dialogue can also be regarded as a goal directed process, where speakers attempt to accomplish a specific task. Reinforcement learning (RL) algorithms are designed specifically for solving such goal-directed problems, but the most direct way to apply RL -- through trial-and-error learning in human conversations, -- is costly. In this paper, we study how offline reinforcement learning can instead be used to train dialogue agents entirely using static datasets collected from human speakers. Our experiments show that recently developed offline RL methods can be combined with language models to yield realistic dialogue agents that better accomplish task goals.
Published: 2022

15. Context-Aware Language Modeling for Goal-Oriented Dialogue Systems

Author: Snell, Charlie, Yang, Mengjiao, Fu, Justin, Su, Yi, and Levine, Sergey
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Goal-oriented dialogue systems face a trade-off between fluent language generation and task-specific control. While supervised learning with large language models is capable of producing realistic text, how to steer such responses towards completing a specific task without sacrificing language quality remains an open question. In this work, we formulate goal-oriented dialogue as a partially observed Markov decision process, interpreting the language model as a representation of both the dynamics and the policy. This view allows us to extend techniques from learning-based control, such as task relabeling, to derive a simple and effective method to finetune language models in a goal-aware way, leading to significantly improved task performance. We additionally introduce a number of training strategies that serve to better focus the model on the task at hand. We evaluate our method, Context-Aware Language Models (CALM), on a practical flight-booking task using AirDialogue. Empirically, CALM outperforms the state-of-the-art method by 7% in terms of task success, matching human-level task performance.
Published: 2022

16. Establishment and Validation of A Prediction Model for the Occurrence of Acute Kidney Disease in Patients with Liver Cirrhosis

Author: OUYANG Jing, CHANG Hong, YANG Mengjiao, ZHANG Meng, TIAN Meng, ZHENG Ya, WANG Yuping, and CHEN Zhaofeng
Subjects: liver cirrhosis, acute kidney disease, nomogram, predictive model, Medicine
Abstract: Objective To establish a model that can predict the occurrence of acute kidney disease (AKD) in liver cirrhotic patients and evaluate its performance. Methods Liver cirrhotic patients who hospitalized in the department of gastroenterology of the First Hospital of Lanzhou University from January 2017 to January 2022 were retrospectively included. They were divided into AKD and non-AKD groups according to whether they were combined with AKD during hospitalization, and were randomized into training and validation sets in a 7∶3 ratio. The clinical data of patients in the two groups were collected, and LASSO regression and multifactorial Logistic regression were used to screen the influencing factors for the occurrence of AKD in patients with liver cirrhosis and to establish a prediction model. The model was then evaluated by using the receiver operating characteristic curve, the calibration curve and the clinical decision curve. Results A total of 796 cases of liver cirrhotic patients who met the inclusion and exclusion criteria were enrolled. Among them, 103 cases were in the AKD group and 693 cases were in the non-AKD group; 561 cases were in the training set and 235 cases were in the validation set. The results of LASSO regression and multifactorial Logistic regression showed that a history of diabetes (OR=2.922, 95% CI: 1.290-6.564, P=0.009), hepatic encephalopathy (OR=6.210, 95% CI: 2.278-17.479, P < 0.001), gastrointestinal bleeding (OR=2.501, 95% CI: 1.236-5.073, P=0.011), ascites (OR=3.219, 95% CI: 1.664-6.539, P < 0.001), male (OR=0.477, 95% CI: 0.254-0.879, P=0.019), hemoglobin (OR=0.987, 95% CI: 0.975-0.999, P=0.044), albumin (OR=0.952, 95% CI: 0.911-0.991, P=0.023), and prothrombin time (OR=0.865, 95% CI: 0.779-0.920, P < 0.001) were the independent influences on the occurrence of AKD in liver cirrhotic patients, and were used to construct a prediction model. The area under the curve of the model in the training set and validation set for predicting the occurrence of AKD in liver cirrhotic patients was 0.895 (95% CI: 0.865-0.925) and 0.869 (95% CI: 0.807-0.930), respectively. The calibration curves showed that the model had good fit and consistency and the clinical decision curves showed that the use of the model for predicting the risk of AKD could benefit liver cirrhotic patients overall. Conclusions A prediction model for the occurrence of AKD in liver cirrhotic patients was established based on eight influencing factors, including gender, history of diabetes, and hepatic encephalopathy. It was validated to have good discrimination, calibration, and clinical utility, and is expected to assist in the clinical early screening and identification of liver cirrhosis-associated AKD.
Published: 2024
Full Text: View/download PDF

17. Research on improvement strategies for a lightweight multi-object weed detection network based on YOLOv5

Author: Sun, Jiandong, You, Jinlong, Li, Fengmei, Sun, Jianhong, Yang, Mengjiao, Zhao, Xueguan, Jin, Ning, and Bai, Haoran
Published: 2024
Full Text: View/download PDF

18. TRAIL: Near-Optimal Imitation Learning with Suboptimal Data

Author: Yang, Mengjiao, Levine, Sergey, and Nachum, Ofir
Subjects: Computer Science - Machine Learning
Abstract: The aim in imitation learning is to learn effective policies by utilizing near-optimal expert demonstrations. However, high-quality demonstrations from human experts can be expensive to obtain in large numbers. On the other hand, it is often much easier to obtain large quantities of suboptimal or task-agnostic trajectories, which are not useful for direct imitation, but can nevertheless provide insight into the dynamical structure of the environment, showing what could be done in the environment even if not what should be done. We ask the question, is it possible to utilize such suboptimal offline datasets to facilitate provably improved downstream imitation learning? In this work, we answer this question affirmatively and present training objectives that use offline datasets to learn a factored transition model whose structure enables the extraction of a latent action space. Our theoretical analysis shows that the learned latent action space can boost the sample-efficiency of downstream imitation learning, effectively reducing the need for large near-optimal expert datasets through the use of auxiliary non-expert data. To learn the latent action space in practice, we propose TRAIL (Transition-Reparametrized Actions for Imitation Learning), an algorithm that learns an energy-based transition model contrastively, and uses the transition model to reparametrize the action space for sample-efficient imitation learning. We evaluate the practicality of our objective through experiments on a set of navigation and locomotion tasks. Our results verify the benefits suggested by our theory and show that TRAIL is able to improve baseline imitation learning by up to 4x in performance.
Published: 2021

19. Study on the effects of inorganic salts and ionic surfactants on the wettability of coal based on the experimental and molecular dynamics investigations

Author: Wang, Kai, Xu, Min, Zhou, Biao, Yang, Mengjiao, and Liu, Ruiping
Published: 2024
Full Text: View/download PDF

20. Combiner: Full Attention Transformer with Sparse Computation Cost

Author: Ren, Hongyu, Dai, Hanjun, Dai, Zihang, Yang, Mengjiao, Leskovec, Jure, Schuurmans, Dale, and Dai, Bo
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: Transformers provide a class of expressive architectures that are extremely effective for sequence modeling. However, the key limitation of transformers is their quadratic memory and time complexity $\mathcal{O}(L^2)$ with respect to the sequence length in attention layers, which restricts application in extremely long sequences. Most existing approaches leverage sparsity or low-rank assumptions in the attention matrix to reduce cost, but sacrifice expressiveness. Instead, we propose Combiner, which provides full attention capability in each attention head while maintaining low computation and memory complexity. The key idea is to treat the self-attention mechanism as a conditional expectation over embeddings at each location, and approximate the conditional distribution with a structured factorization. Each location can attend to all other locations, either via direct attention, or through indirect attention to abstractions, which are again conditional expectations of embeddings from corresponding local regions. We show that most sparse attention patterns used in existing sparse transformers are able to inspire the design of such factorization for full attention, resulting in the same sub-quadratic cost ($\mathcal{O}(L\log(L))$ or $\mathcal{O}(L\sqrt{L})$). Combiner is a drop-in replacement for attention layers in existing transformers and can be easily implemented in common frameworks. An experimental evaluation on both autoregressive and bidirectional sequence tasks demonstrates the effectiveness of this approach, yielding state-of-the-art results on several image and text modeling tasks., Comment: NeurIPS 2021 spotlight
Published: 2021

21. Provable Representation Learning for Imitation with Contrastive Fourier Features

Author: Nachum, Ofir and Yang, Mengjiao
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: In imitation learning, it is common to learn a behavior policy to match an unknown target policy via max-likelihood training on a collected set of target demonstrations. In this work, we consider using offline experience datasets - potentially far from the target distribution - to learn low-dimensional state representations that provably accelerate the sample-efficiency of downstream imitation learning. A central challenge in this setting is that the unknown target policy itself may not exhibit low-dimensional behavior, and so there is a potential for the representation learning objective to alias states in which the target policy acts differently. Circumventing this challenge, we derive a representation learning objective that provides an upper bound on the performance difference between the target policy and a lowdimensional policy trained with max-likelihood, and this bound is tight regardless of whether the target policy itself exhibits low-dimensional structure. Moving to the practicality of our method, we show that our objective can be implemented as contrastive learning, in which the transition dynamics are approximated by either an implicit energy-based model or, in some special cases, an implicit linear model with representations given by random Fourier features. Experiments on both tabular environments and high-dimensional Atari games provide quantitative evidence for the practical benefits of our proposed objective.
Published: 2021

22. Benchmarks for Deep Off-Policy Evaluation

Author: Fu, Justin, Norouzi, Mohammad, Nachum, Ofir, Tucker, George, Wang, Ziyu, Novikov, Alexander, Yang, Mengjiao, Zhang, Michael R., Chen, Yutian, Kumar, Aviral, Paduraru, Cosmin, Levine, Sergey, and Paine, Tom Le
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Off-policy evaluation (OPE) holds the promise of being able to leverage large, offline datasets for both evaluating and selecting complex policies for decision making. The ability to learn offline is particularly important in many real-world domains, such as in healthcare, recommender systems, or robotics, where online data collection is an expensive and potentially dangerous process. Being able to accurately evaluate and select high-performing policies without requiring online interaction could yield significant benefits in safety, time, and cost for these applications. While many OPE methods have been proposed in recent years, comparing results between papers is difficult because currently there is a lack of a comprehensive and unified benchmark, and measuring algorithmic progress has been challenging due to the lack of difficult evaluation tasks. In order to address this gap, we present a collection of policies that in conjunction with existing offline datasets can be used for benchmarking off-policy evaluation. Our tasks include a range of challenging high-dimensional continuous control problems, with wide selections of datasets and policies for performing policy selection. The goal of our benchmark is to provide a standardized measure of progress that is motivated from a set of principles designed to challenge and test the limits of existing OPE methods. We perform an evaluation of state-of-the-art algorithms and provide open-source access to our data and code to foster future research in this area., Comment: ICLR 2021 paper. Policies and evaluation code are available at https://github.com/google-research/deep_ope
Published: 2021

23. Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach

Author: Jiang, Haoming, Dai, Bo, Yang, Mengjiao, Zhao, Tuo, and Wei, Wei
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Reliable automatic evaluation of dialogue systems under an interactive environment has long been overdue. An ideal environment for evaluating dialog systems, also known as the Turing test, needs to involve human interaction, which is usually not affordable for large-scale experiments. Though researchers have attempted to use metrics (e.g., perplexity, BLEU) in language generation tasks or some model-based reinforcement learning methods (e.g., self-play evaluation) for automatic evaluation, these methods only show a very weak correlation with the actual human evaluation in practice. To bridge such a gap, we propose a new framework named ENIGMA for estimating human evaluation scores based on recent advances of off-policy evaluation in reinforcement learning. ENIGMA only requires a handful of pre-collected experience data, and therefore does not involve human interaction with the target policy during the evaluation, making automatic evaluations feasible. More importantly, ENIGMA is model-free and agnostic to the behavior policies for collecting the experience data (see details in Section 2), which significantly alleviates the technical difficulties of modeling complex dialogue environments and human behaviors. Our experiments show that ENIGMA significantly outperforms existing methods in terms of correlation with human evaluation scores., Comment: Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Published: 2021

24. Representation Matters: Offline Pretraining for Sequential Decision Making

Author: Yang, Mengjiao and Nachum, Ofir
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: The recent success of supervised learning methods on ever larger offline datasets has spurred interest in the reinforcement learning (RL) field to investigate whether the same paradigms can be translated to RL algorithms. This research area, known as offline RL, has largely focused on offline policy optimization, aiming to find a return-maximizing policy exclusively from offline data. In this paper, we consider a slightly different approach to incorporating offline data into sequential decision-making. We aim to answer the question, what unsupervised objectives applied to offline datasets are able to learn state representations which elevate performance on downstream tasks, whether those downstream tasks be online RL, imitation learning from expert demonstrations, or even offline policy optimization based on the same offline dataset? Through a variety of experiments utilizing standard offline RL datasets, we find that the use of pretraining with unsupervised learning objectives can dramatically improve the performance of policy learning algorithms that otherwise yield mediocre performance on their own. Extensive ablations further provide insights into what components of these unsupervised objectives -- e.g., reward prediction, continuous or discrete representations, pretraining or finetuning -- are most important and in which settings.
Published: 2021

25. Offline Policy Selection under Uncertainty

Author: Yang, Mengjiao, Dai, Bo, Nachum, Ofir, Tucker, George, and Schuurmans, Dale
Subjects: Computer Science - Machine Learning
Abstract: The presence of uncertainty in policy evaluation significantly complicates the process of policy ranking and selection in real-world settings. We formally consider offline policy selection as learning preferences over a set of policy prospects given a fixed experience dataset. While one can select or rank policies based on point estimates of their policy values or high-confidence intervals, access to the full distribution over one's belief of the policy value enables more flexible selection algorithms under a wider range of downstream evaluation metrics. We propose BayesDICE for estimating this belief distribution in terms of posteriors of distribution correction ratios derived from stochastic constraints (as opposed to explicit likelihood, which is not available). Empirically, BayesDICE is highly competitive to existing state-of-the-art approaches in confidence interval estimation. More importantly, we show how the belief distribution estimated by BayesDICE may be used to rank policies with respect to any arbitrary downstream policy selection metric, and we empirically demonstrate that this selection procedure significantly outperforms existing approaches, such as ranking policies according to mean or high-confidence lower bound value estimates.
Published: 2020

26. Off-Policy Evaluation via the Regularized Lagrangian

Author: Yang, Mengjiao, Nachum, Ofir, Dai, Bo, Li, Lihong, and Schuurmans, Dale
Subjects: Computer Science - Machine Learning, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: The recently proposed distribution correction estimation (DICE) family of estimators has advanced the state of the art in off-policy evaluation from behavior-agnostic data. While these estimators all perform some form of stationary distribution correction, they arise from different derivations and objective functions. In this paper, we unify these estimators as regularized Lagrangians of the same linear program. The unification allows us to expand the space of DICE estimators to new alternatives that demonstrate improved performance. More importantly, by analyzing the expanded space of estimators both mathematically and empirically we find that dual solutions offer greater flexibility in navigating the tradeoff between optimization stability and estimation bias, and generally provide superior estimates in practice.
Published: 2020

27. Energy-Based Processes for Exchangeable Data

Author: Yang, Mengjiao, Dai, Bo, Dai, Hanjun, and Schuurmans, Dale
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Recently there has been growing interest in modeling sets with exchangeability such as point clouds. A shortcoming of current approaches is that they restrict the cardinality of the sets considered or can only express limited forms of distribution over unobserved data. To overcome these limitations, we introduce Energy-Based Processes (EBPs), which extend energy based models to exchangeable data while allowing neural network parameterizations of the energy function. A key advantage of these models is the ability to express more flexible distributions over sets without restricting their cardinality. We develop an efficient training procedure for EBPs that demonstrates state-of-the-art performance on a variety of tasks such as point cloud generation, classification, denoising, and image completion.
Published: 2020

28. Benchmarking Attribution Methods with Relative Feature Importance

Author: Yang, Mengjiao and Kim, Been
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Interpretability is an important area of research for safe deployment of machine learning systems. One particular type of interpretability method attributes model decisions to input features. Despite active development, quantitative evaluation of feature attribution methods remains difficult due to the lack of ground truth: we do not know which input features are in fact important to a model. In this work, we propose a framework for Benchmarking Attribution Methods (BAM) with a priori knowledge of relative feature importance. BAM includes 1) a carefully crafted dataset and models trained with known relative feature importance and 2) three complementary metrics to quantitatively evaluate attribution methods by comparing feature attributions between pairs of models and pairs of inputs. Our evaluation on several widely-used attribution methods suggests that certain methods are more likely to produce false positive explanations---features that are incorrectly attributed as more important to model prediction. We open source our dataset, models, and metrics.
Published: 2019

29. GraphIt: A High-Performance DSL for Graph Analytics

Author: Zhang, Yunming, Yang, Mengjiao, Baghdadi, Riyadh, Kamil, Shoaib, Shun, Julian, and Amarasinghe, Saman
Subjects: Computer Science - Programming Languages
Abstract: The performance bottlenecks of graph applications depend not only on the algorithm and the underlying hardware, but also on the size and structure of the input graph. Programmers must try different combinations of a large set of techniques to develop the best implementation for a specific algorithm and type of graph. Existing graph frameworks lack flexibility, supporting only a limited set of optimizations. This paper introduces GraphIt, a new DSL for graph computations that generates fast implementations for algorithms with different performance characteristics running on graphs with different sizes and structures. GraphIt separates what is computed (algorithm) from how it is computed (schedule). Programmers specify the algorithm using an algorithm language, and performance optimizations are specified using a scheduling language. The algorithm language simplifies expressing the algorithms. We formulate graph optimizations, including edge traversal direction, data layout, parallelization, cache, NUMA, and kernel fusion optimizations, as tradeoffs among locality, parallelism, and work-efficiency. The scheduling language enables programmers to easily search through this complicated tradeoff space by composing together optimizations. We also built an autotuner to automatically find high-performance schedules. The compiler uses a new scheduling representation, the graph iteration space, to model, compose, and ensure the validity of the large number of optimizations. GraphIt outperforms the next fastest of six state-of-the-art shared-memory frameworks (Ligra, Green-Marl, GraphMat, Galois, Gemini, and Grazelle) on 24 out of 32 experiments by up to 4.8$\times$, and is never more than 43% slower than the fastest framework on the other experiments. GraphIt also reduces the lines of code by up to an order of magnitude compared to the next fastest framework., Comment: Paper Accepted at OOPSLA 2018
Published: 2018

30. Epigenetic drug (XL019) JAK2 inhibitor increases mitochondrial function in brown adipocytes by upregulating mitochondrial uncoupling protein 1 (UCP1), screening of epigenetic drug libraries, cell viability, and in-silico studies

Author: Reyad-ul-Ferdous, Md., Abdalla, Mohnad, Yang, Mengjiao, Xiaoling, Li, Bian, Wenbo, Xie, Jin, and Song, Yongfeng
Published: 2022
Full Text: View/download PDF

31. Shenling Baizhu San ameliorates ulcerative colitis by regulating the gut microbiota and its tryptophan metabolites: A complementary medicine to mesalamine

Author: Jiao, Chunhua, Zhang, Qianwen, Yang, Mengjiao, Ma, Jingjing, Zhao, Xiaojing, Tang, Nana, Dai, Mingxin, Li, Qingyu, Jiang, Zhenzhou, Huang, Xin, Zhang, Hongjie, and Sun, Lixin
Published: 2022
Full Text: View/download PDF

32. Melatonin Rinsing Treatment Associated with Storage in a Controlled Atmosphere Improves the Antioxidant Capacity and Overall Quality of Lemons.

Author: Yang, Mengjiao, Zheng, Enlan, Lin, Ziqin, Miao, Ze, Li, Yuhang, Hu, Shiting, Gao, Yanan, Jiang, Yuqian, Pang, Lingling, and Li, Xihong
Subjects: OXIDANT status, POLYPHENOL oxidase, VITAMIN C, PEROXIDASE, STOMATA, LEMON
Abstract: Antioxidant capacity is one of the most important biological activities in fruits and vegetables and is closely related to human health. In this study, 'Eureka' lemons were used as experimental materials and stored at 7–8 °C MT (melatonin, 200 μmol, soaked for 15 min) and CA (controlled atmosphere, 2–3% O2 + 15–16% CO2) individually or in combination for 30 d. The changes in lemon fruits' basic physicochemical properties, enzyme activities, and antioxidant capacities were studied. Comparing the combined treatment to the control, the outcomes demonstrated a significant reduction in weight loss, firmness, stomatal opening, and inhibition of polyphenol oxidase (PPO) and peroxidase (POD) activities. Additionally, the combined treatment maintained high levels of titratable acidity (TA), vitamin C (VC), total phenolic content (TPC), and antioxidant capacity and preserved the lemon aroma. Meanwhile, the correlation between fruit color, aroma compounds, and antioxidant capacity was revealed, providing valuable insights into the postharvest preservation of lemons. In conclusion, the combined treatment (MT + CA) was effective in maintaining the quality and antioxidant capacity of lemons. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

33. Wheat Yield Estimation using Machine Learning Method Based on UAV Remote Sensing Data

Author: Yang, Shurong, primary, Li, Lei, additional, Fei, Shuaipeng, additional, Yang, Mengjiao, additional, Tao, Zhiqiang, additional, Meng, Yaxiong, additional, and Xiao, Yonggui, additional
Published: 2024
Full Text: View/download PDF

34. Association of Frailty with Patient-Report Outcomes and Major Clinical Determinants in Patients with Acute Exacerbation of Chronic Obstructive Pulmonary Disease

Author: Yang, Mengjiao, primary, Liu, Yang, additional, Zhao, Yangyang, additional, Wang, Ziwei, additional, He, Jie, additional, Wang, Yali, additional, and Anme, Tokie, additional
Published: 2024
Full Text: View/download PDF

35. A long-term record (1995–2019) of the dynamics of land desertification in the middle reaches of Yarlung Zangbo River basin derived from Landsat data

Author: Zhan, Qiqi, Zhao, Wei, Yang, Mengjiao, and Xiong, Donghong
Published: 2021
Full Text: View/download PDF

36. Association of Frailty with Patient-Report Outcomes and Major Clinical Determinants in Patients with Acute Exacerbation of Chronic Obstructive Pulmonary Disease

Author: Yang,Mengjiao, Liu,Yang, Zhao,Yangyang, Wang,Ziwei, He,Jie, Wang,Yali, Anme,Tokie, Yang,Mengjiao, Liu,Yang, Zhao,Yangyang, Wang,Ziwei, He,Jie, Wang,Yali, and Anme,Tokie
Abstract: Mengjiao Yang,1,2,&ast; Yang Liu,1,&ast; Yangyang Zhao,3,&ast; Ziwei Wang,4 Jie He,5 Yali Wang,2 Tokie Anme6 1Graduate School of Comprehensive Human Science, University of Tsukuba, Tsukuba, Japan; 2Department of Cardiovascular Surgery, Affiliated Hospital of North Sichuan Medical College, Nanchong City, Sichuan Province, Peopleâs Republic of China; 3Department of Blood Transfusion, Affiliated Hospital of North Sichuan Medical College, Nanchong City, Sichuan Province, Peopleâs Republic of China; 4Department of Respiratory and Critical Care Medicine, Affiliated Hospital of North Sichuan Medical College, Nanchong City, Sichuan Province, Peopleâs Republic of China; 5Department of Nursing, Affiliated Hospital of North Sichuan Medical College, Nanchong City, Sichuan Province, Peopleâs Republic of China; 6Faculty of Medicine, University of Tsukuba, Tsukuba, Ibaraki, Japan&ast;These authors contributed equally to this workCorrespondence: Tokie Anme, Faculty of Medicine, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki, 3058577, Japan, TelFax +81-29-853-3436, Email tokieanme@gmail.comPurpose: This study aimed to explore the correlation of frailty status with disease characteristics and patient-reported outcomes (PROs) in patients with acute exacerbations of chronic obstructive pulmonary disease (AECOPD) and determine the sensitivity and specificity of modified COPD PRO scale (mCOPD-PRO) for detecting frailty.Patients and Methods: This cross-sectional study surveyed 315 inpatients with AECOPD from a tertiary hospital in China from August 2022 to June 2023. Patient frailty and PROs were assessed using the validated FRAIL scale and mCOPD-PRO, respectively. Spearmanâs Ï was used to assess the relevance of lung disease indicators commonly used in clinical practice, and ordinal logistic regression analyses were used to identify the variables associated with frailty status. The validity of mCOPD-PRO in discriminating frail or non-frail individuals
Published: 2024

37. Study on the process for preserving the green and crispness of young cantaloupe prefabricated food.

Author: LI Jiao, LYU Fange, YANG Mengjiao, LIN Ziqin, LI Dandan, JIANG Yuqian, and LI Xihong
Subjects: PHYTIC acid, CITRIC acid, SODIUM alginate, MUSKMELON, FOOD preservation
Abstract: To prevent the browning and softening of young cantaloupe prefabricated food during processing, this study utilized young cantaloupe as the raw material, and optimized through a combination of single factor test and orthogonal test for NaHCO3 blanching conditions, green-preserving reagents (citric acid, D-sodium erythorbate and phytic acid) and crispness-preserving reagents (CaCl2, calcium lactate and sodium alginate) formula, and the Δa*, PPO activity, POD activity, and hardness were analyzed. The results indicated that the optimal blanching conditions were NaHCO3 mass fraction 0. 08%, and a blanching time of 60 seconds. The optimal green and crispness preserving reagents formula were 0. 3% citric acid, 0. 4% D-ascorbic acid, 0. 3% phytic acid, 0.4% CaCl2, 0. 6% calcium lactate and 0. 3% sodium alginate. Under this formula for 20 minutes, Δa* of young cantaloupe prefabricated food was 0. 93, the hardness was 5 188. 75 g, PPO activity was 11. 35 U/ g, POD activity was 8. 49 U/ g, and sensory score was 89. 19 points. The product exhibited uniform color, crispness and delicious, and good quality. After 30 days of storage, the sensory score of the young cantaloupe prefabricated food were maintained at 84~89 points, indicating that it possessed better storage stability and market promotion potential. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

38. Wheat Yield Prediction Using Machine Learning Method Based on UAV Remote Sensing Data.

Author: Yang, Shurong, Li, Lei, Fei, Shuaipeng, Yang, Mengjiao, Tao, Zhiqiang, Meng, Yaxiong, and Xiao, Yonggui
Published: 2024
Full Text: View/download PDF

39. QTL mapping of root traits in wheat under different phosphorus levels using hydroponic culture

Author: Yang, Mengjiao, Wang, Cairong, Hassan, Muhammad Adeel, Li, Faji, Xia, Xianchun, Shi, Shubing, Xiao, Yonggui, and He, Zhonghu
Published: 2021
Full Text: View/download PDF

40. Genome-wide association analysis of stem water-soluble carbohydrate content in bread wheat

Author: Fu, Luping, Wu, Jingchun, Yang, Shurong, Jin, Yirong, Liu, Jindong, Yang, Mengjiao, Rasheed, Awais, Zhang, Yong, Xia, Xianchun, Jing, Ruilian, He, Zhonghu, and Xiao, Yonggui
Published: 2020
Full Text: View/download PDF

41. Identity-based proxy blind signature scheme on lattice in high-speed network environment

Author: Yang, Mengjiao, primary, Ke, Dongmei, additional, and Li, Fengyin, additional
Published: 2023
Full Text: View/download PDF

42. Causal association between inflammatory bowel disease and type 2 diabetes: a bidirectional two-sample Mendelian randomization study

Author: Yang, Mengjiao, primary, Ye, Yongjuan, additional, Zhang, Meng, additional, Chang, Hong, additional, Ouyang, Jing, additional, Zheng, Ya, additional, Wang, Yuping, additional, and Guo, Qinghong, additional
Published: 2023
Full Text: View/download PDF

43. Assessment of Water Retention Capacity of Non-Ionic and Anionic Fluorinated Dust Suppressants on Coal Dust

Author: Wang, Kai, primary, Xu, Min, additional, Zhou, Biao, additional, Yang, Mengjiao, additional, Li, Xiaoxuan, additional, and Yue, Qihang, additional
Published: 2023
Full Text: View/download PDF

44. Post-Quantum Secure Identity-Based Proxy Blind Signature Scheme on a Lattice

Author: Li, Fengyin, primary, Yang, Mengjiao, additional, Song, Zhihao, additional, Wang, Ping, additional, and Li, Guoping, additional
Published: 2023
Full Text: View/download PDF

45. 3-D numerical prediction of vortices group in multiple Francis turbines based on RNG-ELES approach

Author: Yang, Mengjiao, primary, Zhang, Minglu, additional, Zhang, Changbing, additional, and Chen, Duan, additional
Published: 2018
Full Text: View/download PDF

46. Learning Interactive Real-World Simulators

Author: Yang, Mengjiao, Du, Yilun, Ghasemipour, Kamyar, Tompson, Jonathan, Kaelbling, Leslie, Schuurmans, Dale, Abbeel, Pieter, Yang, Mengjiao, Du, Yilun, Ghasemipour, Kamyar, Tompson, Jonathan, Kaelbling, Leslie, Schuurmans, Dale, and Abbeel, Pieter
Abstract: Generative models trained on internet data have revolutionized how text, image, and video content can be created. Perhaps the next milestone for generative models is to simulate realistic experience in response to actions taken by humans, robots, and other interactive agents. Applications of a real-world simulator range from controllable content creation in games and movies, to training embodied agents purely in simulation that can be directly deployed in the real world. We explore the possibility of learning a universal simulator of real-world interaction through generative modeling. We first make the important observation that natural datasets available for learning a real-world simulator are often rich along different dimensions (e.g., abundant objects in image data, densely sampled actions in robotics data, and diverse movements in navigation data). With careful orchestration of diverse datasets, each providing a different aspect of the overall experience, we can simulate the visual outcome of both high-level instructions such as ``open the drawer'' and low-level controls such as "move by x, y" from otherwise static scenes and objects. We use the simulator to train both high-level vision-language policies and low-level reinforcement learning policies, each of which can be deployed in the real world in zero shot after training purely in simulation. We also show that other types of intelligence such as video captioning models can benefit from training with simulated experience, opening up even wider applications. Video demos can be found at https://universal-simulator.github.io., Comment: https://universal-simulator.github.io
Published: 2023

47. Accuracy assessment of plant height using an unmanned aerial vehicle for quantitative genomic analysis in bread wheat

Author: Hassan, Muhammad Adeel, Yang, Mengjiao, Fu, Luping, Rasheed, Awais, Zheng, Bangyou, Xia, Xianchun, Xiao, Yonggui, and He, Zhonghu
Published: 2019
Full Text: View/download PDF

48. Evaluation of Consistency among MODIS Land Surface Temperature Products for Monitoring Surface Warming Trend over the Tibetan Plateau

Author: Yang, Mengjiao, primary, Zhao, Wei, additional, Cai, Junfei, additional, Yang, Yujia, additional, and Fu, Hao, additional
Published: 2023
Full Text: View/download PDF

49. Associations between a polygenic risk score and the risk of gestational diabetes mellitus in a Chinese population: a case-control study

Author: Li, Ying, primary, Yang, Mengjiao, additional, Yuan, Lu, additional, Li, Ting, additional, Zhong, Xinli, additional, and Guo, Yanying, additional
Published: 2023
Full Text: View/download PDF

50. Role of Multifaceted Social Relationships on the Association of Loneliness with Depression Symptoms: A Moderated Mediation Analysis

Author: Liu, Yang, primary, Jiao, Dandan, additional, Yang, Mengjiao, additional, Cui, Mingyu, additional, Li, Xiang, additional, Zhu, Zhu, additional, Sawada, Yuko, additional, Watanabe Miura, Kumi, additional, Watanabe, Taeko, additional, Tanaka, Emiko, additional, and Anme, Tokie, additional
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

204 results on '"Yang,Mengjiao"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources