19,315 results on '"An-Fei Li"'
Search Results
2. ARCap: Collecting High-quality Human Demonstrations for Robot Learning with Augmented Reality Feedback
- Author
-
Chen, Sirui, Wang, Chen, Nguyen, Kaden, Fei-Fei, Li, and Liu, C. Karen
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence - Abstract
Recent progress in imitation learning from human demonstrations has shown promising results in teaching robots manipulation skills. To further scale up training datasets, recent works start to use portable data collection devices without the need for physical robot hardware. However, due to the absence of on-robot feedback during data collection, the data quality depends heavily on user expertise, and many devices are limited to specific robot embodiments. We propose ARCap, a portable data collection system that provides visual feedback through augmented reality (AR) and haptic warnings to guide users in collecting high-quality demonstrations. Through extensive user studies, we show that ARCap enables novice users to collect robot-executable data that matches robot kinematics and avoids collisions with the scenes. With data collected from ARCap, robots can perform challenging tasks, such as manipulation in cluttered environments and long-horizon cross-embodiment manipulation. ARCap is fully open-source and easy to calibrate; all components are built from off-the-shelf products. More details and results can be found on our website: https://stanford-tml.github.io/ARCap, Comment: 8 pages, 8 Figures, submitted to ICRA 2025
- Published
- 2024
3. Automated Creation of Digital Cousins for Robust Policy Learning
- Author
-
Dai, Tianyuan, Wong, Josiah, Jiang, Yunfan, Wang, Chen, Gokmen, Cem, Zhang, Ruohan, Wu, Jiajun, and Fei-Fei, Li
- Subjects
Computer Science - Robotics - Abstract
Training robot policies in the real world can be unsafe, costly, and difficult to scale. Simulation serves as an inexpensive and potentially limitless source of training data, but suffers from the semantics and physics disparity between simulated and real-world environments. These discrepancies can be minimized by training in digital twins, which serve as virtual replicas of a real scene but are expensive to generate and cannot produce cross-domain generalization. To address these limitations, we propose the concept of digital cousins, a virtual asset or scene that, unlike a digital twin, does not explicitly model a real-world counterpart but still exhibits similar geometric and semantic affordances. As a result, digital cousins simultaneously reduce the cost of generating an analogous virtual environment while also facilitating better robustness during sim-to-real domain transfer by providing a distribution of similar training scenes. Leveraging digital cousins, we introduce a novel method for their automated creation, and propose a fully automated real-to-sim-to-real pipeline for generating fully interactive scenes and training robot policies that can be deployed zero-shot in the original scene. We find that digital cousin scenes that preserve geometric and semantic affordances can be produced automatically, and can be used to train policies that outperform policies trained on digital twins, achieving 90% vs. 25% success rates under zero-shot sim-to-real transfer. Additional details are available at https://digital-cousins.github.io/., Comment: CoRL 2024
- Published
- 2024
4. Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
- Author
-
Li, Manling, Zhao, Shiyu, Wang, Qineng, Wang, Kangrui, Zhou, Yu, Srivastava, Sanjana, Gokmen, Cem, Lee, Tony, Li, Li Erran, Zhang, Ruohan, Liu, Weiyu, Liang, Percy, Fei-Fei, Li, Mao, Jiayuan, and Wu, Jiajun
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning ,Computer Science - Robotics - Abstract
We aim to evaluate Large Language Models (LLMs) for embodied decision making. While a significant body of work has been leveraging LLMs for decision making in embodied environments, we still lack a systematic understanding of their performance because they are usually applied in different domains, for different purposes, and built based on different inputs and outputs. Furthermore, existing evaluations tend to rely solely on a final success rate, making it difficult to pinpoint what ability is missing in LLMs and where the problem lies, which in turn blocks embodied agents from leveraging LLMs effectively and selectively. To address these limitations, we propose a generalized interface (Embodied Agent Interface) that supports the formalization of various types of tasks and input-output specifications of LLM-based modules. Specifically, it allows us to unify 1) a broad set of embodied decision-making tasks involving both state and temporally extended goals, 2) four commonly-used LLM-based modules for decision making: goal interpretation, subgoal decomposition, action sequencing, and transition modeling, and 3) a collection of fine-grained metrics which break down evaluation into various types of errors, such as hallucination errors, affordance errors, various types of planning errors, etc. Overall, our benchmark offers a comprehensive assessment of LLMs' performance for different subtasks, pinpointing the strengths and weaknesses in LLM-powered embodied AI systems, and providing insights for effective and selective use of LLMs in embodied decision making., Comment: Accepted for oral presentation at NeurIPS 2024 in the Datasets and Benchmarks track
- Published
- 2024
5. ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
- Author
-
Huang, Wenlong, Wang, Chen, Li, Yunzhu, Zhang, Ruohan, and Fei-Fei, Li
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Representing robotic manipulation tasks as constraints that associate the robot and the environment is a promising way to encode desired robot behaviors. However, it remains unclear how to formulate the constraints such that they are 1) versatile to diverse tasks, 2) free of manual labeling, and 3) optimizable by off-the-shelf solvers to produce robot actions in real-time. In this work, we introduce Relational Keypoint Constraints (ReKep), a visually-grounded representation for constraints in robotic manipulation. Specifically, ReKep is expressed as Python functions mapping a set of 3D keypoints in the environment to a numerical cost. We demonstrate that by representing a manipulation task as a sequence of Relational Keypoint Constraints, we can employ a hierarchical optimization procedure to solve for robot actions (represented by a sequence of end-effector poses in SE(3)) with a perception-action loop at a real-time frequency. Furthermore, in order to circumvent the need for manual specification of ReKep for each new task, we devise an automated procedure that leverages large vision models and vision-language models to produce ReKep from free-form language instructions and RGB-D observations. We present system implementations on a wheeled single-arm platform and a stationary dual-arm platform that can perform a large variety of manipulation tasks, featuring multi-stage, in-the-wild, bimanual, and reactive behaviors, all without task-specific data or environment models. Website at https://rekep-robot.github.io.
- Published
- 2024
6. A Novel Approach to Clinical Thinking Training for Medical Students: The Combined World Cafe Discussion and Case-Based Learning Experience Introduction
- Author
-
Ying Guo, Xia Li, Heng Tan, Jianping Xie, Haiyun Luo, and Fei Li
- Abstract
It is essential for modern medical students to continuously enhance their clinical thinking abilities. This study aims to evaluate the efficacy of the combined World Café discussion and case-based learning (CBL) approach within the clinical thinking training course. The clinical thinking training course incorporated the combined World Café discussion and CBL approach. The assessment of the accuracy and rationality of clinical symptoms, medical examination, pathological processes, diagnostic results, diagnostic basis, and drug use was conducted through case-related queries. Feedback from students and instructors regarding the teaching content, teaching process, and teaching effect was gathered through questionnaires. The findings indicate that the students achieved high marks in all assessed areas, including clinical symptoms, medical examination, pathological processes, diagnostic results, diagnostic basis, and drug use. The feedback from students and instructors on the teaching content, teaching process, and teaching effect was positive. Medical educators can use our findings to implement the combined World Café discussion and CBL mode to enhance student engagement.
- Published
- 2024
- Full Text
- View/download PDF
7. Deep learning and ensemble stacking technique for differentiating polypoidal choroidal vasculopathy from neovascular age-related macular degeneration
- Author
-
Yu-Bai Chou, Chung-Hsuan Hsu, Wei-Shiang Chen, Shih-Jen Chen, De-Kuang Hwang, Yi-Ming Huang, An-Fei Li, and Henry Horng-Shing Lu
- Subjects
Medicine ,Science - Abstract
Abstract Polypoidal choroidal vasculopathy (PCV) and neovascular age-related macular degeneration (nAMD) share some similarity in clinical imaging manifestations. However, their disease entity and treatment strategy as well as visual outcomes are very different. To distinguish these two vision-threatening diseases is somewhat challenging but necessary. In this study, we propose a new artificial intelligence model using an ensemble stacking technique, which combines a color fundus photograph-based deep learning (DL) model and optical coherence tomography-based biomarkers, for differentiation of PCV from nAMD. Furthermore, we introduced multiple correspondence analysis, a method of transforming categorical data into principal components, to handle the dichotomous data for combining with another image DL system. This model achieved a robust performance with an accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve of 83.67%, 80.76%, 84.72%, and 88.57%, respectively, by training nearly 700 active cases with suitable imaging quality and transfer learning architecture. This work could offer an alternative method of developing a multimodal DL model, improve its efficiency for distinguishing different diseases, and facilitate the broad application of medical engineering in a DL model design.
- Published
- 2021
- Full Text
- View/download PDF
8. OccFusion: Rendering Occluded Humans with Generative Diffusion Priors
- Author
-
Sun, Adam, Xiang, Tiange, Delp, Scott, Fei-Fei, Li, and Adeli, Ehsan
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Most existing human rendering methods require every part of the human to be fully visible throughout the input video. However, this assumption does not hold in real-life settings where obstructions are common, resulting in only partial visibility of the human. Considering this, we present OccFusion, an approach that utilizes efficient 3D Gaussian splatting supervised by pretrained 2D diffusion models for efficient and high-fidelity human rendering. We propose a pipeline consisting of three stages. In the Initialization stage, complete human masks are generated from partial visibility masks. In the Optimization stage, 3D human Gaussians are optimized with additional supervision by Score-Distillation Sampling (SDS) to create a complete geometry of the human. Finally, in the Refinement stage, in-context inpainting is designed to further improve rendering quality on the less observed human body parts. We evaluate OccFusion on ZJU-MoCap and challenging OcMotion sequences and find that it achieves state-of-the-art performance in the rendering of occluded humans.
- Published
- 2024
9. Few-Shot Classification of Interactive Activities of Daily Living (InteractADL)
- Author
-
Durante, Zane, Harries, Robathan, Vendrow, Edward, Luo, Zelun, Kyuragi, Yuta, Kozuka, Kazuki, Fei-Fei, Li, and Adeli, Ehsan
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Understanding Activities of Daily Living (ADLs) is a crucial step for different applications including assistive robots, smart homes, and healthcare. However, to date, few benchmarks and methods have focused on complex ADLs, especially those involving multi-person interactions in home environments. In this paper, we propose a new dataset and benchmark, InteractADL, for understanding complex ADLs that involve interaction between humans (and objects). Furthermore, complex ADLs occurring in home environments comprise a challenging long-tailed distribution due to the rarity of multi-person interactions, and pose fine-grained visual recognition tasks due to the presence of semantically and visually similar classes. To address these issues, we propose a novel method for fine-grained few-shot video classification called Name Tuning that enables greater semantic separability by learning optimal class name vectors. We show that Name Tuning can be combined with existing prompt tuning strategies to learn the entire input text (rather than only learning the prompt or class names) and demonstrate improved performance for few-shot classification on InteractADL and 4 other fine-grained visual classification benchmarks. For transparency and reproducibility, we release our code at https://github.com/zanedurante/vlm_benchmark.
- Published
- 2024
10. Joint Effects of Offset Effort Beliefs and Biomedical Causal Attributions on Pre-Service Teachers' Stigma of Children with ADHD-Related Symptoms
- Author
-
Boby Ho-Hong Ching, Yuan Hua Li, and Xiao Fei Li
- Abstract
This experimental study examined ways to reduce stigma against children with attention-deficit/hyperactivity disorder (ADHD) symptoms. We randomly assigned 220 Chinese pre-service teachers to one of the four experimental groups in which they read a vignette describing a student with ADHD symptoms. The contents of the vignettes differed from one another on two dimensions: (a) the causes of the symptoms (biomedical versus social) and (b) the extent to which the student has exerted effort to improve (high effort versus low effort). Participants who received biomedical explanations of ADHD ascribed less blame to the target compared with those who received social explanations. However, the group difference was only significant in the "low-effort" group, but not in the "high-effort" group. Similarly, the "biomedical" group indicated higher levels of entity beliefs than the "social" group, but the group difference was only significant in the "low-effort" group but not in the "high-effort" group. Finally, participants in the "high-effort" condition reported a stronger intention to interact with the target compared with participants in the "low-effort" condition, whereas onset causal attributions did not affect participants' ratings on behavioural intention. These findings may have implications for reducing ADHD stigma and bear relevance to other kinds of social stigma.
- Published
- 2024
- Full Text
- View/download PDF
11. TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction
- Author
-
Jiang, Yunfan, Wang, Chen, Zhang, Ruohan, Wu, Jiajun, and Fei-Fei, Li
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Learning in simulation and transferring the learned policy to the real world has the potential to enable generalist robots. The key challenge of this approach is to address simulation-to-reality (sim-to-real) gaps. Previous methods often require domain-specific knowledge a priori. We argue that a straightforward way to obtain such knowledge is by asking humans to observe and assist robot policy execution in the real world. The robots can then learn from humans to close various sim-to-real gaps. We propose TRANSIC, a data-driven approach to enable successful sim-to-real transfer based on a human-in-the-loop framework. TRANSIC allows humans to augment simulation policies to overcome various unmodeled sim-to-real gaps holistically through intervention and online correction. Residual policies can be learned from human corrections and integrated with simulation policies for autonomous execution. We show that our approach can achieve successful sim-to-real transfer in complex and contact-rich manipulation tasks such as furniture assembly. Through synergistic integration of policies learned in simulation and from humans, TRANSIC is effective as a holistic approach to addressing various, often coexisting sim-to-real gaps. It displays attractive properties such as scaling with human effort. Videos and code are available at https://transic-robot.github.io/, Comment: 8th Conference on Robot Learning (CoRL 2024), Munich, Germany. Project website: https://transic-robot.github.io/
- Published
- 2024
12. BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation
- Author
-
Ge, Yunhao, Tang, Yihe, Xu, Jiashu, Gokmen, Cem, Li, Chengshu, Ai, Wensi, Martinez, Benjamin Jose, Aydin, Arman, Anvari, Mona, Chakravarthy, Ayush K, Yu, Hong-Xing, Wong, Josiah, Srivastava, Sanjana, Lee, Sharon, Zha, Shengxin, Itti, Laurent, Li, Yunzhu, Martín-Martín, Roberto, Liu, Miao, Zhang, Pengchuan, Zhang, Ruohan, Fei-Fei, Li, and Wu, Jiajun
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
The systematic evaluation and understanding of computer vision models under varying conditions require large amounts of data with comprehensive and customized labels, which real-world vision datasets rarely satisfy. While current synthetic data generators offer a promising alternative, particularly for embodied AI tasks, they often fall short for computer vision tasks due to low asset and rendering quality, limited diversity, and unrealistic physical properties. We introduce the BEHAVIOR Vision Suite (BVS), a set of tools and assets to generate fully customized synthetic data for systematic evaluation of computer vision models, based on the newly developed embodied AI benchmark, BEHAVIOR-1K. BVS supports a large number of adjustable parameters at the scene level (e.g., lighting, object placement), the object level (e.g., joint configuration, attributes such as "filled" and "folded"), and the camera level (e.g., field of view, focal length). Researchers can arbitrarily vary these parameters during data generation to perform controlled experiments. We showcase three example application scenarios: systematically evaluating the robustness of models across different continuous axes of domain shift, evaluating scene understanding models on the same set of images, and training and evaluating simulation-to-real transfer for a novel vision task: unary and binary state prediction. Project website: https://behavior-vision-suite.github.io/, Comment: CVPR 2024 (Highlight). Project website: https://behavior-vision-suite.github.io/
- Published
- 2024
13. BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation
- Author
-
Li, Chengshu, Zhang, Ruohan, Wong, Josiah, Gokmen, Cem, Srivastava, Sanjana, Martín-Martín, Roberto, Wang, Chen, Levine, Gabrael, Ai, Wensi, Martinez, Benjamin, Yin, Hang, Lingelbach, Michael, Hwang, Minjune, Hiranaka, Ayano, Garlanka, Sujay, Aydin, Arman, Lee, Sharon, Sun, Jiankai, Anvari, Mona, Sharma, Manasi, Bansal, Dhruva, Hunter, Samuel, Kim, Kyu-Young, Lou, Alan, Matthews, Caleb R, Villa-Renteria, Ivan, Tang, Jerry Huayang, Tang, Claire, Xia, Fei, Li, Yunzhu, Savarese, Silvio, Gweon, Hyowon, Liu, C. Karen, Wu, Jiajun, and Fei-Fei, Li
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence - Abstract
We present BEHAVIOR-1K, a comprehensive simulation benchmark for human-centered robotics. BEHAVIOR-1K includes two components, guided and motivated by the results of an extensive survey on "what do you want robots to do for you?". The first is the definition of 1,000 everyday activities, grounded in 50 scenes (houses, gardens, restaurants, offices, etc.) with more than 9,000 objects annotated with rich physical and semantic properties. The second is OMNIGIBSON, a novel simulation environment that supports these activities via realistic physics simulation and rendering of rigid bodies, deformable bodies, and liquids. Our experiments indicate that the activities in BEHAVIOR-1K are long-horizon and dependent on complex manipulation skills, both of which remain a challenge for even state-of-the-art robot learning solutions. To calibrate the simulation-to-reality gap of BEHAVIOR-1K, we provide an initial study on transferring solutions learned with a mobile manipulator in a simulated apartment to its real-world counterpart. We hope that BEHAVIOR-1K's human-grounded nature, diversity, and realism make it valuable for embodied AI and robot learning research. Project website: https://behavior.stanford.edu., Comment: A preliminary version was published at 6th Conference on Robot Learning (CoRL 2022)
- Published
- 2024
14. DexCap: Scalable and Portable Mocap Data Collection System for Dexterous Manipulation
- Author
-
Wang, Chen, Shi, Haochen, Wang, Weizhuo, Zhang, Ruohan, Fei-Fei, Li, and Liu, C. Karen
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
Imitation learning from human hand motion data presents a promising avenue for imbuing robots with human-like dexterity in real-world manipulation tasks. Despite this potential, substantial challenges persist, particularly with the portability of existing hand motion capture (mocap) systems and the complexity of translating mocap data into effective robotic policies. To tackle these issues, we introduce DexCap, a portable hand motion capture system, alongside DexIL, a novel imitation algorithm for training dexterous robot skills directly from human hand mocap data. DexCap offers precise, occlusion-resistant tracking of wrist and finger motions based on SLAM and electromagnetic field together with 3D observations of the environment. Utilizing this rich dataset, DexIL employs inverse kinematics and point cloud-based imitation learning to seamlessly replicate human actions with robot hands. Beyond direct learning from human motion, DexCap also offers an optional human-in-the-loop correction mechanism during policy rollouts to refine and further improve task performance. Through extensive evaluation across six challenging dexterous manipulation tasks, our approach not only demonstrates superior performance but also showcases the system's capability to effectively learn from in-the-wild mocap data, paving the way for future data collection methods in the pursuit of human-level robot dexterity. More details can be found at https://dex-cap.github.io
- Published
- 2024
15. Data-driven discovery of movement-linked heterogeneity in neurodegenerative diseases
- Author
-
Endo, Mark, Nerrise, Favour, Zhao, Qingyu, Sullivan, Edith V., Fei-Fei, Li, Henderson, Victor W., Pohl, Kilian M., Poston, Kathleen L., and Adeli, Ehsan
- Published
- 2024
- Full Text
- View/download PDF
16. Flow signal change in polyps after anti-vascular endothelial growth factor therapy.
- Author
-
Chia-Jui Chang, Yi-Ming Huang, Ming-Hung Hsieh, An-Fei Li, and Shih-Jen Chen
- Subjects
Medicine ,Science - Abstract
Optical coherence tomography angiography (OCTA) is a novel, non-invasive imaging tool used to detect vascular flow. The absence of a flow signal in OCTA in polyps revealed by indocyanine green angiography (ICGA) in patients with polypoidal choroidal vasculopathy (PCV) may indicate slow or compromised filling of blood flow from choroidal vessels. Naïve patients with PCV treated with intravitreal injections of aflibercept (IVI-A) were enrolled in this study to validate the hypothesis that baseline flow may affect the outcome of polyp regression in ICGA. The flow signal of polyps in OCTA was detected by manual segmentation in the corresponding location by ICGA. Polyps were defined as high-flow if both OCTA and ICGA showed positive findings, and low-flow if OCTA showed a negative flow signal in 3 consecutive horizontal scans at the polyp area shown in ICGA. A total of 24 polyps were identified in 13 PCV patients at baseline. Of these 24 polyps, 22 (91.7%) were high-flow and 2 (8.3%) were low-flow. After 3 monthly IVI-A, all low-flow polyps had complete regression in ICGA. Among 17 (77%) high-flow polyps at baseline that had regression after treatment, 10 (58.8%) became low-flow, while 5 (22.7%) persistent polyps remained high-flow. Flow signal of polyps as detected by OCTA could be a predictive factor for treatment response in patients with PCV. Monitoring changes in flow signal after treatment is clinically relevant.
- Published
- 2020
- Full Text
- View/download PDF
17. Multi-functional roles of TaSSI2 involved in Fusarium head blight and powdery mildew resistance and drought tolerance
- Author
-
Li-qin HU, Jing-jing MU, Pei-sen SU, Hong-yan WU, Guang-hui YU, Gui-ping WANG, Liang WANG, Xin MA, An-fei LI, Hong-wei WANG, Lan-fei ZHAO, and Ling-rang KONG
- Subjects
TaSSI2 ,powdery mildew ,Fusarium head blight ,drought ,Agriculture (General) ,S1-972 - Abstract
The mutation of the gene encoding a stearoyl-acyl carrier protein fatty acid desaturase (ssi2) has been proved to enhance pathogen resistance in several plants, while it's potential to regulate biotic and abiotic stresses in wheat is still unclear. In this study, we cloned TaSSI2 gene in wheat and provided several evidences of its involvement in multiple biological functions. By using barley stripe mosaic virus (BSMV)-induced gene silencing (VIGS) in wheat, it was found that TaSSI2 negatively regulated both powdery mildew and Fusarium head blight (FHB) resistance, which was consistent with the phenotype observed in knock-out mutants of Kronos. The expression of TaSSI2 was down-regulated by in vitro treatments of methyl jasmonate (MeJA), but positively regulated by salicylic acid (SA) and abscisic acid (ABA), implying the cross-talk between different hormone signaling pathways involved in wheat to regulate biotic stresses is still to be elucidated. Furthermore, the up-regulated expression of PR4 and PR5 indicated that TaSSI2 probably regulated FHB resistance by depressing the SA signaling pathway in wheat. In addition, the over-expression of TaSSI2 increased the content of linolenic acid (18:3) and subsequently enhanced drought tolerance of transgenic Brachypodium. This phenomenon might be associated with its subcellular localization in the whole cytosol, partly overlapping with Golgi apparatus and the secreted vesicles. As a stearoyl-acyl carrier protein fatty acid desaturase, TaSSI2 was proposed to be involved in cell lipid metabolism and carried targets out of the cell from membrane or wax synthesis, resulting in enhanced drought tolerance in plant.
- Published
- 2018
- Full Text
- View/download PDF
18. Position Paper: Agent AI Towards a Holistic Intelligence
- Author
-
Huang, Qiuyuan, Wake, Naoki, Sarkar, Bidipta, Durante, Zane, Gong, Ran, Taori, Rohan, Noda, Yusuke, Terzopoulos, Demetri, Kuno, Noboru, Famoti, Ade, Llorens, Ashley, Langford, John, Vo, Hoi, Fei-Fei, Li, Ikeuchi, Katsu, and Gao, Jianfeng
- Subjects
Computer Science - Artificial Intelligence - Abstract
Recent advancements in large foundation models have remarkably enhanced our understanding of sensory information in open-world environments. In leveraging the power of foundation models, it is crucial for AI research to pivot away from excessive reductionism and toward an emphasis on systems that function as cohesive wholes. Specifically, we emphasize developing Agent AI -- an embodied system that integrates large foundation models into agent actions. The emerging field of Agent AI spans a wide range of existing embodied and agent-based multimodal interactions, including robotics, gaming, and healthcare systems, etc. In this paper, we propose a novel large action model to achieve embodied intelligent behavior, the Agent Foundation Model. On top of this idea, we discuss how agent AI exhibits remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. Furthermore, we discuss the potential of Agent AI from an interdisciplinary perspective, underscoring AI cognition and consciousness within scientific discourse. We believe that those discussions serve as a basis for future research directions and encourage broader societal engagement., Comment: 22 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:2401.03568
- Published
- 2024
19. An Interactive Agent Foundation Model
- Author
-
Durante, Zane, Sarkar, Bidipta, Gong, Ran, Taori, Rohan, Noda, Yusuke, Tang, Paul, Adeli, Ehsan, Lakshmikanth, Shrinidhi Kowshika, Schulman, Kevin, Milstein, Arnold, Terzopoulos, Demetri, Famoti, Ade, Kuno, Noboru, Llorens, Ashley, Vo, Hoi, Ikeuchi, Katsu, Fei-Fei, Li, Gao, Jianfeng, Wake, Naoki, and Huang, Qiuyuan
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Machine Learning ,Computer Science - Robotics - Abstract
The development of artificial intelligence systems is transitioning from creating static, task-specific models to dynamic, agent-based systems capable of performing well in a wide range of applications. We propose an Interactive Agent Foundation Model that uses a novel multi-task agent training paradigm for training AI agents across a wide range of domains, datasets, and tasks. Our training paradigm unifies diverse pre-training strategies, including visual masked auto-encoders, language modeling, and next-action prediction, enabling a versatile and adaptable AI framework. We demonstrate the performance of our framework across three separate domains -- Robotics, Gaming AI, and Healthcare. Our model demonstrates its ability to generate meaningful and contextually relevant outputs in each area. The strength of our approach lies in its generality, leveraging a variety of data sources such as robotics sequences, gameplay data, large-scale video datasets, and textual information for effective multimodal and multi-task learning. Our approach provides a promising avenue for developing generalist, action-taking, multimodal systems.
- Published
- 2024
20. Agent AI: Surveying the Horizons of Multimodal Interaction
- Author
-
Durante, Zane, Huang, Qiuyuan, Wake, Naoki, Gong, Ran, Park, Jae Sung, Sarkar, Bidipta, Taori, Rohan, Noda, Yusuke, Terzopoulos, Demetri, Choi, Yejin, Ikeuchi, Katsushi, Vo, Hoi, Fei-Fei, Li, and Gao, Jianfeng
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Human-Computer Interaction ,Computer Science - Machine Learning - Abstract
Multi-modal AI systems will likely become a ubiquitous presence in our everyday lives. A promising approach to making these systems more interactive is to embody them as agents within physical and virtual environments. At present, systems leverage existing foundation models as the basic building blocks for the creation of embodied agents. Embedding agents within such environments facilitates the ability of models to process and interpret visual and contextual data, which is critical for the creation of more sophisticated and context-aware AI systems. For example, a system that can perceive user actions, human behavior, environmental objects, audio expressions, and the collective sentiment of a scene can be used to inform and direct agent responses within the given environment. To accelerate research on agent-based multimodal intelligence, we define "Agent AI" as a class of interactive systems that can perceive visual stimuli, language inputs, and other environmentally-grounded data, and can produce meaningful embodied actions. In particular, we explore systems that aim to improve agents based on next-embodied action prediction by incorporating external knowledge, multi-sensory inputs, and human feedback. We argue that by developing agentic AI systems in grounded environments, one can also mitigate the hallucinations of large foundation models and their tendency to generate environmentally incorrect outputs. The emerging field of Agent AI subsumes the broader embodied and agentic aspects of multimodal interactions. Beyond agents acting and interacting in the physical world, we envision a future where people can easily create any virtual reality or simulated scene and interact with agents embodied within the virtual environment.
- Published
- 2024
21. Wild2Avatar: Rendering Humans Behind Occlusions
- Author
-
Xiang, Tiange, Sun, Adam, Delp, Scott, Kozuka, Kazuki, Fei-Fei, Li, and Adeli, Ehsan
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Rendering the visual appearance of moving humans from occluded monocular videos is a challenging task. Most existing research renders 3D humans under ideal conditions, requiring a clear and unobstructed scene. Those methods cannot be used to render humans in real-world scenes where obstacles may block the camera's view and lead to partial occlusions. In this work, we present Wild2Avatar, a neural rendering approach catered for occluded in-the-wild monocular videos. We propose occlusion-aware scene parameterization for decoupling the scene into three parts - occlusion, human, and background. Additionally, extensive objective functions are designed to help enforce the decoupling of the human from both the occlusion and the background and to ensure the completeness of the human model. We verify the effectiveness of our approach with experiments on in-the-wild videos.
- Published
- 2023
22. Model-Based Control with Sparse Neural Dynamics
- Author
-
Liu, Ziang, Zhou, Genggeng, He, Jeff, Marcucci, Tobia, Fei-Fei, Li, Wu, Jiajun, and Li, Yunzhu
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Learning predictive models from observations using deep neural networks (DNNs) is a promising new approach to many real-world planning and control problems. However, common DNNs are too unstructured for effective planning, and current control methods typically rely on extensive sampling or local gradient descent. In this paper, we propose a new framework for integrated model learning and predictive control that is amenable to efficient optimization algorithms. Specifically, we start with a ReLU neural model of the system dynamics and, with minimal losses in prediction accuracy, we gradually sparsify it by removing redundant neurons. This discrete sparsification process is approximated as a continuous problem, enabling an end-to-end optimization of both the model architecture and the weight parameters. The sparsified model is subsequently used by a mixed-integer predictive controller, which represents the neuron activations as binary variables and employs efficient branch-and-bound algorithms. Our framework is applicable to a wide variety of DNNs, from simple multilayer perceptrons to complex graph neural dynamics. It can efficiently handle tasks involving complicated contact dynamics, such as object pushing, compositional object sorting, and manipulation of deformable objects. Numerical and hardware experiments show that, despite the aggressive sparsification, our framework can deliver better closed-loop performance than existing state-of-the-art methods., Comment: Accepted at NeurIPS 2023. For tutorial code and additional visualizations, see https://robopil.github.io/Sparse-Dynamics/
- Published
- 2023
23. Photorealistic Video Generation with Diffusion Models
- Author
-
Gupta, Agrim, Yu, Lijun, Sohn, Kihyuk, Gu, Xiuye, Hahn, Meera, Fei-Fei, Li, Essa, Irfan, Jiang, Lu, and Lezama, José
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
We present W.A.L.T, a transformer-based approach for photorealistic video generation via diffusion modeling. Our approach has two key design decisions. First, we use a causal encoder to jointly compress images and videos within a unified latent space, enabling training and generation across modalities. Second, for memory and training efficiency, we use a window attention architecture tailored for joint spatial and spatiotemporal generative modeling. Taken together these design decisions enable us to achieve state-of-the-art performance on established video (UCF-101 and Kinetics-600) and image (ImageNet) generation benchmarks without using classifier free guidance. Finally, we also train a cascade of three models for the task of text-to-video generation consisting of a base latent video diffusion model, and two video super-resolution diffusion models to generate videos of $512 \times 896$ resolution at $8$ frames per second., Comment: Project website https://walt-video-diffusion.github.io/
- Published
- 2023
24. Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
- Author
-
Li, Chengshu, Liang, Jacky, Zeng, Andy, Chen, Xinyun, Hausman, Karol, Sadigh, Dorsa, Levine, Sergey, Fei-Fei, Li, Xia, Fei, and Ichter, Brian
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning ,Computer Science - Robotics - Abstract
Code provides a general syntactic structure to build complex programs and perform precise computations when paired with a code interpreter - we hypothesize that language models (LMs) can leverage code-writing to improve Chain of Thought reasoning not only for logic and arithmetic tasks, but also for semantic ones (and in particular, those that are a mix of both). For example, consider prompting an LM to write code that counts the number of times it detects sarcasm in an essay: the LM may struggle to write an implementation for "detect_sarcasm(string)" that can be executed by the interpreter (handling the edge cases would be insurmountable). However, LMs may still produce a valid solution if they not only write code, but also selectively "emulate" the interpreter by generating the expected output of "detect_sarcasm(string)". In this work, we propose Chain of Code (CoC), a simple yet surprisingly effective extension that improves LM code-driven reasoning. The key idea is to encourage LMs to format semantic sub-tasks in a program as flexible pseudocode that the interpreter can explicitly catch undefined behaviors and hand off to simulate with an LM (as an "LMulator"). Experiments demonstrate that Chain of Code outperforms Chain of Thought and other baselines across a variety of benchmarks; on BIG-Bench Hard, Chain of Code achieves 84%, a gain of 12% over Chain of Thought. In a nutshell, CoC broadens the scope of reasoning questions that LMs can answer by "thinking in code"., Comment: ICML 2024 Oral; Project webpage: https://chain-of-code.github.io
- Published
- 2023
25. Exploring the Digital Twin System in Slope Engineering
- Author
-
Shu-yu, Wu, Zheng-Gang, Zhan, Huan-Chun, Zhu, Yong-Fu, Hu, Peng-Fei, Li, Yong-Jun, Deng, di Prisco, Marco, Series Editor, Chen, Sheng-Hong, Series Editor, Vayas, Ioannis, Series Editor, Kumar Shukla, Sanjay, Series Editor, Sharma, Anuj, Series Editor, Kumar, Nagesh, Series Editor, Wang, Chien Ming, Series Editor, Cui, Zhen-Dong, Series Editor, Lu, Xinzheng, Series Editor, Zheng, Sheng’an, editor, Taylor, Richard M., editor, Wu, Wenhao, editor, Nilsen, Bjorn, editor, and Zhao, Gensheng, editor
- Published
- 2025
- Full Text
- View/download PDF
26. Modeling Quality and Prestige in Applied Linguistics Journals: A Bibliometric and Synthetic Analysis
- Author
-
Yiran Xu, Jingyuan Zhuang, Ryan Blair, Amy I. Kim, Fei Li, Rachel Thorson Hernández, and Luke Plonsky
- Abstract
The importance of academic journals in second language (L2) research is evident on at least two levels. Journals are, first of all, central to the process of disseminating scientific findings. Journals are also critical on a professional level as most L2 researchers must publish articles to advance their careers. However, not all journals are perceived as equal; some may be considered more prestigious or of higher quality and may, therefore, achieve a greater impact on the field. It is therefore necessary that we understand the identity and quality of L2 research journals, yet very little research (e.g., Egbert, 2007; VanPatten & Williams, 2002) has considered these issues to date. The current study sought to explore L2 journal identity and quality, and the relationship between these constructs. In order to do so, a database was compiled based on three different types of sources: (1) a questionnaire eliciting L2 researchers' perceptions of the quality and prestige of 27 journals that publish L2 research (N = 327); (2) manual coding of different types of articles (e.g., empirical studies, review papers), data (quantitative, qualitative, mixed), research settings, and authorship patterns (K = 2,024) using the same 27 journals; and (3) bibliometric and submission data such as impact factors, citation counts, and acceptance rates. Descriptive statistics were applied to explore overall quality and prestige ratings as well as publication trends found in each journal. The relationships between those patterns and subjective ratings were also examined. In addition, regression models were built to determine the extent to which perceptions of journal quality and prestige could be explained as a function of journal and article features. We discuss the findings of the study in terms of on-going debates concerning publication practices, study quality, impact factors, journal selection, and the "journal culture" in applied linguistics.
- Published
- 2023
27. Effect of earth-air on water transport in the vadose zone of the loess plateau
- Author
-
Fei, Li, Hongshou, Li, Wenjun, Liu, Shunren, Wang, Shengli, Sun, Hongtao, Zhan, Xiaozhu, Wang, and Xiaowei, Wang
- Published
- 2024
- Full Text
- View/download PDF
28. Clinical evaluation of droplet digital pcr for suspected ascites infection in patients with liver cirrhosis
- Author
-
Han, Jie, Wei, Fei-li, Wu, Hao-xin, Guo, Lu-yao, Guo, Shan, Han, Ying, Sun, Ya-nan, Hou, Wei, and Hu, Zhong-jie
- Published
- 2024
- Full Text
- View/download PDF
29. Holistic Evaluation of Text-To-Image Models
- Author
-
Lee, Tony, Yasunaga, Michihiro, Meng, Chenlin, Mai, Yifan, Park, Joon Sung, Gupta, Agrim, Zhang, Yunzhi, Narayanan, Deepak, Teufel, Hannah Benita, Bellagente, Marco, Kang, Minguk, Park, Taesung, Leskovec, Jure, Zhu, Jun-Yan, Fei-Fei, Li, Wu, Jiajun, Ermon, Stefano, and Liang, Percy
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
The stunning qualitative improvement of recent text-to-image models has led to their widespread attention and adoption. However, we lack a comprehensive quantitative understanding of their capabilities and risks. To fill this gap, we introduce a new benchmark, Holistic Evaluation of Text-to-Image Models (HEIM). Whereas previous evaluations focus mostly on text-image alignment and image quality, we identify 12 aspects, including text-image alignment, image quality, aesthetics, originality, reasoning, knowledge, bias, toxicity, fairness, robustness, multilinguality, and efficiency. We curate 62 scenarios encompassing these aspects and evaluate 26 state-of-the-art text-to-image models on this benchmark. Our results reveal that no single model excels in all aspects, with different models demonstrating different strengths. We release the generated images and human evaluation results for full transparency at https://crfm.stanford.edu/heim/v1.1.0 and the code at https://github.com/stanford-crfm/helm, which is integrated with the HELM codebase., Comment: NeurIPS 2023. First three authors contributed equally
- Published
- 2023
30. NOIR: Neural Signal Operated Intelligent Robots for Everyday Activities
- Author
-
Zhang, Ruohan, Lee, Sharon, Hwang, Minjune, Hiranaka, Ayano, Wang, Chen, Ai, Wensi, Tan, Jin Jie Ryan, Gupta, Shreya, Hao, Yilun, Levine, Gabrael, Gao, Ruohan, Norcia, Anthony, Fei-Fei, Li, and Wu, Jiajun
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence - Abstract
We present Neural Signal Operated Intelligent Robots (NOIR), a general-purpose, intelligent brain-robot interface system that enables humans to command robots to perform everyday activities through brain signals. Through this interface, humans communicate their intended objects of interest and actions to the robots using electroencephalography (EEG). Our novel system demonstrates success in an expansive array of 20 challenging, everyday household activities, including cooking, cleaning, personal care, and entertainment. The effectiveness of the system is improved by its synergistic integration of robot learning algorithms, allowing for NOIR to adapt to individual users and predict their intentions. Our work enhances the way humans interact with robots, replacing traditional channels of interaction with direct, neural communication. Project website: https://noir-corl.github.io/.
- Published
- 2023
31. ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Image
- Author
-
Sargent, Kyle, Li, Zizhang, Shah, Tanmay, Herrmann, Charles, Yu, Hong-Xing, Zhang, Yunzhi, Chan, Eric Ryan, Lagun, Dmitry, Fei-Fei, Li, Sun, Deqing, and Wu, Jiajun
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Graphics - Abstract
We introduce a 3D-aware diffusion model, ZeroNVS, for single-image novel view synthesis for in-the-wild scenes. While existing methods are designed for single objects with masked backgrounds, we propose new techniques to address challenges introduced by in-the-wild multi-object scenes with complex backgrounds. Specifically, we train a generative prior on a mixture of data sources that capture object-centric, indoor, and outdoor scenes. To address issues from data mixture such as depth-scale ambiguity, we propose a novel camera conditioning parameterization and normalization scheme. Further, we observe that Score Distillation Sampling (SDS) tends to truncate the distribution of complex backgrounds during distillation of 360-degree scenes, and propose "SDS anchoring" to improve the diversity of synthesized novel views. Our model sets a new state-of-the-art result in LPIPS on the DTU dataset in the zero-shot setting, even outperforming methods specifically trained on DTU. We further adapt the challenging Mip-NeRF 360 dataset as a new benchmark for single-image novel view synthesis, and demonstrate strong performance in this setting. Our code and data are at http://kylesargent.github.io/zeronvs/, Comment: Accepted to CVPR 2024. 12 pages
- Published
- 2023
32. Open X-Embodiment: Robotic Learning Datasets and RT-X Models
- Author
-
Collaboration, Open X-Embodiment, O'Neill, Abby, Rehman, Abdul, Gupta, Abhinav, Maddukuri, Abhiram, Gupta, Abhishek, Padalkar, Abhishek, Lee, Abraham, Pooley, Acorn, Gupta, Agrim, Mandlekar, Ajay, Jain, Ajinkya, Tung, Albert, Bewley, Alex, Herzog, Alex, Irpan, Alex, Khazatsky, Alexander, Rai, Anant, Gupta, Anchit, Wang, Andrew, Kolobov, Andrey, Singh, Anikait, Garg, Animesh, Kembhavi, Aniruddha, Xie, Annie, Brohan, Anthony, Raffin, Antonin, Sharma, Archit, Yavary, Arefeh, Jain, Arhan, Balakrishna, Ashwin, Wahid, Ayzaan, Burgess-Limerick, Ben, Kim, Beomjoon, Schölkopf, Bernhard, Wulfe, Blake, Ichter, Brian, Lu, Cewu, Xu, Charles, Le, Charlotte, Finn, Chelsea, Wang, Chen, Xu, Chenfeng, Chi, Cheng, Huang, Chenguang, Chan, Christine, Agia, Christopher, Pan, Chuer, Fu, Chuyuan, Devin, Coline, Xu, Danfei, Morton, Daniel, Driess, Danny, Chen, Daphne, Pathak, Deepak, Shah, Dhruv, Büchler, Dieter, Jayaraman, Dinesh, Kalashnikov, Dmitry, Sadigh, Dorsa, Johns, Edward, Foster, Ethan, Liu, Fangchen, Ceola, Federico, Xia, Fei, Zhao, Feiyu, Frujeri, Felipe Vieira, Stulp, Freek, Zhou, Gaoyue, Sukhatme, Gaurav S., Salhotra, Gautam, Yan, Ge, Feng, Gilbert, Schiavi, Giulio, Berseth, Glen, Kahn, Gregory, Yang, Guangwen, Wang, Guanzhi, Su, Hao, Fang, Hao-Shu, Shi, Haochen, Bao, Henghui, Amor, Heni Ben, Christensen, Henrik I, Furuta, Hiroki, Bharadhwaj, Homanga, Walke, Homer, Fang, Hongjie, Ha, Huy, Mordatch, Igor, Radosavovic, Ilija, Leal, Isabel, Liang, Jacky, Abou-Chakra, Jad, Kim, Jaehyung, Drake, Jaimyn, Peters, Jan, Schneider, Jan, Hsu, Jasmine, Vakil, Jay, Bohg, Jeannette, Bingham, Jeffrey, Wu, Jeffrey, Gao, Jensen, Hu, Jiaheng, Wu, Jiajun, Wu, Jialin, Sun, Jiankai, Luo, Jianlan, Gu, Jiayuan, Tan, Jie, Oh, Jihoon, Wu, Jimmy, Lu, Jingpei, Yang, Jingyun, Malik, Jitendra, Silvério, João, Hejna, Joey, Booher, Jonathan, Tompson, Jonathan, Yang, Jonathan, Salvador, Jordi, Lim, Joseph J., Han, Junhyek, Wang, Kaiyuan, Rao, Kanishka, Pertsch, Karl, Hausman, Karol, Go, Keegan, Gopalakrishnan, Keerthana, Goldberg, Ken, Byrne, Kendra, Oslund, Kenneth, Kawaharazuka, Kento, Black, Kevin, Lin, Kevin, Zhang, Kevin, Ehsani, Kiana, Lekkala, Kiran, Ellis, Kirsty, Rana, Krishan, Srinivasan, Krishnan, Fang, Kuan, Singh, Kunal Pratap, Zeng, Kuo-Hao, Hatch, Kyle, Hsu, Kyle, Itti, Laurent, Chen, Lawrence Yunliang, Pinto, Lerrel, Fei-Fei, Li, Tan, Liam, Fan, Linxi "Jim", Ott, Lionel, Lee, Lisa, Weihs, Luca, Chen, Magnum, Lepert, Marion, Memmel, Marius, Tomizuka, Masayoshi, Itkina, Masha, Castro, Mateo Guaman, Spero, Max, Du, Maximilian, Ahn, Michael, Yip, Michael C., Zhang, Mingtong, Ding, Mingyu, Heo, Minho, Srirama, Mohan Kumar, Sharma, Mohit, Kim, Moo Jin, Kanazawa, Naoaki, Hansen, Nicklas, Heess, Nicolas, Joshi, Nikhil J, Suenderhauf, Niko, Liu, Ning, Di Palo, Norman, Shafiullah, Nur Muhammad Mahi, Mees, Oier, Kroemer, Oliver, Bastani, Osbert, Sanketi, Pannag R, Miller, Patrick "Tree", Yin, Patrick, Wohlhart, Paul, Xu, Peng, Fagan, Peter David, Mitrano, Peter, Sermanet, Pierre, Abbeel, Pieter, Sundaresan, Priya, Chen, Qiuyu, Vuong, Quan, Rafailov, Rafael, Tian, Ran, Doshi, Ria, Mart'in-Mart'in, Roberto, Baijal, Rohan, Scalise, Rosario, Hendrix, Rose, Lin, Roy, Qian, Runjia, Zhang, Ruohan, Mendonca, Russell, Shah, Rutav, Hoque, Ryan, Julian, Ryan, Bustamante, Samuel, Kirmani, Sean, Levine, Sergey, Lin, Shan, Moore, Sherry, Bahl, Shikhar, Dass, Shivin, Sonawani, Shubham, Tulsiani, Shubham, Song, Shuran, Xu, Sichun, Haldar, Siddhant, Karamcheti, Siddharth, Adebola, Simeon, Guist, Simon, Nasiriany, Soroush, Schaal, Stefan, Welker, Stefan, Tian, Stephen, Ramamoorthy, Subramanian, Dasari, Sudeep, Belkhale, Suneel, Park, Sungjae, Nair, Suraj, Mirchandani, Suvir, Osa, Takayuki, Gupta, Tanmay, Harada, Tatsuya, Matsushima, Tatsuya, Xiao, Ted, Kollar, Thomas, Yu, Tianhe, Ding, Tianli, Davchev, Todor, Zhao, Tony Z., Armstrong, Travis, Darrell, Trevor, Chung, Trinity, Jain, Vidhi, Kumar, Vikash, Vanhoucke, Vincent, Zhan, Wei, Zhou, Wenxuan, Burgard, Wolfram, Chen, Xi, Chen, Xiangyu, Wang, Xiaolong, Zhu, Xinghao, Geng, Xinyang, Liu, Xiyuan, Liangwei, Xu, Li, Xuanlin, Pang, Yansong, Lu, Yao, Ma, Yecheng Jason, Kim, Yejin, Chebotar, Yevgen, Zhou, Yifan, Zhu, Yifeng, Wu, Yilin, Xu, Ying, Wang, Yixuan, Bisk, Yonatan, Dou, Yongqiang, Cho, Yoonyoung, Lee, Youngwoon, Cui, Yuchen, Cao, Yue, Wu, Yueh-Hua, Tang, Yujin, Zhu, Yuke, Zhang, Yunchu, Jiang, Yunfan, Li, Yunshuang, Li, Yunzhu, Iwasawa, Yusuke, Matsuo, Yutaka, Ma, Zehan, Xu, Zhuo, Cui, Zichen Jeff, Zhang, Zichen, Fu, Zipeng, and Lin, Zipeng
- Subjects
Computer Science - Robotics - Abstract
Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning methods train a separate model for every application, every robot, and even every environment. Can we instead train generalist X-robot policy that can be adapted efficiently to new robots, tasks, and environments? In this paper, we provide datasets in standardized data formats and models to make it possible to explore this possibility in the context of robotic manipulation, alongside experimental results that provide an example of effective X-robot policies. We assemble a dataset from 22 different robots collected through a collaboration between 21 institutions, demonstrating 527 skills (160266 tasks). We show that a high-capacity model trained on this data, which we call RT-X, exhibits positive transfer and improves the capabilities of multiple robots by leveraging experience from other platforms. More details can be found on the project website https://robotics-transformer-x.github.io., Comment: Project website: https://robotics-transformer-x.github.io
- Published
- 2023
33. Mini-BEHAVIOR: A Procedurally Generated Benchmark for Long-horizon Decision-Making in Embodied AI
- Author
-
Jin, Emily, Hu, Jiaheng, Huang, Zhuoyi, Zhang, Ruohan, Wu, Jiajun, Fei-Fei, Li, and Martín-Martín, Roberto
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Machine Learning ,Computer Science - Robotics - Abstract
We present Mini-BEHAVIOR, a novel benchmark for embodied AI that challenges agents to use reasoning and decision-making skills to solve complex activities that resemble everyday human challenges. The Mini-BEHAVIOR environment is a fast, realistic Gridworld environment that offers the benefits of rapid prototyping and ease of use while preserving a symbolic level of physical realism and complexity found in complex embodied AI benchmarks. We introduce key features such as procedural generation, to enable the creation of countless task variations and support open-ended learning. Mini-BEHAVIOR provides implementations of various household tasks from the original BEHAVIOR benchmark, along with starter code for data collection and reinforcement learning agent training. In essence, Mini-BEHAVIOR offers a fast, open-ended benchmark for evaluating decision-making and planning solutions in embodied AI. It serves as a user-friendly entry point for research and facilitates the evaluation and development of solutions, simplifying their assessment and development while advancing the field of embodied AI. Code is publicly available at https://github.com/StanfordVL/mini_behavior.
- Published
- 2023
34. D$^3$Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement
- Author
-
Wang, Yixuan, Zhang, Mingtong, Li, Zhuoran, Kelestemur, Tarik, Driggs-Campbell, Katherine, Wu, Jiajun, Fei-Fei, Li, and Li, Yunzhu
- Subjects
Computer Science - Robotics ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
Scene representation is a crucial design choice in robotic manipulation systems. An ideal representation is expected to be 3D, dynamic, and semantic to meet the demands of diverse manipulation tasks. However, previous works often lack all three properties simultaneously. In this work, we introduce D$^3$Fields -- dynamic 3D descriptor fields. These fields are implicit 3D representations that take in 3D points and output semantic features and instance masks. They can also capture the dynamics of the underlying 3D environments. Specifically, we project arbitrary 3D points in the workspace onto multi-view 2D visual observations and interpolate features derived from visual foundational models. The resulting fused descriptor fields allow for flexible goal specifications using 2D images with varied contexts, styles, and instances. To evaluate the effectiveness of these descriptor fields, we apply our representation to rearrangement tasks in a zero-shot manner. Through extensive evaluation in real worlds and simulations, we demonstrate that D$^3$Fields are effective for zero-shot generalizable rearrangement tasks. We also compare D$^3$Fields with state-of-the-art implicit 3D representations and show significant improvements in effectiveness and efficiency., Comment: Accepted to Conference on Robot Learning (CoRL 2024) as Oral Presentation. The first three authors contributed equally. Project Page: https://robopil.github.io/d3fields/
- Published
- 2023
35. MindAgent: Emergent Gaming Interaction
- Author
-
Gong, Ran, Huang, Qiuyuan, Ma, Xiaojian, Vo, Hoi, Durante, Zane, Noda, Yusuke, Zheng, Zilong, Zhu, Song-Chun, Terzopoulos, Demetri, Fei-Fei, Li, and Gao, Jianfeng
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Human-Computer Interaction ,Computer Science - Multiagent Systems - Abstract
Large Language Models (LLMs) have the capacity of performing complex scheduling in a multi-agent system and can coordinate these agents into completing sophisticated tasks that require extensive collaboration. However, despite the introduction of numerous gaming frameworks, the community has insufficient benchmarks towards building general multi-agents collaboration infrastructure that encompass both LLM and human-NPCs collaborations. In this work, we propose a novel infrastructure - MindAgent - to evaluate planning and coordination emergent capabilities for gaming interaction. In particular, our infrastructure leverages existing gaming framework, to i) require understanding of the coordinator for a multi-agent system, ii) collaborate with human players via un-finetuned proper instructions, and iii) establish an in-context learning on few-shot prompt with feedback. Furthermore, we introduce CUISINEWORLD, a new gaming scenario and related benchmark that dispatch a multi-agent collaboration efficiency and supervise multiple agents playing the game simultaneously. We conduct comprehensive evaluations with new auto-metric CoS for calculating the collaboration efficiency. Finally, our infrastructure can be deployed into real-world gaming scenarios in a customized VR version of CUISINEWORLD and adapted in existing broader Minecraft gaming domain. We hope our findings on LLMs and the new infrastructure for general-purpose scheduling and coordination can help shed light on how such skills can be obtained by learning from large language corpora., Comment: The first three authors contributed equally. 28 pages
- Published
- 2023
36. Sequential Dexterity: Chaining Dexterous Policies for Long-Horizon Manipulation
- Author
-
Chen, Yuanpei, Wang, Chen, Fei-Fei, Li, and Liu, C. Karen
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Many real-world manipulation tasks consist of a series of subtasks that are significantly different from one another. Such long-horizon, complex tasks highlight the potential of dexterous hands, which possess adaptability and versatility, capable of seamlessly transitioning between different modes of functionality without the need for re-grasping or external tools. However, the challenges arise due to the high-dimensional action space of dexterous hand and complex compositional dynamics of the long-horizon tasks. We present Sequential Dexterity, a general system based on reinforcement learning (RL) that chains multiple dexterous policies for achieving long-horizon task goals. The core of the system is a transition feasibility function that progressively finetunes the sub-policies for enhancing chaining success rate, while also enables autonomous policy-switching for recovery from failures and bypassing redundant stages. Despite being trained only in simulation with a few task objects, our system demonstrates generalization capability to novel object shapes and is able to zero-shot transfer to a real-world robot equipped with a dexterous hand. Code and videos are available at https://sequential-dexterity.github.io, Comment: 7th Conference on Robot Learning (CoRL 2023)
- Published
- 2023
37. Dynamics of agricultural system vulnerability to climate change and the externalities of its mitigation in China
- Author
-
Yingying, Wang, Yibin, Wang, and Fei, Li
- Published
- 2024
- Full Text
- View/download PDF
38. Expression of NLRP3 in serum and induced sputum of children with asthma and their relationship with disease severity
- Author
-
Fei Li and Zhiping Liu
- Subjects
Asthma ,Lung function ,NLRP3 ,Disease severity ,Cytokines ,Medicine - Abstract
Abstract Objectives The aim of this cross-sectional study is to investigate the levels of NLRP3 in the serum and induced sputum of children with asthma and their potential association with lung function and disease severity. Methods This cross-sectional study included 83 children with bronchial asthma who sought medical care at our hospital from May 2023 to February 2024. Portable spirometry was used to monitor lung function parameters, including forced vital capacity, forced expiratory volume in 1 s, peak expiratory flow. The expression of C-reactive protein (CRP), interleukin (IL)-6, IL-1β, TNF-α, and Nod-like receptor family pyrin domain-containing 3 (NLRP3) in the serum and induced sputum were measured by enzyme-linked immunosorbent assay. Data analysis was performed using SPSS 25.0 and differences with P
- Published
- 2024
- Full Text
- View/download PDF
39. Stability of Cauchy horizon in Einstein–Power–Maxwell–de Sitter black holes
- Author
-
Lu Chen and Fei Li
- Subjects
Astrophysics ,QB460-466 ,Nuclear and particle physics. Atomic energy. Radioactivity ,QC770-798 - Abstract
Abstract We investigated the Strong Cosmic Censorship (SCC) conjecture in Einstein–Power–Maxwell–de Sitter (EPMdS) black holes by analyzing the quasinormal modes (QNMs) of a massless neutral scalar field perturbation. Using the pseudospectral method, we calculated the QNM frequencies across various cosmological constants and nonlinear electromagnetic parameters. Our results show that for black holes far from extremality, the lowest QNM frequency $$-\text {Im} (\omega )/\kappa _-$$ - Im ( ω ) / κ - remains below 1/2. However, as the black holes approach extremality, the lowest mode’s $$-\text {Im} (\omega )/\kappa _-$$ - Im ( ω ) / κ - consistently exceeds 1/2, leading to a violation of SCC. Moreover, with a fixed cosmological constant, as the nonlinearity parameter $$\alpha $$ α increases, the interval of SCC violation under the charge extremality ratio narrows. Considering that $$k = 1/2 + 1/\alpha $$ k = 1 / 2 + 1 / α is inversely proportional to $$\alpha $$ α , our results indicate that increasing the order k of the non-linear electromagnetic field can effectively mitigate SCC violations.
- Published
- 2024
- Full Text
- View/download PDF
40. CRISPR/Cas9-mediated mutation of Mstn confers growth performance in Culter alburnus juveniles
- Author
-
Jianbo Zheng, Shili Liu, Wenping Jiang, Fei Li, Meili Chi, Shun Cheng, and Yinuo Liu
- Subjects
mstn ,Culter alburnus ,CRISPR/Cas9 ,Growth performance ,Aquaculture. Fisheries. Angling ,SH1-691 - Abstract
Myostatin is a member of the TGF-β superfamily and functions as a negative regulator for skeletal muscle development and growth. It has become the most targeted gene in aquaculture that used for selective breeding. Previous studies involved in genome editing in several fish species confirmed that CRISPR/Cas9 (Clustered Regularly Interspaced Short Palindromic Repeats) system was highly efficient with lower off-target effect, however, no reports were raised in Culter alburnus. In this study, we employed CRISPR/Cas9 gene editing system to successfully disrupt mstn gene by co-injection with Cas9 protein and the targeted sgRNA in C. alburnus. Various Indel mutations were obtained with 82% knockout efficiency in the F0 generation by PCR sequencing. In addition, mutations in mstn that induced by CRISPR/Cas9 were detected in the F1 generation by individually mating the wild-type female with the F0 generation of mstn-KO male at sexual maturity. More importantly, the body weight and length were significantly elevated in mstn ± group when compared to those of the control. As expected in mstn ± group, the expression level of mstn was sharply reduced, whereas a slight increase was observed in two growth-related genes (myod and myog). Moreover, higher numbers of muscle fibers were observed in mstn ± group, meaning that growth performance in mstn ± individuals might be represented by increasing the number of muscle fibers. Taken together, our current study successfully obtained a site-specific modification of mstn using CRISPR/Cas9 technology, and these results provided a new insight for facilitating topmouth culter genetic studies and breeding.
- Published
- 2024
- Full Text
- View/download PDF
41. Bibliometric analysis of microphthalmos and anophthalmos over 20 years: from 2004 to 2023
- Author
-
Ming-Hui Wang, Gong-Fei Li, and Ju Zhang
- Subjects
bibliometric analysis ,network analysis ,microphthalmos ,anophthalmos ,Ophthalmology ,RE1-994 - Abstract
AIM: To conduct a bibliometric analysis of studies on microphthalmos and anophthalmos (M/A), explore research hotspots, and provide information on future research interests in this field to benefit clinicians and researchers. METHODS: Totally 751 publications related to M/A from the year 2004 to 2023 were collected from the Web of Science Core Collection database. These publications consist of both original and review articles, that are composed in English. The contributions of different countries, institutions, journals, and authors were analyzed, and network analysis was conducted by using Microsoft Excel 2021, VOSviewer, and R Studio to visualize research hotspots. RESULTS: Among all publications included, the highest number of publications came from USA (218, 29.03%). China followed with 99 publications (13.18%), and England with 86 publications (11.45%). The publications from the USA had the highest frequency of citations, with 16 699 citations, and the highest H-index of 49. The American Journal of Medical Genetics Part A (43, 5.73%) published the largest number of papers, and the University of London had the most publications (41, 5.46%). The genetic and molecular mechanisms of M/A were still unclear and the clinical intervention for M/A had gained a lot of attention as an emerging area of interest. CONCLUSION: Data have been gathered on the yearly count of published materials and citations, as well as the rise in publication trends, the efficiency of regions or countries, authors, journals, and organizations, along with the high-cited publications in M/A. The recent trend of research has shifted from genetic mechanisms to different clinical phenotypes and corresponding clinical interventions, which can give direction to future research.
- Published
- 2024
- Full Text
- View/download PDF
42. Practice and Application of Regrinding and Re-election of Swept Concentrate from a Low-grade Difficult Gold Ore in Qinghai Province
- Author
-
Pingtian MING, Fei LI, Ziqiang CHEN, Zhaohua XIONG, and Mengzhong HU
- Subjects
mining processing engineering ,high arsenic and high carbon ,monomer dissociation degree ,scavenging concentrate ,recovery rate ,technological innovation ,Mining engineering. Metallurgy ,TN1-997 - Abstract
This is an article in the field of mining processing engineering. The gold grade of a low-grade, high-arsenic, high-carbon refractory gold ore in Qinghai is about 2.38 g/t, and the industrial recovery rate of gold is about 77%. Through process flow investigation and mineralogical analysis, the gold grade of the mine's concentrator scavenging concentrate is about 3~6 g/t. The main gold-bearing minerals, arsenopyrite and pyrite, have low dissociation degrees of 76.23% and 78.74%, respectively. When floated separately, the gold recovery rate is about 30%~50%. In order to further improve the recovery rate of gold in this refractory gold mine, the mine conducted laboratory simulation of the process flow of the concentrator to carry out the experimental study of regrinding and re-separation of the scavenging concentrate, formulated the process flow scheme of returning the scavenging concentrate to the second stage grading pump pool in a centralized manner, and completed the process technology improvement design and process technology improvement practice of the concentrator. The production application results after the technological transformation showed that the single dissociation degree of the main gold-bearing minerals, arsenopyrite and pyrite, increased to 78.03% and 80.63% respectively when the scavenging concentrate was returned in a centralized manner, compared to the sequential return. The regrinding and re-separation process of the scavenging concentrate did not affect the grade of the concentrate under the premise of centralized return. The recovery rate of gold in the concentrator was increased from 77.14% to 81.13%, effectively improving the gold recovery index of this refractory gold mine.
- Published
- 2024
- Full Text
- View/download PDF
43. Ruxolitinib plus steroids for acute graft versus host disease: a multicenter, randomized, phase 3 trial
- Author
-
Liping Dou, Yanli Zhao, Jingjing Yang, Lei Deng, Nan Wang, Xiawei Zhang, Qingyang Liu, Yan Yang, Zhijie Wei, Fuxu Wang, Yifan Jiao, Fei Li, Songhua Luan, Liangding Hu, Sujun Gao, Chuanfang Liu, Xiangjun Liu, Jinsong Yan, Xuejun Zhang, Fang Zhou, Peihua Lu, and Daihong Liu
- Subjects
Medicine ,Biology (General) ,QH301-705.5 - Abstract
Abstract Newly diagnosed patients with high-risk acute graft-versus-host disease (aGVHD) often experience poor clinical outcomes and low complete remission rates. Ruxolitinib with corticosteroids showed promising efficacy in improving response and failure free survival in our phase I study. This study (ClinicalTrials.gov: NCT04061876) sought to evaluate the safety and effectiveness of combining ruxolitinib (RUX, 5 mg/day) with corticosteroids (1 mg/kg/day methylprednisolone, RUX/steroids combined group) versus using methylprednisolone alone (2 mg/kg/day, steroids-only group). Newly diagnosed patients with intermediate- or high-risk aGVHD were included, with risk levels classified by either the Minnesota aGVHD Risk Score or biomarker assessment. Patients were randomized in a ratio of 1:1 into 2 groups: 99 patients received RUX combined with methylprednisolone, while the other 99 received methylprednisolone alone as the initial treatment. The RUX/steroids group showed a significantly higher overall response rate (ORR) on day 28 (92.9%) compared to the steroids-only group (70.7%, Odds Ratio [OR] = 5.8; 95% Confidence Interval [CI], 2.4–14.0; P
- Published
- 2024
- Full Text
- View/download PDF
44. Large-Scale Surface Modification of Decellularized Matrix with Erythrocyte Membrane for Promoting In Situ Regeneration of Heart Valve
- Author
-
Yuqi Liu, Pengning Fan, Yin Xu, Junwei Zhang, Li Xu, Jinsheng Li, Shijie Wang, Fei Li, Si Chen, Jiawei Shi, Weihua Qiao, and Nianguo Dong
- Subjects
In situ tissue engineering heart valves ,Red blood cell membrane ,Endothelialization ,Hemocompatibility ,Immunomodulation ,Engineering (General). Civil engineering (General) ,TA1-2040 - Abstract
In situ regeneration is a promising strategy for constructing tissue engineering heart valves (TEHVs). Currently, the decellularized heart valve (DHV) is extensively employed as a TEHV scaffold. Nevertheless, DHV exhibits limited blood compatibility and notable difficulties in endothelialization, resulting in thrombosis and graft failure. The red blood cell membrane (RBCM) exhibits excellent biocompatibility and prolonged circulation stability and is extensively applied in the camouflage of nanoparticles for drug delivery; however, there is no report on its application for large-scale modification of decellularized extracellular matrix (ECM). For the first time, we utilized a layer-by-layer assembling strategy to immobilize RBCM on the surface of DHV and construct an innovative TEHV scaffold. Our findings demonstrated that the scaffold significantly improved the hemocompatibility of DHV by effectively preventing plasma protein adsorption, activated platelet adhesion, and erythrocyte aggregation, and induced macrophage polarization toward the M2 phenotype in vitro. Moreover, RBCM modification significantly enhanced the mechanical properties and enzymatic stability of DHV. The rat models of subcutaneous embedding and abdominal aorta implantation showed that the scaffold regulated the polarization of macrophages into the anti-inflammatory and pro-modeling M2 phenotype and promoted endothelialization and ECM remodeling in the early stage without thrombosis and calcification. The novel TEHV exhibits excellent performance and can overcome the limitations of commonly used clinical prostheses.
- Published
- 2024
- Full Text
- View/download PDF
45. Prognostic significance of preoperative nutritional status for heart transplantation patients
- Author
-
Dingyi Yao, Shirui Qian, Li Xu, Lin Fan, Fei Li, Si Chen, Jiawei Shi, and Nianguo Dong
- Subjects
Malnutrition ,Heart transplantation ,Prognosis ,Infection ,Diseases of the circulatory (Cardiovascular) system ,RC666-701 - Abstract
Abstract Background The association between malnutrition and outcomes of heart transplantation (HTx) has not been well studied. The purpose of this article was to evaluate the prognostic value of three different nutrition indices in HTx, including CONUT (Controlling Nutritional Status), NRI (Nutritional Risk Index) and GNRI (Geriatric Nutritional Risk Index). Methods A total of 438 patients who underwent THx from January 2015 to December 2020 were included in this study. The nutritional status of the patients was evaluated by CONUT, NRI and GNRI. Kaplan-Meier (KM) curves were constructed to compare the difference in overall survival (OS) between the normal and malnutrition groups in each index. Cox regression analysis was used to identify the independent risk factors of OS. The predictive power was compared by time-dependent ROC and time-dependent ccurves. Logistic regression model was used to evaluate the relationship between these three nutrition indices and postoperative clinical events. Results 336 (76.7%), 183 (43.8%), and 190 (43.4%) patients had malnutrition according to CONUT, NRI and GNRI calculations. 102 (23.3%) patients had died at the end of follow-up. After adjustment for confounding variables, multivariate Cox analysis showed that CONUT [HR 1.286 (95%CI 1.166 ~ 1.419); p
- Published
- 2024
- Full Text
- View/download PDF
46. Enhanced energy storage performance in NBT-based MLCCs via cooperative optimization of polarization and grain alignment
- Author
-
Yang Li, Ningbo Fan, Jie Wu, Bin Xu, Xuexin Li, Xuechen Liu, Yizhou Xiao, Dingwei Hou, Xinya Feng, Jinjing Zhang, Shujun Zhang, Jinglei Li, and Fei Li
- Subjects
Science - Abstract
Abstract Dielectric ceramics possess a unique competitive advantage in electronic systems due to their high-power density and excellent reliability. Na1/2Bi1/2TiO3-based ceramics, one type of extensively studied energy storage dielectric, however, often experience A-site element volatilization and Ti4+ reduction during high-temperature sintering. These issues may result in increased energy loss, reduced polarization and low dielectric breakdown electric field, ultimately making it challenging to achieve both high energy storage density and efficiency. To address these issues, we introduce a synergistic optimization strategy that combine polarization engineering and grain alignment engineering. First principles calculations and experimental analyses show that the doping of Mn2+ can suppress the reduction of Ti4+ in Na1/2Bi1/2TiO3-based ceramics and enhance ion off-centering displacements, thereby reducing energy loss and improving polarization. In addition, we prepared multilayer ceramic capacitors with grains oriented along the direction using the template grain growth method. This approach effectively reduces electric-field-induced strain by 37% and markedly enhances breakdown electric field by 42% when compared with nontextured counterpart. As a result of this comprehensive strategy, -textured Na1/2Bi1/2TiO3-based multilayer ceramic capacitors achieve an ultra-high energy density of 15.7 J·cm−3 and an excellent efficiency beyond 95% at 850 kV·cm−1, exhibiting a superior overall energy storage performance.
- Published
- 2024
- Full Text
- View/download PDF
47. Ultrahigh thermal stability and piezoelectricity of lead-free KNN-based texture piezoceramics
- Author
-
Lihui Xu, Jinfeng Lin, Yuxuan Yang, Zhihao Zhao, Xiaoming Shi, Guanglong Ge, Jin Qian, Cheng Shi, Guohui Li, Simin Wang, Yang Zhang, Peng Li, Bo Shen, Zhengqian Fu, Haijun Wu, Houbing Huang, Fei Li, Xiangdong Ding, Jun Sun, and Jiwei Zhai
- Subjects
Science - Abstract
Abstract The contradiction between high piezoelectricity and uniquely poor temperature stability generated by polymorphic phase boundary is a huge obstacle to high-performance (K, Na)NbO3 -based ceramics entering the application market as Pb-based substitutes. We possess the phase boundary by mimicking Pb(Zr, Ti)O3’s morphotropic phase boundary structure via the synergistic optimization of diffusion phase boundary and crystal orientation in 0.94(Na0.56K0.44)NbO3−0.03Bi0.5Na0.5ZrO3−0.03(Bi0.5K0.5)HfO3 textured ceramics. As a result, a prominent comprehensive performance is obtained, including giant d 33 of 550 ± 30 pC/N and ultrahigh temperature stability (d 33 change rate less than 1.2% within 25-150 °C), representing a significant breakthrough in lead-free piezoceramics, even surpassing the Pb-based piezoelectric ceramics. Within the same temperature range, the d 33 change rate of the commercial Pb(Zr, Ti)O3−5 ceramics is only about 10%, and more importantly, its d 33 (~ 350 pC/N) is much lower than that of the (K, Na)NbO3-based ceramics in this work. This study demonstrates a strategy for constructing the phase boundary with MPB feature, settling the problem of temperature instability in (K, Na)NbO3-based ceramics.
- Published
- 2024
- Full Text
- View/download PDF
48. NLRP10 maintains epidermal homeostasis by promoting keratinocyte survival and P63-dependent differentiation and barrier function
- Author
-
Yeonhee Cho, Zhongzheng Cao, Xin Luo, Jennifer J. Tian, Renee R. Hukkanen, Rajaa Hussien, Belinda Cancilla, Priyanka Chowdhury, Fei Li, Shining Ma, Edward L. LaGory, Mark Schroeder, Amanda Dusenberry, Leslie Marshall, Jenn Hawkins, Menno van Lookeren Campagne, and Yi Zhou
- Subjects
Cytology ,QH573-671 - Abstract
Abstract Atopic dermatitis (AD) is a common chronic inflammatory skin disorder characterized by disrupted epidermal barrier function and aberrant immune responses. Despite recent developments in new therapeutics for AD, there is still a large unmet medical need for disease management due to the complex and multifactorial nature of AD. Recent genome-wide association studies (GWAS) have identified NLRP10 as a susceptible gene for AD but the physiological role of NLRP10 in skin homeostasis and AD remains unknown. Here we show that NLRP10 is downregulated in AD skin samples. Using an air-lift human skin equivalent culture, we demonstrate that NLRP10 promotes keratinocyte survival and is required for epidermal differentiation and barrier function. Mechanistically, NLRP10 limits cell death by preventing the recruitment of caspase-8 to the death inducing signaling complex (DISC) and by inhibiting its subsequent activation. NLRP10 also stabilizes p63, the master regulator of keratinocyte differentiation, to drive proper keratinocyte differentiation and to reinforce the barrier function. Our findings underscore NLRP10 as a key player in atopic dermatitis pathogenesis, highlighting NLRP10 as a potential target for therapeutic intervention to restore skin barrier function and homeostasis in AD.
- Published
- 2024
- Full Text
- View/download PDF
49. Impact of Q-balls formed by first-order phase transition on sterile neutrino dark matter
- Author
-
Jiucheng Ma, Siyu Jiang, and Xiu-Fei Li
- Subjects
Astrophysics ,QB460-466 ,Nuclear and particle physics. Atomic energy. Radioactivity ,QC770-798 - Abstract
Abstract We explore the mechanism that can explain the production of lepton asymmetry and two types of sterile neutrino dark matter. The first type involves heavy sterile dark matter produced directly by the decay of Q-balls which are formed by first-order phase transition in the early universe; the second consists of keV sterile neutrino dark matter, produced resonantly with the aid of lepton asymmetry from Q-ball decay. Besides, gravitational waves from cosmic strings generated during the phase transition process could be detected at future interferometers.
- Published
- 2024
- Full Text
- View/download PDF
50. Advancing Safety in Mining: Machine Learning Approaches for Predicting and Classifying Seismic Bump-Associated Hazardous States
- Author
-
Fei Li
- Subjects
risk management ,seismic hazard prediction ,underground coal mines ,hazardous and non-hazardous states classification ,Engineering (General). Civil engineering (General) ,TA1-2040 ,Chemical engineering ,TP155-156 ,Physics ,QC1-999 - Abstract
The foundation and presumption of underlying risk management in underground coal mines is hazard identification. Even though hazard identification techniques used in underground coal mines have been extensively studied, there is still room for improvement. Because they are experience-based or limited to a single incident or event, traditional hazard identification techniques lack a systematic and all-encompassing identification framework. The material offered explores the intricate problem of predicting high-energy seismic bumps in coal mines that are more than 10^4 Joules. The study uses 2 single predictive models (Random Forest (RF) and Support Vector Classification (SVC)) along with 2 optimization strategies (Artificial Hummingbird Algorithm (AHA) and Turbulent Flow of Water-based Optimization Algorithm (TFWOA)) to tackle this problem. These techniques are applied to improve forecast accuracy. Once the dataset has been divided into hazardous groups and those that are not, a careful analysis of the numerical results is carried out. After a thorough analysis, the most efficient model is the RFC + TFWOA (RFTF) model, which uses Random Forest Classification (RFC) optimized by Turbulent Flow of Water-based Optimization. Notably, the RFTF model attains an astounding accuracy of 0.898 throughout the training phase. This result demonstrates that the RFTF model is more effective than other models at correctly classifying states as hazardous or non-hazardous.
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.