Author: "Chen, Jindong" / Search Limiters: Academic (Peer-Reviewed) Journals - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Chen, Jindong"' showing total 1,086 results

Start Over Author "Chen, Jindong" Search Limiters Academic (Peer-Reviewed) Journals

1,086 results on '"Chen, Jindong"'

1. WebQuest: A Benchmark for Multimodal QA on Web Page Sequences

Author: Wang, Maria, Sunkara, Srinivas, Baechler, Gilles, Lin, Jason, Zhu, Yun, Zubach, Fedir, Shu, Lei, and Chen, Jindong
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence
Abstract: The rise of powerful multimodal LLMs has enhanced the viability of building web agents which can, with increasing levels of autonomy, assist users to retrieve information and complete tasks on various human-computer interfaces. It is hence necessary to build challenging benchmarks that span a wide-variety of use cases reflecting real-world usage. In this work, we present WebQuest, a multi-page question-answering dataset that requires reasoning across multiple related web pages. In contrast to existing UI benchmarks that focus on multi-step web navigation and task completion, our dataset evaluates information extraction, multimodal retrieval and composition of information from many web pages. WebQuest includes three question categories: single-screen QA, multi-screen QA, and QA based on navigation traces. We evaluate leading proprietary multimodal models like GPT-4V, Gemini Flash, Claude 3, and open source models like InstructBLIP, PaliGemma on our dataset, revealing a significant gap between single-screen and multi-screen reasoning. Finally, we investigate inference time techniques like Chain-of-Thought prompting to improve model capabilities on multi-screen reasoning.
Published: 2024

2. Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection

Author: Zhu, Yun, Gu, Jia-Chen, Sikora, Caitlin, Ko, Ho, Liu, Yinxiao, Lin, Chu-Cheng, Shu, Lei, Luo, Liangchen, Meng, Lei, Liu, Bang, and Chen, Jindong
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs) augmented with retrieval exhibit robust performance and extensive versatility by incorporating external contexts. However, the input length grows linearly in the number of retrieved documents, causing a dramatic increase in latency. In this paper, we propose a novel paradigm named Sparse RAG, which seeks to cut computation costs through sparsity. Specifically, Sparse RAG encodes retrieved documents in parallel, which eliminates latency introduced by long-range attention of retrieved documents. Then, LLMs selectively decode the output by only attending to highly relevant caches auto-regressively, which are chosen via prompting LLMs with special control tokens. It is notable that Sparse RAG combines the assessment of each individual document and the generation of the response into a single process. The designed sparse mechanism in a RAG system can facilitate the reduction of the number of documents loaded during decoding for accelerating the inference of the RAG system. Additionally, filtering out undesirable contexts enhances the model's focus on relevant context, inherently improving its generation quality. Evaluation results of two datasets show that Sparse RAG can strike an optimal balance between generation quality and computational efficiency, demonstrating its generalizability across both short- and long-form generation tasks.
Published: 2024

3. Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs

Author: Carbune, Victor, Mansoor, Hassan, Liu, Fangyu, Aralikatte, Rahul, Baechler, Gilles, Chen, Jindong, and Sharma, Abhanshu
Subjects: Computer Science - Computation and Language
Abstract: Vision-language models (VLMs) are achieving increasingly strong performance on multimodal tasks. However, reasoning capabilities remain limited particularly for smaller VLMs, while those of large-language models (LLMs) have seen numerous improvements. We propose a technique to transfer capabilities from LLMs to VLMs. On the recently introduced ChartQA, our method obtains state-of-the-art performance when applied on the PaLI3-5B VLM by \citet{chen2023pali3}, while also enabling much better performance on PlotQA and FigureQA. We first improve the chart representation by continuing the pre-training stage using an improved version of the chart-to-table translation task by \citet{liu2023deplot}. We then propose constructing a 20x larger dataset than the original training set. To improve general reasoning capabilities and improve numerical operations, we synthesize reasoning traces using the table representation of charts. Lastly, our model is fine-tuned using the multitask loss introduced by \citet{hsieh2023distilling}. Our variant ChartPaLI-5B outperforms even 10x larger models such as PaLIX-55B without using an upstream OCR system, while keeping inference time constant compared to the PaLI3-5B baseline. When rationales are further refined with a simple program-of-thought prompt \cite{chen2023program}, our model outperforms the recently introduced Gemini Ultra and GPT-4V., Comment: Findings of NAACL 2024
Published: 2024

4. N-3 PUFA supplementation alleviates anxiety symptoms by manipulating erythrocyte fatty acid levels in depression

Author: Wang, Lu, Liu, Ting, Guo, Jimin, Zhao, Tingyu, Tang, Hui, Wang, Feifei, Dong, Fang, Chen, Jindong, and Tang, Mimi
Published: 2024
Full Text: View/download PDF

5. Potential of Mobile Technology to Relieve the Urgent Mental Health Needs in China: Web-Based Survey

Author: Tan, Yuxi, Teng, Ziwei, Qiu, Yan, Tang, Hui, Xiang, Hui, and Chen, Jindong
Subjects: Information technology, T58.5-58.64, Public aspects of medicine, RA1-1270
Abstract: BackgroundWith the rapid development of information technology and mobile devices, an increasing number of mobile medical services and platforms have emerged. However, China’s current mental health situation necessitates further discussion and research on how to provide more patient-centered services in the face of many challenges and opportunities. ObjectiveThis study aims to explore the attitudes and preferences of mental health service stakeholders regarding mobile mental health services and discuss the challenges and opportunities faced by mobile technology developers in China. MethodsA web-based survey was conducted by following the Checklist for Reporting Results of Internet E-Surveys (CHERRIES) checklist. A total of 586 valid questionnaires were collected. Respondents included 184 patients or their family members, 225 mental health professionals, and 177 people from the general population. Data analysis was completed using SPSS 24.0. ResultsAmong the various problems perceived regarding the current mental health medical environment, difficulty in finding appropriate psychologists and limited visit times ranked highest. Social media (n=380/586, 64.9%) was the most preferred platform among all participants, whereas professionals showed a higher preference for smartphone apps (n=169/225, 75.1%). Professional instruction, psychological consultation, and mental health education (ranked top 3) were the most commonly identified needs. Mental health professionals generally emphasized more on treatment-related mobile mental health service needs, especially medication reminders (χ22=70.7; P
Published: 2020
Full Text: View/download PDF

6. ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Author: Baechler, Gilles, Sunkara, Srinivas, Wang, Maria, Zubach, Fedir, Mansoor, Hassan, Etter, Vincent, Cărbune, Victor, Lin, Jason, Chen, Jindong, and Sharma, Abhanshu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Screen user interfaces (UIs) and infographics, sharing similar visual language and design principles, play important roles in human communication and human-machine interaction. We introduce ScreenAI, a vision-language model that specializes in UI and infographics understanding. Our model improves upon the PaLI architecture with the flexible patching strategy of pix2struct and is trained on a unique mixture of datasets. At the heart of this mixture is a novel screen annotation task in which the model has to identify the type and location of UI elements. We use these text annotations to describe screens to Large Language Models and automatically generate question-answering (QA), UI navigation, and summarization training datasets at scale. We run ablation studies to demonstrate the impact of these design choices. At only 5B parameters, ScreenAI achieves new state-of-the-artresults on UI- and infographics-based tasks (Multi-page DocVQA, WebSRC, MoTIF and Widget Captioning), and new best-in-class performance on others (Chart QA, DocVQA, and InfographicVQA) compared to models of similar size. Finally, we release three new datasets: one focused on the screen annotation task and two others focused on question answering., Comment: Accepted to International Joint Conference on Artificial Intelligence (IJCAI), 2024. Revision Notes: full version of the paper, including 1) Camera-ready version for IJCAI-24; 2) Appendices that are mentioned, but not included in 1)
Published: 2024

7. Effect of propranolol on pharmacokinetics of clozapine in schizophrenic patients: a meta-analysis

Author: Yang, Xiding, Yan, Qiangyong, Yang, Lingfeng, Li, Jingjing, Fan, Xiao, Chen, Jindong, Wu, Haishan, Yang, Yongyu, Zhu, Ronghua, and Fang, Pingfei
Published: 2024
Full Text: View/download PDF

8. Preparation, Evaluation and Field Application of Thermally Induced Crosslinked Polymer Gel Leakage Plugging Agent

Author: Chen, Jindong, Qiu, Hengbin, Djouonkep, Lesly Dasilva Wandji, Lv, Junxian, and Xie, Binqiang
Published: 2024
Full Text: View/download PDF

9. Fusion-Eval: Integrating Assistant Evaluators with LLMs

Author: Shu, Lei, Wichers, Nevan, Luo, Liangchen, Zhu, Yun, Liu, Yinxiao, Chen, Jindong, and Meng, Lei
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Evaluating natural language systems poses significant challenges, particularly in the realms of natural language understanding and high-level reasoning. In this paper, we introduce 'Fusion-Eval', an innovative approach that leverages Large Language Models (LLMs) to integrate insights from various assistant evaluators. The LLM is given the example to evaluate along with scores from the assistant evaluators. Each of these evaluators specializes in assessing distinct aspects of responses. Fusion-Eval achieves a 0.962 system-level Kendall-Tau correlation with humans on SummEval and a 0.744 turn-level Spearman correlation on TopicalChat, which is significantly higher than baseline methods. These results highlight Fusion-Eval's significant potential in the realm of natural language system evaluation.
Published: 2023

10. SiRA: Sparse Mixture of Low Rank Adaptation

Author: Zhu, Yun, Wichers, Nevan, Lin, Chu-Cheng, Wang, Xinyi, Chen, Tianlong, Shu, Lei, Lu, Han, Liu, Canoee, Luo, Liangchen, Chen, Jindong, and Meng, Lei
Subjects: Computer Science - Computation and Language
Abstract: Parameter Efficient Tuning has been an prominent approach to adapt the Large Language Model to downstream tasks. Most previous works considers adding the dense trainable parameters, where all parameters are used to adapt certain task. We found this less effective empirically using the example of LoRA that introducing more trainable parameters does not help. Motivated by this we investigate the importance of leveraging "sparse" computation and propose SiRA: sparse mixture of low rank adaption. SiRA leverages the Sparse Mixture of Expert(SMoE) to boost the performance of LoRA. Specifically it enforces the top $k$ experts routing with a capacity limit restricting the maximum number of tokens each expert can process. We propose a novel and simple expert dropout on top of gating network to reduce the over-fitting issue. Through extensive experiments, we verify SiRA performs better than LoRA and other mixture of expert approaches across different single tasks and multitask settings.
Published: 2023

11. Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer

Author: Tan, Bowen, Zhu, Yun, Liu, Lijuan, Xing, Eric, Hu, Zhiting, and Chen, Jindong
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language
Abstract: Large language models (LLMs) such as T0, FLAN, and OPT-IML, excel in multi-tasking under a unified instruction-following paradigm, where they also exhibit remarkable generalization abilities to unseen tasks. Despite their impressive performance, these LLMs, with sizes ranging from several billion to hundreds of billions of parameters, demand substantial computational resources, making their training and inference expensive and inefficient. Furthermore, adapting these models to downstream applications, particularly complex tasks, is often unfeasible due to the extensive hardware requirements for finetuning, even when utilizing parameter-efficient approaches such as prompt tuning. Additionally, the most powerful multi-task LLMs, such as OPT-IML-175B and FLAN-PaLM-540B, are not publicly accessible, severely limiting their customization potential. To address these challenges, we introduce a pretrained small scorer, Cappy, designed to enhance the performance and efficiency of multi-task LLMs. With merely 360 million parameters, Cappy functions either independently on classification tasks or serve as an auxiliary component for LLMs, boosting their performance. Moreover, Cappy enables efficiently integrating downstream supervision without requiring LLM finetuning nor the access to their parameters. Our experiments demonstrate that, when working independently on 11 language understanding tasks from PromptSource, Cappy outperforms LLMs that are several orders of magnitude larger. Besides, on 45 complex tasks from BIG-Bench, Cappy boosts the performance of the advanced multi-task LLM, FLAN-T5, by a large margin. Furthermore, Cappy is flexible to cooperate with other LLM adaptations, including finetuning and in-context learning, offering additional performance enhancement., Comment: In proceedings of NeurIPS 2023; Code and model available at https://github.com/tanyuqian/cappy and https://huggingface.co/btan2/cappy-large, respectively
Published: 2023

12. RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs

Author: Tan, Bowen, Zhu, Yun, Liu, Lijuan, Wang, Hongyi, Zhuang, Yonghao, Chen, Jindong, Xing, Eric, and Hu, Zhiting
Subjects: Computer Science - Machine Learning
Abstract: The recent progress of AI can be largely attributed to large language models (LLMs). However, their escalating memory requirements introduce challenges for machine learning (ML) researchers and engineers. Addressing this requires developers to partition a large model to distribute it across multiple GPUs or TPUs. This necessitates considerable coding and intricate configuration efforts with existing model parallel tools, such as Megatron-LM, DeepSpeed, and Alpa. These tools require users' expertise in machine learning systems (MLSys), creating a bottleneck in LLM development, particularly for developers without MLSys background. In this work, we present RedCoast (Redco), a lightweight and user-friendly tool crafted to automate distributed training and inference for LLMs, as well as to simplify ML pipeline development. The design of Redco emphasizes two key aspects. Firstly, to automate model parallelism, our study identifies two straightforward rules to generate tensor parallel strategies for any given LLM. Integrating these rules into Redco facilitates effortless distributed LLM training and inference, eliminating the need of additional coding or complex configurations. We demonstrate the effectiveness by applying Redco on a set of LLM architectures, such as GPT-J, LLaMA, T5, and OPT, up to the size of 66B. Secondly, we propose a mechanism that allows for the customization of diverse ML pipelines through the definition of merely three functions, avoiding redundant and formulaic code like multi-host related processing. This mechanism proves adaptable across a spectrum of ML algorithms, from foundational language modeling to complex algorithms like meta-learning and reinforcement learning. As a result, Redco implementations exhibit significantly fewer lines of code compared to their official counterparts., Comment: RedCoast (Redco) has been released under Apache License 2.0 at https://github.com/tanyuqian/redco
Published: 2023

13. Towards an On-device Agent for Text Rewriting

Author: Zhu, Yun, Liu, Yinxiao, Stahlberg, Felix, Kumar, Shankar, Chen, Yu-hui, Luo, Liangchen, Shu, Lei, Liu, Renjie, Chen, Jindong, and Meng, Lei
Subjects: Computer Science - Computation and Language
Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities for text rewriting. Nonetheless, the large sizes of these models make them impractical for on-device inference, which would otherwise allow for enhanced privacy and economical inference. Creating a smaller yet potent language model for text rewriting presents a formidable challenge because it requires balancing the need for a small size with the need to retain the emergent capabilities of the LLM, that requires costly data collection. To address the above challenge, we introduce a new instruction tuning approach for building a mobile-centric text rewriting model. Our strategies enable the generation of high quality training data without any human labeling. In addition, we propose a heuristic reinforcement learning framework which substantially enhances performance without requiring preference data. To further bridge the performance gap with the larger server-side model, we propose an effective approach that combines the mobile rewrite agent with the server model using a cascade. To tailor the text rewriting tasks to mobile scenarios, we introduce MessageRewriteEval, a benchmark that focuses on text rewriting for messages through natural language instructions. Through empirical experiments, we demonstrate that our on-device model surpasses the current state-of-the-art LLMs in text rewriting while maintaining a significantly reduced model size. Notably, we show that our proposed cascading approach improves model performance.
Published: 2023

14. The relationship between childhood trauma and mental health status among Chinese vocational high school adolescents: the mediating effect of poor self-control and internet addiction

Author: Su, Yuhan, Chen, Zhuohui, Teng, Ziwei, Liu, Mengdong, Yang, Yuanguang, Chen, Jindong, Liu, Xiaoman, and Huang, Jing
Published: 2024
Full Text: View/download PDF

15. Clinical characteristics of 217 Chinese cases with depersonalization/derealization disorder

Author: Song, Mingkang, Zheng, Sisi, Song, Nan, Zhu, Hong, Jia, Yuan, Dai, Zhiqing, Liu, Xinzi, Wu, Ziyao, Duan, Yuhang, Huang, Zhiyuan, Chen, Jindong, and Jia, Hongxiao
Published: 2024
Full Text: View/download PDF

16. Association among abnormal glycolipids, reproductive hormones, and cognitive dysfunction in female patients with bipolar disorder

Author: Zou, Tianxiang, Yang, Min, Chen, Zhuohui, Xie, Haiqing, Huang, Jing, Qin, Yue, Liu, Furu, Chen, Haiyu, Xu, Xuelei, Chen, Jindong, Tang, Hui, Xiang, Hui, Wu, Haishan, Liu, MingHui, Luo, Wenbo, Liu, Jieyu, and Teng, Ziwei
Published: 2024
Full Text: View/download PDF

17. Association of childhood trauma, social support, cognition, and suicidality in females with bipolar disorder

Author: Yang, Min, Li, Jiaxin, Fu, Yaqian, Wang, Guotao, Liu, Minghui, Chen, Jindong, and Liu, Jieyu
Published: 2024
Full Text: View/download PDF

18. Prevalence of thalassemia-carrier couples and fertility risk assessment

Author: Wu, Liu-Song, Luo, Xi, Tan, Mei, Zhang, Li-Jun, Luo, Hong-Fang, Huang, Ge, Huang, Pei, Chen, Jindong, and Chen, Yan
Published: 2024
Full Text: View/download PDF

19. RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting

Author: Shu, Lei, Luo, Liangchen, Hoskere, Jayakumar, Zhu, Yun, Liu, Yinxiao, Tong, Simon, Chen, Jindong, and Meng, Lei
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities in creative tasks such as storytelling and E-mail generation. However, as LLMs are primarily trained on final text results rather than intermediate revisions, it might be challenging for them to perform text rewriting tasks. Most studies in the rewriting tasks focus on a particular transformation type within the boundaries of single sentences. In this work, we develop new strategies for instruction tuning and reinforcement learning to better align LLMs for cross-sentence rewriting tasks using diverse wording and structures expressed through natural languages including 1) generating rewriting instruction data from Wiki edits and public corpus through instruction generation and chain-of-thought prompting; 2) collecting comparison data for reward model training through a new ranking function. To facilitate this research, we introduce OpenRewriteEval, a novel benchmark covers a wide variety of rewriting types expressed through natural language instructions. Our results show significant improvements over a variety of baselines. The public repository is available on GitHub under Google Research (https://github.com/google-research/google-research/tree/master/rewritelm).
Published: 2023

20. ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots

Author: Hsiao, Yu-Chung, Zubach, Fedir, Baechler, Gilles, Carbune, Victor, Lin, Jason, Wang, Maria, Sunkara, Srinivas, Zhu, Yun, and Chen, Jindong
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Human-Computer Interaction
Abstract: We present a new benchmark and dataset, ScreenQA, for screen content understanding via question answering. The existing screen datasets are focused either on structure and component-level understanding, or on a much higher-level composite task such as navigation and task completion. We attempt to bridge the gap between these two by annotating 86K question-answer pairs over the RICO dataset in hope to benchmark the screen reading comprehension capacity. This work is also the first to annotate answers for different application scenarios, including both full sentences and short forms, as well as supporting UI contents on screen and their bounding boxes. With the rich annotation, we discuss and define the evaluation metrics of the benchmark, show applications of the dataset, and provide a few baselines using closed and open source models.
Published: 2022

21. DETR++: Taming Your Multi-Scale Detection Transformer

Author: Zhang, Chi, Liu, Lijuan, Zang, Xiaoxue, Liu, Frederick, Zhang, Hao, Song, Xinying, and Chen, Jindong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Convolutional Neural Networks (CNN) have dominated the field of detection ever since the success of AlexNet in ImageNet classification [12]. With the sweeping reform of Transformers [27] in natural language processing, Carion et al. [2] introduce the Transformer-based detection method, i.e., DETR. However, due to the quadratic complexity in the self-attention mechanism in the Transformer, DETR is never able to incorporate multi-scale features as performed in existing CNN-based detectors, leading to inferior results in small object detection. To mitigate this issue and further improve performance of DETR, in this work, we investigate different methods to incorporate multi-scale features and find that a Bi-directional Feature Pyramid (BiFPN) works best with DETR in further raising the detection precision. With this discovery, we propose DETR++, a new architecture that improves detection results by 1.9% AP on MS COCO 2017, 11.5% AP on RICO icon detection, and 9.1% AP on RICO layout extraction over existing baselines., Comment: T4V: Transformers for Vision workshop @ CVPR 2022
Published: 2022

22. Preparation and evaluation of an ultra-high temperature resistant zwitterionic polymer viscosity reducer

Author: Deng, Hu, Tao, Huaizhi, Ai, Jiawei, Chen, Jindong, Xie, Binqiang, and Djouonkep, Lesly Dasilva Wandji
Published: 2024
Full Text: View/download PDF

23. Olanzapine enhances early brain maturation through activation of the NODAL/FOXH1 axis

Author: Teng, Ziwei, Chen, Zhuohui, Zou, Tianxiang, Su, Yuhan, Zeng, Hongtao, Sun, Shixiong, Chen, Haiyu, Liu, Jieyu, Qin, Yue, Wu, Haishan, Chen, Jindong, Liu, Fangkun, Wu, Renrong, and Huang, Jing
Published: 2024
Full Text: View/download PDF

24. Quality improvement of BaGa4Se7 crystal by annealing in BaSe vapor atmosphere

Author: Sun, Mengran, Shi, Jinlong, Chen, Jindong, Li, Chunxiao, and Yao, Jiyong
Published: 2024
Full Text: View/download PDF

25. Differences in cognitive impairment and its correlation with circulating cell-free mitochondrial DNA in medication-free depression and bipolar depression patients

Author: Jin, Kun, Teng, Ziwei, Li, Jiaxin, Qiu, Yan, Li, Sujuan, Xu, Xuelei, Wang, Lu, Chen, Jindong, Huang, Jing, Xiang, Hui, Wu, Haishan, and Tang, Hui
Published: 2025
Full Text: View/download PDF

26. Metabolic profiling of lumateperone in vitro and in vivo by UPLC-Q Exactive Orbitrap HRMS, and its pharmacokinetic study in rat plasma by LC-MS/MS

Author: Qiu, Yifan, Guo, Jing, Chen, Jindong, Zhang, Wenjing, and Wang, Wenyan
Published: 2024
Full Text: View/download PDF

27. Intervertebral disc injury triggers neurogenic inflammation of adjacent healthy discs

Author: Li, Yongchao, Dai, Chen, Wu, Bing, Yang, Liang, Yan, Xiujie, Liu, Tanghua, Chen, Jindong, Zheng, Zhaomin, and Peng, Baogan
Published: 2024
Full Text: View/download PDF

28. CD24 promotes metastasis and chemoresistance by directly targeting Arf6-ERK pathway in esophageal squamous cell carcinoma

Author: Hong, Pan, Xu, Taoyang, Xu, Jiaojiao, Chen, Wenyou, Hu, Huifang, Chen, Jindong, Li, Lan, Zheng, Cancan, Li, Bin, Liu, Jun, Dai, Wei, Li, Enmin, Zhang, Fan, and Xu, Wenwen
Published: 2024
Full Text: View/download PDF

29. Determining of 18 amino acids in plasma of pregnant women with sleep disorders by UHPLC-MS/MS

Author: Chen, Jindong, Qiu, Yifan, Guo, Jing, Shan, Ligang, Chen, Guangxue, Wang, Fan, and Wang, Wenyan
Published: 2024
Full Text: View/download PDF

30. Thermo-thickening/amphoteric polymer nanocomposite incorporating vinyl-functionalized nano-silica as a viscosifier for high-salt and ultra-high temperature water-based drilling fluids

Author: Wandji Djouonkep, Lesly Dasilva, Xie, Binqiang, Tao, Huaizhi, Chen, Jindong, Zhuo, Lvyan, Songwe Selabi, Naomie Beolle, Tchameni, Alain Pierre, and Zhao, Lin
Published: 2024
Full Text: View/download PDF

31. Increased circulating cell-free mitochondrial DNA in plasma of first-diagnosed drug-naïve bipolar disorder patients: A case-control and 4-week follow-up study

Author: Teng, Ziwei, Xu, Xuelei, Chen, Xiaoqin, Qiu, Yan, Li, Sujuan, Chen, Jindong, Tang, Hui, Xiang, Hui, Wang, Bolun, Tan, Yuxi, and Wu, Haishan
Published: 2024
Full Text: View/download PDF

32. Safety and effectiveness of oral medium to high dose blonanserin in patients with schizophrenia: subgroup analysis from a prospective, multicenter, post-marketing surveillance study in mainland China

Author: Yang, Yuan, Ge, Hongmin, Wang, Xijin, Liu, Xuejun, Li, Keqing, Wang, Gang, Yang, Xiaodong, Deng, Huaili, Sun, Meijuan, Zhang, Ruiling, Chen, Jindong, Cai, Duanfang, Sang, Hong, Liu, Xianglai, Zhan, Guilai, Zhao, Guijun, Li, Haiyun, and Xun, Zhiyuan
Published: 2023
Full Text: View/download PDF

33. Risk factors for osteoporosis in chronic schizophrenia on long-term treatment with antipsychotics: a cross-sectional study

Author: Liu, Furu, Wu, Ying, Chen, Jindong, Zou, Tianxiang, Qin, Yue, Teng, Ziwei, Su, Yuhan, Wu, Renrong, and Huang, Jing
Published: 2023
Full Text: View/download PDF

34. A phase 3, multicenter, double-blind, randomized, placebo-controlled clinical trial to verify the efficacy and safety of ansofaxine (LY03005) for major depressive disorder

Author: Mi, Weifeng, Di, Xiaolan, Wang, Yiming, Li, Huafang, Xu, Xiufeng, Li, Lehua, Wang, Huaning, Wang, Guoqiang, Zhang, Kerang, Tian, Feng, Luo, Jiong, Yang, Chanjuan, Zhou, Yunfei, Xie, Shiping, Zhong, Hua, Wu, Bin, Yang, Dong, Chen, Zhenhua, Li, Yi, Chen, Jindong, Lv, Shuyun, Yi, Qizhong, Jiang, Zhiwei, Tian, Jingwei, and Zhang, Hongyan
Published: 2023
Full Text: View/download PDF

35. Effectiveness and safety of blonanserin in young and middle-aged female patients with schizophrenia: data from a post-marketing surveillance

Author: Bo, Qijing, Wang, Xijin, Liu, Xuejun, Sang, Hong, Xun, Zhiyuan, Zhang, Ruiling, Yang, Xiaodong, Deng, Huaili, Li, Keqing, Chen, Jindong, Sun, Meijuan, Zhao, Guijun, Liu, Xianglai, Cai, Duanfang, Zhan, Guilai, Li, Juhong, Li, Haiyun, and Wang, Gang
Published: 2023
Full Text: View/download PDF

36. UIBert: Learning Generic Multimodal Representations for UI Understanding

Author: Bai, Chongyang, Zang, Xiaoxue, Xu, Ying, Sunkara, Srinivas, Rastogi, Abhinav, Chen, Jindong, and Arcas, Blaise Aguera y
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: To improve the accessibility of smart devices and to simplify their usage, building models which understand user interfaces (UIs) and assist users to complete their tasks is critical. However, unique challenges are proposed by UI-specific characteristics, such as how to effectively leverage multimodal UI features that involve image, text, and structural metadata and how to achieve good performance when high-quality labeled data is unavailable. To address such challenges we introduce UIBert, a transformer-based joint image-text model trained through novel pre-training tasks on large-scale unlabeled UI data to learn generic feature representations for a UI and its components. Our key intuition is that the heterogeneous features in a UI are self-aligned, i.e., the image and text features of UI components, are predictive of each other. We propose five pretraining tasks utilizing this self-alignment among different features of a UI component and across various components in the same UI. We evaluate our method on nine real-world downstream UI tasks where UIBert outperforms strong multimodal baselines by up to 9.26% accuracy., Comment: 8 pages, IJCAI 2021
Published: 2021

37. Multimodal Icon Annotation For Mobile Applications

Author: Zang, Xiaoxue, Xu, Ying, and Chen, Jindong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Human-Computer Interaction
Abstract: Annotating user interfaces (UIs) that involves localization and classification of meaningful UI elements on a screen is a critical step for many mobile applications such as screen readers and voice control of devices. Annotating object icons, such as menu, search, and arrow backward, is especially challenging due to the lack of explicit labels on screens, their similarity to pictures, and their diverse shapes. Existing studies either use view hierarchy or pixel based methods to tackle the task. Pixel based approaches are more popular as view hierarchy features on mobile platforms are often incomplete or inaccurate, however it leaves out instructional information in the view hierarchy such as resource-ids or content descriptions. We propose a novel deep learning based multi-modal approach that combines the benefits of both pixel and view hierarchy features as well as leverages the state-of-the-art object detection techniques. In order to demonstrate the utility provided, we create a high quality UI dataset by manually annotating the most commonly used 29 icons in Rico, a large scale mobile design dataset consisting of 72k UI screenshots. The experimental results indicate the effectiveness of our multi-modal approach. Our model not only outperforms a widely used object classification baseline but also pixel based object detection models. Our study sheds light on how to combine view hierarchy with pixel features for annotating UI elements., Comment: 11 pages, MobileHCI 2021
Published: 2021

38. PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling

Author: Zang, Xiaoxue, Liu, Lijuan, Wang, Maria, Song, Yang, Zhang, Hao, and Chen, Jindong
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: We present a new human-human dialogue dataset - PhotoChat, the first dataset that casts light on the photo sharing behavior in onlin emessaging. PhotoChat contains 12k dialogues, each of which is paired with a user photo that is shared during the conversation. Based on this dataset, we propose two tasks to facilitate research on image-text modeling: a photo-sharing intent prediction task that predicts whether one intends to share a photo in the next conversation turn, and a photo retrieval task that retrieves the most relevant photo according to the dialogue context. In addition, for both tasks, we provide baseline models using the state-of-the-art models and report their benchmark performances. The best image retrieval model achieves 10.4% recall@1 (out of 1000 candidates) and the best photo intent prediction model achieves 58.1% F1 score, indicating that the dataset presents interesting yet challenging real-world problems. We are releasing PhotoChat to facilitate future research work among the community.
Published: 2021

39. Enhanced amphoteric polymer filtration reducer with vinyl-functionalized nanosilica for high-salt and ultra-high temperature water-based drilling environments

Author: Wandji Djouonkep, Lesly Dasilva, Xie, Binqiang, Tao, Huaizhi, Chen, Jindong, Zhuo, Lvyan, Selabi, Naomie Beolle Songwe, and Zhao, Lin
Published: 2024
Full Text: View/download PDF

40. Low-carbon environment-friendly economic optimal scheduling of multi-energy microgrid with integrated demand response considering waste heat utilization

Author: Chen, Haipeng, Yang, Shuoshi, Chen, Jindong, Wang, Xingyu, Li, Yang, Shui, Siyuan, and Yu, Hao
Published: 2024
Full Text: View/download PDF

41. Omega-3 polyunsaturated fatty acids supplementation improves memory in first-diagnosed, drug-naïve patients with depression: Secondary analysis of data from a randomized controlled trial

Author: Xue, Ying, Wang, Lu, Liu, Ting, Zhao, Tingyu, Xie, Kaiqiang, Guo, Jimin, Chen, Jindong, Tang, Hui, and Tang, Mimi
Published: 2024
Full Text: View/download PDF

42. Osteoporosis and low bone mass among schizophrenia and bipolar disorder: A cross-sectional study with newly diagnosed, drug-naïve subjects

Author: Li, Sujuan, Chen, Xiaoqin, Qiu, Yan, Teng, Ziwei, Xu, Xuelei, Tang, Hui, Xiang, Hui, Wang, Bolun, Chen, Jindong, Yuan, Hui, and Wu, Haishan
Published: 2024
Full Text: View/download PDF

43. ActionBert: Leveraging User Actions for Semantic Understanding of User Interfaces

Author: He, Zecheng, Sunkara, Srinivas, Zang, Xiaoxue, Xu, Ying, Liu, Lijuan, Wichers, Nevan, Schubiner, Gabriel, Lee, Ruby, Chen, Jindong, and Arcas, Blaise Agüera y
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: As mobile devices are becoming ubiquitous, regularly interacting with a variety of user interfaces (UIs) is a common aspect of daily life for many people. To improve the accessibility of these devices and to enable their usage in a variety of settings, building models that can assist users and accomplish tasks through the UI is vitally important. However, there are several challenges to achieve this. First, UI components of similar appearance can have different functionalities, making understanding their function more important than just analyzing their appearance. Second, domain-specific features like Document Object Model (DOM) in web pages and View Hierarchy (VH) in mobile applications provide important signals about the semantics of UI elements, but these features are not in a natural language format. Third, owing to a large diversity in UIs and absence of standard DOM or VH representations, building a UI understanding model with high coverage requires large amounts of training data. Inspired by the success of pre-training based approaches in NLP for tackling a variety of problems in a data-efficient way, we introduce a new pre-trained UI representation model called ActionBert. Our methodology is designed to leverage visual, linguistic and domain-specific features in user interaction traces to pre-train generic feature representations of UIs and their components. Our key intuition is that user actions, e.g., a sequence of clicks on different UI components, reveals important information about their functionality. We evaluate the proposed model on a wide variety of downstream tasks, ranging from icon classification to UI component retrieval based on its natural language description. Experiments show that the proposed ActionBert model outperforms multi-modal baselines across all downstream tasks by up to 15.5%., Comment: Accepted to AAAI Conference on Artificial Intelligence (AAAI-21)
Published: 2020

44. A Novel Approach for Fake Review Detection Based on Reviewing Behavior and BERT Fused with Cosine Similarity

Author: Wang, Junren, Chen, Jindong, Zhang, Wen, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Chen, Jian, editor, Huynh, Van-Nam, editor, Tang, Xijin, editor, and Wu, Jiangning, editor
Published: 2023
Full Text: View/download PDF

45. End-to-End Aspect-Based Sentiment Analysis Based on IDCNN-BLSA Feature Fusion

Author: Liu, Xinyuan, Chen, Jindong, Zhang, Wen, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Chen, Jian, editor, Huynh, Van-Nam, editor, Tang, Xijin, editor, and Wu, Jiangning, editor
Published: 2023
Full Text: View/download PDF

46. Perspective Chapter: An Update on Renal Cell Carcinoma

Author: Chen, Jindong, primary
Published: 2023
Full Text: View/download PDF

47. MultiWOZ 2.2 : A Dialogue Dataset with Additional Annotation Corrections and State Tracking Baselines

Author: Zang, Xiaoxue, Rastogi, Abhinav, Sunkara, Srinivas, Gupta, Raghav, Zhang, Jianguo, and Chen, Jindong
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: MultiWOZ is a well-known task-oriented dialogue dataset containing over 10,000 annotated dialogues spanning 8 domains. It is extensively used as a benchmark for dialogue state tracking. However, recent works have reported presence of substantial noise in the dialogue state annotations. MultiWOZ 2.1 identified and fixed many of these erroneous annotations and user utterances, resulting in an improved version of this dataset. This work introduces MultiWOZ 2.2, which is a yet another improved version of this dataset. Firstly, we identify and fix dialogue state annotation errors across 17.3% of the utterances on top of MultiWOZ 2.1. Secondly, we redefine the ontology by disallowing vocabularies of slots with a large number of possible values (e.g., restaurant name, time of booking). In addition, we introduce slot span annotations for these slots to standardize them across recent models, which previously used custom string matching heuristics to generate them. We also benchmark a few state of the art dialogue state tracking models on the corrected dataset to facilitate comparison for future work. In the end, we discuss best practices for dialogue data collection that can help avoid annotation errors.
Published: 2020

48. Prevalence and characteristics of polycystic ovarian syndrome in patients with bipolar disorder

Author: Liu, Jieyu, Teng, Ziwei, Xie, Haiqing, Yuan, Hui, Liu, Minghui, Chen, Jindong, Tang, Hui, Xiang, Hui, Wu, Haishan, and Huang, Jing
Published: 2023
Full Text: View/download PDF

49. Safety and effectiveness of oral blonanserin for schizophrenia in adolescents in China: Data from a 12-week post-marketing surveillance

Author: Yu, Yun, Wang, Xijin, Zhang, Ruiling, Li, Keqing, Wang, Gang, Chen, Jindong, Sun, Meijuan, Xun, Zhiyuan, Yang, Xiaodong, Zhao, Guijun, Li, Haiyun, and Liu, Xuejun
Published: 2023
Full Text: View/download PDF

50. Learning Question-Guided Video Representation for Multi-Turn Video Question Answering

Author: Chao, Guan-Lin, Rastogi, Abhinav, Yavuz, Semih, Hakkani-Tür, Dilek, Chen, Jindong, and Lane, Ian
Subjects: Computer Science - Computation and Language
Abstract: Understanding and conversing about dynamic scenes is one of the key capabilities of AI agents that navigate the environment and convey useful information to humans. Video question answering is a specific scenario of such AI-human interaction where an agent generates a natural language response to a question regarding the video of a dynamic scene. Incorporating features from multiple modalities, which often provide supplementary information, is one of the challenging aspects of video question answering. Furthermore, a question often concerns only a small segment of the video, hence encoding the entire video sequence using a recurrent neural network is not computationally efficient. Our proposed question-guided video representation module efficiently generates the token-level video summary guided by each word in the question. The learned representations are then fused with the question to generate the answer. Through empirical evaluation on the Audio Visual Scene-aware Dialog (AVSD) dataset, our proposed models in single-turn and multi-turn question answering achieve state-of-the-art performance on several automatic natural language generation evaluation metrics., Comment: Accepted at SIGDIAL 2019
Published: 2019

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

1,086 results on '"Chen, Jindong"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources