Author: "Zhou, Ben" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Zhou, Ben"' showing total 640 results

Start Over Author "Zhou, Ben"

640 results on '"Zhou, Ben"'

1. ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoning

Author: Yu, Xiaodong, Zhou, Ben, Cheng, Hao, and Roth, Dan
Subjects: Computer Science - Artificial Intelligence
Abstract: Existing math datasets evaluate the reasoning abilities of large language models (LLMs) by either using the final answer or the intermediate reasoning steps derived from static examples. However, the former approach fails to surface model's uses of shortcuts and wrong reasoning while the later poses challenges in accommodating alternative solutions. In this work, we seek to use symbolic programs as a means for automated evaluation if a model can consistently produce correct final answers across various inputs to the program. We begin by extracting programs for popular math datasets (GSM8K and MATH) using GPT4-o. For those executable programs verified using the original input-output pairs, they are found to encapsulate the proper reasoning required to solve the original text questions. We then prompt GPT4-o to generate new questions using alternative input-output pairs based the extracted program. We apply the resulting datasets to evaluate a collection of LLMs. In our experiments, we observe significant accuracy drops using our proposed evaluation compared with original static examples, suggesting the fragility of math reasoning in state-of-the-art LLMs.
Published: 2024

2. ToW: Thoughts of Words Improve Reasoning in Large Language Models

Author: Xu, Zhikun, Shen, Ming, Dineen, Jacob, Li, Zhaonan, Ye, Xiao, Lu, Shijie, RRV, Aswin, Baral, Chitta, and Zhou, Ben
Subjects: Computer Science - Computation and Language
Abstract: We introduce thoughts of words (ToW), a novel training-time data-augmentation method for next-word prediction. ToW views next-word prediction as a core reasoning task and injects fine-grained thoughts explaining what the next word should be and how it is related to the previous contexts in pre-training texts. Our formulation addresses two fundamental drawbacks of existing next-word prediction learning schemes: they induce factual hallucination and are inefficient for models to learn the implicit reasoning processes in raw texts. While there are many ways to acquire such thoughts of words, we explore the first step of acquiring ToW annotations through distilling from larger models. After continual pre-training with only 70K ToW annotations, we effectively improve models' reasoning performances by 7% to 9% on average and reduce model hallucination by up to 10%. At the same time, ToW is entirely agnostic to tasks and applications, introducing no additional biases on labels or semantics.
Published: 2024

3. Familiarity-aware Evidence Compression for Retrieval Augmented Generation

Author: Jung, Dongwon, Liu, Qin, Huang, Tenghao, Zhou, Ben, and Chen, Muhao
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: Retrieval Augmented Generation (RAG) improves large language models (LMs) by incorporating non-parametric knowledge through evidence retrieval from external sources. However, it often struggles to filter out inconsistent and irrelevant information that can distract the LM from its tasks. While compressing the retrieved evidence with a compression model aims to address this issue, the compressed evidence may still be unfamiliar to the target model used for downstream task, potentially failing to utilize the evidence effectively. We propose FaviComp (Familiarity-aware Evidence Compression), a novel training-free evidence compression technique that makes retrieved evidence more familiar to the target model, while seamlessly integrating parametric knowledge from the model. Specifically, FaviComp proactively lowers the perplexity of the compressed evidence with regard to the target model by combining token probabilities from both the compression model and the target model to generate context that is more familiar to the target model. This approach balances the integration of parametric and non-parametric knowledge, which is especially helpful in complex tasks where the retrieved evidence set may not contain all the necessary information. Experimental results demonstrate that FaviComp consistently outperforms existing baselines in multiple open-domain QA datasets, achieving high compression rates and showcasing the effective integration of both parametric and non-parametric knowledge.
Published: 2024

4. FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation

Author: Li, Bangzheng, Zhou, Ben, Fu, Xingyu, Wang, Fei, Roth, Dan, and Chen, Muhao
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Language models have shown impressive in-context-learning capabilities, which allow them to benefit from input prompts and perform better on downstream end tasks. Existing works investigate the mechanisms behind this observation, and propose label-agnostic prompt metrics that can better estimate end-task performances. One popular approach is using perplexity as a way to measure models' familiarity with the prompt. While showing consistent improvements on in-domain tasks, we found that familiarity metrics such as perplexity cannot accurately estimate performance in complicated situations such as task or domain transferring scenarios. In this work, we propose a revised measure called FamiCom, providing a more comprehensive measure for task-agnostic performance estimation. Specifically, FamiCom combines familiarity with \textit{complexity} -- the inherent difficulty of end tasks, which is an important factor missing from current metrics. Experiments show that FamiCom strongly correlates with end-task performances, producing a 0.85 Spearman's correlation, versus 0.43 of familiarity-only ones'. We further apply FamiCom to automatic prompt and demonstration selection, and outperform existing methods and baselines by more than 7.0% in accuracy.
Published: 2024

5. BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Author: Feng, Yu, Zhou, Ben, Lin, Weidong, and Roth, Dan
Subjects: Computer Science - Computation and Language
Abstract: Predictive models often need to work with incomplete information in real-world tasks. Consequently, they must provide reliable probability or confidence estimation, especially in large-scale decision making and planning tasks. Current large language models (LLM) are insufficient for such accurate estimations, but they can generate relevant factors that may affect the probabilities, produce coarse-grained probabilities when the information is more complete, and help determine which factors are relevant to specific downstream contexts. In this paper, we make use of these capabilities of LLMs to provide a significantly more accurate probabilistic estimation. We propose BIRD, a novel probabilistic inference framework that aligns a Bayesian network with LLM abductions and then estimates more accurate probabilities in a deduction step. We show BIRD provides reliable probability estimations that are 30\% better than those provided directly by LLM baselines. These estimates can further contribute to better and more trustworthy decision-making.
Published: 2024

6. Conceptual and Unbiased Reasoning in Language Models

Author: Zhou, Ben, Zhang, Hongming, Chen, Sihao, Yu, Dian, Wang, Hongwei, Peng, Baolin, Roth, Dan, and Yu, Dong
Subjects: Computer Science - Computation and Language
Abstract: Conceptual reasoning, the ability to reason in abstract and high-level perspectives, is key to generalization in human cognition. However, limited study has been done on large language models' capability to perform conceptual reasoning. In this work, we bridge this gap and propose a novel conceptualization framework that forces models to perform conceptual reasoning on abstract questions and generate solutions in a verifiable symbolic space. Using this framework as an analytical tool, we show that existing large language models fall short on conceptual reasoning, dropping 9% to 28% on various benchmarks compared to direct inference methods. We then discuss how models can improve since high-level abstract reasoning is key to unbiased and generalizable decision-making. We propose two techniques to add trustworthy induction signals by generating familiar questions with similar underlying reasoning paths and asking models to perform self-refinement. Experiments show that our proposed techniques improve models' conceptual reasoning performance by 8% to 11%, achieving a more robust reasoning system that relies less on inductive biases., Comment: Preprint under review
Published: 2024

7. Cognitive Overload: Jailbreaking Large Language Models with Overloaded Logical Thinking

Author: Xu, Nan, Wang, Fei, Zhou, Ben, Li, Bang Zheng, Xiao, Chaowei, and Chen, Muhao
Subjects: Computer Science - Computation and Language
Abstract: While large language models (LLMs) have demonstrated increasing power, they have also given rise to a wide range of harmful behaviors. As representatives, jailbreak attacks can provoke harmful or unethical responses from LLMs, even after safety alignment. In this paper, we investigate a novel category of jailbreak attacks specifically designed to target the cognitive structure and processes of LLMs. Specifically, we analyze the safety vulnerability of LLMs in the face of (1) multilingual cognitive overload, (2) veiled expression, and (3) effect-to-cause reasoning. Different from previous jailbreak attacks, our proposed cognitive overload is a black-box attack with no need for knowledge of model architecture or access to model weights. Experiments conducted on AdvBench and MasterKey reveal that various LLMs, including both popular open-source model Llama 2 and the proprietary model ChatGPT, can be compromised through cognitive overload. Motivated by cognitive psychology work on managing cognitive load, we further investigate defending cognitive overload attack from two perspectives. Empirical studies show that our cognitive overload from three perspectives can jailbreak all studied LLMs successfully, while existing defense strategies can hardly mitigate the caused malicious uses effectively.
Published: 2023

8. Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination?

Author: Li, Bangzheng, Zhou, Ben, Wang, Fei, Fu, Xingyu, Roth, Dan, and Chen, Muhao
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Despite the recent advancement in large language models (LLMs) and their high performances across numerous benchmarks, recent research has unveiled that LLMs suffer from hallucinations and unfaithful reasoning. This work studies a specific type of hallucination induced by semantic associations. Specifically, we investigate to what extent LLMs take shortcuts from certain keyword/entity biases in the prompt instead of following the correct reasoning path. To quantify this phenomenon, we propose a novel probing method and benchmark called EureQA. We start from questions that LLMs will answer correctly with utmost certainty, and mask the important entity with evidence sentence recursively, asking models to find masked entities according to a chain of evidence before answering the question. During the construction of the evidence, we purposefully replace semantic clues (entities) that may lead to the correct answer with distractor clues (evidence) that will not directly lead to the correct answer but require a chain-like reasoning process. We evaluate if models can follow the correct reasoning chain instead of short-cutting through distractor clues. We find that existing LLMs lack the necessary capabilities to follow correct reasoning paths and resist the attempt of greedy shortcuts. We show that the distractor semantic associations often lead to model hallucination, which is strong evidence that questions the validity of current LLM reasoning., Comment: Work accepted by NAACL 2024
Published: 2023

9. Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations

Author: Chen, Sihao, Zhang, Hongming, Chen, Tong, Zhou, Ben, Yu, Wenhao, Yu, Dian, Peng, Baolin, Wang, Hongwei, Roth, Dan, and Yu, Dong
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: We introduce sub-sentence encoder, a contrastively-learned contextual embedding model for fine-grained semantic representation of text. In contrast to the standard practice with sentence embeddings, where the meaning of an entire sequence of text is encoded into a fixed-length vector, the sub-sentence encoder learns to produce distinct contextual embeddings corresponding to different atomic propositions, i.e. atomic units of meaning expressed within a text sequence. The sub-sentence embeddings are contrastively learned to recognize (inferred) semantic equivalence between propositions across different text sequences. Our experiments show the effectiveness of sub-sentence encoders in applications, such as retrieving supporting facts for fine-grained text attribution or recognizing the conditional semantic similarity between texts. In practice, we demonstrate that sub-sentence encoders keep the same level of inference cost and space complexity compared to sentence encoders.
Published: 2023

10. Building Interpretable and Reliable Open Information Retriever for New Domains Overnight

Author: Yu, Xiaodong, Zhou, Ben, and Roth, Dan
Subjects: Computer Science - Computation and Language, Computer Science - Information Retrieval
Abstract: Information retrieval (IR) or knowledge retrieval, is a critical component for many down-stream tasks such as open-domain question answering (QA). It is also very challenging, as it requires succinctness, completeness, and correctness. In recent works, dense retrieval models have achieved state-of-the-art (SOTA) performance on in-domain IR and QA benchmarks by representing queries and knowledge passages with dense vectors and learning the lexical and semantic similarity. However, using single dense vectors and end-to-end supervision are not always optimal because queries may require attention to multiple aspects and event implicit knowledge. In this work, we propose an information retrieval pipeline that uses entity/event linking model and query decomposition model to focus more accurately on different information units of the query. We show that, while being more interpretable and reliable, our proposed pipeline significantly improves passage coverages and denotation accuracies across five IR and QA benchmarks. It will be the go-to system to use for applications that need to perform IR on a new domain without much dedicated effort, because of its superior interpretability and cross-domain performance., Comment: Submission of ACL 2023. Rejected
Published: 2023

11. Non-alcoholic fatty liver disease and gestational diabetes mellitus: a bidirectional two-sample mendelian randomization study

Author: Zhou, Ben-Gang, Xia, Jian-Lei, Jiang, Xin, Ding, Yan-Bing, and She, Qiang
Published: 2024
Full Text: View/download PDF

12. Dynamic Clue Bottlenecks: Towards Interpretable-by-Design Visual Question Answering

Author: Fu, Xingyu, Zhou, Ben, Chen, Sihao, Yatskar, Mark, and Roth, Dan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent advances in multimodal large language models (LLMs) have shown extreme effectiveness in visual question answering (VQA). However, the design nature of these end-to-end models prevents them from being interpretable to humans, undermining trust and applicability in critical domains. While post-hoc rationales offer certain insight into understanding model behavior, these explanations are not guaranteed to be faithful to the model. In this paper, we address these shortcomings by introducing an interpretable by design model that factors model decisions into intermediate human-legible explanations, and allows people to easily understand why a model fails or succeeds. We propose the Dynamic Clue Bottleneck Model ( (DCLUB), a method that is designed towards an inherently interpretable VQA system. DCLUB provides an explainable intermediate space before the VQA decision and is faithful from the beginning, while maintaining comparable performance to black-box systems. Given a question, DCLUB first returns a set of visual clues: natural language statements of visually salient evidence from the image, and then generates the output based solely on the visual clues. To supervise and evaluate the generation of VQA explanations within DCLUB, we collect a dataset of 1.7k reasoning-focused questions with visual clues. Evaluations show that our inherently interpretable system can improve 4.64% over a comparable black-box system in reasoning-focused questions while preserving 99.43% of performance on VQA-v2., Comment: Multimodal, Visual Question Answering, Vision and Language
Published: 2023

13. Generic Temporal Reasoning with Differential Analysis and Explanation

Author: Feng, Yu, Zhou, Ben, Wang, Haoyu, Jin, Helen, and Roth, Dan
Subjects: Computer Science - Computation and Language
Abstract: Temporal reasoning is the task of predicting temporal relations of event pairs. While temporal reasoning models can perform reasonably well on in-domain benchmarks, we have little idea of these systems' generalizability due to existing datasets' limitations. In this work, we introduce a novel task named TODAY that bridges this gap with temporal differential analysis, which as the name suggests, evaluates whether systems can correctly understand the effect of incremental changes. Specifically, TODAY introduces slight contextual changes for given event pairs, and systems are asked to tell how this subtle contextual change would affect relevant temporal relation distributions. To facilitate learning, TODAY also annotates human explanations. We show that existing models, including GPT-3.5, drop to random guessing on TODAY, suggesting that they heavily rely on spurious information rather than proper reasoning for temporal predictions. On the other hand, we show that TODAY's supervision style and explanation annotations can be used in joint learning, encouraging models to use more appropriate signals during training and thus outperform across several benchmarks. TODAY can also be used to train models to solicit incidental supervision from noisy sources such as GPT-3.5, thus moving us more toward the goal of generic temporal reasoning systems.
Published: 2022

14. Learning to Decompose: Hypothetical Question Decomposition Based on Comparable Texts

Author: Zhou, Ben, Richardson, Kyle, Yu, Xiaodong, and Roth, Dan
Subjects: Computer Science - Computation and Language
Abstract: Explicit decomposition modeling, which involves breaking down complex tasks into more straightforward and often more interpretable sub-tasks, has long been a central theme in developing robust and interpretable NLU systems. However, despite the many datasets and resources built as part of this effort, the majority have small-scale annotations and limited scope, which is insufficient to solve general decomposition tasks. In this paper, we look at large-scale intermediate pre-training of decomposition-based transformers using distant supervision from comparable texts, particularly large-scale parallel news. We show that with such intermediate pre-training, developing robust decomposition-based models for a diverse range of tasks becomes more feasible. For example, on semantic parsing, our model, DecompT5, improves 20% to 30% on two datasets, Overnight and TORQUE, over the baseline language model. We further use DecompT5 to build a novel decomposition-based QA system named DecompEntail, improving over state-of-the-art models, including GPT-3, on both HotpotQA and StrategyQA by 8% and 4%, respectively., Comment: Accepted at EMNLP 2022
Published: 2022

15. Cross-Lingual Speaker Identification Using Distant Supervision

Author: Zhou, Ben, Yu, Dian, Yu, Dong, and Roth, Dan
Subjects: Computer Science - Computation and Language
Abstract: Speaker identification, determining which character said each utterance in literary text, benefits many downstream tasks. Most existing approaches use expert-defined rules or rule-based features to directly approach this task, but these approaches come with significant drawbacks, such as lack of contextual reasoning and poor cross-lingual generalization. In this work, we propose a speaker identification framework that addresses these issues. We first extract large-scale distant supervision signals in English via general-purpose tools and heuristics, and then apply these weakly-labeled instances with a focus on encouraging contextual reasoning to train a cross-lingual language model. We show that the resulting model outperforms previous state-of-the-art methods on two English speaker identification benchmarks by up to 9% in accuracy and 5% with only distant supervision, as well as two Chinese speaker identification datasets by up to 4.7%.
Published: 2022

16. There is a Time and Place for Reasoning Beyond the Image

Author: Fu, Xingyu, Zhou, Ben, Chandratreya, Ishaan Preetam, Vondrick, Carl, and Roth, Dan
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Images are often more significant than only the pixels to human eyes, as we can infer, associate, and reason with contextual information from other sources to establish a more complete picture. For example, in Figure 1, we can find a way to identify the news articles related to the picture through segment-wise understandings of the signs, the buildings, the crowds, and more. This reasoning could provide the time and place the image was taken, which will help us in subsequent tasks, such as automatic storyline construction, correction of image source in intended effect photographs, and upper-stream processing such as image clustering for certain location or time. In this work, we formulate this problem and introduce TARA: a dataset with 16k images with their associated news, time, and location, automatically extracted from New York Times, and an additional 61k examples as distant supervision from WIT. On top of the extractions, we present a crowdsourced subset in which we believe it is possible to find the images' spatio-temporal information for evaluation purpose. We show that there exists a $70\%$ gap between a state-of-the-art joint model and human performance, which is slightly filled by our proposed model that uses segment-wise reasoning, motivating higher-level vision-language joint models that can conduct open-ended reasoning with world knowledge. The data and code are publicly available at https://github.com/zeyofu/TARA., Comment: Article accepted to the ACL 2022 Main conference
Published: 2022

17. Gut microbiota-brain bile acid axis orchestrates aging-related neuroinflammation and behavior impairment in mice

Author: Ma, Junli, Li, Mingxiao, Bao, Yiyang, Huang, Wenjin, He, Xiaofang, Hong, Ying, Wei, Wenjing, Liu, Zekun, Gao, Xinxin, Yang, Yang, Cui, Zhengyu, Wang, Wantao, Wang, Jie, Zhu, Weize, Zheng, Ningning, Pan, Lingyun, Wang, Deheng, Ke, Zunji, Zhou, Ben, Sheng, Lili, and Li, Houkai
Published: 2024
Full Text: View/download PDF

18. Efficient transformation of levulinic acid/esters to γ-valerolactone via a durable catalyst with simply tunable acid-base sites

Author: Zhang, Liangqing, Zhou, Ben, Hong, Yonglin, Wu, Qiaomei, Qiu, Jiarong, Chen, Jianfeng, and Zeng, Xianhai
Published: 2024
Full Text: View/download PDF

19. One-pot cascade process for efficient upgrading of furfural to γ-valerolactone over adjustable Lewis-Brønsted bi-acidic catalyst

Author: Qiu, Jiarong, Liu, Yi, Zhang, Jingwen, Zhou, Ben, Yang, Qiyue, Zhang, Liangqing, Chen, Jianfeng, Wang, Bingshu, Lin, Lu, and Zeng, Xianhai
Published: 2024
Full Text: View/download PDF

20. Association between inflammatory bowel disease and risk of incident prostate cancer: a systematic review and meta-analysis of cohort studies

Author: Zhou, Ben‐Gang, Yu, Qi, Jiang, Xin, Mei, Yu-Zhou, Ding, Yan-Bing, and Wang, Mei
Published: 2023
Full Text: View/download PDF

21. Acute liver steatosis translationally controls the epigenetic regulator MIER1 to promote liver regeneration in a study with male mice

Author: Chen, Yanhao, Chen, Lanlan, Wu, Xiaoshan, Zhao, Yongxu, Wang, Yuchen, Jiang, Dacheng, Liu, Xiaojian, Zhou, Tingting, Li, Shuang, Wei, Yuda, Liu, Yan, Hu, Cheng, Zhou, Ben, Qin, Jun, Ying, Hao, and Ding, Qiurong
Published: 2023
Full Text: View/download PDF

22. Efficient catalytic transfer hydrogenation of furfural and other biomass-derived compounds over sustainable magnetic catalyst

Author: Qiu, Jiarong, Zhou, Ben, Yang, Qiyue, Liu, Yi, Zhang, Liangqing, Wang, Bingshu, Song, Shunming, Zhang, Jingwen, Huang, Suchang, Chen, Jianfeng, Lin, Lu, and Zeng, Xianhai
Published: 2024
Full Text: View/download PDF

23. Temporal Reasoning on Implicit Events from Distant Supervision

Author: Zhou, Ben, Richardson, Kyle, Ning, Qiang, Khot, Tushar, Sabharwal, Ashish, and Roth, Dan
Subjects: Computer Science - Computation and Language
Abstract: We propose TRACIE, a novel temporal reasoning dataset that evaluates the degree to which systems understand implicit events -- events that are not mentioned explicitly in natural language text but can be inferred from it. This introduces a new challenge in temporal reasoning research, where prior work has focused on explicitly mentioned events. Human readers can infer implicit events via commonsense reasoning, resulting in a more comprehensive understanding of the situation and, consequently, better reasoning about time. We find, however, that state-of-the-art models struggle when predicting temporal relationships between implicit and explicit events. To address this, we propose a neuro-symbolic temporal reasoning model, SYMTIME, which exploits distant supervision signals from large-scale text and uses temporal rules to combine start times and durations to infer end times. SYMTIME outperforms strong baseline systems on TRACIE by 5%, and by 11% in a zero prior knowledge training setting. Our approach also generalizes to other temporal reasoning tasks, as evidenced by a gain of 1%-9% on MATRES, an explicit event benchmark., Comment: Accepted at NAACL 2021
Published: 2020

24. Temporal Common Sense Acquisition with Minimal Supervision

Author: Zhou, Ben, Ning, Qiang, Khashabi, Daniel, and Roth, Dan
Subjects: Computer Science - Computation and Language
Abstract: Temporal common sense (e.g., duration and frequency of events) is crucial for understanding natural language. However, its acquisition is challenging, partly because such information is often not expressed explicitly in text, and human annotation on such concepts is costly. This work proposes a novel sequence modeling approach that exploits explicit and implicit mentions of temporal common sense, extracted from a large corpus, to build TACOLM, a temporal common sense language model. Our method is shown to give quality predictions of various dimensions of temporal common sense (on UDST and a newly collected dataset from RealNews). It also produces representations of events for relevant tasks such as duration comparison, parent-child relations, event coreference and temporal QA (on TimeBank, HiEVE and MCTACO) that are better than using the standard BERT. Thus, it will be an important component of temporal NLP., Comment: Accepted by ACL 2020
Published: 2020

25. Cross-lingual Entity Alignment with Incidental Supervision

Author: Chen, Muhao, Shi, Weijia, Zhou, Ben, and Roth, Dan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Information Retrieval
Abstract: Much research effort has been put to multilingual knowledge graph (KG) embedding methods to address the entity alignment task, which seeks to match entities in different languagespecific KGs that refer to the same real-world object. Such methods are often hindered by the insufficiency of seed alignment provided between KGs. Therefore, we propose an incidentally supervised model, JEANS , which jointly represents multilingual KGs and text corpora in a shared embedding scheme, and seeks to improve entity alignment with incidental supervision signals from text. JEANS first deploys an entity grounding process to combine each KG with the monolingual text corpus. Then, two learning processes are conducted: (i) an embedding learning process to encode the KG and text of each language in one embedding space, and (ii) a selflearning based alignment learning process to iteratively induce the matching of entities and that of lexemes between embeddings. Experiments on benchmark datasets show that JEANS leads to promising improvement on entity alignment with incidental supervision, and significantly outperforms state-of-the-art methods that solely rely on internal information of KGs., Comment: EACL 2021
Published: 2020

26. Evaluating Models' Local Decision Boundaries via Contrast Sets

Author: Gardner, Matt, Artzi, Yoav, Basmova, Victoria, Berant, Jonathan, Bogin, Ben, Chen, Sihao, Dasigi, Pradeep, Dua, Dheeru, Elazar, Yanai, Gottumukkala, Ananth, Gupta, Nitish, Hajishirzi, Hanna, Ilharco, Gabriel, Khashabi, Daniel, Lin, Kevin, Liu, Jiangming, Liu, Nelson F., Mulcaire, Phoebe, Ning, Qiang, Singh, Sameer, Smith, Noah A., Subramanian, Sanjay, Tsarfaty, Reut, Wallace, Eric, Zhang, Ally, and Zhou, Ben
Subjects: Computer Science - Computation and Language
Abstract: Standard test sets for supervised learning evaluate in-distribution generalization. Unfortunately, when a dataset has systematic gaps (e.g., annotation artifacts), these evaluations are misleading: a model can learn simple decision rules that perform well on the test set but do not capture a dataset's intended capabilities. We propose a new annotation paradigm for NLP that helps to close systematic gaps in the test data. In particular, after a dataset is constructed, we recommend that the dataset authors manually perturb the test instances in small but meaningful ways that (typically) change the gold label, creating contrast sets. Contrast sets provide a local view of a model's decision boundary, which can be used to more accurately evaluate a model's true linguistic capabilities. We demonstrate the efficacy of contrast sets by creating them for 10 diverse NLP datasets (e.g., DROP reading comprehension, UD parsing, IMDb sentiment analysis). Although our contrast sets are not explicitly adversarial, model performance is significantly lower on them than on the original test sets---up to 25\% in some cases. We release our contrast sets as new evaluation benchmarks and encourage future dataset construction efforts to follow similar annotation processes.
Published: 2020

27. Novel waterborne polymeric crosslinkers containing blocked isocyanate moieties and active epoxy groups for electrophoretic coatings

Author: Zhou, Ben, Cao, Yu-Ting, Lu, Shuai-Wei, and Wei, Jun
Published: 2023
Full Text: View/download PDF

28. Selective transformation of biomass-derived substrates to 1,2-butanediol: A comprehensive review and new insights

Author: Zhang, Liangqing, Huang, Suchang, Qiu, Jiarong, Wang, Bingshu, Yan, BeiBei, Zhang, Jingwen, Zhou, Ben, Chen, Jianfeng, and Zeng, Xianhai
Published: 2023
Full Text: View/download PDF

29. 'Going on a vacation' takes longer than 'Going for a walk': A Study of Temporal Commonsense Understanding

Author: Zhou, Ben, Khashabi, Daniel, Ning, Qiang, and Roth, Dan
Subjects: Computer Science - Computation and Language
Abstract: Understanding time is crucial for understanding events expressed in natural language. Because people rarely say the obvious, it is often necessary to have commonsense knowledge about various temporal aspects of events, such as duration, frequency, and temporal order. However, this important problem has so far received limited attention. This paper systematically studies this temporal commonsense problem. Specifically, we define five classes of temporal commonsense, and use crowdsourcing to develop a new dataset, MCTACO, that serves as a test set for this task. We find that the best current methods used on MCTACO are still far behind human performance, by about 20%, and discuss several directions for improvement. We hope that the new dataset and our study here can foster more future research on this topic., Comment: EMNLP 2019 (short paper). arXiv admin note: text overlap with arXiv:1908.04926
Published: 2019

30. Zero-Shot Open Entity Typing as Type-Compatible Grounding

Author: Zhou, Ben, Khashabi, Daniel, Tsai, Chen-Tse, and Roth, Dan
Subjects: Computer Science - Computation and Language
Abstract: The problem of entity-typing has been studied predominantly in supervised learning fashion, mostly with task-specific annotations (for coarse types) and sometimes with distant supervision (for fine types). While such approaches have strong performance within datasets, they often lack the flexibility to transfer across text genres and to generalize to new type taxonomies. In this work we propose a zero-shot entity typing approach that requires no annotated data and can flexibly identify newly defined types. Given a type taxonomy defined as Boolean functions of FREEBASE "types", we ground a given mention to a set of type-compatible Wikipedia entries and then infer the target mention's types using an inference algorithm that makes use of the types of these entries. We evaluate our system on a broad range of datasets, including standard fine-grained and coarse-grained entity typing datasets, and also a dataset in the biological domain. Our system is shown to be competitive with state-of-the-art supervised NER systems and outperforms them on out-of-domain datasets. We also show that our system significantly outperforms other zero-shot fine typing systems., Comment: 16 pages, 5 figures, Accepted at EMNLP 2018
Published: 2019

31. CogCompTime: A Tool for Understanding Time in Natural Language Text

Author: Ning, Qiang, Zhou, Ben, Feng, Zhili, Peng, Haoruo, and Roth, Dan
Subjects: Computer Science - Computation and Language, Computer Science - Information Retrieval
Abstract: Automatic extraction of temporal information in text is an important component of natural language understanding. It involves two basic tasks: (1) Understanding time expressions that are mentioned explicitly in text (e.g., February 27, 1998 or tomorrow), and (2) Understanding temporal information that is conveyed implicitly via relations. In this paper, we introduce CogCompTime, a system that has these two important functionalities. It incorporates the most recent progress, achieves state-of-the-art performance, and is publicly available.1 We believe that this demo will be useful for multiple time-aware applications and provide valuable insight for future research in temporal understanding., Comment: Demo paper appeared in EMNLP'18. 6 pages and 3 figures
Published: 2019

32. Triphenyl borate as an effective film-modification additive for regulating the solid electrolyte interphase formed on graphite and silicon based anode

Author: Lu, Hai, Zeng, Fubao, Ma, Yitian, Liu, Meng, Zhou, Ben, Zhang, Zhiyun, Liu, Huan, and Yuan, Yan
Published: 2023
Full Text: View/download PDF

33. Gut microbiota remodeling improves natural aging-related disorders through Akkermansia muciniphila and its derived acetic acid

Author: Ma, Junli, Liu, Zekun, Gao, Xinxin, Bao, Yiyang, Hong, Ying, He, Xiaofang, Zhu, Weize, Li, Yan, Huang, Wenjin, Zheng, Ningning, Sheng, Lili, Zhou, Ben, Chen, Hongzhuan, and Li, Houkai
Published: 2023
Full Text: View/download PDF

34. Comparison of Ganglion Cell Layer and Ganglion Cell/Inner Plexiform Layer Measures for Detection of Early Glaucoma

Author: Mahmoudinezhad, Golnoush, Mohammadzadeh, Vahid, Martinyan, Jack, Edalati, Kiumars, Zhou, Ben, Yalzadeh, Dariush, Amini, Navid, Caprioli, Joseph, and Nouri-Mahdavi, Kouros
Published: 2023
Full Text: View/download PDF

35. MRG15 aggravates non-alcoholic steatohepatitis progression by regulating the mitochondrial proteolytic degradation of TUFM

Author: Tian, Cheng, Min, Xuewen, Zhao, Yongxu, Wang, Yuchen, Wu, Xiaoshan, Liu, Situn, Dou, Wei, Zhou, Tingting, Liu, Yan, Luo, Rongkui, Li, Zhigang, Lui, Kathy O., Li, Yu, Zhou, Ben, and Ding, Qiurong
Published: 2022
Full Text: View/download PDF

36. Minimal [formula omitted]-symmetric periodic solutions of general Hamiltonian systems and delay differential equations

Author: Zhou, Ben-Xing, Liu, Chungen, Zhou, Zhan, and Zhang, Xiaofei
Published: 2022
Full Text: View/download PDF

37. Single-cell sequencing unveils key contributions of immune cell populations in cancer-associated adipose wasting

Author: Han, Jun, Wang, Yuchen, Qiu, Yan, Sun, Diya, Liu, Yan, Li, Zhigang, Zhou, Ben, Zhang, Haibing, Xiao, Yichuan, Wu, Guohao, and Ding, Qiurong
Published: 2022
Full Text: View/download PDF

38. Association of MASLD with the risk of extrahepatic cancers: A systematic review and meta‐analysis of 18 cohort studies.

Author: Zhou, Ben‐Gang, Jiang, Xin, She, Qiang, and Ding, Yan‐Bing
Subjects: *SMALL intestine cancer, *FATTY liver, *BILE ducts, *GENITALIA, *BILIARY tract
Abstract: Background: Numerous recent studies have explored the association between metabolic dysfunction‐associated steatotic liver disease (MASLD) and the risk of various extrahepatic cancers. However, the conclusions were inconclusive. The aim of this study was to clarify this relationship by conducting a robust meta‐analysis. Methods: Systematic searches were conducted on PubMed, Embase and Web of Science databases to identify relevant cohort studies published prior to February 2024. Hazard ratios (HRs) and their corresponding 95% confidence intervals (95% CIs) were combined using a random‐effects model in this meta‐analysis. Results: Eighteen cohort studies (approximately 16.7 million participants) were finally included in this meta‐analysis. MASLD was linked to a higher risk of extrahepatic cancers, such as gastric (n = 10, HR = 1.47, 95% CI: 1.07–2.01), colorectal (n = 13, HR = 1.33, 95% CI: 1.16–1.53), pancreatic (n = 8, HR = 1.41, 95% CI: 1.11–1.79), biliary tract (n = 5, HR = 1.27, 95% CI: 1.18–1.37), thyroid (n = 6, HR = 1.46, 95% CI: 1.02–2.09), urinary system (n = 10, HR = 1.45, 95% CI: 1.25–1.69), breast (n = 11, HR = 1.17, 95% CI: 1.08–1.26) and female genital organ cancers (n = 10, HR = 1.36, 95% CI: 1.11–1.66). However, there was no statistically significant association between MASLD and the risk of head and neck (n = 6, HR = 1.03, 95% CI: 99–1.07), oesophageal (n = 9, HR = 1.26, 95% CI: 0.86–1.86), lung (n = 9, HR = 1.01, 95% CI: 0.92–1.10), prostate (n = 9, HR = 1.06, 95% CI: 0.94–1.19) or small intestine cancer (n = 2, HR = 1.75, 95% CI: 1.00–3.06). Conclusions: This latest large‐scale meta‐analysis indicated that MASLD was associated with an increased risk of various extrahepatic cancers, such as gastric, colorectal, pancreatic, biliary duct, thyroid, urinary system, breast, skin and female genital cancers. Further research is needed to investigate the mechanisms underlying these associations. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

39. Minimal $P$-symmetric period problem of first-order autonomous Hamiltonian Systems

Author: Liu, Chungen and Zhou, Ben-Xing
Subjects: Mathematics - Dynamical Systems
Abstract: Let $P\in Sp(2n)$ satisfying $P^{k}=I_{2n}$, we consider the minimal $P$-symmetric period problem of the autonomous nonlinear Hamiltonian system \begin{equation*} \dot x(t) = JH^{\prime}(x(t)). \end{equation*} For some symplectic matrices $P$, we show that for any $\tau>0$ the above Hamiltonian system possesses a $k\tau$ periodic solution $x$ with $k\tau$ being its minimal $P$-symmetric period provided $H$ satisfies the Rabinowitz's conditions on the minimal period conjecture, together with that $H$ is convex and $H(Px)=H(x)$., Comment: 13 pages
Published: 2016

40. Homoclinic solutions of discrete nonlinear Schrödinger equations with unbounded potentials

Author: Zhou, Ben-Xing and Liu, Chungen
Published: 2022
Full Text: View/download PDF

41. Tryptophan metabolism and liver fibrosis

Author: Wu, Hui-Xing, primary, Long, Chang-Rui, additional, Zhang, Zhen-Hua, additional, Chen, Yu-Lian, additional, Wang, Yan-Yu, additional, Xiang, Shi-Jian, additional, and Zhou, Ben-Jie, additional
Published: 2024
Full Text: View/download PDF

42. Association between metabolic dysfunction-associated steatotic liver disease and risk of incident pancreatic cancer: a systematic review and meta-analysis of cohort studies

Author: Zhang, Yi, primary, Zhou, Ben-Gang, additional, Zhan, Ji-Dong, additional, and Du, Bin-Bin, additional
Published: 2024
Full Text: View/download PDF

43. Psychological Stress Classification Using EEG and ECG: A CNN Based Multimodal Fusion Model

Author: Zhou, Ben, primary, Wang, Lei, additional, and Jiang, Chenyu, additional
Published: 2024
Full Text: View/download PDF

44. Connected Order Ideals and P-Partitions

Author: Zhou, Ben P.
Subjects: Mathematics - Combinatorics
Abstract: Given a finite poset $P$, we associate a simple graph denoted by $G_P$ with all connected order ideals of $P$ as vertices, and two vertices are adjacent if and only if they have nonempty intersection and are incomparable with respect to set inclusion. We establish a bijection between the set of maximum independent sets of $G_P$ and the set of $P$-forests, introduced by F\'eray and Reiner in their study of the fundamental generating function $F_P(\textbf{x})$ associated with $P$-partitions. Based on this bijection, in the cases when $P$ is naturally labeled we show that $F_P(\textbf{x})$ can factorise, such that each factor is a summation of rational functions determined by maximum independent sets of a connected component of $G_P$. This approach enables us to give an alternative proof for F\'eray and Reiner's nice formula of $F_P(\textbf{x})$ for the case of $P$ being a naturally labeled forest with duplications. Another consequence of our result is a product formula to compute the number of linear extensions of $P$., Comment: 23 pages, 6 figures
Published: 2016

45. Nano‐LaMnO3 Modified Separator for Enhanced Redox Reaction Kinetics and Electrochemical Performance of Lithium‐Sulfur Batteries.

Author: Yi, Dawei, Chang, Linqing, Zhou, Ben, Ma, Yitian, Wu, Yuhao, Wang, Peipei, Hou, Zhaoqi, Du, Huiling, and Lu, Hai
Subjects: LITHIUM sulfur batteries, OXIDATION-reduction reaction, CHEMICAL kinetics, CARBON-black, POLYSULFIDES
Abstract: The "shuttle effect" and sluggish conversion reaction kinetics of polysulfides seriously hinder the practical application of Li−S batteries. In this study, nano‐sized LaMnO3 (N‐LMO) with typical perovskite structure combined with highly‐conductive carbon black (SP) was introduced on the commercial separator to fabricate a functional surface modification. It is found that the polar N‐LMO with abundant active sites offers strong chemisorption towards the soluble polysulfides meanwhile accelerates their redox conversion, and the conductive SP contributes to extra spots to reactivate the trapped sulfur species. Consequently, the synergistic effect of adsorption‐catalysis‐conduction built by the cooperative N‐LMO/SP modification greatly enhances the sulfur redox kinetics as well as suppresses the polysulfide shuttling not at the expense of weakening Li‐ion transport, which endows the Li−S cell with a high initial discharge capacity of 1143.6 mAh g−1 at 0.2 C and a capacity decay rate of only 0.072 % per cycle at 1 C over 500 cycles. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

46. Applications of Conley index theory on difference equations with non-resonance

Author: Zhou, Ben-Xing and Liu, Chungen
Published: 2020
Full Text: View/download PDF

47. Multi-scale convolutional attention and Riemannian geometry network for EEG-based motor imagery classification

Author: Zhou, Ben, primary, Wang, Lei, additional, Xu, Wenchang, additional, and Jiang, Chenyu, additional
Published: 2024
Full Text: View/download PDF

48. A study on the critical Kirchhoff problem in high-dimensional space

Author: Xie, Qilin and Zhou, Ben-Xing
Published: 2022
Full Text: View/download PDF

49. Saturated very long chain fatty acid configures glycosphingolipid for lysosome homeostasis in long-lived C. elegans

Author: Wang, Feng, Dai, Yuxi, Zhu, Xufeng, Chen, Qilong, Zhu, Huanhu, Zhou, Ben, Tang, Haiqing, and Pang, Shanshan
Published: 2021
Full Text: View/download PDF

50. Prevalence of mental health problems in frontline healthcare workers after the first outbreak of COVID-19 in China: a cross-sectional study

Author: Guo, Wen-Ping, Min, Qing, Gu, Wei-Wei, Yu, Liang, Xiao, Xiao, Yi, Wei-Bing, Li, Hong-Liang, Huang, Bei, Li, Jun-Li, Dai, Ya-Jun, Xia, Jian, Liu, Jie, Li, Bei, Zhou, Ben-Hong, Li, Minglun, Xu, Hong-Xi, Wang, Xuan-Bin, and Shi, Wen-Yuan
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

640 results on '"Zhou, Ben"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources