Author: "Yang, Jingfeng" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Yang, Jingfeng"' showing total 468 results

Start Over Author "Yang, Jingfeng"

468 results on '"Yang, Jingfeng"'

1. Scaling Laws for Predicting Downstream Performance in LLMs

Author: Chen, Yangyi, Huang, Binxuan, Gao, Yifan, Wang, Zhengyang, Yang, Jingfeng, and Ji, Heng
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Precise estimation of downstream performance in large language models (LLMs) prior to training is essential for guiding their development process. Scaling laws analysis utilizes the statistics of a series of significantly smaller sampling language models (LMs) to predict the performance of the target LLM. For downstream performance prediction, the critical challenge lies in the emergent abilities in LLMs that occur beyond task-specific computational thresholds. In this work, we focus on the pre-training loss as a more computation-efficient metric for performance estimation. Our two-stage approach consists of first estimating a function that maps computational resources (e.g., FLOPs) to the pre-training Loss using a series of sampling models, followed by mapping the pre-training loss to downstream task Performance after the critical "emergent phase". In preliminary experiments, this FLP solution accurately predicts the performance of LLMs with 7B and 13B parameters using a series of sampling LMs up to 3B, achieving error margins of 5% and 10%, respectively, and significantly outperforming the FLOPs-to-Performance approach. This motivates FLP-M, a fundamental approach for performance prediction that addresses the practical need to integrate datasets from multiple sources during pre-training, specifically blending general corpora with code data to accurately represent the common necessity. FLP-M extends the power law analytical function to predict domain-specific pre-training loss based on FLOPs across data sources, and employs a two-layer neural network to model the non-linear relationship between multiple domain-specific loss and downstream performance. By utilizing a 3B LLM trained on a specific ratio and a series of smaller sampling LMs, FLP-M can effectively forecast the performance of 3B and 7B LLMs across various data mixtures for most benchmarks within 10% error margins.
Published: 2024

2. Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs

Author: Cheng, Kewei, Yang, Jingfeng, Jiang, Haoming, Wang, Zhengyang, Huang, Binxuan, Li, Ruirui, Li, Shiyang, Li, Zheng, Gao, Yifan, Li, Xian, Yin, Bing, and Sun, Yizhou
Subjects: Computer Science - Artificial Intelligence
Abstract: Reasoning encompasses two typical types: deductive reasoning and inductive reasoning. Despite extensive research into the reasoning capabilities of Large Language Models (LLMs), most studies have failed to rigorously differentiate between inductive and deductive reasoning, leading to a blending of the two. This raises an essential question: In LLM reasoning, which poses a greater challenge - deductive or inductive reasoning? While the deductive reasoning capabilities of LLMs, (i.e. their capacity to follow instructions in reasoning tasks), have received considerable attention, their abilities in true inductive reasoning remain largely unexplored. To investigate into the true inductive reasoning capabilities of LLMs, we propose a novel framework, SolverLearner. This framework enables LLMs to learn the underlying function (i.e., $y = f_w(x)$), that maps input data points $(x)$ to their corresponding output values $(y)$, using only in-context examples. By focusing on inductive reasoning and separating it from LLM-based deductive reasoning, we can isolate and investigate inductive reasoning of LLMs in its pure form via SolverLearner. Our observations reveal that LLMs demonstrate remarkable inductive reasoning capabilities through SolverLearner, achieving near-perfect performance with ACC of 1 in most cases. Surprisingly, despite their strong inductive reasoning abilities, LLMs tend to relatively lack deductive reasoning capabilities, particularly in tasks involving ``counterfactual'' reasoning.
Published: 2024

3. Segment Anything without Supervision

Author: Wang, XuDong, Yang, Jingfeng, and Darrell, Trevor
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: The Segmentation Anything Model (SAM) requires labor-intensive data labeling. We present Unsupervised SAM (UnSAM) for promptable and automatic whole-image segmentation that does not require human annotations. UnSAM utilizes a divide-and-conquer strategy to "discover" the hierarchical structure of visual scenes. We first leverage top-down clustering methods to partition an unlabeled image into instance/semantic level segments. For all pixels within a segment, a bottom-up clustering method is employed to iteratively merge them into larger groups, thereby forming a hierarchical structure. These unsupervised multi-granular masks are then utilized to supervise model training. Evaluated across seven popular datasets, UnSAM achieves competitive results with the supervised counterpart SAM, and surpasses the previous state-of-the-art in unsupervised segmentation by 11% in terms of AR. Moreover, we show that supervised SAM can also benefit from our self-supervised labels. By integrating our unsupervised pseudo masks into SA-1B's ground-truth masks and training UnSAM with only 1% of SA-1B, a lightly semi-supervised UnSAM can often segment entities overlooked by supervised SAM, exceeding SAM's AR by over 6.7% and AP by 3.9% on SA-1B., Comment: Code: https://github.com/frank-xwang/UnSAM
Published: 2024

4. Large Language Models in the Clinic: A Comprehensive Benchmark

Author: Liu, Fenglin, Li, Zheng, Zhou, Hongjian, Yin, Qingyu, Yang, Jingfeng, Tang, Xianfeng, Luo, Chen, Zeng, Ming, Jiang, Haoming, Gao, Yifan, Nigam, Priyanka, Nag, Sreyashi, Yin, Bing, Hua, Yining, Zhou, Xuan, Rohanian, Omid, Thakur, Anshul, Clifton, Lei, and Clifton, David A.
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: The adoption of large language models (LLMs) to assist clinicians has attracted remarkable attention. Existing works mainly adopt the close-ended question-answering (QA) task with answer options for evaluation. However, many clinical decisions involve answering open-ended questions without pre-set options. To better understand LLMs in the clinic, we construct a benchmark ClinicBench. We first collect eleven existing datasets covering diverse clinical language generation, understanding, and reasoning tasks. Furthermore, we construct six novel datasets and clinical tasks that are complex but common in real-world practice, e.g., open-ended decision-making, long document processing, and emerging drug analysis. We conduct an extensive evaluation of twenty-two LLMs under both zero-shot and few-shot settings. Finally, we invite medical experts to evaluate the clinical usefulness of LLMs. The benchmark data is available at https://github.com/AI-in-Health/ClinicBench., Comment: Accepted at EMNLP 2024 Main Conference
Published: 2024

5. Development of a Successive two-stage Radio Frequency Combined hot/cold air Drying of Squid Based on Drying Kinetics and Browning Index

Author: Zhang, Feilong, Zhang, Yajin, Yang, Jingfeng, Li, Feng, Kong, Fanbin, Tang, Juming, Shi, Hu, and Jiao, Yang
Published: 2024
Full Text: View/download PDF

6. MEMORYLLM: Towards Self-Updatable Large Language Models

Author: Wang, Yu, Gao, Yifan, Chen, Xiusi, Jiang, Haoming, Li, Shiyang, Yang, Jingfeng, Yin, Qingyu, Li, Zheng, Li, Xian, Yin, Bing, Shang, Jingbo, and McAuley, Julian
Subjects: Computer Science - Computation and Language
Abstract: Existing Large Language Models (LLMs) usually remain static after deployment, which might make it hard to inject new knowledge into the model. We aim to build models containing a considerable portion of self-updatable parameters, enabling the model to integrate new knowledge effectively and efficiently. To this end, we introduce MEMORYLLM, a model that comprises a transformer and a fixed-size memory pool within the latent space of the transformer. MEMORYLLM can self-update with text knowledge and memorize the knowledge injected earlier. Our evaluations demonstrate the ability of MEMORYLLM to effectively incorporate new knowledge, as evidenced by its performance on model editing benchmarks. Meanwhile, the model exhibits long-term information retention capacity, which is validated through our custom-designed evaluations and long-context benchmarks. MEMORYLLM also shows operational integrity without any sign of performance degradation even after nearly a million memory updates. Our code and model are open-sourced at https://github.com/wangyu-ustc/MemoryLLM., Comment: 13 pages, 9 figures
Published: 2024

7. LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Author: Jin, Hongye, Han, Xiaotian, Yang, Jingfeng, Jiang, Zhimeng, Liu, Zirui, Chang, Chia-Yuan, Chen, Huiyuan, and Hu, Xia
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: It is well known that LLMs cannot generalize well to long contexts whose lengths are larger than the training sequence length. This poses challenges when employing LLMs for processing long input sequences during inference. In this work, we argue that LLMs themselves have inherent capabilities to handle long contexts without fine-tuning. To achieve this goal, we propose SelfExtend to extend the context window of LLMs by constructing bi-level attention information: the grouped attention and the neighbor attention. The grouped attention captures the dependencies among tokens that are far apart, while neighbor attention captures dependencies among adjacent tokens within a specified range. The two-level attentions are computed based on the original model's self-attention mechanism during inference. With minor code modification, our SelfExtend can effortlessly extend existing LLMs' context window without any fine-tuning. We conduct comprehensive experiments on multiple benchmarks and the results show that our SelfExtend can effectively extend existing LLMs' context window length. The code can be found at \url{https://github.com/datamllab/LongLM}., Comment: ICML2024 Spotlight
Published: 2024

8. Enhancing User Intent Capture in Session-Based Recommendation with Attribute Patterns

Author: Liu, Xin, Li, Zheng, Gao, Yifan, Yang, Jingfeng, Cao, Tianyu, Wang, Zhengyang, Yin, Bing, and Song, Yangqiu
Subjects: Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: The goal of session-based recommendation in E-commerce is to predict the next item that an anonymous user will purchase based on the browsing and purchase history. However, constructing global or local transition graphs to supplement session data can lead to noisy correlations and user intent vanishing. In this work, we propose the Frequent Attribute Pattern Augmented Transformer (FAPAT) that characterizes user intents by building attribute transition graphs and matching attribute patterns. Specifically, the frequent and compact attribute patterns are served as memory to augment session representations, followed by a gate and a transformer block to fuse the whole session information. Through extensive experiments on two public benchmarks and 100 million industrial data in three domains, we demonstrate that FAPAT consistently outperforms state-of-the-art methods by an average of 4.5% across various evaluation metrics (Hits, NDCG, MRR). Besides evaluating the next-item prediction, we estimate the models' capabilities to capture user intents via predicting items' attributes and period-item recommendations., Comment: Accepted by NeurIPS 2023
Published: 2023

9. Ascorbic acid alleviates the autolysis of the sea cucumber Apostichopus japonicus via the activation of antioxidant enzymes to remove reactive oxygen species during live storage

Author: Cai, Han, Zhao, Jun, Wang, Lu, Wang, Yanjie, Zheng, Jie, Song, Shuang, and Yang, Jingfeng
Published: 2024
Full Text: View/download PDF

10. Alterations in feeding preference and gastric emptying of giant freshwater prawn (Macrobrachium rosenbergii) following administration of varying quantities of fermented soybean meal

Author: Cai, XingHui, Luo, Jingyi, Li, Xiang, Yang, JingFeng, Hua, XueMing, and Liu, Tao
Published: 2024
Full Text: View/download PDF

11. GrowLength: Accelerating LLMs Pretraining by Progressively Growing Training Length

Author: Jin, Hongye, Han, Xiaotian, Yang, Jingfeng, Jiang, Zhimeng, Chang, Chia-Yuan, and Hu, Xia
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: The evolving sophistication and intricacies of Large Language Models (LLMs) yield unprecedented advancements, yet they simultaneously demand considerable computational resources and incur significant costs. To alleviate these challenges, this paper introduces a novel, simple, and effective method named ``\growlength'' to accelerate the pretraining process of LLMs. Our method progressively increases the training length throughout the pretraining phase, thereby mitigating computational costs and enhancing efficiency. For instance, it begins with a sequence length of 128 and progressively extends to 4096. This approach enables models to process a larger number of tokens within limited time frames, potentially boosting their performance. In other words, the efficiency gain is derived from training with shorter sequences optimizing the utilization of resources. Our extensive experiments with various state-of-the-art LLMs have revealed that models trained using our method not only converge more swiftly but also exhibit superior performance metrics compared to those trained with existing methods. Furthermore, our method for LLMs pretraining acceleration does not require any additional engineering efforts, making it a practical solution in the realm of LLMs.
Published: 2023

12. Situated Natural Language Explanations

Author: Zhu, Zining, Jiang, Haoming, Yang, Jingfeng, Nag, Sreyashi, Zhang, Chao, Huang, Jie, Gao, Yifan, Rudzicz, Frank, and Yin, Bing
Subjects: Computer Science - Computation and Language
Abstract: Natural language is among the most accessible tools for explaining decisions to humans, and large pretrained language models (PLMs) have demonstrated impressive abilities to generate coherent natural language explanations (NLE). The existing NLE research perspectives do not take the audience into account. An NLE can have high textual quality, but it might not accommodate audiences' needs and preference. To address this limitation, we propose an alternative perspective, \textit{situated} NLE. On the evaluation side, we set up automated evaluation scores. These scores describe the properties of NLEs in lexical, semantic, and pragmatic categories. On the generation side, we identify three prompt engineering techniques and assess their applicability on the situations. Situated NLE provides a perspective and facilitates further research on the generation and evaluation of explanations.
Published: 2023

13. CCGen: Explainable Complementary Concept Generation in E-Commerce

Author: Huang, Jie, Gao, Yifan, Li, Zheng, Yang, Jingfeng, Song, Yangqiu, Zhang, Chao, Zhu, Zining, Jiang, Haoming, Chang, Kevin Chen-Chuan, and Yin, Bing
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: We propose and study Complementary Concept Generation (CCGen): given a concept of interest, e.g., "Digital Cameras", generating a list of complementary concepts, e.g., 1) Camera Lenses 2) Batteries 3) Camera Cases 4) Memory Cards 5) Battery Chargers. CCGen is beneficial for various applications like query suggestion and item recommendation, especially in the e-commerce domain. To solve CCGen, we propose to train language models to generate ranked lists of concepts with a two-step training strategy. We also teach the models to generate explanations by incorporating explanations distilled from large teacher models. Extensive experiments and analysis demonstrate that our model can generate high-quality concepts complementary to the input concept while producing explanations to justify the predictions.
Published: 2023

14. Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond

Author: Yang, Jingfeng, Jin, Hongye, Tang, Ruixiang, Han, Xiaotian, Feng, Qizhang, Jiang, Haoming, Yin, Bing, and Hu, Xia
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: This paper presents a comprehensive and practical guide for practitioners and end-users working with Large Language Models (LLMs) in their downstream natural language processing (NLP) tasks. We provide discussions and insights into the usage of LLMs from the perspectives of models, data, and downstream tasks. Firstly, we offer an introduction and brief summary of current GPT- and BERT-style LLMs. Then, we discuss the influence of pre-training data, training data, and test data. Most importantly, we provide a detailed discussion about the use and non-use cases of large language models for various natural language processing tasks, such as knowledge-intensive tasks, traditional natural language understanding tasks, natural language generation tasks, emergent abilities, and considerations for specific tasks.We present various use cases and non-use cases to illustrate the practical applications and limitations of LLMs in real-world scenarios. We also try to understand the importance of data and the specific challenges associated with each NLP task. Furthermore, we explore the impact of spurious biases on LLMs and delve into other essential considerations, such as efficiency, cost, and latency, to ensure a comprehensive understanding of deploying LLMs in practice. This comprehensive guide aims to provide researchers and practitioners with valuable insights and best practices for working with LLMs, thereby enabling the successful implementation of these models in a wide range of NLP tasks. A curated list of practical guide resources of LLMs, regularly updated, can be found at \url{https://github.com/Mooler0410/LLMsPracticalGuide}.
Published: 2023

15. Mutually-paced Knowledge Distillation for Cross-lingual Temporal Knowledge Graph Reasoning

Author: Wang, Ruijie, Li, Zheng, Yang, Jingfeng, Cao, Tianyu, Zhang, Chao, Yin, Bing, and Abdelzaher, Tarek
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Social and Information Networks
Abstract: This paper investigates cross-lingual temporal knowledge graph reasoning problem, which aims to facilitate reasoning on Temporal Knowledge Graphs (TKGs) in low-resource languages by transfering knowledge from TKGs in high-resource ones. The cross-lingual distillation ability across TKGs becomes increasingly crucial, in light of the unsatisfying performance of existing reasoning methods on those severely incomplete TKGs, especially in low-resource languages. However, it poses tremendous challenges in two aspects. First, the cross-lingual alignments, which serve as bridges for knowledge transfer, are usually too scarce to transfer sufficient knowledge between two TKGs. Second, temporal knowledge discrepancy of the aligned entities, especially when alignments are unreliable, can mislead the knowledge distillation process. We correspondingly propose a mutually-paced knowledge distillation model MP-KD, where a teacher network trained on a source TKG can guide the training of a student network on target TKGs with an alignment module. Concretely, to deal with the scarcity issue, MP-KD generates pseudo alignments between TKGs based on the temporal information extracted by our representation module. To maximize the efficacy of knowledge transfer and control the noise caused by the temporal knowledge discrepancy, we enhance MP-KD with a temporal cross-lingual attention mechanism to dynamically estimate the alignment strength. The two procedures are mutually paced along with model training. Extensive experiments on twelve cross-lingual TKG transfer tasks in the EventKG benchmark demonstrate the effectiveness of the proposed MP-KD method., Comment: This paper is accepted by The Web Conference 2023
Published: 2023

16. Choice Fusion as Knowledge for Zero-Shot Dialogue State Tracking

Author: Su, Ruolin, Yang, Jingfeng, Wu, Ting-Wei, and Juang, Biing-Hwang
Subjects: Computer Science - Computation and Language
Abstract: With the demanding need for deploying dialogue systems in new domains with less cost, zero-shot dialogue state tracking (DST), which tracks user's requirements in task-oriented dialogues without training on desired domains, draws attention increasingly. Although prior works have leveraged question-answering (QA) data to reduce the need for in-domain training in DST, they fail to explicitly model knowledge transfer and fusion for tracking dialogue states. To address this issue, we propose CoFunDST, which is trained on domain-agnostic QA datasets and directly uses candidate choices of slot-values as knowledge for zero-shot dialogue-state generation, based on a T5 pre-trained language model. Specifically, CoFunDST selects highly-relevant choices to the reference context and fuses them to initialize the decoder to constrain the model outputs. Our experimental results show that our proposed model achieves outperformed joint goal accuracy compared to existing zero-shot DST approaches in most domains on the MultiWOZ 2.1. Extensive analyses demonstrate the effectiveness of our proposed approach for improving zero-shot DST learning from QA., Comment: Accepted by ICASSP 2023
Published: 2023

17. Multi-VALUE: A Framework for Cross-Dialectal English NLP

Author: Ziems, Caleb, Held, William, Yang, Jingfeng, Dhamala, Jwala, Gupta, Rahul, and Yang, Diyi
Subjects: Computer Science - Computation and Language
Abstract: Dialect differences caused by regional, social, and economic factors cause performance discrepancies for many groups of language technology users. Inclusive and equitable language technology must critically be dialect invariant, meaning that performance remains constant over dialectal shifts. Current systems often fall short of this ideal since they are designed and tested on a single dialect: Standard American English (SAE). We introduce a suite of resources for evaluating and achieving English dialect invariance. The resource is called Multi-VALUE, a controllable rule-based translation system spanning 50 English dialects and 189 unique linguistic features. Multi-VALUE maps SAE to synthetic forms of each dialect. First, we use this system to stress tests question answering, machine translation, and semantic parsing. Stress tests reveal significant performance disparities for leading models on non-standard dialects. Second, we use this system as a data augmentation technique to improve the dialect robustness of existing systems. Finally, we partner with native speakers of Chicano and Indian English to release new gold-standard variants of the popular CoQA task. To execute the transformation code, run model checkpoints, and download both synthetic and gold-standard dialectal benchmark datasets, see http://value-nlp.org., Comment: ACL 2023
Published: 2022

18. On the Security Vulnerabilities of Text-to-SQL Models

Author: Peng, Xutan, Zhang, Yipeng, Yang, Jingfeng, and Stevenson, Mark
Subjects: Computer Science - Computation and Language, Computer Science - Cryptography and Security, Computer Science - Databases, Computer Science - Machine Learning, Computer Science - Software Engineering
Abstract: Although it has been demonstrated that Natural Language Processing (NLP) algorithms are vulnerable to deliberate attacks, the question of whether such weaknesses can lead to software security threats is under-explored. To bridge this gap, we conducted vulnerability tests on Text-to-SQL systems that are commonly used to create natural language interfaces to databases. We showed that the Text-to-SQL modules within six commercial applications can be manipulated to produce malicious code, potentially leading to data breaches and Denial of Service attacks. This is the first demonstration that NLP models can be exploited as attack vectors in the wild. In addition, experiments using four open-source language models verified that straightforward backdoor attacks on Text-to-SQL systems achieve a 100% success rate without affecting their performance. The aim of this work is to draw the community's attention to potential software security issues associated with NLP algorithms and encourage exploration of methods to mitigate against them., Comment: Best Paper Candidate at ISSRE 2023. Replaced "PLM" with "LLM" for better visibility
Published: 2022

19. SeqZero: Few-shot Compositional Semantic Parsing with Sequential Prompts and Zero-shot Models

Author: Yang, Jingfeng, Jiang, Haoming, Yin, Qingyu, Zhang, Danqing, Yin, Bing, and Yang, Diyi
Subjects: Computer Science - Computation and Language
Abstract: Recent research showed promising results on combining pretrained language models (LMs) with canonical utterance for few-shot semantic parsing. The canonical utterance is often lengthy and complex due to the compositional structure of formal languages. Learning to generate such canonical utterance requires significant amount of data to reach high performance. Fine-tuning with only few-shot samples, the LMs can easily forget pretrained knowledge, overfit spurious biases, and suffer from compositionally out-of-distribution generalization errors. To tackle these issues, we propose a novel few-shot semantic parsing method -- SeqZero. SeqZero decomposes the problem into a sequence of sub-problems, which correspond to the sub-clauses of the formal language. Based on the decomposition, the LMs only need to generate short answers using prompts for predicting sub-clauses. Thus, SeqZero avoids generating a long canonical utterance at once. Moreover, SeqZero employs not only a few-shot model but also a zero-shot model to alleviate the overfitting. In particular, SeqZero brings out the merits from both models via ensemble equipped with our proposed constrained rescaling. SeqZero achieves SOTA performance of BART-based models on GeoQuery and EcommerceQuery, which are two few-shot datasets with compositional data split., Comment: 12 pages, Findings of NAACL 2022
Published: 2022

20. SUBS: Subtree Substitution for Compositional Semantic Parsing

Author: Yang, Jingfeng, Zhang, Le, and Yang, Diyi
Subjects: Computer Science - Computation and Language
Abstract: Although sequence-to-sequence models often achieve good performance in semantic parsing for i.i.d. data, their performance is still inferior in compositional generalization. Several data augmentation methods have been proposed to alleviate this problem. However, prior work only leveraged superficial grammar or rules for data augmentation, which resulted in limited improvement. We propose to use subtree substitution for compositional data augmentation, where we consider subtrees with similar semantic functions as exchangeable. Our experiments showed that such augmented data led to significantly better performance on SCAN and GeoQuery, and reached new SOTA on compositional split of GeoQuery., Comment: 6 pages
Published: 2022

21. TableFormer: Robust Transformer Modeling for Table-Text Encoding

Author: Yang, Jingfeng, Gupta, Aditya, Upadhyay, Shyam, He, Luheng, Goel, Rahul, and Paul, Shachi
Subjects: Computer Science - Computation and Language
Abstract: Understanding tables is an important aspect of natural language understanding. Existing models for table understanding require linearization of the table structure, where row or column order is encoded as an unwanted bias. Such spurious biases make the model vulnerable to row and column order perturbations. Additionally, prior work has not thoroughly modeled the table structures or table-text alignments, hindering the table-text understanding ability. In this work, we propose a robust and structurally aware table-text encoding architecture TableFormer, where tabular structural biases are incorporated completely through learnable attention biases. TableFormer is (1) strictly invariant to row and column orders, and, (2) could understand tables better due to its tabular inductive biases. Our evaluations showed that TableFormer outperforms strong baselines in all settings on SQA, WTQ and TabFact table reasoning datasets, and achieves state-of-the-art performance on SQA, especially when facing answer-invariant row and column order perturbations (6% improvement over the best baseline), because previous SOTA models' performance drops by 4% - 6% when facing such perturbations while TableFormer is not affected., Comment: ACL 2022, 10 pages
Published: 2022

22. Preparation of antibacterial composite film based on arginine-modified chitosan and its application in the preservation of ready-to-eat sea cucumber

Author: Sun, Jinghe, Li, Yimeng, Yan, Tingting, and Yang, Jingfeng
Published: 2024
Full Text: View/download PDF

23. Lack of signal peptide in insect prophenoloxidase to avoid glycosylation to damage the zymogen activity

Author: Wu, Kai, Yang, Bing, Chen, Rongbing, Majeed, Rafia, Li, Baoling, Gong, Liyuan, Wei, Xuefei, Yang, Jingfeng, Tang, Yingyu, Wang, Aibin, Toufeeq, Shahzad, Shaik, Haq Abdul, Huang, Wuren, Guo, Xuan, and Ling, Erjun
Published: 2024
Full Text: View/download PDF

24. Oxygenated storage alleviates autolysis of the sea cucumber Apostichopus japonicus during transport

Author: Zhou, Yan, Zheng, Jie, Zhao, Jun, Li, Shuang, Xing, Jie, Ai, Chunqing, Yu, Chenxu, Yang, Sheng, and Yang, Jingfeng
Published: 2023
Full Text: View/download PDF

25. Degradation of low-molecular-weight fucoidans by human intestinal microbiota and their regulation effect on intestinal microbiota and metabolites during in vitro fermentation

Author: Sun, Xiaona, Yang, Yunning, Song, Chen, Ai, Chunqing, Yang, Jingfeng, and Song, Shuang
Published: 2024
Full Text: View/download PDF

26. An analysis combining proteomics and transcriptomics revealed a regulation target of sea cucumber autolysis

Author: Yan, Tingting, Sun, Jinghe, Zheng, Jie, and Yang, Jingfeng
Published: 2024
Full Text: View/download PDF

27. Optimization of customer service and driver dispatch areas for on-demand food delivery

Author: Yang, Jingfeng, Lau, Hoong Chuin, and Wang, Hai
Published: 2024
Full Text: View/download PDF

28. Optimization of Station-Skip in a Cyclic Express Subway Service

Author: Yang, Jingfeng, Wang, Hai, and Jin, Jiangang
Published: 2023
Full Text: View/download PDF

29. The protective effect of Enteromorpha prolifera polysaccharide on alcoholic liver injury in C57BL/6 mice

Author: Yan, Tingting, Zhang, Yuying, Lu, Hengyu, Zhao, Jun, Wen, Chengrong, Song, Shuang, Ai, Chunqing, and Yang, Jingfeng
Published: 2024
Full Text: View/download PDF

30. LR-CNN: Local-aware Region CNN for Vehicle Detection in Aerial Imagery

Author: Liao, Wentong, Chen, Xiang, Yang, Jingfeng, Roth, Stefan, Goesele, Michael, Yang, Michael Ying, and Rosenhahn, Bodo
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: State-of-the-art object detection approaches such as Fast/Faster R-CNN, SSD, or YOLO have difficulties detecting dense, small targets with arbitrary orientation in large aerial images. The main reason is that using interpolation to align RoI features can result in a lack of accuracy or even loss of location information. We present the Local-aware Region Convolutional Neural Network (LR-CNN), a novel two-stage approach for vehicle detection in aerial imagery. We enhance translation invariance to detect dense vehicles and address the boundary quantization issue amongst dense vehicles by aggregating the high-precision RoIs' features. Moreover, we resample high-level semantic pooled features, making them regain location information from the features of a shallower convolutional block. This strengthens the local feature invariance for the resampled features and enables detecting vehicles in an arbitrary orientation. The local feature invariance enhances the learning ability of the focal loss function, and the focal loss further helps to focus on the hard examples. Taken together, our method better addresses the challenges of aerial imagery. We evaluate our approach on several challenging datasets (VEDAI, DOTA), demonstrating a significant improvement over state-of-the-art methods. We demonstrate the good generalization ability of our approach on the DLR 3K dataset., Comment: 8 pages
Published: 2020

31. Aflatoxin B1-induced early developmental hepatotoxicity in larvae zebrafish

Author: Feng, Chi, Bai, Hongxia, Chang, Xu, Wu, Zhixuan, Dong, Wu, Ma, Qianqian, and Yang, Jingfeng
Published: 2023
Full Text: View/download PDF

32. Reduced pigmentation and thyroid hormone disruption in zebrafish embryos caused by industrial sludge near Bohai Bay, China

Author: Dong, Wenjing, Yin, Xiaoyu, Qi, Chelimuge, Wei, Tingting, Wei, Lijia, Yang, Jingfeng, Mu, Jingli, Teraoka, Hiroki, and Dong, Wu
Published: 2023
Full Text: View/download PDF

33. A survey of cross-lingual features for zero-shot cross-lingual semantic parsing

Author: Yang, Jingfeng, Fancellu, Federico, and Webber, Bonnie
Subjects: Computer Science - Computation and Language
Abstract: The availability of corpora to train semantic parsers in English has lead to significant advances in the field. Unfortunately, for languages other than English, annotation is scarce and so are developed parsers. We then ask: could a parser trained in English be applied to language that it hasn't been trained on? To answer this question we explore zero-shot cross-lingual semantic parsing where we train an available coarse-to-fine semantic parser (Liu et al., 2018) using cross-lingual word embeddings and universal dependencies in English and test it on Italian, German and Dutch. Results on the Parallel Meaning Bank - a multilingual semantic graphbank, show that Universal Dependency features significantly boost performance when used in conjunction with other lexical features but modelling the UD structure directly when encoding the input does not.
Published: 2019

34. The therapeutic effect of Zhenbao pills on behavioral changes in zebrafish caused by aluminum chloride

Author: Chen, Hongsong, Li, Huilei, Yin, Xiaoyu, Liu, Yuanyuan, Zhang, Tengdan, Wu, Hui, Kang, Guiying, Yu, Yongli, Bai, Meirong, Bao, Liming, Yang, Jingfeng, and Dong, Wu
Published: 2023
Full Text: View/download PDF

35. Pseudoephedrine hydrochloride causes hyperactivity in zebrafish via modulation of the serotonin pathway

Author: Zhou, Yini, Li, Tonglaga, Zhou, Shangzi, Xu, Han, Yin, Xiaoyu, Chen, Hao, Ni, Xuan, Bai, Meirong, Ao, Wuliji, Yang, Jingfeng, Ahmed, R. G., Zhang, Xuefu, Bao, Shuyin, Yu, Jianhua, Kwok, Kevin W. H., and Dong, Wu
Published: 2022
Full Text: View/download PDF

36. Adaptive neural network control of an uncertain 2-DOF helicopter system with input backlash and output constraints

Author: Zhao, Zhijia, He, Weitian, Yang, Jingfeng, and Li, ZhiFu
Published: 2022
Full Text: View/download PDF

37. Risk-Aware Procurement Optimization in a Global Technology Supply Chain

Author: Chase, Jonathan, Yang, Jingfeng, Lau, Hoong Chuin, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, de Armas, Jesica, editor, Ramalhinho, Helena, editor, and Voß, Stefan, editor
Published: 2022
Full Text: View/download PDF

38. Inhibitory effects of fucoidan from Laminaria japonica against some pathogenic bacteria and SARS-CoV-2 depend on its large molecular weight

Author: Sun, Xiaona, Ai, Chunqing, Wen, Chengrong, Peng, Haoran, Yang, Jingfeng, Cui, Yuna, and Song, Shuang
Published: 2023
Full Text: View/download PDF

39. Corrigendum to “The protective effect of Enteromorpha prolifera polysaccharide on alcoholic liver injury in C57BL/6 mice” [Int. J. Biol. Macromol. 261 (2024) 129908]

Author: Yan, Tingting, primary, Zhang, Yuying, additional, Lu, Hengyu, additional, Zhao, Jun, additional, Wen, Chengrong, additional, Song, Shuang, additional, Ai, Chunqing, additional, and Yang, Jingfeng, additional
Published: 2024
Full Text: View/download PDF

40. Chinese Discourse Segmentation Using Bilingual Discourse Commonality

Author: Yang, Jingfeng and Li, Sujian
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Discourse segmentation aims to segment Elementary Discourse Units (EDUs) and is a fundamental task in discourse analysis. For Chinese, previous researches identify EDUs just through discriminating the functions of punctuations. In this paper, we argue that Chinese EDUs may not end at the punctuation positions and should follow the definition of EDU in RST-DT. With this definition, we conduct Chinese discourse segmentation with the help of English labeled data.Using discourse commonality between English and Chinese, we design an adversarial neural network framework to extract common language-independent features and language-specific features which are useful for discourse segmentation, when there is no or only a small scale of Chinese labeled data available. Experiments on discourse segmentation demonstrate that our models can leverage common features from bilingual data, and learn efficient Chinese-specific features from a small amount of Chinese labeled data, outperforming the baseline models.
Published: 2018

41. Toward Fast and Accurate Neural Discourse Segmentation

Author: Wang, Yizhong, Li, Sujian, and Yang, Jingfeng
Subjects: Computer Science - Computation and Language
Abstract: Discourse segmentation, which segments texts into Elementary Discourse Units, is a fundamental step in discourse analysis. Previous discourse segmenters rely on complicated hand-crafted features and are not practical in actual use. In this paper, we propose an end-to-end neural segmenter based on BiLSTM-CRF framework. To improve its accuracy, we address the problem of data insufficiency by transferring a word representation model that is trained on a large corpus. We also propose a restricted self-attention mechanism in order to capture useful information within a neighborhood. Experiments on the RST-DT corpus show that our model is significantly faster than previous methods, while achieving new state-of-the-art performance., Comment: 6 pages, camera-ready version of EMNLP 2018
Published: 2018

42. Tag-Enhanced Tree-Structured Neural Networks for Implicit Discourse Relation Classification

Author: Wang, Yizhong, Li, Sujian, Yang, Jingfeng, Sun, Xu, and Wang, Houfeng
Subjects: Computer Science - Computation and Language
Abstract: Identifying implicit discourse relations between text spans is a challenging task because it requires understanding the meaning of the text. To tackle this task, recent studies have tried several deep learning methods but few of them exploited the syntactic information. In this work, we explore the idea of incorporating syntactic parse tree into neural networks. Specifically, we employ the Tree-LSTM model and Tree-GRU model, which are based on the tree structure, to encode the arguments in a relation. Moreover, we further leverage the constituent tags to control the semantic composition process in these tree-structured neural networks. Experimental results show that our method achieves state-of-the-art performance on PDTB corpus., Comment: Accepted by IJCNLP 2017, 10 pages
Published: 2018

43. RPTD: Reliability-enhanced Privacy-preserving Truth Discovery for Mobile Crowdsensing

Author: Liu, Yuxian, Liu, Fagui, Wu, Hao-Tian, Yang, Jingfeng, Zheng, Kaihong, Xu, Lingling, Yan, Xingfu, and Hu, Jiankun
Published: 2022
Full Text: View/download PDF

44. Protective effect of curcumin on zebrafish liver under ethanol-induced oxidative stress

Author: Song, Lei, Li, Ming, Feng, Chi, Sa, Rigaiqiqige, Hu, Xiaodong, Wang, Jie, Yin, Xiaoyu, Qi, Chelimuge, Dong, Wu, and Yang, Jingfeng
Published: 2022
Full Text: View/download PDF

45. Prophenoloxidase-positive tubes derived from the hindguts may be the doorkeeper to detoxify the waste metabolites collected by Malpighian tubules in Lepidoptera insects

Author: Tang, Yingyu, Zhang, Ying, Zhang, Qiaoli, Chen, Rongbing, Gong, Liyuan, Wei, Xuefei, Yang, Jingfeng, Wu, Kai, Huang, Wuren, Li, Shirong, Toufeeq, Shahzad, Liu, Qiuning, and Ling, Erjun
Published: 2022
Full Text: View/download PDF

46. Developmental disorders caused by cefixime in the otic vesicles of zebrafish embryos or larvae

Author: Chen, Chaobao, Ni, Xuan, Yin, Xiaoyu, Chen, Hao, Zhou, Yini, Sun, Huiying, Qi, Chelimuge, Bu, Nini, Wang, Shuaiyu, Yu, Jianhua, Yang, Jingfeng, Ao, Wuliji, Zhao, Baoquan, and Dong, Wu
Published: 2022
Full Text: View/download PDF

47. Enteromorpha prolifera Polysaccharide Alleviates Acute Alcoholic Liver Injury in C57 BL/6 Mice through the Gut–Liver Axis and NF-κB Pathway.

Author: Yan, Tingting, Sun, Jinghe, Zhang, Yuying, Wen, Chengrong, and Yang, Jingfeng
Published: 2024
Full Text: View/download PDF

48. A Learning and Optimization Framework for Collaborative Urban Delivery Problems with Alliances

Author: Yang, Jingfeng, Lau, Hoong Chuin, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Mes, Martijn, editor, Lalla-Ruiz, Eduardo, editor, and Voß, Stefan, editor
Published: 2021
Full Text: View/download PDF

49. Pteryxin attenuates LPS-induced inflammatory responses and inhibits NLRP3 inflammasome activation in RAW264.7 cells

Author: Zhen, Dong, Xuan, Tian-qi, Hu, Boqin, Bai, Xue, Fu, Dan-ni, Wang, Yu, Wu, Yun, Yang, Jingfeng, and Ma, Qianqian
Published: 2022
Full Text: View/download PDF

50. Alterations in feeding preference and gastric emptying of giant freshwater prawn (Macrobrachium rosenbergii) following administration of varying quantities of fermented soybean meal

Author: Cai, XingHui, primary, Luo, Jingyi, additional, Li, Xiang, additional, Yang, JingFeng, additional, Hua, XueMing, additional, and Liu, Tao, additional
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

468 results on '"Yang, Jingfeng"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources