Author: "Askari, Arian" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Askari, Arian"' showing total 49 results

Start Over Author "Askari, Arian"

49 results on '"Askari, Arian"'

1. Generative Retrieval with Few-shot Indexing

Author: Askari, Arian, Meng, Chuan, Aliannejadi, Mohammad, Ren, Zhaochun, Kanoulas, Evangelos, and Verberne, Suzan
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning, H.3.3
Abstract: Existing generative retrieval (GR) approaches rely on training-based indexing, i.e., fine-tuning a model to memorise the associations between a query and the document identifier (docid) of a relevant document. Training-based indexing has three limitations: high training overhead, under-utilization of the pre-trained knowledge of large language models (LLMs), and challenges in adapting to a dynamic document corpus. To address the above issues, we propose a novel few-shot indexing-based GR framework (Few-Shot GR). It has a novel few-shot indexing process, where we prompt an LLM to generate docids for all documents in a corpus, ultimately creating a docid bank for the entire corpus. During retrieval, we feed a query to the same LLM and constrain it to generate a docid within the docid bank created during indexing, and then map the generated docid back to its corresponding document. Few-Shot GR relies solely on prompting an LLM without requiring any training, making it more efficient. Moreover, we devise few-shot indexing with one-to-many mapping to further enhance Few-Shot GR. Experiments show that Few-Shot GR achieves superior performance to state-of-the-art GR methods that require heavy training.
Published: 2024

2. MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL

Author: Askari, Arian, Poelitz, Christian, and Tang, Xinye
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Databases, Computer Science - Human-Computer Interaction
Abstract: Self-correction in text-to-SQL is the process of prompting large language model (LLM) to revise its previously incorrectly generated SQL, and commonly relies on manually crafted self-correction guidelines by human experts that are not only labor-intensive to produce but also limited by the human ability in identifying all potential error patterns in LLM responses. We introduce MAGIC, a novel multi-agent method that automates the creation of the self-correction guideline. MAGIC uses three specialized agents: a manager, a correction, and a feedback agent. These agents collaborate on the failures of an LLM-based method on the training set to iteratively generate and refine a self-correction guideline tailored to LLM mistakes, mirroring human processes but without human involvement. Our extensive experiments show that MAGIC's guideline outperforms expert human's created ones. We empirically find out that the guideline produced by MAGIC enhance the interpretability of the corrections made, providing insights in analyzing the reason behind the failures and successes of LLMs in self-correction. We make all agent interactions publicly available to the research community, to foster further research in this area, offering a synthetic dataset for future explorations into automatic self-correction guideline generation., Comment: 20 pages, 17 figures
Published: 2024

3. Ranked List Truncation for Large Language Model-based Re-Ranking

Author: Meng, Chuan, Arabzadeh, Negar, Askari, Arian, Aliannejadi, Mohammad, and de Rijke, Maarten
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning, H.3.3
Abstract: We study ranked list truncation (RLT) from a novel "retrieve-then-re-rank" perspective, where we optimize re-ranking by truncating the retrieved list (i.e., trim re-ranking candidates). RLT is crucial for re-ranking as it can improve re-ranking efficiency by sending variable-length candidate lists to a re-ranker on a per-query basis. It also has the potential to improve re-ranking effectiveness. Despite its importance, there is limited research into applying RLT methods to this new perspective. To address this research gap, we reproduce existing RLT methods in the context of re-ranking, especially newly emerged large language model (LLM)-based re-ranking. In particular, we examine to what extent established findings on RLT for retrieval are generalizable to the "retrieve-then-re-rank" setup from three perspectives: (i) assessing RLT methods in the context of LLM-based re-ranking with lexical first-stage retrieval, (ii) investigating the impact of different types of first-stage retrievers on RLT methods, and (iii) investigating the impact of different types of re-rankers on RLT methods. We perform experiments on the TREC 2019 and 2020 deep learning tracks, investigating 8 RLT methods for pipelines involving 3 retrievers and 2 re-rankers. We reach new insights into RLT methods in the context of re-ranking., Comment: Accepted for publication as a long paper at SIGIR 2024
Published: 2024
Full Text: View/download PDF

4. Query Performance Prediction using Relevance Judgments Generated by Large Language Models

Author: Meng, Chuan, Arabzadeh, Negar, Askari, Arian, Aliannejadi, Mohammad, and de Rijke, Maarten
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning, H.3.3
Abstract: Query performance prediction (QPP) aims to estimate the retrieval quality of a search system for a query without human relevance judgments. Previous QPP methods typically return a single scalar value and do not require the predicted values to approximate a specific information retrieval (IR) evaluation measure, leading to certain drawbacks: (i) a single scalar is insufficient to accurately represent different IR evaluation measures, especially when metrics do not highly correlate, and (ii) a single scalar limits the interpretability of QPP methods because solely using a scalar is insufficient to explain QPP results. To address these issues, we propose a QPP framework using automatically generated relevance judgments (QPP-GenRE), which decomposes QPP into independent subtasks of predicting the relevance of each item in a ranked list to a given query. This allows us to predict any IR evaluation measure using the generated relevance judgments as pseudo-labels. This also allows us to interpret predicted IR evaluation measures, and identify, track and rectify errors in generated relevance judgments to improve QPP quality. We predict an item's relevance by using open-source large language models (LLMs) to ensure scientific reproducibility. We face two main challenges: (i) excessive computational costs of judging an entire corpus for predicting a metric considering recall, and (ii) limited performance in prompting open-source LLMs in a zero-/few-shot manner. To solve the challenges, we devise an approximation strategy to predict an IR measure considering recall and propose to fine-tune open-source LLMs using human-labeled relevance judgments. Experiments on the TREC 2019-2022 deep learning tracks show that QPP-GenRE achieves state-of-the-art QPP quality for both lexical and neural rankers.
Published: 2024

5. CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems

Author: Abolghasemi, Amin, Ren, Zhaochun, Askari, Arian, Aliannejadi, Mohammad, de Rijke, Maarten, and Verberne, Suzan
Subjects: Computer Science - Computation and Language
Abstract: An important unexplored aspect in previous work on user satisfaction estimation for Task-Oriented Dialogue (TOD) systems is their evaluation in terms of robustness for the identification of user dissatisfaction: current benchmarks for user satisfaction estimation in TOD systems are highly skewed towards dialogues for which the user is satisfied. The effect of having a more balanced set of satisfaction labels on performance is unknown. However, balancing the data with more dissatisfactory dialogue samples requires further data collection and human annotation, which is costly and time-consuming. In this work, we leverage large language models (LLMs) and unlock their ability to generate satisfaction-aware counterfactual dialogues to augment the set of original dialogues of a test collection. We gather human annotations to ensure the reliability of the generated samples. We evaluate two open-source LLMs as user satisfaction estimators on our augmented collection against state-of-the-art fine-tuned models. Our experiments show that when used as few-shot user satisfaction estimators, open-source LLMs show higher robustness to the increase in the number of dissatisfaction labels in the test collection than the fine-tuned state-of-the-art models. Our results shed light on the need for data augmentation approaches for user satisfaction estimation in TOD systems. We release our aligned counterfactual dialogues, which are curated by human annotation, to facilitate further research on this topic.
Published: 2024

6. Measuring Bias in a Ranked List using Term-based Representations

Author: Abolghasemi, Amin, Azzopardi, Leif, Askari, Arian, de Rijke, Maarten, and Verberne, Suzan
Subjects: Computer Science - Computation and Language
Abstract: In most recent studies, gender bias in document ranking is evaluated with the NFaiRR metric, which measures bias in a ranked list based on an aggregation over the unbiasedness scores of each ranked document. This perspective in measuring the bias of a ranked list has a key limitation: individual documents of a ranked list might be biased while the ranked list as a whole balances the groups' representations. To address this issue, we propose a novel metric called TExFAIR (term exposure-based fairness), which is based on two new extensions to a generic fairness evaluation framework, attention-weighted ranking fairness (AWRF). TExFAIR assesses fairness based on the term-based representation of groups in a ranked list: (i) an explicit definition of associating documents to groups based on probabilistic term-level associations, and (ii) a rank-biased discounting factor (RBDF) for counting non-representative documents towards the measurement of the fairness of a ranked list. We assess TExFAIR on the task of measuring gender bias in passage ranking, and study the relationship between TExFAIR and NFaiRR. Our experiments show that there is no strong correlation between TExFAIR and NFaiRR, which indicates that TExFAIR measures a different dimension of fairness than NFaiRR. With TExFAIR, we extend the AWRF framework to allow for the evaluation of fairness in settings with term-based representations of groups in documents in a ranked list., Comment: Accepted at the 46th European Conference on Information Retrieval (ECIR 2024)
Published: 2024

7. Self-seeding and Multi-intent Self-instructing LLMs for Generating Intent-aware Information-Seeking dialogs

Author: Askari, Arian, Petcu, Roxana, Meng, Chuan, Aliannejadi, Mohammad, Abolghasemi, Amin, Kanoulas, Evangelos, and Verberne, Suzan
Subjects: Computer Science - Computation and Language
Abstract: Identifying user intents in information-seeking dialogs is crucial for a system to meet user's information needs. Intent prediction (IP) is challenging and demands sufficient dialogs with human-labeled intents for training. However, manually annotating intents is resource-intensive. While large language models (LLMs) have been shown to be effective in generating synthetic data, there is no study on using LLMs to generate intent-aware information-seeking dialogs. In this paper, we focus on leveraging LLMs for zero-shot generation of large-scale, open-domain, and intent-aware information-seeking dialogs. We propose SOLID, which has novel self-seeding and multi-intent self-instructing schemes. The former improves the generation quality by using the LLM's own knowledge scope to initiate dialog generation; the latter prompts the LLM to generate utterances sequentially, and mitigates the need for manual prompt design by asking the LLM to autonomously adapt its prompt instruction when generating complex multi-intent utterances. Furthermore, we propose SOLID-RL, which is further trained to generate a dialog in one step on the data generated by SOLID. We propose a length-based quality estimation mechanism to assign varying weights to SOLID-generated dialogs based on their quality during the training process of SOLID-RL. We use SOLID and SOLID-RL to generate more than 300k intent-aware dialogs, surpassing the size of existing datasets. Experiments show that IP methods trained on dialogs generated by SOLID and SOLID-RL achieve better IP quality than ones trained on human-generated dialogs.
Published: 2024

8. Answer Retrieval in Legal Community Question Answering

Author: Askari, Arian, Yang, Zihui, Ren, Zhaochun, and Verberne, Suzan
Subjects: Computer Science - Information Retrieval
Abstract: The task of answer retrieval in the legal domain aims to help users to seek relevant legal advice from massive amounts of professional responses. Two main challenges hinder applying existing answer retrieval approaches in other domains to the legal domain: (1) a huge knowledge gap between lawyers and non-professionals; and (2) a mix of informal and formal content on legal QA websites. To tackle these challenges, we propose CE_FS, a novel cross-encoder (CE) re-ranker based on the fine-grained structured inputs. CE_FS uses additional structured information in the CQA data to improve the effectiveness of cross-encoder re-rankers. Furthermore, we propose LegalQA: a real-world benchmark dataset for evaluating answer retrieval in the legal domain. Experiments conducted on LegalQA show that our proposed method significantly outperforms strong cross-encoder re-rankers fine-tuned on MS MARCO. Our novel finding is that adding the question tags of each question besides the question description and title into the input of cross-encoder re-rankers structurally boosts the rankers' effectiveness. While we study our proposed method in the legal domain, we believe that our method can be applied in similar applications in other domains., Comment: accepted at ECIR 2024
Published: 2024

9. Enhancing Documents with Multidimensional Relevance Statements in Cross-encoder Re-ranking

Author: Upadhyay, Rishabh, Askari, Arian, Pasi, Gabriella, and Viviani, Marco
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence
Abstract: In this paper, we propose a novel approach to consider multiple dimensions of relevance beyond topicality in cross-encoder re-ranking. On the one hand, current multidimensional retrieval models often use na\"ive solutions at the re-ranking stage to aggregate multiple relevance scores into an overall one. On the other hand, cross-encoder re-rankers are effective in considering topicality but are not designed to straightforwardly account for other relevance dimensions. To overcome these issues, we envisage enhancing the candidate documents -- which are retrieved by a first-stage lexical retrieval model -- with "relevance statements" related to additional dimensions of relevance and then performing a re-ranking on them with cross-encoders. In particular, here we consider an additional relevance dimension beyond topicality, which is credibility. We test the effectiveness of our solution in the context of the Consumer Health Search task, considering publicly available datasets. Our results show that the proposed approach statistically outperforms both aggregation-based and cross-encoder re-rankers.
Published: 2023

10. Injecting the score of the first-stage retriever as text improves BERT-based re-rankers

Author: Askari, Arian, Abolghasemi, Amin, Pasi, Gabriella, Kraaij, Wessel, and Verberne, Suzan
Published: 2024
Full Text: View/download PDF

11. Generating Synthetic Documents for Cross-Encoder Re-Rankers: A Comparative Study of ChatGPT and Human Experts

Author: Askari, Arian, Aliannejadi, Mohammad, Kanoulas, Evangelos, and Verberne, Suzan
Subjects: Computer Science - Information Retrieval
Abstract: We investigate the usefulness of generative Large Language Models (LLMs) in generating training data for cross-encoder re-rankers in a novel direction: generating synthetic documents instead of synthetic queries. We introduce a new dataset, ChatGPT-RetrievalQA, and compare the effectiveness of models fine-tuned on LLM-generated and human-generated data. Data generated with generative LLMs can be used to augment training data, especially in domains with smaller amounts of labeled data. We build ChatGPT-RetrievalQA based on an existing dataset, human ChatGPT Comparison Corpus (HC3), consisting of public question collections with human responses and answers from ChatGPT. We fine-tune a range of cross-encoder re-rankers on either human-generated or ChatGPT-generated data. Our evaluation on MS MARCO DEV, TREC DL'19, and TREC DL'20 demonstrates that cross-encoder re-ranking models trained on ChatGPT responses are statistically significantly more effective zero-shot re-rankers than those trained on human responses. In a supervised setting, the human-trained re-rankers outperform the LLM-trained re-rankers. Our novel findings suggest that generative LLMs have high potential in generating training data for neural retrieval models. Further work is needed to determine the effect of factually wrong information in the generated responses and test our findings' generalizability with open-source LLMs. We release our data, code, and cross-encoders checkpoints for future work.
Published: 2023

12. Retrieval for Extremely Long Queries and Documents with RPRS: a Highly Efficient and Effective Transformer-based Re-Ranker

Author: Askari, Arian, Verberne, Suzan, Abolghasemi, Amin, Kraaij, Wessel, and Pasi, Gabriella
Subjects: Computer Science - Information Retrieval
Abstract: Retrieval with extremely long queries and documents is a well-known and challenging task in information retrieval and is commonly known as Query-by-Document (QBD) retrieval. Specifically designed Transformer models that can handle long input sequences have not shown high effectiveness in QBD tasks in previous work. We propose a Re-Ranker based on the novel Proportional Relevance Score (RPRS) to compute the relevance score between a query and the top-k candidate documents. Our extensive evaluation shows RPRS obtains significantly better results than the state-of-the-art models on five different datasets. Furthermore, RPRS is highly efficient since all documents can be pre-processed, embedded, and indexed before query time which gives our re-ranker the advantage of having a complexity of O(N) where N is the total number of sentences in the query and candidate documents. Furthermore, our method solves the problem of the low-resource training in QBD retrieval tasks as it does not need large amounts of training data, and has only three parameters with a limited range that can be optimized with a grid search even if a small amount of labeled data is available. Our detailed analysis shows that RPRS benefits from covering the full length of candidate documents and queries., Comment: Accepted at ACM Transactions on Information Systems (ACM TOIS journal)
Published: 2023

13. Injecting the BM25 Score as Text Improves BERT-Based Re-rankers

Author: Askari, Arian, Abolghasemi, Amin, Pasi, Gabriella, Kraaij, Wessel, and Verberne, Suzan
Subjects: Computer Science - Information Retrieval
Abstract: In this paper we propose a novel approach for combining first-stage lexical retrieval models and Transformer-based re-rankers: we inject the relevance score of the lexical model as a token in the middle of the input of the cross-encoder re-ranker. It was shown in prior work that interpolation between the relevance score of lexical and BERT-based re-rankers may not consistently result in higher effectiveness. Our idea is motivated by the finding that BERT models can capture numeric information. We compare several representations of the BM25 score and inject them as text in the input of four different cross-encoders. We additionally analyze the effect for different query types, and investigate the effectiveness of our method for capturing exact matching relevance. Evaluation on the MSMARCO Passage collection and the TREC DL collections shows that the proposed method significantly improves over all cross-encoder re-rankers as well as the common interpolation methods. We show that the improvement is consistent for all query types. We also find an improvement in exact matching capabilities over both BM25 and the cross-encoders. Our findings indicate that cross-encoder re-rankers can efficiently be improved without additional computational burden and extra steps in the pipeline by explicitly adding the output of the first-stage ranker to the model input, and this effect is robust for different models and query types., Comment: Accepted at ECIR 2023
Published: 2023

14. Detection of Monogenic Disorders Using Noninvasive Prenatal Screening

Author: Taheri, Mohammad, Harsij, Atefeh, Askari, Arian, Pourtavakoli, Ashkan, Rather, Riyaz Ahmad, editor, and Saha, Subhas Chandra, editor
Published: 2024
Full Text: View/download PDF

15. Beyond Topicality: Including Multidimensional Relevance in Cross-encoder Re-ranking : The Health Misinformation Case Study

Author: Upadhyay, Rishabh, Askari, Arian, Pasi, Gabriella, Viviani, Marco, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Goharian, Nazli, editor, Tonellotto, Nicola, editor, He, Yulan, editor, Lipani, Aldo, editor, McDonald, Graham, editor, Macdonald, Craig, editor, and Ounis, Iadh, editor
Published: 2024
Full Text: View/download PDF

16. On the Interpolation of Contextualized Term-based Ranking with BM25 for Query-by-Example Retrieval

Author: Abolghasemi, Amin, Askari, Arian, and Verberne, Suzan
Subjects: Computer Science - Information Retrieval
Abstract: Term-based ranking with pre-trained transformer-based language models has recently gained attention as they bring the contextualization power of transformer models into the highly efficient term-based retrieval. In this work, we examine the generalizability of two of these deep contextualized term-based models in the context of query-by-example (QBE) retrieval in which a seed document acts as the query to find relevant documents. In this setting -- where queries are much longer than common keyword queries -- BERT inference at query time is problematic as it involves quadratic complexity. We investigate TILDE and TILDEv2, both of which leverage BERT tokenizer as their query encoder. With this approach, there is no need for BERT inference at query time, and also the query can be of any length. Our extensive evaluation on the four QBE tasks of SciDocs benchmark shows that in a query-by-example retrieval setting TILDE and TILDEv2 are still less effective than a cross-encoder BERT ranker. However, we observe that BM25 could show a competitive ranking quality compared to TILDE and TILDEv2 which is in contrast to the findings about the relative performance of these three models on retrieval for short queries reported in prior work. This result raises the question about the use of contextualized term-based ranking models being beneficial in QBE setting. We follow-up on our findings by studying the score interpolation between the relevance score from TILDE (TILDEv2) and BM25. We conclude that these two contextualized term-based ranking models capture different relevance signals than BM25 and combining the different term-based rankers results in statistically significant improvements in QBE retrieval. Our work sheds light on the challenges of retrieval settings different from the common evaluation benchmarks., Comment: Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval
Published: 2022
Full Text: View/download PDF

17. Role of miR-424 in the carcinogenesis

Author: Ghafouri-Fard, Soudeh, Askari, Arian, Hussen, Bashdar Mahmud, Taheri, Mohammad, and Akbari Dilmaghani, Nader
Published: 2024
Full Text: View/download PDF

18. LeiBi@COLIEE 2022: Aggregating Tuned Lexical Models with a Cluster-driven BERT-based Model for Case Law Retrieval

Author: Askari, Arian, Peikos, Georgios, Pasi, Gabriella, and Verberne, Suzan
Subjects: Computer Science - Information Retrieval
Abstract: This paper summarizes our approaches submitted to the case law retrieval task in the Competition on Legal Information Extraction/Entailment (COLIEE) 2022. Our methodology consists of four steps; in detail, given a legal case as a query, we reformulate it by extracting various meaningful sentences or n-grams. Then, we utilize the pre-processed query case to retrieve an initial set of possible relevant legal cases, which we further re-rank. Lastly, we aggregate the relevance scores obtained by the first stage and the re-ranking models to improve retrieval effectiveness. In each step of our methodology, we explore various well-known and novel methods. In particular, to reformulate the query cases aiming to make them shorter, we extract unigrams using three different statistical methods: KLI, PLM, IDF-r, as well as models that leverage embeddings (e.g., KeyBERT). Moreover, we investigate if automatic summarization using Longformer-Encoder-Decoder (LED) can produce an effective query representation for this retrieval task. Furthermore, we propose a novel re-ranking cluster-driven approach, which leverages Sentence-BERT models that are pre-tuned on large amounts of data for embedding sentences from query and candidate documents. Finally, we employ a linear aggregation method to combine the relevance scores obtained by traditional IR models and neural-based models, aiming to incorporate the semantic understanding of neural models and the statistically measured topical relevance. We show that aggregating these relevance scores can improve the overall retrieval effectiveness., Comment: Accepted at the COLIEE Workshop in Proceedings of JURISIN 2022. Please cite the published version
Published: 2022

19. Expert Finding in Legal Community Question Answering

Author: Askari, Arian, Verberne, Suzan, and Pasi, Gabriella
Subjects: Computer Science - Information Retrieval
Abstract: Expert finding has been well-studied in community question answering (QA) systems in various domains. However, none of these studies addresses expert finding in the legal domain, where the goal is for citizens to find lawyers based on their expertise. In the legal domain, there is a large knowledge gap between the experts and the searchers, and the content on the legal QA websites consist of a combination formal and informal communication. In this paper, we propose methods for generating query-dependent textual profiles for lawyers covering several aspects including sentiment, comments, and recency. We combine query-dependent profiles with existing expert finding methods. Our experiments are conducted on a novel dataset gathered from an online legal QA service. We discovered that taking into account different lawyer profile aspects improves the best baseline model. We make our dataset publicly available for future work., Comment: Accepted at Proceedings of the 44th European Conference on Information Retrieval, ECIR 2022. Please cite the published version
Published: 2022

20. Expression analysis of necroptosis related genes and lncRNAs in patients with pituitary neuroendocrine tumors

Author: Askari, Arian, Darabi, Mohammad Reza, Eslami, Solat, Jamali, Elena, Sharifi, Guive, Ghafouri-Fard, Soudeh, and Dilmaghani, Nader Akbari
Published: 2024
Full Text: View/download PDF

21. DoSSIER@COLIEE 2021: Leveraging dense retrieval and summarization-based re-ranking for case law retrieval

Author: Althammer, Sophia, Askari, Arian, Verberne, Suzan, and Hanbury, Allan
Subjects: Computer Science - Information Retrieval
Abstract: In this paper, we present our approaches for the case law retrieval and the legal case entailment task in the Competition on Legal Information Extraction/Entailment (COLIEE) 2021. As first stage retrieval methods combined with neural re-ranking methods using contextualized language models like BERT achieved great performance improvements for information retrieval in the web and news domain, we evaluate these methods for the legal domain. A distinct characteristic of legal case retrieval is that the query case and case description in the corpus tend to be long documents and therefore exceed the input length of BERT. We address this challenge by combining lexical and dense retrieval methods on the paragraph-level of the cases for the first stage retrieval. Here we demonstrate that the retrieval on the paragraph-level outperforms the retrieval on the document-level. Furthermore the experiments suggest that dense retrieval methods outperform lexical retrieval. For re-ranking we address the problem of long documents by summarizing the cases and fine-tuning a BERT-based re-ranker with the summaries. Overall, our best results were obtained with a combination of BM25 and dense passage retrieval using domain-specific embeddings., Comment: Published in COLIEE 2021
Published: 2021

22. A bioinformatics-based approach and expression assay for identification of dysregulated genes in pituitary adenoma

Author: Jamali, Elena, Hashemnejad, Mohammad Amin, Askari, Arian, Shomali, Hanieh, Eslami, Solat, Akbari Dilmaghani, Nader, Sharifi, Guive, Bahranian, Arefe, and Ghafouri-Fard, Soudeh
Published: 2024
Full Text: View/download PDF

23. Non-coding RNA profile for natural killer cell activity

Author: Ghafouri-Fard, Soudeh, Askari, Arian, Zangooie, Alireza, Shoorei, Hamed, Pourmoshtagh, Hasan, and Taheri, Mohammad
Published: 2023
Full Text: View/download PDF

24. A review on the role of MYC-induced long non-coding RNA in human disorders

Author: Taheri, Mohammad, Askari, Arian, Hussen, Bashdar Mahmud, Eghbali, Ahmad, and Ghafouri-Fard, Soudeh
Published: 2023
Full Text: View/download PDF

25. A review on the role of LINC00472 in malignant and non-malignant disorders

Author: Ghafouri-Fard, Soudeh, Askari, Arian, Hussen, Bashdar Mahmud, Rasul, Mohammed Fatih, Taheri, Mohammad, and Ayatollahi, Seyed Abdulmajid
Published: 2023
Full Text: View/download PDF

26. Role of MAGI2-AS3 in malignant and non-malignant disorders

Author: Taheri, Mohammad, Askari, Arian, Hussen, Bashdar Mahmud, Ghafouri-Fard, Soudeh, and Rashnoo, Fariborz
Published: 2023
Full Text: View/download PDF

27. A review on the role of ADAMTS9-AS2 in different disorders

Author: Ghafouri-Fard, Soudeh, Askari, Arian, Hussen, Bashdar Mahmud, Baniahmad, Aria, Taheri, Mohammad, and Mokhtari, Majid
Published: 2023
Full Text: View/download PDF

28. Injecting the BM25 Score as Text Improves BERT-Based Re-rankers

Author: Askari, Arian, primary, Abolghasemi, Amin, additional, Pasi, Gabriella, additional, Kraaij, Wessel, additional, and Verberne, Suzan, additional
Published: 2023
Full Text: View/download PDF

29. A review on the role of LINC00152 in different disorders

Author: Ghafouri-Fard, Soudeh, Askari, Arian, Hussen, Bashdar Mahmud, Rasul, Mohammed Fatih, Taheri, Mohammad, and Kiani, Arda
Published: 2023
Full Text: View/download PDF

30. Query Understanding via Entity Attribute Identification

Author: Nobari, Arash Dargahi, Askari, Arian, Hasibi, Faegheh, and Neshati, Mahmood
Subjects: Computer Science - Information Retrieval
Abstract: Understanding searchers' queries is an essential component of semantic search systems. In many cases, search queries involve specific attributes of an entity in a knowledge base (KB), which can be further used to find query answers. In this study, we aim to move forward the understanding of queries by identifying their related entity attributes from a knowledge base. To this end, we introduce the task of entity attribute identification and propose two methods to address it: (i) a model based on Markov Random Field, and (ii) a learning to rank model. We develop a human annotated test collection and show that our proposed methods can bring significant improvements over the baseline methods., Comment: Proceedings of the 27th International Conference on Information and Knowledge Management (CIKM '18), 2018
Published: 2018
Full Text: View/download PDF

31. Retrieval for Extremely Long Queries and Documents with RPRS: A Highly Efficient and Effective Transformer-based Re-Ranker.

Author: Askari, Arian, Verberne, Suzan, Abolghasemi, Amin, Kraaij, Wessel, and Pasi, Gabriella
Abstract: The article focuses on improving Query-by-Document (QBD) retrieval, a challenging task in information retrieval, by proposing a Re-Ranker based on the Proportional Relevance Score (RPRS) to compute relevance between queries and candidate documents. RPRS significantly outperforms existing models on various datasets, offering high efficiency with a complexity of O(N) and addressing the low-resource training issue in QBD retrieval tasks.
Published: 2024
Full Text: View/download PDF

32. Expert Finding in Legal Community Question Answering

Author: Askari, Arian, primary, Verberne, Suzan, additional, and Pasi, Gabriella, additional
Published: 2022
Full Text: View/download PDF

33. Beyond Topicality: Including Multidimensional Relevance in Cross-encoder Re-ranking

Author: Goharian, N, Tonellotto, N, He, Y, Lipani, A, McDonald, G, Macdonald, C, Ounis, I, Upadhyay, R, Askari, A, Pasi, G, Viviani, M, Upadhyay, Rishabh, Askari, Arian, Pasi, Gabriella, Viviani, Marco, Goharian, N, Tonellotto, N, He, Y, Lipani, A, McDonald, G, Macdonald, C, Ounis, I, Upadhyay, R, Askari, A, Pasi, G, Viviani, M, Upadhyay, Rishabh, Askari, Arian, Pasi, Gabriella, and Viviani, Marco
Abstract: In this paper, we propose a novel approach to consider multiple dimensions of relevance in cross-encoder re-ranking. On the one hand, cross-encoders constitute an effective solution for re-ranking when considering a single relevance dimension such as topicality, but are not designed to straightforwardly account for additional relevance dimensions. On the other hand, the majority of re-ranking models accounting for multdimensional relevance are often based on the aggregation of multiple relevance scores at the re-ranking stage, leading to potential compensatory effects. To address these issues, in the proposed solution we enhance the candidate documents retrieved by a first-stage lexical retrieval model with suitable relevance statements related to distinct relevance dimensions, and then perform a re-ranking on them with cross-encoders. In this work we focus, in particular, on an extra dimension of relevance beyond topicality, namely, credibility, to address health misinformation in the Consumer Health Search task. Experimental evaluations are performed by considering publicly available datasets; our results show that the proposed approach statistically outperforms state-of-the-art aggregation-based and cross-encoder re-rankers.
Published: 2024

34. Retrieval for Extremely Long Queries and Documents with RPRS: a Highly Efficient and Effective Transformer-based Re-Ranker

Author: Askari, Arian, primary, Verberne, Suzan, additional, Abolghasemi, Amin, additional, Kraaij, Wessel, additional, and Pasi, Gabriella, additional
Published: 2023
Full Text: View/download PDF

35. Retrievability Bias Estimation Using Synthetically Generated Queries

Author: Abolghasemi, Amin, primary, Verberne, Suzan, additional, Askari, Arian, additional, and Azzopardi, Leif, additional
Published: 2023
Full Text: View/download PDF

36. Injecting the Score of the First-stage Retriever as Text Improves BERT-Based Re-rankers

Author: Askari, Arian, primary, Abolghasemi, Amin, additional, Pasi, Gabriella, additional, Kraaij, Wessel, additional, and Verberne, Suzan, additional
Published: 2023
Full Text: View/download PDF

37. Antioxidant therapy against TGF‐β/SMAD pathway involved in organ fibrosis.

Author: Ghafouri‐Fard, Soudeh, Askari, Arian, Shoorei, Hamed, Seify, Mohammad, Koohestanidehaghi, Yeganeh, Hussen, Bashdar Mahmud, Taheri, Mohammad, and Samsami, Majid
Subjects: FIBROSIS, PULMONARY fibrosis, TRANSFORMING growth factors-beta, RENAL fibrosis, HEART fibrosis, SCARS
Abstract: Fibrosis refers to excessive build‐up of scar tissue and extracellular matrix components in different organs. In recent years, it has been revealed that different cytokines and chemokines, especially Transforming growth factor beta (TGF‐β) is involved in the pathogenesis of fibrosis. It has been shown that TGF‐β is upregulated in fibrotic tissues, and contributes to fibrosis by mediating pathways that are related to matrix preservation and fibroblasts differentiation. There is no doubt that antioxidants protect against different inflammatory conditions by reversing the effects of nitrogen, oxygen and sulfur‐based reactive elements. Oxidative stress has a direct impact on chronic inflammation, and as results, prolonged inflammation ultimately results in fibrosis. Different types of antioxidants, in the forms of vitamins, natural compounds or synthetic ones, have been proven to be beneficial in the protection against fibrotic conditions both in vitro and in vivo. In this study, we reviewed the role of different compounds with antioxidant activity in induction or inhibition of TGF‐β/SMAD signalling pathway, with regard to different fibrotic conditions such as gastro‐intestinal fibrosis, cardiac fibrosis, pulmonary fibrosis, skin fibrosis, renal fibrosis and also some rare cases of fibrosis, both in animal models and cell lines. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

38. Sarcopenia and noncoding RNAs: A comprehensive review

Author: Ghafouri‐Fard, Soudeh, primary, Askari, Arian, additional, Mahmud Hussen, Bashdar, additional, Taheri, Mohammad, additional, and Kiani, Arda, additional
Published: 2023
Full Text: View/download PDF

39. Role of miR-424 in the carcinogenesis

Author: Ghafouri-Fard, Soudeh, primary, Askari, Arian, additional, Hussen, Bashdar Mahmud, additional, Taheri, Mohammad, additional, and Akbari Dilmaghani, Nader, additional
Published: 2023
Full Text: View/download PDF

40. A review on the role of NCK1 Antisense RNA 1 (NCK1-AS1) in diverse disorders

Author: Taheri, Mohammad, primary, Askari, Arian, additional, Behzad Moghadam, Kimia, additional, Hussen, Bashdar Mahmud, additional, Ghafouri-Fard, Soudeh, additional, and Kiani, Arda, additional
Published: 2023
Full Text: View/download PDF

41. A review on the role of ZEB1-AS1 in human disorders

Author: Ghafouri-Fard, Soudeh, primary, Askari, Arian, additional, Moghadam, Kimia Behzad, additional, Hussen, Bashdar Mahmud, additional, Taheri, Mohammad, additional, and Samadian, Mohammad, additional
Published: 2023
Full Text: View/download PDF

42. Expand, Highlight, Generate: RL-driven Document Generation for Passage Reranking

Author: Askari, Arian, primary, Aliannejadi, Mohammad, additional, Meng, Chuan, additional, Kanoulas, Evangelos, additional, and Verberne, Suzan, additional
Published: 2023
Full Text: View/download PDF

43. A review on the role of miR-671 in human disorders

Author: Ghafouri-Fard, Soudeh, primary, Askari, Arian, additional, Hussen, Bashdar Mahmud, additional, Rasul, Mohammed Fatih, additional, Hatamian, Sevak, additional, Taheri, Mohammad, additional, and Kiani, Arda, additional
Published: 2022
Full Text: View/download PDF

44. A long non-coding RNA with important roles in the carcinogenesis

Author: Ghafouri-Fard, Soudeh, primary, Askari, Arian, additional, Hussen, Bashdar Mahmud, additional, Taheri, Mohammad, additional, and Mokhtari, Majid, additional
Published: 2022
Full Text: View/download PDF

45. On the Interpolation of Contextualized Term-based Ranking with BM25 for Query-by-Example Retrieval

Author: Abolghasemi, Amin, primary, Askari, Arian, additional, and Verberne, Suzan, additional
Published: 2022
Full Text: View/download PDF

46. On Anonymous Commenting

Author: Askari, Arian, primary, Jalilvand, Asal, additional, and Neshati, Mahmood, additional
Published: 2019
Full Text: View/download PDF

47. Query Understanding via Entity Attribute Identification

Author: Dargahi Nobari, Arash, primary, Askari, Arian, additional, Hasibi, Faegheh, additional, and Neshati, Mahmood, additional
Published: 2018
Full Text: View/download PDF

48. A review on the role of ZEB1-AS1 in human disorders.

Author: Ghafouri-Fard S, Askari A, Behzad Moghadam K, Hussen BM, Taheri M, and Samadian M
Subjects: Humans, Cell Line, Tumor, Cell Movement genetics, Cell Proliferation genetics, Gene Expression Regulation, Neoplastic, Zinc Finger E-box-Binding Homeobox 1 genetics, MicroRNAs genetics, RNA, Long Noncoding genetics, RNA, Long Noncoding metabolism, Neoplasms genetics
Abstract: ZEB1 Antisense RNA 1 (ZEB1-AS1) is a type of RNA characterized as long non-coding RNA (lncRNA). This lncRNA has important regulatory roles on its related gene, Zinc Finger E-Box Binding Homeobox 1 (ZEB1). In addition, role of ZEB1-AS1 has been approved in diverse malignancies such as colorectal cancer, breast cancer, glioma, hepatocellular carcinoma and gastric cancer. ZEB1-AS1 serves as a sponge for a number of microRNAs, namely miR-577, miR-335-5p, miR-101, miR-505-3p, miR-455-3p, miR-205, miR-23a, miR-365a-3p, miR-302b, miR-299-3p, miR-133a-3p, miR-200a, miR-200c, miR-342-3p, miR-214, miR-149-3p and miR-1224-5p. In addition to malignant conditions, ZEB1-AS1 has functional role in non-malignant conditions like diabetic nephropathy, diabetic lung, arthrosclerosis, Chlamydia trachomatis infection, pulmonary fibrosis and ischemic stroke. This review outlines different molecular mechanisms of ZEB1-AS1 in a variety of disorders and highlights its importance in their pathogenesis., Competing Interests: Declaration of Competing Interest The authors declare they have no conflict of interest., (Copyright © 2023 The Authors. Published by Elsevier GmbH.. All rights reserved.)
Published: 2023
Full Text: View/download PDF

49. A review on the role of NCK1 Antisense RNA 1 (NCK1-AS1) in diverse disorders.

Author: Taheri M, Askari A, Behzad Moghadam K, Hussen BM, Ghafouri-Fard S, and Kiani A
Subjects: Male, Humans, RNA, Antisense genetics, Cell Proliferation genetics, Cell Line, Tumor, Gene Expression Regulation, Neoplastic, Carcinoma, Non-Small-Cell Lung genetics, Lung Neoplasms genetics, MicroRNAs genetics, Glioma pathology, RNA, Long Noncoding genetics
Abstract: NCK1 Antisense RNA 1 (NCK1-AS1), alternatively named as NCK1-DT, is a long non-coding RNA (lncRNA) with important roles in the carcinogenesis. Multiple studies verified its oncogenic role in different types of cancer, including gastric cancer, non-small cell lung cancer, glioma, prostate cancer and cervical cancer. NCK1-AS1 functions as a sponge for several microRNAs, including miR-137, miR-22-3p, miR-526b-5p, miR-512-5p, miR-138-2-3p and miR-6857. In this review we present an outline of NCK1-AS1 function in malignant conditions as well as atherosclerosis., Competing Interests: Declaration of Competing Interest The authors declare they have no conflict of interest., (Copyright © 2023 The Authors. Published by Elsevier GmbH.. All rights reserved.)
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

49 results on '"Askari, Arian"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources