Author: "Doostmohammadi, Ehsan" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Doostmohammadi, Ehsan"' showing total 27 results

Start Over Author "Doostmohammadi, Ehsan"

27 results on '"Doostmohammadi, Ehsan"'

1. How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?

Author: Doostmohammadi, Ehsan, Holmström, Oskar, and Kuhlmann, Marco
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Work on instruction-tuned Large Language Models (LLMs) has used automatic methods based on text overlap and LLM judgments as cost-effective alternatives to human evaluation. In this paper, we perform a meta-evaluation of such methods and assess their reliability across a broad range of tasks. In evaluating how well automatic methods align with human evaluations, correlation metrics are the most commonly employed method despite their inherent limitations when dealing with ties and different scales. To address these shortcomings, we use Pairwise Accuracy as an alternative to standard correlation measures. We observe that while automatic evaluation methods can approximate human ratings under specific conditions, their validity is highly context-dependent. Specifically, the simple ROUGE-L metric correlates very well with human ratings for short-answer English tasks but is unreliable in free-form generation tasks and cross-lingual scenarios. The effectiveness of the more advanced method of using GPT-4 as a judge diminishes significantly if reference answers are not included in the prompt, which is the scenario where this method has the potential to provide the most value compared to other metrics. Our findings enhance the understanding of how automatic methods should be applied and interpreted when developing and evaluating instruction-tuned LLMs.
Published: 2024

2. Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language Models

Author: Doostmohammadi, Ehsan, Norlund, Tobias, Kuhlmann, Marco, and Johansson, Richard
Subjects: Computer Science - Computation and Language
Abstract: Augmenting language models with a retrieval mechanism has been shown to significantly improve their performance while keeping the number of parameters low. Retrieval-augmented models commonly rely on a semantic retrieval mechanism based on the similarity between dense representations of the query chunk and potential neighbors. In this paper, we study the state-of-the-art Retro model and observe that its performance gain is better explained by surface-level similarities, such as token overlap. Inspired by this, we replace the semantic retrieval in Retro with a surface-level method based on BM25, obtaining a significant reduction in perplexity. As full BM25 retrieval can be computationally costly for large datasets, we also apply it in a re-ranking scenario, gaining part of the perplexity reduction with minimal computational overhead.
Published: 2023

3. On the Generalization Ability of Retrieval-Enhanced Transformers

Author: Norlund, Tobias, Doostmohammadi, Ehsan, Johansson, Richard, and Kuhlmann, Marco
Subjects: Computer Science - Computation and Language
Abstract: Recent work on the Retrieval-Enhanced Transformer (RETRO) model has shown that off-loading memory from trainable weights to a retrieval database can significantly improve language modeling and match the performance of non-retrieval models that are an order of magnitude larger in size. It has been suggested that at least some of this performance gain is due to non-trivial generalization based on both model weights and retrieval. In this paper, we try to better understand the relative contributions of these two components. We find that the performance gains from retrieval largely originate from overlapping tokens between the database and the test data, suggesting less non-trivial generalization than previously assumed. More generally, our results point to the challenges of evaluating the generalization of retrieval-augmented language models such as RETRO, as even limited token overlap may significantly decrease test-time loss. We release our code and model at https://github.com/TobiasNorlund/retro
Published: 2023

4. SINA-BERT: A pre-trained Language Model for Analysis of Medical Texts in Persian

Author: Taghizadeh, Nasrin, Doostmohammadi, Ehsan, Seifossadat, Elham, Rabiee, Hamid R., and Tahaei, Maedeh S.
Subjects: Computer Science - Computation and Language
Abstract: We have released Sina-BERT, a language model pre-trained on BERT (Devlin et al., 2018) to address the lack of a high-quality Persian language model in the medical domain. SINA-BERT utilizes pre-training on a large-scale corpus of medical contents including formal and informal texts collected from a variety of online resources in order to improve the performance on health-care related tasks. We employ SINA-BERT to complete following representative tasks: categorization of medical questions, medical sentiment analysis, and medical question retrieval. For each task, we have developed Persian annotated data sets for training and evaluation and learnt a representation for the data of each task especially complex and long medical questions. With the same architecture being used across tasks, SINA-BERT outperforms BERT-based models that were previously made available in the Persian language.
Published: 2021

5. Joint Persian Word Segmentation Correction and Zero-Width Non-Joiner Recognition Using BERT

Author: Doostmohammadi, Ehsan, Nassajian, Minoo, and Rahimi, Adel
Subjects: Computer Science - Computation and Language
Abstract: Words are properly segmented in the Persian writing system; in practice, however, these writing rules are often neglected, resulting in single words being written disjointedly and multiple words written without any white spaces between them. This paper addresses the problems of word segmentation and zero-width non-joiner (ZWNJ) recognition in Persian, which we approach jointly as a sequence labeling problem. We achieved a macro-averaged F1-score of 92.40% on a carefully collected corpus of 500 sentences with a high level of difficulty.
Published: 2020

6. PerKey: A Persian News Corpus for Keyphrase Extraction and Generation

Author: Doostmohammadi, Ehsan, Bokaei, Mohammad Hadi, and Sameti, Hossein
Subjects: Computer Science - Computation and Language
Abstract: Keyphrases provide an extremely dense summary of a text. Such information can be used in many Natural Language Processing tasks, such as information retrieval and text summarization. Since previous studies on Persian keyword or keyphrase extraction have not published their data, the field suffers from the lack of a human extracted keyphrase dataset. In this paper, we introduce PerKey, a corpus of 553k news articles from six Persian news websites and agencies with relatively high quality author extracted keyphrases, which is then filtered and cleaned to achieve higher quality keyphrases. The resulted data was put into human assessment to ensure the quality of the keyphrases. We also measured the performance of different supervised and unsupervised techniques, e.g. TFIDF, MultipartiteRank, KEA, etc. on the dataset using precision, recall, and F1-score.
Published: 2020
Full Text: View/download PDF

7. Persian Keyphrase Generation Using Sequence-to-Sequence Models

Author: Doostmohammadi, Ehsan, Bokaei, Mohammad Hadi, and Sameti, Hossein
Subjects: Computer Science - Computation and Language
Abstract: Keyphrases are a very short summary of an input text and provide the main subjects discussed in the text. Keyphrase extraction is a useful upstream task and can be used in various natural language processing problems, for example, text summarization and information retrieval, to name a few. However, not all the keyphrases are explicitly mentioned in the body of the text. In real-world examples there are always some topics that are discussed implicitly. Extracting such keyphrases requires a generative approach, which is adopted here. In this paper, we try to tackle the problem of keyphrase generation and extraction from news articles using deep sequence-to-sequence models. These models significantly outperform the conventional methods such as Topic Rank, KPMiner, and KEA in the task of keyphrase extraction.
Published: 2020
Full Text: View/download PDF

8. Investigating Machine Learning Methods for Language and Dialect Identification of Cuneiform Texts

Author: Doostmohammadi, Ehsan and Nassajian, Minoo
Subjects: Computer Science - Computation and Language
Abstract: Identification of the languages written using cuneiform symbols is a difficult task due to the lack of resources and the problem of tokenization. The Cuneiform Language Identification task in VarDial 2019 addresses the problem of identifying seven languages and dialects written in cuneiform; Sumerian and six dialects of Akkadian language: Old Babylonian, Middle Babylonian Peripheral, Standard Babylonian, Neo-Babylonian, Late Babylonian, and Neo-Assyrian. This paper describes the approaches taken by SharifCL team to this problem in VarDial 2019. The best result belongs to an ensemble of Support Vector Machines and a naive Bayes classifier, both working on character-level features, with macro-averaged F1-score of 72.10%.
Published: 2020
Full Text: View/download PDF

9. Ghmerti at SemEval-2019 Task 6: A Deep Word- and Character-based Approach to Offensive Language Identification

Author: Doostmohammadi, Ehsan, Sameti, Hossein, and Saffar, Ali
Subjects: Computer Science - Computation and Language
Abstract: This paper presents the models submitted by Ghmerti team for subtasks A and B of the OffensEval shared task at SemEval 2019. OffensEval addresses the problem of identifying and categorizing offensive language in social media in three subtasks; whether or not a content is offensive (subtask A), whether it is targeted (subtask B) towards an individual, a group, or other entities (subtask C). The proposed approach includes character-level Convolutional Neural Network, word-level Recurrent Neural Network, and some preprocessing. The performance achieved by the proposed model for subtask A is 77.93% macro-averaged F1-score.
Published: 2020
Full Text: View/download PDF

10. Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging

Author: Doostmohammadi, Ehsan, Nassajian, Minoo, and Rahimi, Adel
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Ezafe is a grammatical particle in some Iranian languages that links two words together. Regardless of the important information it conveys, it is almost always not indicated in Persian script, resulting in mistakes in reading complex sentences and errors in natural language processing tasks. In this paper, we experiment with different machine learning methods to achieve state-of-the-art results in the task of ezafe recognition. Transformer-based methods, BERT and XLMRoBERTa, achieve the best results, the latter achieving 2.68% F1-score more than the previous state-of-the-art. We, moreover, use ezafe information to improve Persian part-of-speech tagging results and show that such information will not be useful to transformer-based methods and explain why that might be the case.
Published: 2020

11. Growth of Chinese Medicine in Iran: Past, Present, and Prospects.

Author: Doostmohammadi, Ehsan
Subjects: SILK Road, MEDICAL communication, BELT & Road Initiative, CHINESE medicine, HERBAL medicine
Abstract: The international recognition of traditional Chinese medicine (TCM) has continuously increased, and that medical practice has gradually become incorporated into the medical systems of many nations. As an important country in the Belt and Road Initiative, Iran has enormous potential for cooperation with China in medical and health care. High-level officials of the two countries attach great importance to cooperation in both areas. Despite the recent rapid development of TCM in Iran, that medical practice still faces many problems; examples here are the lack of practitioners of TCM as well as the paucity of comprehensive cooperation among herbal medicine factories, publishing houses, traditional medicine colleges, and universities in the two countries. The present study collected and analyzed materials related to the practice of TCM in Iran; it made a deep examination of the current situation, problems, and development prospects regarding TCM in Iran with the aim of providing ideas and references to promote the international development of that form of medicine. Following an analysis of the development trends of TCM in Iran in recent years, it became evident that the prospects for TCM in that country are extensive, and the practice has excellent potential. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

12. Growth of Chinese Medicine in Iran: Past, Present, and Prospects

Author: Doostmohammadi, Ehsan, primary
Published: 2023
Full Text: View/download PDF

13. Making Instruction Finetuning Accessible to Non-English Languages : A Case Study on Swedish Models

Author: Holmström, Oskar, Doostmohammadi, Ehsan, Holmström, Oskar, and Doostmohammadi, Ehsan
Abstract: In recent years, instruction finetuning models have received increased attention due to their remarkable zero-shot and generalization capabilities. However, the widespread implementation of these models has been limited to the English language, largely due to the costs and challenges associated with creating instruction datasets. To overcome this, automatic instruction generation has been proposed as a resourceful alternative. We see this as an opportunity for the adoption of instruction finetuning for other languages. In this paper we explore the viability of instruction finetuning for Swedish. We translate a dataset of generated instructions from English to Swedish, using it to finetune both Swedish and non-Swedish models. Results indicate that the use of translated instructions significantly improves the models’ zero-shot performance, even on unseen data, while staying competitive with strong baselines ten times in size. We see this paper is a first step and a proof of concept that instruction finetuning for Swedish is within reach, through resourceful means, and that there exist several directions for further improvements.
Published: 2023

14. Making Instruction Finetuning Accessible to Non-English Languages: A Case Study on Swedish

Author: Holmström, Oskar and Doostmohammadi, Ehsan
Subjects: NoDaLiDa 2023
Published: 2023

15. Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language Models

Author: Doostmohammadi, Ehsan, primary, Norlund, Tobias, additional, Kuhlmann, Marco, additional, and Johansson, Richard, additional
Published: 2023
Full Text: View/download PDF

16. On the Effects of Video Grounding on Language Models

Author: Doostmohammadi, Ehsan, Kuhlmann, Marco, Doostmohammadi, Ehsan, and Kuhlmann, Marco
Abstract: Transformer-based models trained on text and vision modalities try to improve the performance on multimodal downstream tasks or tackle the problem Transformer-based models trained on text and vision modalities try to improve the performance on multimodal downstream tasks or tackle the problem of lack of grounding, e.g., addressing issues like models’ insufficient commonsense knowledge. While it is more straightforward to evaluate the effects of such models on multimodal tasks, such as visual question answering or image captioning, it is not as well-understood how these tasks affect the model itself, and its internal linguistic representations. In this work, we experiment with language models grounded in videos and measure the models’ performance on predicting masked words chosen based on their imageability. The results show that the smaller model benefits from video grounding in predicting highly imageable words, while the results for the larger model seem harder to interpret.of lack of grounding, e.g., addressing issues like models’ insufficient commonsense knowledge. While it is more straightforward to evaluate the effects of such models on multimodal tasks, such as visual question answering or image captioning, it is not as well-understood how these tasks affect the model itself, and its internal linguistic representations. In this work, we experiment with language models grounded in videos and measure the models’ performance on predicting masked words chosen based on their imageability. The results show that the smaller model benefits from video grounding in predicting highly imageable words, while the results for the larger model seem harder to interpret.
Published: 2022

17. Persian Sentiment Analysis Without Training Data Using Cross-Lingual Word Embeddings

Author: Aliramezani, Mohammad, primary, Doostmohammadi, Ehsan, additional, Bokaei, Mohammad Hadi, additional, and Sameti, Hossien, additional
Published: 2020
Full Text: View/download PDF

18. Joint Persian Word Segmentation Correction and Zero-Width Non-Joiner Recognition Using BERT

Author: Doostmohammadi, Ehsan, primary, Nassajian, Minoo, additional, and Rahimi, Adel, additional
Published: 2020
Full Text: View/download PDF

19. Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging

Author: Doostmohammadi, Ehsan, primary, Nassajian, Minoo, additional, and Rahimi, Adel, additional
Published: 2020
Full Text: View/download PDF

20. Persian Keyphrase Generation Using Sequence-to-Sequence Models

Author: Doostmohammadi, Ehsan, primary, Bokaei, Mohammad Hadi, additional, and Sameti, Hossein, additional
Published: 2019
Full Text: View/download PDF

21. Ghmerti at SemEval-2019 Task 6: A Deep Word- and Character-based Approach to Offensive Language Identification

Author: Doostmohammadi, Ehsan, primary, Sameti, Hossein, additional, and Saffar, Ali, additional
Published: 2019
Full Text: View/download PDF

22. Investigating Machine Learning Methods for Language and Dialect Identification of Cuneiform Texts

Author: Doostmohammadi, Ehsan, primary and Nassajian, Minoo, additional
Published: 2019
Full Text: View/download PDF

23. PerKey: A Persian News Corpus for Keyphrase Extraction and Generation

Author: Doostmohammadi, Ehsan, primary, Bokaei, Mohammad Hadi, additional, and Sameti, Hossein, additional
Published: 2018
Full Text: View/download PDF

24. A Witness of Cultural Exchange.

Author: Doostmohammadi, Ehsan
Subjects: *CULTURAL relations, *BELT & Road Initiative, *MEDICAL personnel, SILK Road
Abstract: The article discusses the historical and cultural exchanges between China and Iran, with a focus on traditional Chinese medicine (TCM) as an important carrier of cultural exchange. It highlights the diplomatic exchanges between the two countries during ancient times and the opening of the Silk Road. The article also mentions the influence of Chinese medicine in Iran, including the import and export of medicines and the translation of Chinese medical texts. It concludes by emphasizing the recognition and development of TCM in Iran in modern times. [Extracted from the article]
Published: 2024

25. Persian Word Embedding Evaluation Benchmarks

Author: Zahedi, Mohammad Sadegh, primary, Bokaei, Mohammad Hadi, additional, Shoeleh, Farzaneh, additional, Yadollahi, Mohammad Mehdi, additional, Doostmohammadi, Ehsan, additional, and Farhoodi, Mojgan, additional
Published: 2018
Full Text: View/download PDF

26. In Sync.

Author: Doostmohammadi, Ehsan
Subjects: *CULTURAL relations, *MODERN society, *CHINESE characters
Abstract: This article discusses the concept of "hehe culture" in China, which is characterized by harmony, unity, and cooperation. Chinese characters, being logographic, carry deep historical and cultural meanings, with the character for "harmony" having a significant influence on Chinese society. Hehe culture emphasizes the harmonious coexistence between humanity and nature, the balanced development of both, and the pursuit of peace. It also promotes tolerance, inclusiveness, and cultural exchange. The article suggests that studying and applying the wisdom of hehe culture can help address the challenges of modern society and create a better future. [Extracted from the article]
Published: 2023

27. Wisdom of the Sages.

Author: Doostmohammadi, Ehsan
Subjects: *WISDOM, *SAGE, *HEALING
Abstract: Confucius established the first private school in China in the state of Lu in the sixth century B.C., and, according to historical records, he had 3,000 disciples, of whom 72 became highly accomplished. After the founding of the Zhou Dynasty (1046-256 B.C.), Jiang was crowned as ruler of the vassal state of Qi, now part of Shandong. Both Confucius and Mozi lived at a time when multiple schools of thought flourished and contended for dominance. [Extracted from the article]
Published: 2023

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

27 results on '"Doostmohammadi, Ehsan"'

1. How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?

2. Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language Models

3. On the Generalization Ability of Retrieval-Enhanced Transformers

4. SINA-BERT: A pre-trained Language Model for Analysis of Medical Texts in Persian

5. Joint Persian Word Segmentation Correction and Zero-Width Non-Joiner Recognition Using BERT

6. PerKey: A Persian News Corpus for Keyphrase Extraction and Generation

7. Persian Keyphrase Generation Using Sequence-to-Sequence Models

8. Investigating Machine Learning Methods for Language and Dialect Identification of Cuneiform Texts

9. Ghmerti at SemEval-2019 Task 6: A Deep Word- and Character-based Approach to Offensive Language Identification

10. Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging

11. Growth of Chinese Medicine in Iran: Past, Present, and Prospects.

12. Growth of Chinese Medicine in Iran: Past, Present, and Prospects

13. Making Instruction Finetuning Accessible to Non-English Languages : A Case Study on Swedish Models

14. Making Instruction Finetuning Accessible to Non-English Languages: A Case Study on Swedish

15. Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language Models

16. On the Effects of Video Grounding on Language Models

17. Persian Sentiment Analysis Without Training Data Using Cross-Lingual Word Embeddings

18. Joint Persian Word Segmentation Correction and Zero-Width Non-Joiner Recognition Using BERT

19. Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging

20. Persian Keyphrase Generation Using Sequence-to-Sequence Models

21. Ghmerti at SemEval-2019 Task 6: A Deep Word- and Character-based Approach to Offensive Language Identification

22. Investigating Machine Learning Methods for Language and Dialect Identification of Cuneiform Texts

23. PerKey: A Persian News Corpus for Keyphrase Extraction and Generation

24. A Witness of Cultural Exchange.

25. Persian Word Embedding Evaluation Benchmarks

26. In Sync.

27. Wisdom of the Sages.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

27 results on '"Doostmohammadi, Ehsan"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources