Author: "Goldman, Omer" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Goldman, Omer"' showing total 41 results

Start Over Author "Goldman, Omer"

41 results on '"Goldman, Omer"'

1. Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

Author: Goldman, Omer, Jacovi, Alon, Slobodkin, Aviv, Maimon, Aviya, Dagan, Ido, and Tsarfaty, Reut
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Improvements in language models' capabilities have pushed their applications towards longer contexts, making long-context evaluation and development an active research area. However, many disparate use-cases are grouped together under the umbrella term of "long-context", defined simply by the total length of the model's input, including - for example - Needle-in-a-Haystack tasks, book summarization, and information aggregation. Given their varied difficulty, in this position paper we argue that conflating different tasks by their context length is unproductive. As a community, we require a more precise vocabulary to understand what makes long-context tasks similar or different. We propose to unpack the taxonomy of long-context based on the properties that make them more difficult with longer contexts. We propose two orthogonal axes of difficulty: (I) Diffusion: How hard is it to find the necessary information in the context? (II) Scope: How much necessary information is there to find? We survey the literature on long-context, provide justification for this taxonomy as an informative descriptor, and situate the literature with respect to it. We conclude that the most difficult and interesting settings, whose necessary information is very long and highly diffused within the input, is severely under-explored. By using a descriptive vocabulary and discussing the relevant properties of difficulty in long-context, we can implement more informed research in this area. We call for a careful design of tasks and benchmarks with distinctly long context, taking into account the characteristics that make it qualitatively different from shorter context., Comment: EMNLP 2024
Published: 2024

2. Unpacking Tokenization: Evaluating Text Compression and its Correlation with Model Performance

Author: Goldman, Omer, Caciularu, Avi, Eyal, Matan, Cao, Kris, Szpektor, Idan, and Tsarfaty, Reut
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Despite it being the cornerstone of BPE, the most common tokenization algorithm, the importance of compression in the tokenization process is still unclear. In this paper, we argue for the theoretical importance of compression, that can be viewed as 0-gram language modeling where equal probability is assigned to all tokens. We also demonstrate the empirical importance of compression for downstream success of pre-trained language models. We control the compression ability of several BPE tokenizers by varying the amount of documents available during their training: from 1 million documents to a character-based tokenizer equivalent to no training data at all. We then pre-train English language models based on those tokenizers and fine-tune them over several tasks. We show that there is a correlation between tokenizers' compression and models' downstream performance, suggesting that compression is a reliable intrinsic indicator of tokenization quality. These correlations are more pronounced for generation tasks (over classification) or for smaller models (over large ones). We replicated a representative part of our experiments on Turkish and found similar results, confirming that our results hold for languages with typological characteristics dissimilar to English. We conclude that building better compressing tokenizers is a fruitful avenue for further research and for improving overall model performance., Comment: EMNLP 2024, Findings
Published: 2024

3. Explicit Morphological Knowledge Improves Pre-training of Language Models for Hebrew

Author: Gueta, Eylon, Goldman, Omer, and Tsarfaty, Reut
Subjects: Computer Science - Computation and Language
Abstract: Pre-trained language models (PLMs) have shown remarkable successes in acquiring a wide range of linguistic knowledge, relying solely on self-supervised training on text streams. Nevertheless, the effectiveness of this language-agnostic approach has been frequently questioned for its sub-optimal performance when applied to morphologically-rich languages (MRLs). We investigate the hypothesis that incorporating explicit morphological knowledge in the pre-training phase can improve the performance of PLMs for MRLs. We propose various morphologically driven tokenization methods enabling the model to leverage morphological cues beyond raw text. We pre-train multiple language models utilizing the different methods and evaluate them on Hebrew, a language with complex and highly ambiguous morphology. Our experiments show that morphologically driven tokenization demonstrates improved results compared to a standard language-agnostic tokenization, on a benchmark of both semantic and morphologic tasks. These findings suggest that incorporating morphological knowledge holds the potential for further improving PLMs for morphologically rich languages.
Published: 2023

4. Is Probing All You Need? Indicator Tasks as an Alternative to Probing Embedding Spaces

Author: Levy, Tal, Goldman, Omer, and Tsarfaty, Reut
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The ability to identify and control different kinds of linguistic information encoded in vector representations of words has many use cases, especially for explainability and bias removal. This is usually done via a set of simple classification tasks, termed probes, to evaluate the information encoded in the embedding space. However, the involvement of a trainable classifier leads to entanglement between the probe's results and the classifier's nature. As a result, contemporary works on probing include tasks that do not involve training of auxiliary models. In this work we introduce the term indicator tasks for non-trainable tasks which are used to query embedding spaces for the existence of certain properties, and claim that this kind of tasks may point to a direction opposite to probes, and that this contradiction complicates the decision on whether a property exists in an embedding space. We demonstrate our claims with two test cases, one dealing with gender debiasing and another with the erasure of morphological information from embedding spaces. We show that the application of a suitable indicator provides a more accurate picture of the information captured and removed compared to probes. We thus conclude that indicator tasks should be implemented and taken into consideration when eliciting information from embedded representations., Comment: Findings of EMNLP 2023
Published: 2023

5. The Curious Case of Hallucinatory (Un)answerability: Finding Truths in the Hidden States of Over-Confident Large Language Models

Author: Slobodkin, Aviv, Goldman, Omer, Caciularu, Avi, Dagan, Ido, and Ravfogel, Shauli
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs) have been shown to possess impressive capabilities, while also raising crucial concerns about the faithfulness of their responses. A primary issue arising in this context is the management of (un)answerable queries by LLMs, which often results in hallucinatory behavior due to overconfidence. In this paper, we explore the behavior of LLMs when presented with (un)answerable queries. We ask: do models represent the fact that the question is (un)answerable when generating a hallucinatory answer? Our results show strong indications that such models encode the answerability of an input query, with the representation of the first decoded token often being a strong indicator. These findings shed new light on the spatial organization within the latent representations of LLMs, unveiling previously unexplored facets of these models. Moreover, they pave the way for the development of improved decoding techniques with better adherence to factual generation, particularly in scenarios where query (un)answerability is a concern., Comment: EMNLP 2023
Published: 2023

6. Morphological Inflection with Phonological Features

Author: Guriel, David, Goldman, Omer, and Tsarfaty, Reut
Subjects: Computer Science - Computation and Language
Abstract: Recent years have brought great advances into solving morphological tasks, mostly due to powerful neural models applied to various tasks as (re)inflection and analysis. Yet, such morphological tasks cannot be considered solved, especially when little training data is available or when generalizing to previously unseen lemmas. This work explores effects on performance obtained through various ways in which morphological models get access to subcharacter phonological features that are the targets of morphological processes. We design two methods to achieve this goal: one that leaves models as is but manipulates the data to include features instead of characters, and another that manipulates models to take phonological features into account when building representations for phonemes. We elicit phonemic data from standard graphemic data using language-specific grammars for languages with shallow grapheme-to-phoneme mapping, and we experiment with two reinflection models over eight languages. Our results show that our methods yield comparable results to the grapheme-based baseline overall, with minor improvements in some of the languages. All in all, we conclude that patterns in character distributions are likely to allow models to infer the underlying phonological characteristics, even when phonemes are not explicitly represented., Comment: ACL 2023 main conference; 8 pages, 1 figure
Published: 2023

7. Stop Uploading Test Data in Plain Text: Practical Strategies for Mitigating Data Contamination by Evaluation Benchmarks

Author: Jacovi, Alon, Caciularu, Avi, Goldman, Omer, and Goldberg, Yoav
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Data contamination has become prevalent and challenging with the rise of models pretrained on large automatically-crawled corpora. For closed models, the training data becomes a trade secret, and even for open models, it is not trivial to detect contamination. Strategies such as leaderboards with hidden answers, or using test data which is guaranteed to be unseen, are expensive and become fragile with time. Assuming that all relevant actors value clean test data and will cooperate to mitigate data contamination, what can be done? We propose three strategies that can make a difference: (1) Test data made public should be encrypted with a public key and licensed to disallow derivative distribution; (2) demand training exclusion controls from closed API holders, and protect your test data by refusing to evaluate without them; (3) avoid data which appears with its solution on the internet, and release the web-page context of internet-derived data along with the data. These strategies are practical and can be effective in preventing data contamination., Comment: Accepted to EMNLP 2023
Published: 2023

8. UniMorph 4.0: Universal Morphology

Author: Batsuren, Khuyagbaatar, Goldman, Omer, Khalifa, Salam, Habash, Nizar, Kieraś, Witold, Bella, Gábor, Leonard, Brian, Nicolai, Garrett, Gorman, Kyle, Ate, Yustinus Ghanggo, Ryskina, Maria, Mielke, Sabrina J., Budianskaya, Elena, El-Khaissi, Charbel, Pimentel, Tiago, Gasser, Michael, Lane, William, Raj, Mohit, Coler, Matt, Samame, Jaime Rafael Montoya, Camaiteri, Delio Siticonatzi, Sagot, Benoît, Rojas, Esaú Zumaeta, Francis, Didier López, Oncevay, Arturo, Bautista, Juan López, Villegas, Gema Celeste Silva, Hennigen, Lucas Torroba, Ek, Adam, Guriel, David, Dirix, Peter, Bernardy, Jean-Philippe, Scherbakov, Andrey, Bayyr-ool, Aziyana, Anastasopoulos, Antonios, Zariquiey, Roberto, Sheifer, Karina, Ganieva, Sofya, Cruz, Hilaria, Karahóǧa, Ritván, Markantonatou, Stella, Pavlidis, George, Plugaryov, Matvey, Klyachko, Elena, Salehi, Ali, Angulo, Candy, Baxi, Jatayu, Krizhanovsky, Andrew, Krizhanovskaya, Natalia, Salesky, Elizabeth, Vania, Clara, Ivanova, Sardana, White, Jennifer, Maudslay, Rowan Hall, Valvoda, Josef, Zmigrod, Ran, Czarnowska, Paula, Nikkarinen, Irene, Salchak, Aelita, Bhatt, Brijesh, Straughn, Christopher, Liu, Zoey, Washington, Jonathan North, Pinter, Yuval, Ataman, Duygu, Wolinski, Marcin, Suhardijanto, Totok, Yablonskaya, Anna, Stoehr, Niklas, Dolatian, Hossep, Nuriah, Zahroh, Ratan, Shyam, Tyers, Francis M., Ponti, Edoardo M., Aiton, Grant, Arora, Aryaman, Hatcher, Richard J., Kumar, Ritesh, Young, Jeremiah, Rodionova, Daria, Yemelina, Anastasia, Andrushko, Taras, Marchenko, Igor, Mashkovtseva, Polina, Serova, Alexandra, Prud'hommeaux, Emily, Nepomniashchaya, Maria, Giunchiglia, Fausto, Chodroff, Eleanor, Hulden, Mans, Silfverberg, Miikka, McCarthy, Arya D., Yarowsky, David, Cotterell, Ryan, Tsarfaty, Reut, and Vylomova, Ekaterina
Subjects: Computer Science - Computation and Language
Abstract: The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This paper presents the expansions and improvements made on several fronts over the last couple of years (since McCarthy et al. (2020)). Collaborative efforts by numerous linguists have added 67 new languages, including 30 endangered languages. We have implemented several improvements to the extraction pipeline to tackle some issues, e.g. missing gender and macron information. We have also amended the schema to use a hierarchical structure that is needed for morphological phenomena like multiple-argument agreement and case stacking, while adding some missing morphological features to make the schema more inclusive. In light of the last UniMorph release, we also augmented the database with morpheme segmentation for 16 languages. Lastly, this new release makes a push towards inclusion of derivational morphology in UniMorph by enriching the data and annotation schema with instances representing derivational processes from MorphyNet., Comment: LREC 2022; The first two authors made equal contributions
Published: 2022

9. Morphological Reinflection with Multiple Arguments: An Extended Annotation schema and a Georgian Case Study

Author: Guriel, David, Goldman, Omer, and Tsarfaty, Reut
Subjects: Computer Science - Computation and Language
Abstract: In recent years, a flurry of morphological datasets had emerged, most notably UniMorph, a multi-lingual repository of inflection tables. However, the flat structure of the current morphological annotation schema makes the treatment of some languages quirky, if not impossible, specifically in cases of polypersonal agreement, where verbs agree with multiple arguments using true affixes. In this paper, we propose to address this phenomenon by expanding the UniMorph annotation schema to a hierarchical feature structure that naturally accommodates complex argument marking. We apply this extended schema to one such language, Georgian, and provide a human-verified, accurate and balanced morphological dataset for Georgian verbs. The dataset has 4 times more tables and 6 times more verb forms compared to the existing UniMorph dataset, covering all possible variants of argument marking, demonstrating the adequacy of our proposed scheme. Experiments with a standard reinflection model show that generalization is easy when the data is split at the form level, but extremely hard when splitting along lemma lines. Expanding the other languages in UniMorph to this schema is expected to improve both the coverage, consistency and interpretability of this benchmark., Comment: ACL 2022
Published: 2022

10. Morphology Without Borders: Clause-Level Morphology

Author: Goldman, Omer and Tsarfaty, Reut
Subjects: Computer Science - Computation and Language
Abstract: Morphological tasks use large multi-lingual datasets that organize words into inflection tables, which then serve as training and evaluation data for various tasks. However, a closer inspection of these data reveals profound cross-linguistic inconsistencies, that arise from the lack of a clear linguistic and operational definition of what is a word, and that severely impair the universality of the derived tasks. To overcome this deficiency, we propose to view morphology as a clause-level phenomenon, rather than word-level. It is anchored in a fixed yet inclusive set of features, that encapsulates all functions realized in a saturated clause. We deliver MightyMorph, a novel dataset for clause-level morphology covering 4 typologically-different languages: English, German, Turkish and Hebrew. We use this dataset to derive 3 clause-level morphological tasks: inflection, reinflection and analysis. Our experiments show that the clause-level tasks are substantially harder than the respective word-level tasks, while having comparable complexity across languages. Furthermore, redefining morphology to the clause-level provides a neat interface with contextualized language models (LMs) and allows assessing the morphological knowledge encoded in these models and their usability for morphological tasks. Taken together, this work opens up new horizons in the study of computational morphology, leaving ample space for studying neural morphology cross-linguistically., Comment: To appear on TACL
Published: 2022

11. (Un)solving Morphological Inflection: Lemma Overlap Artificially Inflates Models' Performance

Author: Goldman, Omer, Guriel, David, and Tsarfaty, Reut
Subjects: Computer Science - Computation and Language
Abstract: In the domain of Morphology, Inflection is a fundamental and important task that gained a lot of traction in recent years, mostly via SIGMORPHON's shared-tasks. With average accuracy above 0.9 over the scores of all languages, the task is considered mostly solved using relatively generic neural seq2seq models, even with little data provided. In this work, we propose to re-evaluate morphological inflection models by employing harder train-test splits that will challenge the generalization capacity of the models. In particular, as opposed to the na{\"i}ve split-by-form, we propose a split-by-lemma method to challenge the performance on existing benchmarks. Our experiments with the three top-ranked systems on the SIGMORPHON's 2020 shared-task show that the lemma-split presents an average drop of 30 percentage points in macro-average for the 90 languages included. The effect is most significant for low-resourced languages with a drop as high as 95 points, but even high-resourced languages lose about 10 points on average. Our results clearly show that generalizing inflection to unseen lemmas is far from being solved, presenting a simple yet effective means to promote more sophisticated models., Comment: ACL 2022
Published: 2021

12. Minimal Supervision for Morphological Inflection

Author: Goldman, Omer and Tsarfaty, Reut
Subjects: Computer Science - Computation and Language
Abstract: Neural models for the various flavours of morphological inflection tasks have proven to be extremely accurate given ample labeled data -- data that may be slow and costly to obtain. In this work we aim to overcome this annotation bottleneck by bootstrapping labeled data from a seed as little as {\em five} labeled paradigms, accompanied by a large bulk of unlabeled text. Our approach exploits different kinds of regularities in morphological systems in a two-phased setup, where word tagging based on {\em analogies} is followed by word pairing based on {\em distances}. We experiment with the Paradigm Cell Filling Problem over eight typologically different languages, and find that, in languages with relatively simple morphology, orthographic regularities on their own allow inflection models to achieve respectable accuracy. Combined orthographic and semantic regularities alleviate difficulties with particularly complex morpho-phonological systems. Our results suggest that hand-crafting many tagged examples might be an unnecessary effort. However, more work is needed in order to address rarely used forms., Comment: EMNLP 2021
Published: 2021

13. Weakly-supervised Semantic Parsing with Abstract Examples

Author: Goldman, Omer, Latcinnik, Veronica, Naveh, Udi, Globerson, Amir, and Berant, Jonathan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Training semantic parsers from weak supervision (denotations) rather than strong supervision (programs) complicates training in two ways. First, a large search space of potential programs needs to be explored at training time to find a correct program. Second, spurious programs that accidentally lead to a correct denotation add noise to training. In this work we propose that in closed worlds with clear semantic types, one can substantially alleviate these problems by utilizing an abstract representation, where tokens in both the language utterance and program are lifted to an abstract form. We show that these abstractions can be defined with a handful of lexical rules and that they result in sharing between different examples that alleviates the difficulties in training. To test our approach, we develop the first semantic parser for CNLVR, a challenging visual reasoning dataset, where the search space is large and overcoming spuriousness is critical, because denotations are either TRUE or FALSE, and thus random programs are likely to lead to a correct denotation. Our method substantially improves performance, and reaches 82.5% accuracy, a 14.7% absolute accuracy improvement compared to the best reported accuracy so far., Comment: CNLVR,NLVR. Accepted to ACL 2018
Published: 2017

14. Targeting purine synthesis in ASS1-expressing tumors enhances the response to immune checkpoint inhibitors

Author: Keshet, Rom, Lee, Joo Sang, Adler, Lital, Iraqi, Muhammed, Ariav, Yarden, Lim, Lisha Qiu Jin, Lerner, Shaul, Rabinovich, Shiran, Oren, Roni, Katzir, Rotem, Weiss Tishler, Hila, Stettner, Noa, Goldman, Omer, Landesman, Hadas, Galai, Sivan, Kuperman, Yael, Kuznetsov, Yuri, Brandis, Alexander, Mehlman, Tevi, Malitsky, Sergey, Itkin, Maxim, Koehler, S. Eleonore, Zhao, Yongmei, Talsania, Keyur, Shen, Tsai-wei, Peled, Nir, Ulitsky, Igor, Porgador, Angel, Ruppin, Eytan, and Erez, Ayelet
Published: 2020
Full Text: View/download PDF

15. Supplementary Table S3 from Early Infiltration of Innate Immune Cells to the Liver Depletes HNF4α and Promotes Extrahepatic Carcinogenesis

Author: Goldman, Omer, primary, Adler, Lital N., primary, Hajaj, Emma, primary, Croese, Tommaso, primary, Darzi, Naama, primary, Galai, Sivan, primary, Tishler, Hila, primary, Ariav, Yarden, primary, Lavie, Dor, primary, Fellus-Alyagor, Liat, primary, Oren, Roni, primary, Kuznetsov, Yuri, primary, David, Eyal, primary, Jaschek, Rami, primary, Stossel, Chani, primary, Singer, Oded, primary, Malitsky, Sergey, primary, Barak, Renana, primary, Seger, Rony, primary, Erez, Neta, primary, Amit, Ido, primary, Tanay, Amos, primary, Saada, Ann, primary, Golan, Talia, primary, Rubinek, Tamar, primary, Sang Lee, Joo, primary, Ben-Shachar, Shay, primary, Wolf, Ido, primary, and Erez, Ayelet, primary
Published: 2023
Full Text: View/download PDF

16. Data from Early Infiltration of Innate Immune Cells to the Liver Depletes HNF4α and Promotes Extrahepatic Carcinogenesis

Author: Goldman, Omer, primary, Adler, Lital N., primary, Hajaj, Emma, primary, Croese, Tommaso, primary, Darzi, Naama, primary, Galai, Sivan, primary, Tishler, Hila, primary, Ariav, Yarden, primary, Lavie, Dor, primary, Fellus-Alyagor, Liat, primary, Oren, Roni, primary, Kuznetsov, Yuri, primary, David, Eyal, primary, Jaschek, Rami, primary, Stossel, Chani, primary, Singer, Oded, primary, Malitsky, Sergey, primary, Barak, Renana, primary, Seger, Rony, primary, Erez, Neta, primary, Amit, Ido, primary, Tanay, Amos, primary, Saada, Ann, primary, Golan, Talia, primary, Rubinek, Tamar, primary, Sang Lee, Joo, primary, Ben-Shachar, Shay, primary, Wolf, Ido, primary, and Erez, Ayelet, primary
Published: 2023
Full Text: View/download PDF

17. Supplementary Figure S2 from Early Infiltration of Innate Immune Cells to the Liver Depletes HNF4α and Promotes Extrahepatic Carcinogenesis

Author: Goldman, Omer, primary, Adler, Lital N., primary, Hajaj, Emma, primary, Croese, Tommaso, primary, Darzi, Naama, primary, Galai, Sivan, primary, Tishler, Hila, primary, Ariav, Yarden, primary, Lavie, Dor, primary, Fellus-Alyagor, Liat, primary, Oren, Roni, primary, Kuznetsov, Yuri, primary, David, Eyal, primary, Jaschek, Rami, primary, Stossel, Chani, primary, Singer, Oded, primary, Malitsky, Sergey, primary, Barak, Renana, primary, Seger, Rony, primary, Erez, Neta, primary, Amit, Ido, primary, Tanay, Amos, primary, Saada, Ann, primary, Golan, Talia, primary, Rubinek, Tamar, primary, Sang Lee, Joo, primary, Ben-Shachar, Shay, primary, Wolf, Ido, primary, and Erez, Ayelet, primary
Published: 2023
Full Text: View/download PDF

18. Supplementary Extended Data 1 from Early Infiltration of Innate Immune Cells to the Liver Depletes HNF4α and Promotes Extrahepatic Carcinogenesis

Author: Goldman, Omer, primary, Adler, Lital N., primary, Hajaj, Emma, primary, Croese, Tommaso, primary, Darzi, Naama, primary, Galai, Sivan, primary, Tishler, Hila, primary, Ariav, Yarden, primary, Lavie, Dor, primary, Fellus-Alyagor, Liat, primary, Oren, Roni, primary, Kuznetsov, Yuri, primary, David, Eyal, primary, Jaschek, Rami, primary, Stossel, Chani, primary, Singer, Oded, primary, Malitsky, Sergey, primary, Barak, Renana, primary, Seger, Rony, primary, Erez, Neta, primary, Amit, Ido, primary, Tanay, Amos, primary, Saada, Ann, primary, Golan, Talia, primary, Rubinek, Tamar, primary, Sang Lee, Joo, primary, Ben-Shachar, Shay, primary, Wolf, Ido, primary, and Erez, Ayelet, primary
Published: 2023
Full Text: View/download PDF

19. Genome-wide SWAp-Tag yeast libraries for proteome exploration

Author: Weill, Uri, Yofe, Ido, Sass, Ehud, Stynen, Bram, Davidi, Dan, Natarajan, Janani, Ben-Menachem, Reut, Avihou, Zohar, Goldman, Omer, Harpaz, Nofar, Chuartzman, Silvia, Kniazev, Kiril, Knoblach, Barbara, Laborenz, Janina, Boos, Felix, Kowarzyk, Jacqueline, Ben-Dor, Shifra, Zalckvar, Einat, Herrmann, Johannes M., Rachubinski, Richard A., Pines, Ophry, Rapaport, Doron, Michnick, Stephen W., Levy, Emmanuel D., and Schuldiner, Maya
Published: 2018
Full Text: View/download PDF

20. Data from Acid-Induced Downregulation of ASS1 Contributes to the Maintenance of Intracellular pH in Cancer

Author: Silberman, Alon, primary, Goldman, Omer, primary, Boukobza Assayag, Odeya, primary, Jacob, Adi, primary, Rabinovich, Shiran, primary, Adler, Lital, primary, Lee, Joo Sang, primary, Keshet, Rom, primary, Sarver, Alona, primary, Frug, Julia, primary, Stettner, Noa, primary, Galai, Sivan, primary, Persi, Erez, primary, Halpern, Keren Bahar, primary, Zaltsman-Amir, Yehudit, primary, Pode-Shakked, Ben, primary, Eilam, Raya, primary, Anikster, Yair, primary, Nagamani, Sandesh C.S., primary, Ulitsky, Igor, primary, Ruppin, Eytan, primary, and Erez, Ayelet, primary
Published: 2023
Full Text: View/download PDF

21. Supplementary Data from Acid-Induced Downregulation of ASS1 Contributes to the Maintenance of Intracellular pH in Cancer

Author: Silberman, Alon, primary, Goldman, Omer, primary, Boukobza Assayag, Odeya, primary, Jacob, Adi, primary, Rabinovich, Shiran, primary, Adler, Lital, primary, Lee, Joo Sang, primary, Keshet, Rom, primary, Sarver, Alona, primary, Frug, Julia, primary, Stettner, Noa, primary, Galai, Sivan, primary, Persi, Erez, primary, Halpern, Keren Bahar, primary, Zaltsman-Amir, Yehudit, primary, Pode-Shakked, Ben, primary, Eilam, Raya, primary, Anikster, Yair, primary, Nagamani, Sandesh C.S., primary, Ulitsky, Igor, primary, Ruppin, Eytan, primary, and Erez, Ayelet, primary
Published: 2023
Full Text: View/download PDF

22. Early Infiltration of Innate Immune Cells to the Liver Depletes HNF4α and Promotes Extrahepatic Carcinogenesis

Author: Goldman, Omer, primary, Adler, Lital N., additional, Hajaj, Emma, additional, Croese, Tommaso, additional, Darzi, Naama, additional, Galai, Sivan, additional, Tishler, Hila, additional, Ariav, Yarden, additional, Lavie, Dor, additional, Fellus-Alyagor, Liat, additional, Oren, Roni, additional, Kuznetsov, Yuri, additional, David, Eyal, additional, Jaschek, Rami, additional, Stossel, Chani, additional, Singer, Oded, additional, Malitsky, Sergey, additional, Barak, Renana, additional, Seger, Rony, additional, Erez, Neta, additional, Amit, Ido, additional, Tanay, Amos, additional, Saada, Ann, additional, Golan, Talia, additional, Rubinek, Tamar, additional, Sang Lee, Joo, additional, Ben-Shachar, Shay, additional, Wolf, Ido, additional, and Erez, Ayelet, additional
Published: 2023
Full Text: View/download PDF

23. Stop Uploading Test Data in Plain Text: Practical Strategies for Mitigating Data Contamination by Evaluation Benchmarks

Author: Jacovi, Alon, primary, Caciularu, Avi, additional, Goldman, Omer, additional, and Goldberg, Yoav, additional
Published: 2023
Full Text: View/download PDF

24. Morphological Inflection with Phonological Features

Author: Guriel, David, primary, Goldman, Omer, additional, and Tsarfaty, Reut, additional
Published: 2023
Full Text: View/download PDF

25. Is Probing All You Need? Indicator Tasks as an Alternative to Probing Embedding Spaces

Author: Levy, Tal, primary, Goldman, Omer, additional, and Tsarfaty, Reut, additional
Published: 2023
Full Text: View/download PDF

26. Findings of the 1st Shared Task on Multi-lingual Multi-task Information Retrieval at MRL 2023

Author: Tinner, Francesco, primary, Adelani, David Ifeoluwa, additional, Emezue, Chris, additional, Hajili, Mammad, additional, Goldman, Omer, additional, Adilazuarda, Muhammad Farid, additional, Al Kautsar, Muhammad Dehan, additional, Mirsaidova, Aziza, additional, Kural, Müge, additional, Massey, Dylan, additional, Chukwuneke, Chiamaka, additional, Mbonu, Chinedu, additional, Oloyede, Damilola Oluwaseun, additional, Olaleye, Kayode, additional, Atala, Jonathan, additional, Ajibade, Benjamin A., additional, Bassi, Saksham, additional, Aralikatte, Rahul, additional, Kim, Najoung, additional, and Ataman, Duygu, additional
Published: 2023
Full Text: View/download PDF

27. SIGMORPHON–UniMorph 2023 Shared Task 0: Typologically Diverse Morphological Inflection

Author: Goldman, Omer, primary, Batsuren, Khuyagbaatar, additional, Khalifa, Salam, additional, Arora, Aryaman, additional, Nicolai, Garrett, additional, Tsarfaty, Reut, additional, and Vylomova, Ekaterina, additional
Published: 2023
Full Text: View/download PDF

28. The Curious Case of Hallucinatory (Un)answerability: Finding Truths in the Hidden States of Over-Confident Large Language Models

Author: Slobodkin, Aviv, primary, Goldman, Omer, additional, Caciularu, Avi, additional, Dagan, Ido, additional, and Ravfogel, Shauli, additional
Published: 2023
Full Text: View/download PDF

29. Author Correction: Genome-wide SWAp-Tag yeast libraries for proteome exploration

Author: Weill, Uri, Yofe, Ido, Sass, Ehud, Stynen, Bram, Davidi, Dan, Natarajan, Janani, Ben-Menachem, Reut, Avihou, Zohar, Goldman, Omer, Harpaz, Nofar, Chuartzman, Silvia, Kniazev, Kiril, Knoblach, Barbara, Laborenz, Janina, Boos, Felix, Kowarzyk, Jacqueline, Ben-Dor, Shifra, Zalckvar, Einat, Herrmann, Johannes M., Rachubinski, Richard A., Pines, Ophry, Rapaport, Doron, Michnick, Stephen W., Levy, Emmanuel D., and Schuldiner, Maya
Published: 2019
Full Text: View/download PDF

30. SIGMORPHON–UniMorph 2022 Shared Task 0: Generalization and Typologically Diverse Morphological Inflection

Author: Kodner, Jordan, Khalifa, Salam, Batsuren, Khuyagbaatar, Dolatian, Hossep, Cotterell, Ryan, Akkuş, Faruk, Anastasopoulos, Antonios, Andrushko, Taras, Arora, Aryaman, Atanalov, Nona, Bella, Gábor, Budianskaya, Elena, Ghanggo Ate, Yustinus, Goldman, Omer, Guriel, David, Guriel, Simon, Guriel-Agiashvili, Silvia, Kieraś, Witold, Krizhanovsky, Andrew, Krizhanovsky, Natalia, Marchenko, Igor, Markowska, Magdalena, Mashkovtseva, Polina, Nepommiashchaya, Maria, Rodionova, Daria, Sheifer, Karina, Serova, Alexandra, Yemelina, Anastasia, Young, Jeremiah, Vylomova, Ekaterina, Nicolai, Garrett, and Chodroff, Eleanor
Abstract: The 2022 SIGMORPHON–UniMorph shared task on large scale morphological inflection generation included a wide range of typologically diverse languages: 33 languages from 11 top-level language families: Arabic (Modern Standard), Assamese, Braj, Chukchi, Eastern Armenian, Evenki, Georgian, Gothic, Gujarati, Hebrew, Hungarian, Itelmen, Karelian, Kazakh, Ket, Khalkha Mongolian, Kholosi, Korean, Lamahalot, Low German, Ludic, Magahi, Middle Low German, Old English, Old High German, Old Norse, Polish, Pomak, Slovak, Turkish, Upper Sorbian, Veps, and Xibe. We emphasize generalization along different dimensions this year by evaluating test items with unseen lemmas and unseen features separately under small and large training conditions. Across the five submitted systems and two baselines, the prediction of inflections with unseen features proved challenging, with average performance decreased substantially from last year. This was true even for languages for which the forms were in principle predictable, which suggests that further work is needed in designing systems that capture the various types of generalization required for the world’s languages., Proceedings of the 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, ISBN:978-1-955917-82-7
Published: 2022

31. Morphological Reinflection with Multiple Arguments: An Extended Annotation schema and a Georgian Case Study

Author: Guriel, David, primary, Goldman, Omer, additional, and Tsarfaty, Reut, additional
Published: 2022
Full Text: View/download PDF

32. Morphology Without Borders: Clause-Level Morphology

Author: Goldman, Omer, primary and Tsarfaty, Reut, additional
Published: 2022
Full Text: View/download PDF

33. (Un)solving Morphological Inflection: Lemma Overlap Artificially Inflates Models’ Performance

Author: Goldman, Omer, primary, Guriel, David, additional, and Tsarfaty, Reut, additional
Published: 2022
Full Text: View/download PDF

34. The MRL 2022 Shared Task on Multilingual Clause-level Morphology

Author: Goldman, Omer, primary, Tinner, Francesco, additional, Gonen, Hila, additional, Muller, Benjamin, additional, Basmov, Victoria, additional, Kirimi, Shadrack, additional, Nishimwe, Lydia, additional, Sagot, Benoît, additional, Seddah, Djamé, additional, Tsarfaty, Reut, additional, and Ataman, Duygu, additional
Published: 2022
Full Text: View/download PDF

35. Well-Defined Morphology is Sentence-Level Morphology

Author: Goldman, Omer, primary and Tsarfaty, Reut, additional
Published: 2021
Full Text: View/download PDF

36. Minimal Supervision for Morphological Inflection

Author: Goldman, Omer, primary and Tsarfaty, Reut, additional
Published: 2021
Full Text: View/download PDF

37. SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages

Author: Pimentel, Tiago, primary, Ryskina, Maria, additional, Mielke, Sabrina J., additional, Wu, Shijie, additional, Chodroff, Eleanor, additional, Leonard, Brian, additional, Nicolai, Garrett, additional, Ghanggo Ate, Yustinus, additional, Khalifa, Salam, additional, Habash, Nizar, additional, El-Khaissi, Charbel, additional, Goldman, Omer, additional, Gasser, Michael, additional, Lane, William, additional, Coler, Matt, additional, Oncevay, Arturo, additional, Montoya Samame, Jaime Rafael, additional, Silva Villegas, Gema Celeste, additional, Ek, Adam, additional, Bernardy, Jean-Philippe, additional, Shcherbakov, Andrey, additional, Bayyr-ool, Aziyana, additional, Sheifer, Karina, additional, Ganieva, Sofya, additional, Plugaryov, Matvey, additional, Klyachko, Elena, additional, Salehi, Ali, additional, Krizhanovsky, Andrew, additional, Krizhanovsky, Natalia, additional, Vania, Clara, additional, Ivanova, Sardana, additional, Salchak, Aelita, additional, Straughn, Christopher, additional, Liu, Zoey, additional, Washington, Jonathan North, additional, Ataman, Duygu, additional, Kieraś, Witold, additional, Woliński, Marcin, additional, Suhardijanto, Totok, additional, Stoehr, Niklas, additional, Nuriah, Zahroh, additional, Ratan, Shyam, additional, Tyers, Francis M., additional, Ponti, Edoardo M., additional, Aiton, Grant, additional, Hatcher, Richard J., additional, Prud'hommeaux, Emily, additional, Kumar, Ritesh, additional, Hulden, Mans, additional, Barta, Botond, additional, Lakatos, Dorina, additional, Szolnok, Gábor, additional, Ács, Judit, additional, Raj, Mohit, additional, Yarowsky, David, additional, Cotterell, Ryan, additional, Ambridge, Ben, additional, and Vylomova, Ekaterina, additional
Published: 2021
Full Text: View/download PDF

38. Acid-Induced Downregulation of ASS1 Contributes to the Maintenance of Intracellular pH in Cancer

Author: Silberman, Alon, primary, Goldman, Omer, additional, Boukobza Assayag, Odeya, additional, Jacob, Adi, additional, Rabinovich, Shiran, additional, Adler, Lital, additional, Lee, Joo Sang, additional, Keshet, Rom, additional, Sarver, Alona, additional, Frug, Julia, additional, Stettner, Noa, additional, Galai, Sivan, additional, Persi, Erez, additional, Halpern, Keren Bahar, additional, Zaltsman-Amir, Yehudit, additional, Pode-Shakked, Ben, additional, Eilam, Raya, additional, Anikster, Yair, additional, Nagamani, Sandesh C.S., additional, Ulitsky, Igor, additional, Ruppin, Eytan, additional, and Erez, Ayelet, additional
Published: 2019
Full Text: View/download PDF

39. Weakly Supervised Semantic Parsing with Abstract Examples

Author: Goldman, Omer, primary, Latcinnik, Veronica, additional, Nave, Ehud, additional, Globerson, Amir, additional, and Berant, Jonathan, additional
Published: 2018
Full Text: View/download PDF

40. One library to make them all: streamlining the creation of yeast libraries via a SWAp-Tag strategy

Author: Yofe, Ido, primary, Weill, Uri, additional, Meurer, Matthias, additional, Chuartzman, Silvia, additional, Zalckvar, Einat, additional, Goldman, Omer, additional, Ben-Dor, Shifra, additional, Schütze, Conny, additional, Wiedemann, Nils, additional, Knop, Michael, additional, Khmelinskii, Anton, additional, and Schuldiner, Maya, additional
Published: 2016
Full Text: View/download PDF

41. A letter from Israel

Author: Goldman, Omer
Subjects: Militarism -- Personal narratives -- Social aspects, Israel-Arab conflicts -- Social aspects -- Personal narratives, International relations, Social aspects, Personal narratives
Abstract: My name is Omer Goldman. I am 19 years old. I am one of the Shministim. I need your help. I first went to prison on Sept. 23 and served [...]
Published: 2009

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

41 results on '"Goldman, Omer"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources