Author: "Yanai A" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Yanai A"' showing total 28,171 results

Start Over Author "Yanai A"

28,171 results on '"Yanai A"'

1. GRADE: Quantifying Sample Diversity in Text-to-Image Models

Author: Rassin, Royi, Slobodkin, Aviv, Ravfogel, Shauli, Elazar, Yanai, and Goldberg, Yoav
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Text-to-image (T2I) models are remarkable at generating realistic images based on textual descriptions. However, textual prompts are inherently underspecified: they do not specify all possible attributes of the required image. This raises two key questions: Do T2I models generate diverse outputs on underspecified prompts? How can we automatically measure diversity? We propose GRADE: Granular Attribute Diversity Evaluation, an automatic method for quantifying sample diversity. GRADE leverages the world knowledge embedded in large language models and visual question-answering systems to identify relevant concept-specific axes of diversity (e.g., ``shape'' and ``color'' for the concept ``cookie''). It then estimates frequency distributions of concepts and their attributes and quantifies diversity using (normalized) entropy. GRADE achieves over 90% human agreement while exhibiting weak correlation to commonly used diversity metrics. We use GRADE to measure the overall diversity of 12 T2I models using 400 concept-attribute pairs, revealing that all models display limited variation. Further, we find that these models often exhibit default behaviors, a phenomenon where the model consistently generates concepts with the same attributes (e.g., 98% of the cookies are round). Finally, we demonstrate that a key reason for low diversity is due to underspecified captions in training data. Our work proposes a modern, semantically-driven approach to measure sample diversity and highlights the stunning homogeneity in outputs by T2I models., Comment: For project page and code see https://royira.github.io/GRADE
Published: 2024

2. Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback

Author: Miranda, Lester James V., Wang, Yizhong, Elazar, Yanai, Kumar, Sachin, Pyatkin, Valentina, Brahman, Faeze, Smith, Noah A., Hajishirzi, Hannaneh, and Dasigi, Pradeep
Subjects: Computer Science - Computation and Language
Abstract: Learning from human feedback has enabled the alignment of language models (LMs) with human preferences. However, directly collecting human preferences can be expensive, time-consuming, and can have high variance. An appealing alternative is to distill preferences from LMs as a source of synthetic annotations as they are more consistent, cheaper, and scale better than human annotation; however, they are also prone to biases and errors. In this work, we introduce a routing framework that combines inputs from humans and LMs to achieve better annotation quality, while reducing the total cost of human annotation. The crux of our approach is to identify preference instances that will benefit from human annotations. We formulate this as an optimization problem: given a preference dataset and an evaluation metric, we train a performance prediction model to predict a reward model's performance on an arbitrary combination of human and LM annotations and employ a routing strategy that selects a combination that maximizes predicted performance. We train the performance prediction model on MultiPref, a new preference dataset with 10K instances paired with human and LM labels. We show that the selected hybrid mixture of LM and direct human preferences using our routing framework achieves better reward model performance compared to using either one exclusively. We simulate selective human preference collection on three other datasets and show that our method generalizes well to all three. We analyze features from the routing model to identify characteristics of instances that can benefit from human feedback, e.g., prompts with a moderate safety concern or moderate intent complexity. We release the dataset, annotation platform, and source code used in this study to foster more efficient and accurate preference collection in the future., Comment: Code in https://github.com/allenai/hybrid-preferences, MultiPref dataset in https://huggingface.co/datasets/allenai/multipref, Updated related work
Published: 2024

3. How Many Van Goghs Does It Take to Van Gogh? Finding the Imitation Threshold

Author: Verma, Sahil, Rassin, Royi, Das, Arnav, Bhatt, Gantavya, Seshadri, Preethi, Shah, Chirag, Bilmes, Jeff, Hajishirzi, Hannaneh, and Elazar, Yanai
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Text-to-image models are trained using large datasets collected by scraping image-text pairs from the internet. These datasets often include private, copyrighted, and licensed material. Training models on such datasets enables them to generate images with such content, which might violate copyright laws and individual privacy. This phenomenon is termed imitation -- generation of images with content that has recognizable similarity to its training images. In this work we study the relationship between a concept's frequency in the training dataset and the ability of a model to imitate it. We seek to determine the point at which a model was trained on enough instances to imitate a concept -- the imitation threshold. We posit this question as a new problem: Finding the Imitation Threshold (FIT) and propose an efficient approach that estimates the imitation threshold without incurring the colossal cost of training multiple models from scratch. We experiment with two domains -- human faces and art styles -- for which we create four datasets, and evaluate three text-to-image models which were trained on two pretraining datasets. Our results reveal that the imitation threshold of these models is in the range of 200-600 images, depending on the domain and the model. The imitation threshold can provide an empirical basis for copyright violation claims and acts as a guiding principle for text-to-image model developers that aim to comply with copyright and privacy laws. We release the code and data at \url{https://github.com/vsahil/MIMETIC-2.git} and the project's website is hosted at \url{https://how-many-van-goghs-does-it-take.github.io}., Comment: Accepted at ATTRIB, RegML, and SafeGenAI workshops at NeurIPS 2024 and NLLP Workshop 2024
Published: 2024

4. A novel machine learning method to detect double-$\Lambda$ hypernuclear events in nuclear emulsions

Author: He, Yan, Drozd, Vasyl, Ekawa, Hiroyuki, Escrig, Samuel, Gao, Yiming, Kasagi, Ayumi, Liu, Enqiang, Muneem, Abdul, Nakagawa, Manami, Nakazawa, Kazuma, Rappold, Christophe, Saito, Nami, Saito, Takehiko R., Sugimoto, Shohei, Taki, Masato, Tanaka, Yoshiki K., Wang, He, Yanai, Ayari, Yoshida, Junya, and Zhang, Hongfei
Subjects: High Energy Physics - Experiment
Abstract: A novel method was developed to detect double-$\Lambda$ hypernuclear events in nuclear emulsions using machine learning techniques. The object detection model, the Mask R-CNN, was trained using images generated by Monte Carlo simulations, image processing, and image-style transformation based on generative adversarial networks. Despite being exclusively trained on $\prescript{6\ }{\Lambda\Lambda}{\rm{He}}$ events, the model achieved a detection efficiency of 93.8$\%$ for $\prescript{6\ }{\Lambda\Lambda}{\rm{He}}$ and 82.0$\%$ for $\prescript{5\ }{\Lambda\Lambda}{\rm{H}}$ events in the produced images. In addition, the model demonstrated its ability to detect the $\prescript{6\ }{\Lambda\Lambda}{\rm{He}}$ event named the Nagara event, which is the only uniquely identified double-$\Lambda$ hypernuclear event reported to date. It also exhibited a proper segmentation of the event topology. Furthermore, after analyzing 0.2$\%$ of the entire emulsion data from the J-PARC E07 experiment utilizing the developed approach, six new candidates for double-$\Lambda$ hypernuclear events were detected, suggesting that more than 2000 double-strangeness hypernuclear events were recorded in the entire dataset. This method is sufficiently effective for mining more latent double-$\Lambda$ hypernuclear events recorded in nuclear emulsion sheets by reducing the time required for manual visual inspection by a factor of five hundred.
Published: 2024

5. Data Contamination Report from the 2024 CONDA Shared Task

Author: Sainz, Oscar, García-Ferrero, Iker, Jacovi, Alon, Campos, Jon Ander, Elazar, Yanai, Agirre, Eneko, Goldberg, Yoav, Chen, Wei-Lin, Chim, Jenny, Choshen, Leshem, D'Amico-Wong, Luca, Dell, Melissa, Fan, Run-Ze, Golchin, Shahriar, Li, Yucheng, Liu, Pengfei, Pahwa, Bhavish, Prabhu, Ameya, Sharma, Suryansh, Silcock, Emily, Solonko, Kateryna, Stap, David, Surdeanu, Mihai, Tseng, Yu-Min, Udandarao, Vishaal, Wang, Zengzhi, Xu, Ruijie, and Yang, Jinglin
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: The 1st Workshop on Data Contamination (CONDA 2024) focuses on all relevant aspects of data contamination in natural language processing, where data contamination is understood as situations where evaluation data is included in pre-training corpora used to train large scale models, compromising evaluation results. The workshop fostered a shared task to collect evidence on data contamination in current available datasets and models. The goal of the shared task and associated database is to assist the community in understanding the extent of the problem and to assist researchers in avoiding reporting evaluation results on known contaminated resources. The shared task provides a structured, centralized public database for the collection of contamination evidence, open to contributions from the community via GitHub pool requests. This first compilation paper is based on 566 reported entries over 91 contaminated sources from a total of 23 contributors. The details of the individual contamination events are available in the platform. The platform continues to be online, open to contributions from the community., Comment: https://huggingface.co/spaces/CONDA-Workshop/Data-Contamination-Database
Published: 2024

6. Decoding BACnet Packets: A Large Language Model Approach for Packet Interpretation

Author: Sharma, Rashi, Okada, Hiroyuki, Oba, Tatsumi, Subramanian, Karthikk, Yanai, Naoto, and Pranata, Sugiri
Subjects: Computer Science - Cryptography and Security, Computer Science - Artificial Intelligence
Abstract: The Industrial Control System (ICS) environment encompasses a wide range of intricate communication protocols, posing substantial challenges for Security Operations Center (SOC) analysts tasked with monitoring, interpreting, and addressing network activities and security incidents. Conventional monitoring tools and techniques often struggle to provide a clear understanding of the nature and intent of ICS-specific communications. To enhance comprehension, we propose a software solution powered by a Large Language Model (LLM). This solution currently focused on BACnet protocol, processes a packet file data and extracts context by using a mapping database, and contemporary context retrieval methods for Retrieval Augmented Generation (RAG). The processed packet information, combined with the extracted context, serves as input to the LLM, which generates a concise packet file summary for the user. The software delivers a clear, coherent, and easily understandable summary of network activities, enabling SOC analysts to better assess the current state of the control system., Comment: 12 pages
Published: 2024

7. Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data

Author: Wang, Xinyi, Antoniades, Antonis, Elazar, Yanai, Amayuelas, Alfonso, Albalak, Alon, Zhang, Kexun, and Wang, William Yang
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The impressive capabilities of large language models (LLMs) have sparked debate over whether these models genuinely generalize to unseen tasks or predominantly rely on memorizing vast amounts of pretraining data. To explore this issue, we introduce an extended concept of memorization, distributional memorization, which measures the correlation between the LLM output probabilities and the pretraining data frequency. To effectively capture task-specific pretraining data frequency, we propose a novel task-gram language model, which is built by counting the co-occurrence of semantically related $n$-gram pairs from task inputs and outputs in the pretraining corpus. Using the Pythia models trained on the Pile dataset, we evaluate three distinct tasks: machine translation, factual question answering, and reasoning. Our findings reveal varying levels of memorization, with the strongest effect observed in factual question answering. Furthermore, while model performance improves across all tasks as LLM size increases, only factual question answering shows an increase in memorization, whereas machine translation and reasoning tasks exhibit greater generalization, producing more novel outputs. This study demonstrates that memorization plays a larger role in simpler, knowledge-intensive tasks, while generalization is the key for harder, reasoning-based tasks, providing a scalable method for analyzing large pretraining corpora in greater depth., Comment: full 10-page version
Published: 2024

8. Detection and Measurement of Syntactic Templates in Generated Text

Author: Shaib, Chantal, Elazar, Yanai, Li, Junyi Jessy, and Wallace, Byron C.
Subjects: Computer Science - Computation and Language
Abstract: Recent work on evaluating the diversity of text generated by LLMs has focused on word-level features. Here we offer an analysis of syntactic features to characterize general repetition in models, beyond frequent n-grams. Specifically, we define syntactic templates and show that models tend to produce templated text in downstream tasks at a higher rate than what is found in human-reference texts. We find that most (76%) templates in model-generated text can be found in pre-training data (compared to only 35% of human-authored text), and are not overwritten during fine-tuning processes such as RLHF. This connection to the pre-training data allows us to analyze syntactic templates in models where we do not have the pre-training data. We also find that templates as features are able to differentiate between models, tasks, and domains, and are useful for qualitatively evaluating common model constructions. Finally, we demonstrate the use of templates as a useful tool for analyzing style memorization of training data in LLMs., Comment: EMNLP 2024
Published: 2024

9. Evaluating $n$-Gram Novelty of Language Models Using Rusty-DAWG

Author: Merrill, William, Smith, Noah A., and Elazar, Yanai
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: How novel are texts generated by language models (LMs) relative to their training corpora? In this work, we investigate the extent to which modern LMs generate $n$-grams from their training data, evaluating both (i) the probability LMs assign to complete training $n$-grams and (ii) $n$-novelty, the proportion of $n$-grams generated by an LM that did not appear in the training data (for arbitrarily large $n$). To enable arbitrary-length $n$-gram search over a corpus in constant time w.r.t. corpus size, we develop Rusty-DAWG, a novel search tool inspired by indexing of genomic data. We compare the novelty of LM-generated text to human-written text and explore factors that affect generation novelty, focusing on the Pythia models. We find that, for $n > 4$, LM-generated text is less novel than human-written text, though it is more novel for smaller $n$. Larger LMs and more constrained decoding strategies both decrease novelty. Finally, we show that LMs complete $n$-grams with lower loss if they are more frequent in the training data. Overall, our results reveal factors influencing the novelty of LM-generated text, and we release Rusty-DAWG to facilitate further pretraining data research., Comment: To appear at EMNLP 2024
Published: 2024

10. Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation

Author: Iluz, Bar, Elazar, Yanai, Yehudai, Asaf, and Stanovsky, Gabriel
Subjects: Computer Science - Computation and Language
Abstract: Most works on gender bias focus on intrinsic bias -- removing traces of information about a protected group from the model's internal representation. However, these works are often disconnected from the impact of such debiasing on downstream applications, which is the main motivation for debiasing in the first place. In this work, we systematically test how methods for intrinsic debiasing affect neural machine translation models, by measuring the extrinsic bias of such systems under different design choices. We highlight three challenges and mismatches between the debiasing techniques and their end-goal usage, including the choice of embeddings to debias, the mismatch between words and sub-word tokens debiasing, and the effect on different target languages. We find that these considerations have a significant impact on downstream performance and the success of debiasing.
Published: 2024

11. Study of light hypernuclei in Europe: The hypertriton and nnΛ puzzles

Author: Rappold C., Achenbach P., Alibrahim Alfaki H., Amjad F., Armstrong M., Behr K.-H., Benlliure J., Brencic Z., Dickel T., Drozd V., Dubey S., Ekawa H., Escrig S., Feijoo-Fontan M., Fujioka H., Gao Y., Geissel H., Goldenbaum F., Graña Gonzalez A., Haettner E., Harakeh M. N., He Y., Heggen H., Hornung C., Hubbard N., Itahashi K., Iwasaki M., Kalantar-Nayestanaki N., Kasagi A., Kavatsyuk M., Kazantseva E., Khreptak A., Kindler B., Knoebel R., Kollmus H., Kostyleva D., Kraft-Bermuth S., Kurz N., Liu E., Lommel B., Metag V., Minami S., Morrissey D.J., Moskal P., Mukha I., Muneem A., Nakagawa M., Nakazawa K., Nociforo C., Ong H.J., Pietri S., Pochodzalla J., Purushothaman S., Rocco E., Rodríguez-Sánchez J.L., Roy P., Ruber R., Saito T.R., Schadmand S., Scheidenberger C., Schwarz P., Sekiya R., Serdyuk V., Skurzok M., Streicher B., Suzuki K., Szczepanczyk B., Tanaka Y. K., Tang X., Tortorelli N., Vencelj M., Wang H., Weber T., Weick H., Will M., Wimmer K., Yamamoto A., Yanai A., Yoshida J., and Zhao J.
Subjects: Physics, QC1-999
Abstract: The current understanding of light hypernuclei, which are sub-atomic nuclei with strangeness, is being challenged and studied in detail by several European research groups and collaborations. In recent years, studies of hypernuclei using high-energy heavy ion beams have reported unexpected results on the three-body hypernuclear state 3ΛH, named the hypertriton. For some time, reports of a shorter lifetime and larger binding energy than what was previously accepted have created a puzzling situation for its theoretical description; this is known as the "hypertriton puzzle". With the inclusion of the most recent experimental measurements, the current status of the hypertriton puzzle is evolving. Additionally, the possible neutral bound state of a Λ hyperon with two neutrons, nnΛ, has raised questions about our understanding of the formation of light hypernuclei either in bound or resonance states. These results have initiated several ongoing experimental programs all over the world to study these three-body hypernuclear states precisely. We are studying these light hypernuclear states by employing heavy ion beams at 2AGeV on a fixed carbon target with the WASA detector system and the Fragment Separator (FRS) at GSI. The WASA-FRS experimental campaign was performed during the first quarter of 2022, and this paper presents a short overview of the campaign and how it seeks to tackle the hypertriton and nnΛ puzzles. Data analysis is ongoing, and several preliminary results will be reported.
Published: 2023
Full Text: View/download PDF

12. Tectal glioma: clinical, radiological, and pathological features, and the importance of molecular analysis

Author: Imoto, Ryoji, Otani, Yoshihiro, Fujii, Kentaro, Ishida, Joji, Hirano, Shuichiro, Kemmotsu, Naoya, Suruga, Yasuki, Mizuta, Ryo, Kegoya, Yasuhito, Inoue, Yohei, Umeda, Tsuyoshi, Hokama, Madoka, Washio, Kana, Yanai, Hiroyuki, Tanaka, Shota, Satomi, Kaishi, Ichimura, Koichi, and Date, Isao
Published: 2024
Full Text: View/download PDF

13. DNA methylation drives hematopoietic stem cell aging phenotypes after proliferative stress

Author: Yanai, Hagai, McNeely, Taylor, Ayyar, Saipriya, Leone, Michael, Zong, Le, Park, Bongsoo, and Beerman, Isabel
Published: 2024
Full Text: View/download PDF

14. Effectiveness and safety of rituximab in severely relapsed antineutrophil cytoplasmic antibody–associated vasculitis: a retrospective analysis of a Japanese multicentre cohort from the J-CANVAS

Author: Kidoguchi, Genki, Yoshida, Yusuke, Watanabe, Hirofumi, Sugimoto, Tomohiro, Mokuda, Sho, Kida, Takashi, Yajima, Nobuyuki, Omura, Satoshi, Nakagomi, Daiki, Abe, Yoshiyuki, Kadoya, Masatoshi, Takizawa, Naoho, Nomura, Atsushi, Kukida, Yuji, Kondo, Naoya, Yamano, Yasuhiko, Yanagida, Takuya, Endo, Koji, Matsui, Kiyoshi, Takeuchi, Tohru, Ichinose, Kunihiro, Kato, Masaru, Yanai, Ryo, Matsuo, Yusuke, Shimojima, Yasuhiro, Nishioka, Ryo, Okazaki, Ryota, Takata, Tomoaki, Ito, Takafumi, Moriyama, Mayuko, Takatani, Ayuko, Miyawaki, Yoshia, Ito-Ihara, Toshiko, Kawaguchi, Takashi, Kawahito, Yutaka, and Hirata, Shintaro
Published: 2024
Full Text: View/download PDF

15. Association between social jetlag and weight and fat reduction in dieting

Author: Minabe, Kai, Shimura, Akiyoshi, Sugiura, Ko, Hino, Hiroko, Akatsuka, Yusaku, Seto, Takeshi, Yanai, Miho, Masuya, Jiro, Tamada, Yu, and Inoue, Takeshi
Published: 2024
Full Text: View/download PDF

16. Clinical epidemiology of the endoscopic, laparoscopic, and surgical resection of malignant gastric tumors in Japan, 2014–2021: a retrospective study using open data from a national claims database

Author: Sako, Akahito, Yada, Tomoyuki, Fujiya, Keiichi, Nakashima, Ryo, Yoshimura, Kensuke, Yanai, Hidekatsu, and Uemura, Naomi
Published: 2024
Full Text: View/download PDF

17. Comparison of surgical outcomes between robot-assisted and conventional laparoscopic nerve-sparing modified radical hysterectomy for deep endometriosis

Author: Kanno, Kiyoshi, Yanai, Shiori, Masuda, Sayaka, Ochi, Yoshifumi, Sawada, Mari, Sakate, Shintaro, and Andou, Masaaki
Published: 2024
Full Text: View/download PDF

18. Nitrogen and Phosphorus Addition Affect Soil Respiration in Northern Hardwood Forests

Author: Mann, T. A., Yanai, R. D., Fahey, T. J., and Reinmann, A. B.
Published: 2024
Full Text: View/download PDF

19. A mathematical simulation model to determine the optimal endoscopic screening strategy for detection of H. pylori-naïve gastric neoplasms

Author: Ishibashi, Fumiaki, Okusa, Kosuke, Tokai, Yoshitaka, Hirasawa, Toshiaki, Kawakami, Tomohiro, Mochida, Kentaro, Yanai, Yuka, Yokoi, Chizu, Hayashi, Yuko, Ozawa, Shun-ichiro, Uraushihara, Koji, Minato, Yohei, Nakanishi, Hiroyuki, Ueyama, Hiroya, Kataoka, Mikinori, Toyama, Yuzo, Mizokami, Yuji, and Suzuki, Sho
Published: 2024
Full Text: View/download PDF

20. Unique approach for precise determination of binding energies of hypernuclei with nuclear emulsion and machine learning

Author: Nakagawa Manami, Kasagi Ayumi, Liu Enqiang, Ekawa Hiroyuki, Yoshida Junya, Dou Wenbo, He Yan, Muneem Abdul, Nakazawa Kazuma, Rappold Christophe, Saito Nami, Saito Takehiko R., Sugimoto Shohei, Taki Masato, Tanaka Yoshiki K., Wang He, Gao Yiming, Yanai Ayari, and Yoshimoto Masahiro
Subjects: Physics, QC1-999
Abstract: Hypertriton is the lightest hypernucleus and a benchmark in hypernuclear physics. However, it has recently been suggested that its lifetime and binding energy values may differ from the established values. To solve this puzzle, it is necessary to measure both values with a higher precision. For the precise measurement of the binding energy, we are aiming at developing a novel technique to measure the hypertriton binding energy with unprecedented accuracy by combining nuclear emulsion data and machine learning techniques. The analysis will be based on the J-PARC E07 nuclear emulsion data. Furthermore, a machine-learning model is being developed to identify other single and double-strangeness hypernuclei.
Published: 2022
Full Text: View/download PDF

21. A Survey on Data Selection for Language Models

Author: Albalak, Alon, Elazar, Yanai, Xie, Sang Michael, Longpre, Shayne, Lambert, Nathan, Wang, Xinyi, Muennighoff, Niklas, Hou, Bairu, Pan, Liangming, Jeong, Haewon, Raffel, Colin, Chang, Shiyu, Hashimoto, Tatsunori, and Wang, William Yang
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: A major factor in the recent success of large language models is the use of enormous and ever-growing text datasets for unsupervised pre-training. However, naively training a model on all available data may not be optimal (or feasible), as the quality of available text data can vary. Filtering out data can also decrease the carbon footprint and financial costs of training models by reducing the amount of training required. Data selection methods aim to determine which candidate data points to include in the training dataset and how to appropriately sample from the selected data points. The promise of improved data selection methods has caused the volume of research in the area to rapidly expand. However, because deep learning is mostly driven by empirical evidence and experimentation on large-scale data is expensive, few organizations have the resources for extensive data selection research. Consequently, knowledge of effective data selection practices has become concentrated within a few organizations, many of which do not openly share their findings and methodologies. To narrow this gap in knowledge, we present a comprehensive review of existing literature on data selection methods and related research areas, providing a taxonomy of existing approaches. By describing the current landscape of research, this work aims to accelerate progress in data selection by establishing an entry point for new and established researchers. Additionally, throughout this review we draw attention to noticeable holes in the literature and conclude the paper by proposing promising avenues for future research., Comment: Paper list available at https://github.com/alon-albalak/data-selection-survey
Published: 2024

22. Calibrating Large Language Models with Sample Consistency

Author: Lyu, Qing, Shridhar, Kumar, Malaviya, Chaitanya, Zhang, Li, Elazar, Yanai, Tandon, Niket, Apidianaki, Marianna, Sachan, Mrinmaya, and Callison-Burch, Chris
Subjects: Computer Science - Computation and Language
Abstract: Accurately gauging the confidence level of Large Language Models' (LLMs) predictions is pivotal for their reliable application. However, LLMs are often uncalibrated inherently and elude conventional calibration techniques due to their proprietary nature and massive scale. In this work, we explore the potential of deriving confidence from the distribution of multiple randomly sampled model generations, via three measures of consistency. We perform an extensive evaluation across various open and closed-source models on nine reasoning datasets. Results show that consistency-based calibration methods outperform existing post-hoc approaches. Meanwhile, we find that factors such as intermediate explanations, model scaling, and larger sample sizes enhance calibration, while instruction-tuning makes calibration more difficult. Moreover, confidence scores obtained from consistency have the potential to enhance model performance. Finally, we offer practical guidance on choosing suitable consistency metrics for calibration, tailored to the characteristics of various LMs.
Published: 2024

23. OLMo: Accelerating the Science of Language Models

Author: Groeneveld, Dirk, Beltagy, Iz, Walsh, Pete, Bhagia, Akshita, Kinney, Rodney, Tafjord, Oyvind, Jha, Ananya Harsh, Ivison, Hamish, Magnusson, Ian, Wang, Yizhong, Arora, Shane, Atkinson, David, Authur, Russell, Chandu, Khyathi Raghavi, Cohan, Arman, Dumas, Jennifer, Elazar, Yanai, Gu, Yuling, Hessel, Jack, Khot, Tushar, Merrill, William, Morrison, Jacob, Muennighoff, Niklas, Naik, Aakanksha, Nam, Crystal, Peters, Matthew E., Pyatkin, Valentina, Ravichander, Abhilasha, Schwenk, Dustin, Shah, Saurabh, Smith, Will, Strubell, Emma, Subramani, Nishant, Wortsman, Mitchell, Dasigi, Pradeep, Lambert, Nathan, Richardson, Kyle, Zettlemoyer, Luke, Dodge, Jesse, Lo, Kyle, Soldaini, Luca, Smith, Noah A., and Hajishirzi, Hannaneh
Subjects: Computer Science - Computation and Language
Abstract: Language models (LMs) have become ubiquitous in both NLP research and in commercial product offerings. As their commercial importance has surged, the most powerful models have become closed off, gated behind proprietary interfaces, with important details of their training data, architectures, and development undisclosed. Given the importance of these details in scientifically studying these models, including their biases and potential risks, we believe it is essential for the research community to have access to powerful, truly open LMs. To this end, we have built OLMo, a competitive, truly Open Language Model, to enable the scientific study of language models. Unlike most prior efforts that have only released model weights and inference code, we release OLMo alongside open training data and training and evaluation code. We hope this release will empower the open research community and inspire a new wave of innovation.
Published: 2024

24. Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Author: Soldaini, Luca, Kinney, Rodney, Bhagia, Akshita, Schwenk, Dustin, Atkinson, David, Authur, Russell, Bogin, Ben, Chandu, Khyathi, Dumas, Jennifer, Elazar, Yanai, Hofmann, Valentin, Jha, Ananya Harsh, Kumar, Sachin, Lucy, Li, Lyu, Xinxi, Lambert, Nathan, Magnusson, Ian, Morrison, Jacob, Muennighoff, Niklas, Naik, Aakanksha, Nam, Crystal, Peters, Matthew E., Ravichander, Abhilasha, Richardson, Kyle, Shen, Zejiang, Strubell, Emma, Subramani, Nishant, Tafjord, Oyvind, Walsh, Pete, Zettlemoyer, Luke, Smith, Noah A., Hajishirzi, Hannaneh, Beltagy, Iz, Groeneveld, Dirk, Dodge, Jesse, and Lo, Kyle
Subjects: Computer Science - Computation and Language
Abstract: Information about pretraining corpora used to train the current best-performing language models is seldom discussed: commercial models rarely detail their data, and even open models are often released without accompanying training data or recipes to reproduce them. As a result, it is challenging to conduct and advance scientific research on language modeling, such as understanding how training data impacts model capabilities and limitations. To facilitate scientific research on language model pretraining, we curate and release Dolma, a three-trillion-token English corpus, built from a diverse mixture of web content, scientific papers, code, public-domain books, social media, and encyclopedic materials. We extensively document Dolma, including its design principles, details about its construction, and a summary of its contents. We present analyses and experimental results on intermediate states of Dolma to share what we have learned about important data curation practices. Finally, we open-source our data curation toolkit to enable reproduction of our work as well as support further research in large-scale data curation., Comment: Accepted at ACL 2024; Dataset: https://hf.co/datasets/allenai/dolma; Code: https://github.com/allenai/dolma
Published: 2024

25. Paloma: A Benchmark for Evaluating Language Model Fit

Author: Magnusson, Ian, Bhagia, Akshita, Hofmann, Valentin, Soldaini, Luca, Jha, Ananya Harsh, Tafjord, Oyvind, Schwenk, Dustin, Walsh, Evan Pete, Elazar, Yanai, Lo, Kyle, Groeneveld, Dirk, Beltagy, Iz, Hajishirzi, Hannaneh, Smith, Noah A., Richardson, Kyle, and Dodge, Jesse
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Language models (LMs) commonly report perplexity on monolithic data held out from training. Implicitly or explicitly, this data is composed of domains$\unicode{x2013}$varying distributions of language. Rather than assuming perplexity on one distribution extrapolates to others, Perplexity Analysis for Language Model Assessment (Paloma), measures LM fit to 585 text domains, ranging from nytimes.com to r/depression on Reddit. We invite submissions to our benchmark and organize results by comparability based on compliance with guidelines such as removal of benchmark contamination from pretraining. Submissions can also record parameter and training token count to make comparisons of Pareto efficiency for performance as a function of these measures of cost. We populate our benchmark with results from 6 baselines pretrained on popular corpora. In case studies, we demonstrate analyses that are possible with Paloma, such as finding that pretraining without data beyond Common Crawl leads to inconsistent fit to many domains., Comment: Project Page: https://paloma.allen.ai/
Published: 2023

26. A case of protein-losing gastroenteropathy due to Sjögren’s syndrome detected by foggy vision with refractive error of the intraocular lens

Author: Watanabe, Naho, Kaneko, Rena, Kishi, Monami, Yanai, Ryo, Ikehara, Takashi, Nagai, Hidenari, and Matsuda, Takahisa
Published: 2024
Full Text: View/download PDF

27. A case of undifferentiated pleomorphic rectal sarcoma occurring after radiation exposure

Author: Eizuka, Makoto, Toya, Yosuke, Yamada, Shun, Oizumi, Tomofumi, Yanai, Shunichi, Kudara, Norihiko, Yanagawa, Naoki, Sugai, Tamotsu, and Matsumoto, Takayuki
Published: 2024
Full Text: View/download PDF

28. A diagnostic approach of various urethral diseases using multimodal imaging findings: comprehensive overview

Author: Kimura, Koichiro, Yamamoto, Tatsuya, Tsuchiya, Junichi, Yoshida, Soichiro, Yanai, Shuichi, Onishi, Iichiro, Fujii, Yasuhisa, and Tateishi, Ukihide
Published: 2024
Full Text: View/download PDF

29. Do Backdoors Assist Membership Inference Attacks?

Author: Goto, Yumeki, Ashizawa, Nami, Shibahara, Toshiki, Yanai, Naoto, Akan, Ozgur, Editorial Board Member, Bellavista, Paolo, Editorial Board Member, Cao, Jiannong, Editorial Board Member, Coulson, Geoffrey, Editorial Board Member, Dressler, Falko, Editorial Board Member, Ferrari, Domenico, Editorial Board Member, Gerla, Mario, Editorial Board Member, Kobayashi, Hisashi, Editorial Board Member, Palazzo, Sergio, Editorial Board Member, Sahni, Sartaj, Editorial Board Member, Shen, Xuemin, Editorial Board Member, Stan, Mircea, Editorial Board Member, Jia, Xiaohua, Editorial Board Member, Zomaya, Albert Y., Editorial Board Member, Duan, Haixin, editor, Debbabi, Mourad, editor, de Carné de Carnavalet, Xavier, editor, Luo, Xiapu, editor, Du, Xiaojiang, editor, and Au, Man Ho Allen, editor
Published: 2025
Full Text: View/download PDF

30. An Authentication Algorithm for Sets of Spatial Data Objects

Author: Li, Wenhao, Wang, Chengliang, Hu, Xiaobing, Zhou, Hongwen, Zeng, Hang, Wang, Yanai, Akan, Ozgur, Editorial Board Member, Bellavista, Paolo, Editorial Board Member, Cao, Jiannong, Editorial Board Member, Coulson, Geoffrey, Editorial Board Member, Dressler, Falko, Editorial Board Member, Ferrari, Domenico, Editorial Board Member, Gerla, Mario, Editorial Board Member, Kobayashi, Hisashi, Editorial Board Member, Palazzo, Sergio, Editorial Board Member, Sahni, Sartaj, Editorial Board Member, Shen, Xuemin, Editorial Board Member, Stan, Mircea, Editorial Board Member, Jia, Xiaohua, Editorial Board Member, Zomaya, Albert Y., Editorial Board Member, Duan, Haixin, editor, Debbabi, Mourad, editor, de Carné de Carnavalet, Xavier, editor, Luo, Xiapu, editor, Du, Xiaojiang, editor, and Au, Man Ho Allen, editor
Published: 2025
Full Text: View/download PDF

31. Association between comorbidities associated with diabetes and higher-level functional status in older patients with type 2 diabetes mellitus: a cross sectional study

Author: Shoji, Takuro, Kogure, Kenta, Toda, Nagisa, Hakoshima, Mariko, Katsuyama, Hisayuki, Yanai, Hidekatsu, Tokunaga, Satoshi, Tateoka, Korin, Tsuji, Taishi, and Okura, Tomohiro
Published: 2024
Full Text: View/download PDF

32. Modeling Group Discourse with Epistemic Network Analysis: Unpacking Connections, Perspectives, and Individual Contributions

Author: Goldfarb Cohen, Shai, Yanai, Johnatan Verissimo, and Dishon, Gideon
Published: 2024
Full Text: View/download PDF

33. Cellular adaptation to cancer therapy along a resistance continuum

Author: França, Gustavo S., Baron, Maayan, King, Benjamin R., Bossowski, Jozef P., Bjornberg, Alicia, Pour, Maayan, Rao, Anjali, Patel, Ayushi S., Misirlioglu, Selim, Barkley, Dalia, Tang, Kwan Ho, Dolgalev, Igor, Liberman, Deborah A., Avital, Gal, Kuperwaser, Felicia, Chiodin, Marta, Levine, Douglas A., Papagiannakopoulos, Thales, Marusyk, Andriy, Lionnet, Timothée, and Yanai, Itai
Published: 2024
Full Text: View/download PDF

34. Gauged compact Q-balls and Q-shells in a multi-component $CP^N$ model

Author: Klimas, P., Kubaski, L. C., Sawado, N., and Yanai, S.
Subjects: High Energy Physics - Theory, Mathematical Physics
Abstract: We study a multicomponent $CP^N$ model's scalar electrodynamics. The model contains Q-balls/shells, which are non-topological compact solitons with time dependency $e^{i\omega t}$. Two coupled $CP^N$ models can decouple locally if one of their $CP^N$ fields takes the vacuum value. Because of the compacton nature of solutions, Q-shells can shelter another compact Q-ball or Q-shell within their hollow region. Even if compactons do not overlap, they can interact through the electromagnetic field. We investigate how the size of multi-compacton formations is affected by electric charge. We are interested in structures with non-zero or zero total net charge., Comment: 23 pages, 8 figures
Published: 2023

35. Measuring and Improving Attentiveness to Partial Inputs with Counterfactuals

Author: Elazar, Yanai, Paranjape, Bhargavi, Peng, Hao, Wiegreffe, Sarah, Raghavi, Khyathi, Srikumar, Vivek, Singh, Sameer, and Smith, Noah A.
Subjects: Computer Science - Computation and Language
Abstract: The inevitable appearance of spurious correlations in training datasets hurts the generalization of NLP models on unseen data. Previous work has found that datasets with paired inputs are prone to correlations between a specific part of the input (e.g., the hypothesis in NLI) and the label; consequently, models trained only on those outperform chance. Are these correlations picked up by models trained on the full input data? To address this question, we propose a new evaluation method, Counterfactual Attentiveness Test (CAT). CAT uses counterfactuals by replacing part of the input with its counterpart from a different example (subject to some restrictions), expecting an attentive model to change its prediction. Using CAT, we systematically investigate established supervised and in-context learning models on ten datasets spanning four tasks: natural language inference, reading comprehension, paraphrase detection, and visual & language reasoning. CAT reveals that reliance on such correlations is mainly data-dependent. Surprisingly, we find that GPT3 becomes less attentive with an increased number of demonstrations, while its accuracy on the test data improves. Our results demonstrate that augmenting training or demonstration data with counterfactuals is effective in improving models' attentiveness. We show that models' attentiveness measured by CAT reveals different conclusions from solely measuring correlations in data., Comment: Findings of EMNLP 2024
Published: 2023

36. What's In My Big Data?

Author: Elazar, Yanai, Bhagia, Akshita, Magnusson, Ian, Ravichander, Abhilasha, Schwenk, Dustin, Suhr, Alane, Walsh, Pete, Groeneveld, Dirk, Soldaini, Luca, Singh, Sameer, Hajishirzi, Hanna, Smith, Noah A., and Dodge, Jesse
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Large text corpora are the backbone of language models. However, we have a limited understanding of the content of these corpora, including general statistics, quality, social factors, and inclusion of evaluation data (contamination). In this work, we propose What's In My Big Data? (WIMBD), a platform and a set of sixteen analyses that allow us to reveal and compare the contents of large text corpora. WIMBD builds on two basic capabilities -- count and search -- at scale, which allows us to analyze more than 35 terabytes on a standard compute node. We apply WIMBD to ten different corpora used to train popular language models, including C4, The Pile, and RedPajama. Our analysis uncovers several surprising and previously undocumented findings about these corpora, including the high prevalence of duplicate, synthetic, and low-quality content, personally identifiable information, toxic language, and benchmark contamination. For instance, we find that about 50% of the documents in RedPajama and LAION-2B-en are duplicates. In addition, several datasets used for benchmarking models trained on such corpora are contaminated with respect to important benchmarks, including the Winograd Schema Challenge and parts of GLUE and SuperGLUE. We open-source WIMBD's code and artifacts to provide a standard set of evaluations for new text-based corpora and to encourage more analyses and transparency around them., Comment: Published at ICLR 2024 spotlight
Published: 2023

37. A search for $\mu^+\to e^+\gamma$ with the first dataset of the MEG II experiment

Author: MEG II collaboration, Afanaciev, K., Baldini, A. M., Ban, S., Baranov, V., Benmansour, H., Biasotti, M., Boca, G., Cattaneo, P. W., Cavoto, G., Cei, F., Chiappini, M., Chiarello, G., Corvaglia, A., Cuna, F., Maso, G. Dal, De Bari, A., De Gerone, M., Barusso, L. Ferrari, Francesconi, M., Galli, L., Gallucci, G., Gatti, F., Gerritzen, L., Grancagnolo, F., Grandoni, E. G., Grassi, M., Grigoriev, D. N., Hildebrandt, M., Ieki, K., Ignatov, F., Ikeda, F., Iwamoto, T., Karpov, S., Kettle, P. -R., Khomutov, N., Kobayashi, S., Kolesnikov, A., Kravchuk, N., Krylov, V., Kuchinskiy, N., Kyle, W., Libeiro, T., Malyshev, V., Matsushita, A., Meucci, M., Mihara, S., Molzon, W., Mori, Toshinori, Nakao, M., Nicolò, D., Nishiguchi, H., Ochi, A., Ogawa, S., Onda, R., Ootani, W., Oya, A., Palo, D., Panareo, M., Papa, A., Pettinacci, V., Popov, A., Renga, F., Ritt, S., Rossella, M., Rozhdestvensky, A., Schwendimann, P., Shimada, K., Signorelli, G., Takahashi, M., Tassielli, G. F., Toyoda, K., Uchiyama, Y., Usami, M., Venturini, A., Vitali, B., Voena, C., Yamamoto, K., Yanai, K., Yonemoto, T., Yoshida, K., and Yudin, Yu. V.
Subjects: High Energy Physics - Experiment
Abstract: The MEG II experiment, based at the Paul Scherrer Institut in Switzerland, reports the result of a search for the decay $\mu^+\to e^+\gamma$ from data taken in the first physics run in 2021. No excess of events over the expected background is observed, yielding an upper limit on the branching ratio of B($\mu^+\to e^+\gamma$) < $7.5 \times 10^{-13}$ (90% C.L.). The combination of this result and the limit obtained by MEG gives B($\mu^+\to e^+\gamma$) < $3.1 \times 10^{-13}$ (90% C.L.), which is the most stringent limit to date. A ten-fold larger sample of data is being collected during the years 2022-2023, and data-taking will continue in the coming years., Comment: 10 pages, 6 figures. To be published in EPJC
Published: 2023

38. Operation and performance of MEG II detector

Author: MEG II Collaboration, Afanaciev, K., Baldini, A. M., Ban, S., Baranov, V., Benmansour, H., Biasotti, M., Boca, G., Cattaneo, P. W., Cavoto, G., Cei, F., Chiappini, M., Chiarello, G., Corvaglia, A., Cuna, F., Maso, G. Dal, De Bari, A., De Gerone, M., Barusso, L. Ferrari, Francesconi, M., Galli, L., Gallucci, G., Gatti, F., Gerritzen, L., Grancagnolo, F., Grandoni, E. G., Grassi, M., Grigoriev, D. N., Hildebrandt, M., Ieki, K., Ignatov, F., Ikeda, F., Iwamoto, T., Karpov, S., Kettle, P. -R., Khomutov, N., Kobayashi, S., Kolesnikov, A., Kravchuk, N., Krylov, V., Kuchinskiy, N., Kyle, W., Libeiro, T., Malyshev, V., Matsushita, A., Meucci, M., Mihara, S., Molzon, W., Mori, Toshinori, Morsani, F., Nakao, M., Nicolò, D., Nishiguchi, H., Ochi, A., Ogawa, S., Onda, R., Ootani, W., Oya, A., Palo, D., Panareo, M., Papa, A., Pettinacci, V., Popov, A., Raffaelli, F., Renga, F., Ritt, S., Rossella, M., Rozhdestvensky, A., Schwendimann, P., Shimada, K., Signorelli, G., Stoykov, A., Takahashi, M., Tassielli, G. F., Toyoda, K., Uchiyama, Y., Usami, M., Venturini, A., Vitali, B., Voena, C., Yamamoto, K., Yanai, K., Yonemoto, T., Yoshida, K., and Yudin, Yu. V.
Subjects: Physics - Instrumentation and Detectors, High Energy Physics - Experiment
Abstract: The MEG II experiment, located at the Paul Scherrer Institut (PSI) in Switzerland, is the successor to the MEG experiment, which completed data taking in 2013. MEG II started fully operational data taking in 2021, with the goal of improving the sensitivity of the mu+ -> e+ gamma decay down to 6e-14 almost an order of magnitude better than the current limit. In this paper, we describe the operation and performance of the experiment and give a new estimate of its sensitivity versus data acquisition time., Comment: 42 pages, 55 figures. Submitted to EPJC
Published: 2023

39. Grasping Psychological Evidence: Integrating Evidentiary Practices in Psychology Instruction

Author: Gideon Dishon, Sarit Barzilai, and Johnatan Verissimo Yanai
Abstract: The spread of misinformation has underscored the importance of cultivating citizens' competency to critically evaluate popular accounts of scientific evidence. Extending the prevailing emphasis on evidence in the natural sciences, we argue for fostering students' understanding of psychological evidence and its communication in the media. In this study, we illustrate how this goal can be advanced in undergraduate psychology instruction by actively engaging students in evidence evaluation and design. We employed the Grasp of Evidence framework to document students' evidence evaluation ideals and processes and how these changed over a course in which students engaged in a series of collaborative evidence evaluation and design tasks. Prior to instruction, students exhibited a mechanistic understanding of scientific methods, coupled with substantial reliance on personal experience. Following instruction, students demonstrated three key shifts in grasp of evidence: a shift in perception of the sources of psychological knowledge, a shift in views of scientific objectivity, and a shift in definitions of psychological phenomena. Analysis of students' collaborative discourse highlighted three design principles that supported increasingly complex understandings of psychological evidence: collaborative critique and redesign of flawed studies, engagement with diverse sources of popular evidence, and confronting elusive conceptual constructs.
Published: 2024
Full Text: View/download PDF

40. Absolute value measurement of ion-scale turbulence by two-dimensional phase contrast imaging in Large Helical Device

Author: Kinoshita, T., Tanaka, K., Sakai, H., Yanai, R., Nunami, M., and Michael, C. A.
Subjects: Physics - Plasma Physics, Physics - Applied Physics
Abstract: Absolute value measurements of turbulence amplitude in magnetically confined high-temperature plasmas can effectively explain turbulence-driven transport characteristics and their role in plasma confinements. Two-dimensional phase contrast imaging (2D-PCI) is a technique to evaluate the space-time spectrum of ion-scale electron density fluctuation. However, absolute value measurement of turbulence amplitude has not been conducted owing to the nonlinearity of the detector. In this study, the absolute measurement method proposed in the previous study is applied to turbulence measurement results in the large helical device. As a result, the localized turbulence amplitude at $n_e=1.5\times 10^{19}$m$^{-3}$ is approximately $3.5\times 10^{15}$m$^{-3}$, which is 0.02\% of the electron density. In addition, the evaluated poloidal wavenumber spectrum is almost consistent, within a certain error range, the spectrum being calculated using a nonlinear gyrokinetic simulation. This result is the first to the best of our knowledge to quantitatively evaluate turbulence amplitudes measured by 2D-PCI and compare with simulations.
Published: 2023

41. The Bias Amplification Paradox in Text-to-Image Generation

Author: Seshadri, Preethi, Singh, Sameer, and Elazar, Yanai
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computers and Society
Abstract: Bias amplification is a phenomenon in which models exacerbate biases or stereotypes present in the training data. In this paper, we study bias amplification in the text-to-image domain using Stable Diffusion by comparing gender ratios in training vs. generated images. We find that the model appears to amplify gender-occupation biases found in the training data (LAION) considerably. However, we discover that amplification can be largely attributed to discrepancies between training captions and model prompts. For example, an inherent difference is that captions from the training data often contain explicit gender information while our prompts do not, which leads to a distribution shift and consequently inflates bias measures. Once we account for distributional differences between texts used for training and generation when evaluating amplification, we observe that amplification decreases drastically. Our findings illustrate the challenges of comparing biases in models and their training data, and highlight confounding factors that impact analyses.
Published: 2023

42. Focusing on what to decode and what to train: Efficient Training with HOI Split Decoders and Specific Target Guided DeNoising

Author: Chen, Junwen, Wang, Yingcheng, and Yanai, Keiji
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent one-stage transformer-based methods achieve notable gains in the Human-object Interaction Detection (HOI) task by leveraging the detection of DETR. However, the current methods redirect the detection target of the object decoder, and the box target is not explicitly separated from the query embeddings, which leads to long and hard training. Furthermore, matching the predicted HOI instances with the ground-truth is more challenging than object detection, simply adapting training strategies from the object detection makes the training more difficult. To clear the ambiguity between human and object detection and share the prediction burden, we propose a novel one-stage framework (SOV), which consists of a subject decoder, an object decoder, and a verb decoder. Moreover, we propose a novel Specific Target Guided (STG) DeNoising training strategy, which leverages learnable object and verb label embeddings to guide the training and accelerate the training convergence. In addition, for the inference part, the label-specific information is directly fed into the decoders by initializing the query embeddings from the learnable label embeddings. Without additional features or prior language knowledge, our method (SOV-STG) achieves higher accuracy than the state-of-the-art method in one-third of training epochs. The code is available at this https://github.com/cjw2021/SOV-STG.
Published: 2023

43. Estimating the Causal Effect of Early ArXiving on Paper Acceptance

Author: Elazar, Yanai, Zhang, Jiayao, Wadden, David, Zhang, Bo, and Smith, Noah A.
Subjects: Computer Science - Computation and Language
Abstract: What is the effect of releasing a preprint of a paper before it is submitted for peer review? No randomized controlled trial has been conducted, so we turn to observational data to answer this question. We use data from the ICLR conference (2018--2022) and apply methods from causal inference to estimate the effect of arXiving a paper before the reviewing period (early arXiving) on its acceptance to the conference. Adjusting for confounders such as topic, authors, and quality, we may estimate the causal effect. However, since quality is a challenging construct to estimate, we use the negative outcome control method, using paper citation count as a control variable to debias the quality confounding effect. Our results suggest that early arXiving may have a small effect on a paper's chances of acceptance. However, this effect (when existing) does not differ significantly across different groups of authors, as grouped by author citation count and institute rank. This suggests that early arXiving does not provide an advantage to any particular group., Comment: Published at CLeaR 2024
Published: 2023

44. JABBERWOCK: A Tool for WebAssembly Dataset Generation and Its Application to Malicious Website Detection

Author: Komiya, Chika, Yanai, Naoto, Yamashita, Kyosuke, and Okamura, Shingo
Subjects: Computer Science - Cryptography and Security, Computer Science - Machine Learning
Abstract: Machine learning is often used for malicious website detection, but an approach incorporating WebAssembly as a feature has not been explored due to a limited number of samples, to the best of our knowledge. In this paper, we propose JABBERWOCK (JAvascript-Based Binary EncodeR by WebAssembly Optimization paCKer), a tool to generate WebAssembly datasets in a pseudo fashion via JavaScript. Loosely speaking, JABBERWOCK automatically gathers JavaScript code in the real world, convert them into WebAssembly, and then outputs vectors of the WebAssembly as samples for malicious website detection. We also conduct experimental evaluations of JABBERWOCK in terms of the processing time for dataset generation, comparison of the generated samples with actual WebAssembly samples gathered from the Internet, and an application for malicious website detection. Regarding the processing time, we show that JABBERWOCK can construct a dataset in 4.5 seconds per sample for any number of samples. Next, comparing 10,000 samples output by JABBERWOCK with 168 gathered WebAssembly samples, we believe that the generated samples by JABBERWOCK are similar to those in the real world. We then show that JABBERWOCK can provide malicious website detection with 99\% F1-score because JABBERWOCK makes a gap between benign and malicious samples as the reason for the above high score. We also confirm that JABBERWOCK can be combined with an existing malicious website detection tool to improve F1-scores. JABBERWOCK is publicly available via GitHub (https://github.com/c-chocolate/Jabberwock)., Comment: Accepted in DCDS 2023 (co-located in DSN 2023)
Published: 2023

45. An Empirical Study of Impact of Solidity Compiler Updates on Vulnerabilities in Ethereum Smart Contracts

Author: Kado, Chihiro, Yanai, Naoto, Cruz, Jason Paul, Yamashita, Kyosuke, and Okamura, Shingo
Subjects: Computer Science - Cryptography and Security, Computer Science - Software Engineering
Abstract: Vulnerabilities of Ethereum smart contracts often cause serious financial damage. Whereas the Solidity compiler has been updated to prevent vulnerabilities, its effectiveness has not been revealed so far, to the best of our knowledge. In this paper, we shed light on the impact of compiler versions of vulnerabilities of Ethereum smart contracts. To this end, we collected 503,572 contracts with Solidity source codes in the Ethereum blockchain and then analyzed their vulnerabilities. For three vulnerabilities with high severity, i.e., Locked Money, Using tx.origin, and Unchecked Call, we show that their appearance rates are decreased by virtue of major updates of the Solidity compiler. We then found the following four key insights. First, after the release of version 0.6, the appearance rate for Locked Money has decreased. Second, regardless of compiler updates, the appearance rate for Using tx.origin is significantly low. Third, although the appearance rate for Unchecked Call has decreased in version 0.8, it still remains high due to various factors, including code clones. Fourth, through analysis of code clones, our promising results show that the appearance rate for Unchecked Call can be further decreased by removing the code clones.
Published: 2023

46. Backdoor Attacks Leveraging Latent Representation in Competitive Learning

Author: Iwahana, Kazuki, Yanai, Naoto, Fujiwara, Toru, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Katsikas, Sokratis, editor, Abie, Habtamu, editor, Ranise, Silvio, editor, Verderame, Luca, editor, Cambiaso, Enrico, editor, Ugarelli, Rita, editor, Praça, Isabel, editor, Li, Wenjuan, editor, Meng, Weizhi, editor, Furnell, Steven, editor, Katt, Basel, editor, Pirbhulal, Sandeep, editor, Shukla, Ankur, editor, Ianni, Michele, editor, Dalla Preda, Mila, editor, Choo, Kim-Kwang Raymond, editor, Pupo Correia, Miguel, editor, Abhishta, Abhishta, editor, Sileno, Giovanni, editor, Alishahi, Mina, editor, Kalutarage, Harsha, editor, and Yanai, Naoto, editor
Published: 2024
Full Text: View/download PDF

47. Rectum necrosis in a patient with severe COVID19 infection after CAR-T therapy: a case report

Author: Saeki, Kiyoshi, Nakagama, Hidenobu, Tanaka, Yuichi, Goto, Yoshitaka, Kaneshiro, Kazuhisa, Kono, Hiroshi, Yanai, Kosuke, Yamamoto, Hirofumi, Yoneda, Reiko, Shimakawa, Takashi, and Ueki, Takashi
Published: 2024
Full Text: View/download PDF

48. Modulation of triplet quantum coherence by guest-induced structural changes in a flexible metal-organic framework

Author: Yamauchi, Akio, Fujiwara, Saiya, Kimizuka, Nobuo, Asada, Mizue, Fujiwara, Motoyasu, Nakamura, Toshikazu, Pirillo, Jenny, Hijikata, Yuh, and Yanai, Nobuhiro
Published: 2024
Full Text: View/download PDF

49. Baseline genetic abnormalities and effectiveness of osimertinib treatment in patients with chemotherapy-naïve EGFR-mutated NSCLC based on performance status

Author: Taniguchi, Yoshihiko, Tamiya, Akihiro, Osuga, Mitsuo, Harada, Daijiro, Isa, Shun-ichi, Nakamura, Keiichi, Mizumori, Yasuyuki, Shinohara, Tsutomu, Yanai, Hidetoshi, Nakatomi, Katsumi, Oki, Masahide, Mori, Masahide, Kuwako, Tomohito, Yamazaki, Koji, Tamura, Atsuhisa, Ando, Masahiko, and Koh, Yasuhiro
Published: 2024
Full Text: View/download PDF

50. Lymphangiogenesis in the liver of biliary atresia

Author: Kosaka, Seitaro, Muraji, Toshihiro, Ohtani, Haruo, Harumatsu, Toshio, Shimizu, Sakika, Toma, Miki, Yanai, Toshihiro, and Ieiri, Satoshi
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

28,171 results on '"Yanai A"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources