Author: "Aizawa A" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Aizawa A"' showing total 48,033 results

Start Over Author "Aizawa A"

48,033 results on '"Aizawa A"'

1. The Promises and Pitfalls of LLM Annotations in Dataset Labeling: a Case Study on Media Bias Detection

Author: Horych, Tomas, Mandl, Christoph, Ruas, Terry, Greiner-Petter, Andre, Gipp, Bela, Aizawa, Akiko, and Spinde, Timo
Subjects: Computer Science - Computation and Language
Abstract: High annotation costs from hiring or crowdsourcing complicate the creation of large, high-quality datasets needed for training reliable text classifiers. Recent research suggests using Large Language Models (LLMs) to automate the annotation process, reducing these costs while maintaining data quality. LLMs have shown promising results in annotating downstream tasks like hate speech detection and political framing. Building on the success in these areas, this study investigates whether LLMs are viable for annotating the complex task of media bias detection and whether a downstream media bias classifier can be trained on such data. We create annolexical, the first large-scale dataset for media bias classification with over 48000 synthetically annotated examples. Our classifier, fine-tuned on this dataset, surpasses all of the annotator LLMs by 5-9 percent in Matthews Correlation Coefficient (MCC) and performs close to or outperforms the model trained on human-labeled data when evaluated on two media bias benchmark datasets (BABE and BASIL). This study demonstrates how our approach significantly reduces the cost of dataset creation in the media bias domain and, by extension, the development of classifiers, while our subsequent behavioral stress-testing reveals some of its current limitations and trade-offs.
Published: 2024

2. Self-Compositional Data Augmentation for Scientific Keyphrase Generation

Author: Houbre, Mael, Boudin, Florian, Daille, Beatrice, and Aizawa, Akiko
Subjects: Computer Science - Computation and Language, Computer Science - Information Retrieval
Abstract: State-of-the-art models for keyphrase generation require large amounts of training data to achieve good performance. However, obtaining keyphrase-labeled documents can be challenging and costly. To address this issue, we present a self-compositional data augmentation method. More specifically, we measure the relatedness of training documents based on their shared keyphrases, and combine similar documents to generate synthetic samples. The advantage of our method lies in its ability to create additional training samples that keep domain coherence, without relying on external data or resources. Our results on multiple datasets spanning three different domains, demonstrate that our method consistently improves keyphrase generation. A qualitative analysis of the generated keyphrases for the Computer Science domain confirms this improvement towards their representativity property., Comment: Accepted to JCDL 2024. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive version was published in the proceedings of the 2024 ACM/IEEE Joint Conference on Digital Libraries (JCDL 24) https://doi.org/10.1145/3677389.3702504
Published: 2024
Full Text: View/download PDF

3. JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation

Author: Onohara, Shota, Miyai, Atsuyuki, Imajuku, Yuki, Egashira, Kazuki, Baek, Jeonghun, Yue, Xiang, Neubig, Graham, and Aizawa, Kiyoharu
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Accelerating research on Large Multimodal Models (LMMs) in non-English languages is crucial for enhancing user experiences across broader populations. In this paper, we introduce JMMMU (Japanese MMMU), the first large-scale Japanese benchmark designed to evaluate LMMs on expert-level tasks based on the Japanese cultural context. To facilitate comprehensive culture-aware evaluation, JMMMU features two complementary subsets: (i) culture-agnostic (CA) subset, where the culture-independent subjects (e.g., Math) are selected and translated into Japanese, enabling one-to-one comparison with its English counterpart MMMU; and (ii) culture-specific (CS) subset, comprising newly crafted subjects that reflect Japanese cultural context. Using the CA subset, we observe performance drop in many LMMs when evaluated in Japanese, which is purely attributable to language variation. Using the CS subset, we reveal their inadequate Japanese cultural understanding. Further, by combining both subsets, we identify that some LMMs perform well on the CA subset but not on the CS subset, exposing a shallow understanding of the Japanese language that lacks depth in cultural understanding. We hope this work will not only help advance LMM performance in Japanese but also serve as a guideline to create high-standard, culturally diverse benchmarks for multilingual LMM development. The project page is https://mmmu-japanese-benchmark.github.io/JMMMU/., Comment: Project page: https://mmmu-japanese-benchmark.github.io/JMMMU/
Published: 2024

4. Universal weight systems from a minimal $\mathbb{Z}_2^2$-graded Lie algebra

Author: Aizawa, N. and Kimura, Daichi
Subjects: Mathematics - Geometric Topology
Abstract: Color Lie algebras, which were introduced by Ree, are a graded extension of Lie (super)algebras by an abelian group. We show that the color Lie algebras can be used to construct universal weight systems for knot invariants of of Vassiliev and Kontsevich. As a simple example, we take $\mathbb{Z}_2 \times \mathbb{Z}_2$ as the grading group and consider the four-dimensional color Lie algebra called $A1_{\epsilon}$. The weight system constructed from $A1_{\epsilon}$ is studied in some detail and some relations between the weights, such as the recurrence relation for chord diagrams, are derived. These relations show that the weight system from $A1_{\epsilon}$ is a hybrid of those from $sl(2)$ and $gl(1|1)$., Comment: 28 pages, many figures
Published: 2024

5. JASMINE image simulator for high-precision astrometry and photometry

Author: Kamizuka, Takafumi, Kawahara, Hajime, Ohsawa, Ryou, Kataza, Hirokazu, Kawata, Daisuke, Yamada, Yoshiyuki, Hirano, Teruyuki, Miyakawa, Kohei, Aizawa, Masataka, Omiya, Masashi, Yano, Taihei, Kano, Ryouhei, Wada, Takehiko, Löffler, Wolfgang, Biermann, Michael, Ramos, Pau, Isobe, Naoki, Usui, Fumihiko, Hattori, Kohei, Yoshioka, Satoshi, Tatekawa, Takayuki, Izumiura, Hideyuki, Fukui, Akihiko, Miyoshi, Makoto, Tatsumi, Daisuke, and Gouda, Naoteru
Subjects: Astrophysics - Instrumentation and Methods for Astrophysics
Abstract: JASMINE is a Japanese planned space mission that aims to reveal the formation history of our Galaxy and discover habitable exoEarths. For these objectives, the JASMINE satellite performs high-precision astrometric observations of the Galactic bulge and high-precision transit monitoring of M-dwarfs in the near-infrared (1.0-1.6 microns in wavelength). For feasibility studies, we develop an image simulation software named JASMINE-imagesim, which produces realistic observation images. This software takes into account various factors such as the optical point spread function (PSF), telescope jitter caused by the satellite's attitude control error (ACE), detector flat patterns, exposure timing differences between detector pixels, and various noise factors. As an example, we report a simulation for the feasibility study of astrometric observations using JASMINE-imagesim. The simulation confirms that the required position measurement accuracy of 4 mas for a single exposure of 12.5-mag objects is achievable if the telescope pointing jitter uniformly dilutes the PSF across all stars in the field of view. On the other hand, the simulation also demonstrates that the combination of realistic pointing jitter and exposure timing differences in the detector can significantly degrade accuracy and prevent achieving the requirement. This means that certain countermeasures against this issue must be developed. This result implies that this kind of simulation is important for mission planning and advanced developments to realize more realistic simulations help us to identify critical issues and also devise effective solutions., Comment: 13 pages, 7 figures, 1 table
Published: 2024
Full Text: View/download PDF

6. The Impact of Loanwords on the English-Japanese Version of Vocabulary Size Test

Author: Ayako Aizawa
Abstract: The Vocabulary Size Test (VST) measures English learners' decontextualised receptive vocabulary knowledge of written English and has nine bilingual versions with multiple-choice options written in other languages. This study used the English-Japanese version of the VST to investigate the extent to which loanword items were answered correctly by Japanese first language (L1) university students compared to non-loanword items, and whether it was easier to answer these loanword items when the correct answer option was written in loanwords rather than Japanese-words. Paired t-tests showed a significant difference in correct response rates between the loanword and non-loanword items, and the loanword options and Japanese-word options, with a large effect size. The results suggest the relative ease of learning English loanwords compared to non-loanwords for L1 Japanese users, and the need to consider the use of loanwords in vocabulary tests to measure test-takers' vocabulary size more accurately.
Published: 2024

7. FoodMLLM-JP: Leveraging Multimodal Large Language Models for Japanese Recipe Generation

Author: Imajuku, Yuki, Yamakata, Yoko, and Aizawa, Kiyoharu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multimedia
Abstract: Research on food image understanding using recipe data has been a long-standing focus due to the diversity and complexity of the data. Moreover, food is inextricably linked to people's lives, making it a vital research area for practical applications such as dietary management. Recent advancements in Multimodal Large Language Models (MLLMs) have demonstrated remarkable capabilities, not only in their vast knowledge but also in their ability to handle languages naturally. While English is predominantly used, they can also support multiple languages including Japanese. This suggests that MLLMs are expected to significantly improve performance in food image understanding tasks. We fine-tuned open MLLMs LLaVA-1.5 and Phi-3 Vision on a Japanese recipe dataset and benchmarked their performance against the closed model GPT-4o. We then evaluated the content of generated recipes, including ingredients and cooking procedures, using 5,000 evaluation samples that comprehensively cover Japanese food culture. Our evaluation demonstrates that the open models trained on recipe data outperform GPT-4o, the current state-of-the-art model, in ingredient generation. Our model achieved F1 score of 0.531, surpassing GPT-4o's F1 score of 0.481, indicating a higher level of accuracy. Furthermore, our model exhibited comparable performance to GPT-4o in generating cooking procedure text., Comment: 14 pages, 5 figures
Published: 2024

8. JMedBench: A Benchmark for Evaluating Japanese Biomedical Large Language Models

Author: Jiang, Junfeng, Huang, Jiahao, and Aizawa, Akiko
Subjects: Computer Science - Computation and Language
Abstract: Recent developments in Japanese large language models (LLMs) primarily focus on general domains, with fewer advancements in Japanese biomedical LLMs. One obstacle is the absence of a comprehensive, large-scale benchmark for comparison. Furthermore, the resources for evaluating Japanese biomedical LLMs are insufficient. To advance this field, we propose a new benchmark including eight LLMs across four categories and 20 Japanese biomedical datasets across five tasks. Experimental results indicate that: (1) LLMs with a better understanding of Japanese and richer biomedical knowledge achieve better performance in Japanese biomedical tasks, (2) LLMs that are not mainly designed for Japanese biomedical domains can still perform unexpectedly well, and (3) there is still much room for improving the existing LLMs in certain Japanese biomedical tasks. Moreover, we offer insights that could further enhance development in this field. Our evaluation tools tailored to our benchmark as well as the datasets are publicly available in https://huggingface.co/datasets/Coldog2333/JMedBench to facilitate future research.
Published: 2024

9. Unsupervised Domain Adaptation for Keyphrase Generation using Citation Contexts

Author: Boudin, Florian and Aizawa, Akiko
Subjects: Computer Science - Computation and Language
Abstract: Adapting keyphrase generation models to new domains typically involves few-shot fine-tuning with in-domain labeled data. However, annotating documents with keyphrases is often prohibitively expensive and impractical, requiring expert annotators. This paper presents silk, an unsupervised method designed to address this issue by extracting silver-standard keyphrases from citation contexts to create synthetic labeled data for domain adaptation. Extensive experiments across three distinct domains demonstrate that our method yields high-quality synthetic samples, resulting in significant and consistent improvements in in-domain performance over strong baselines., Comment: Accepted at EMNLP 2024 Findings
Published: 2024

10. Affine extensions of $\mathbb{Z}_2^2$-graded $osp(1|2)$ and Virasoro algebra

Author: Aizawa, N. and Segar, J.
Subjects: Mathematical Physics
Abstract: It is known that there are two inequivalent $\mathbb{Z}_2^2$-graded $osp(1|2)$ Lie superalgebras. Their affine extensions are investigated and it is shown that one of them admits two central elements, one is non-graded and the other is $(1,1)$-graded. The affine $\mathbb{Z}_2^2$-$osp(1|2)$ algebras are used by the Sugawara construction to study possible $\mathbb{Z}_2^2$-graded extensions of the Virasoro algebra. We obtain a $\mathbb{Z}_2^2$-graded Virasoro algebra with a non-trivially graded central element. Throughout the investigation, invariant bilinear forms on $\mathbb{Z}_2^2$-graded superalgebras play a crucial role, so a theory of invariant bilinear forms is also developed., Comment: 21 pages, no figures, contribution to GROUP33/35 held in Cotonou, Benin, July 15 - 19, 2024
Published: 2024

11. Training-Free Sketch-Guided Diffusion with Latent Optimization

Author: Ding, Sandra Zhang, Mao, Jiafeng, and Aizawa, Kiyoharu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Based on recent advanced diffusion models, Text-to-image (T2I) generation models have demonstrated their capabilities in generating diverse and high-quality images. However, leveraging their potential for real-world content creation, particularly in providing users with precise control over the image generation result, poses a significant challenge. In this paper, we propose an innovative training-free pipeline that extends existing text-to-image generation models to incorporate a sketch as an additional condition. To generate new images with a layout and structure closely resembling the input sketch, we find that these core features of a sketch can be tracked with the cross-attention maps of diffusion models. We introduce latent optimization, a method that refines the noisy latent at each intermediate step of the generation process using cross-attention maps to ensure that the generated images closely adhere to the desired structure outlined in the reference sketch. Through latent optimization, our method enhances the fidelity and accuracy of image generation, offering users greater control and customization options in content creation.
Published: 2024

12. Investigating the Perception of Facial Anonymization Techniques in 360{\deg} Videos

Author: Wöhler, Leslie, Ikehata, Satoshi, and Aizawa, Kiyoharu
Subjects: Computer Science - Human-Computer Interaction
Abstract: In this work, we investigate facial anonymization techniques in 360{\deg} videos and assess their influence on the perceived realism, anonymization effect, and presence of participants. In comparison to traditional footage, 360{\deg} videos can convey engaging, immersive experiences that accurately represent the atmosphere of real-world locations. As the entire environment is captured simultaneously, it is necessary to anonymize the faces of bystanders in recordings of public spaces. Since this alters the video content, the perceived realism and immersion could be reduced. To understand these effects, we compare non-anonymized and anonymized 360{\deg} videos using blurring, black boxes, and face-swapping shown either on a regular screen or in a head-mounted display (HMD). Our results indicate significant differences in the perception of the anonymization techniques. We find that face-swapping is most realistic and least disruptive, however, participants raised concerns regarding the effectiveness of the anonymization. Furthermore, we observe that presence is affected by facial anonymization in HMD condition. Overall, the results underscore the need for facial anonymization techniques that balance both photo-realism and a sense of privacy.
Published: 2024

13. Probing electron trapping by current collapse in GaN/AlGaN FETs utilizing quantum transport characteristics

Author: Abe, Takaya, Shinozaki, Motoya, Matsumura, Kazuma, Aizawa, Takumi, Kumasaka, Takeshi, Ito, Norikazu, Tanaka, Taketoshi, Nakahara, Ken, and Otsuka, Tomohiro
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: GaN is expected to be a key material for next-generation electronics due to its interesting properties. However, the current collapse poses a challenge to the application of GaN FETs to electronic devices. In this study, we investigate the formation of quantum dots in GaN FETs under the current collapse. By comparing the Coulomb diamond between standard measurements and those under current collapse, we find that the gate capacitance is significantly decreased by the current collapse. This suggests that the current collapse changes the distribution of trapped electrons at the device surface, which is reported in the previous study by operando X-ray spectroscopy. Also, we show external control of quantum dot formation, previously challenging in an FET structure, by using current collapse., Comment: 11 pages, 4 figures
Published: 2024

14. Multi-dimensional optimisation of the scanning strategy for the LiteBIRD space mission

Author: Takase, Y., Vacher, L., Ishino, H., Patanchon, G., Montier, L., Stever, S. L., Ishizaka, K., Nagano, Y., Wang, W., Aumont, J., Aizawa, K., Anand, A., Baccigalupi, C., Ballardini, M., Banday, A. J., Barreiro, R. B., Bartolo, N., Basak, S., Bersanelli, M., Bortolami, M., Brinckmann, T., Calabrese, E., Campeti, P., Carinos, E., Carones, A., Casas, F. J., Cheung, K., Clermont, L., Columbro, F., Coppolecchia, A., Cuttaia, F., de Bernardis, P., de Haan, T., de la Hoz, E., Della Torre, S., Diego-Palazuelos, P., D'Alessandro, G., Eriksen, H. K., Errard, J., Finelli, F., Fuskeland, U., Galloni, G., Galloway, M., Gervasi, M., Ghigna, T., Giardiello, S., Gimeno-Amo, C., Gjerløw, E., González, R. González, Gruppuso, A., Hazumi, M., Henrot-Versillé, S., Hergt, L. T., Ikuma, K., Kohri, K., Lamagna, L., Lattanzi, M., Leloup, C., Lembo, M., Levrier, F., Lonappan, A. I., López-Caniego, M., Luzzi, G., Maffei, B., Martínez-González, E., Masi, S., Matarrese, S., Matsuda, F. T., Matsumura, T., Micheli, S., Migliaccio, M., Monelli, M., Morgante, G., Mot, B., Nagata, R., Namikawa, T., Novelli, A., Odagiri, K., Oguri, S., Omae, R., Pagano, L., Paoletti, D., Piacentini, F., Pinchera, M., Polenta, G., Porcelli, L., Raffuzzi, N., Remazeilles, M., Ritacco, A., Ruiz-Granda, M., Sakurai, Y., Scott, D., Sekimoto, Y., Shiraishi, M., Signorelli, G., Sullivan, R. M., Takakura, H., Terenzi, L., Tomasi, M., Tristram, M., van Tent, B., Vielva, P., Wehus, I. K., Westbrook, B., Weymann-Despres, G., Wollack, E. J., Zannoni, M., and Zhou, Y.
Subjects: Astrophysics - Instrumentation and Methods for Astrophysics, Astrophysics - Cosmology and Nongalactic Astrophysics
Abstract: Large angular scale surveys in the absence of atmosphere are essential for measuring the primordial $B$-mode power spectrum of the Cosmic Microwave Background (CMB). Since this proposed measurement is about three to four orders of magnitude fainter than the temperature anisotropies of the CMB, in-flight calibration of the instruments and active suppression of systematic effects are crucial. We investigate the effect of changing the parameters of the scanning strategy on the in-flight calibration effectiveness, the suppression of the systematic effects themselves, and the ability to distinguish systematic effects by null-tests. Next-generation missions such as LiteBIRD, modulated by a Half-Wave Plate (HWP), will be able to observe polarisation using a single detector, eliminating the need to combine several detectors to measure polarisation, as done in many previous experiments and hence avoiding the consequent systematic effects. While the HWP is expected to suppress many systematic effects, some of them will remain. We use an analytical approach to comprehensively address the mitigation of these systematic effects and identify the characteristics of scanning strategies that are the most effective for implementing a variety of calibration strategies in the multi-dimensional space of common spacecraft scan parameters. We also present Falcons, a fast spacecraft scanning simulator that we developed to investigate this scanning parameter space.
Published: 2024

15. An Encoding--Searching Separation Perspective on Bi-Encoder Neural Search

Author: Tran, Hung-Nghiep, Aizawa, Akiko, and Takasu, Atsuhiro
Subjects: Computer Science - Machine Learning, Computer Science - Information Retrieval
Abstract: This paper reviews, analyzes, and proposes a new perspective on the bi-encoder architecture for neural search. While the bi-encoder architecture is widely used due to its simplicity and scalability at test time, it has some notable issues such as low performance on seen datasets and weak zero-shot performance on new datasets. In this paper, we analyze these issues and summarize two main critiques: the encoding information bottleneck problem and limitations of the basic assumption of embedding search. We then construct a thought experiment to logically analyze the encoding and searching operations and challenge the basic assumption of embedding search. Building on these observations, we propose a new perspective on the bi-encoder architecture called the \textit{encoding--searching separation} perspective, which conceptually and practically separates the encoding and searching operations. This new perspective is applied to explain the root cause of the identified issues and discuss ways to mitigate the problems. Finally, we discuss the implications of the ideas underlying the new perspective, the design surface that it exposes and the potential research directions arising from it.
Published: 2024

16. Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey

Author: Miyai, Atsuyuki, Yang, Jingkang, Zhang, Jingyang, Ming, Yifei, Lin, Yueqian, Yu, Qing, Irie, Go, Joty, Shafiq, Li, Yixuan, Li, Hai, Liu, Ziwei, Yamasaki, Toshihiko, and Aizawa, Kiyoharu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Detecting out-of-distribution (OOD) samples is crucial for ensuring the safety of machine learning systems and has shaped the field of OOD detection. Meanwhile, several other problems are closely related to OOD detection, including anomaly detection (AD), novelty detection (ND), open set recognition (OSR), and outlier detection (OD). To unify these problems, a generalized OOD detection framework was proposed, taxonomically categorizing these five problems. However, Vision Language Models (VLMs) such as CLIP have significantly changed the paradigm and blurred the boundaries between these fields, again confusing researchers. In this survey, we first present a generalized OOD detection v2, encapsulating the evolution of AD, ND, OSR, OOD detection, and OD in the VLM era. Our framework reveals that, with some field inactivity and integration, the demanding challenges have become OOD detection and AD. In addition, we also highlight the significant shift in the definition, problem settings, and benchmarks; we thus feature a comprehensive review of the methodology for OOD detection, including the discussion over other related tasks to clarify their relationship to OOD detection. Finally, we explore the advancements in the emerging Large Vision Language Model (LVLM) era, such as GPT-4V. We conclude this survey with open challenges and future directions., Comment: survey paper. We welcome questions, issues, and paper requests via https://github.com/AtsuMiyai/Awesome-OOD-VLM
Published: 2024

17. MangaUB: A Manga Understanding Benchmark for Large Multimodal Models

Author: Ikuta, Hikaru, Wöhler, Leslie, and Aizawa, Kiyoharu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multimedia
Abstract: Manga is a popular medium that combines stylized drawings and text to convey stories. As manga panels differ from natural images, computational systems traditionally had to be designed specifically for manga. Recently, the adaptive nature of modern large multimodal models (LMMs) shows possibilities for more general approaches. To provide an analysis of the current capability of LMMs for manga understanding tasks and identifying areas for their improvement, we design and evaluate MangaUB, a novel manga understanding benchmark for LMMs. MangaUB is designed to assess the recognition and understanding of content shown in a single panel as well as conveyed across multiple panels, allowing for a fine-grained analysis of a model's various capabilities required for manga understanding. Our results show strong performance on the recognition of image content, while understanding the emotion and information conveyed across multiple panels is still challenging, highlighting future work towards LMMs for manga understanding., Comment: This work has been submitted to the IEEE for possible publication
Published: 2024

18. LiteBIRD Science Goals and Forecasts. Mapping the Hot Gas in the Universe

Author: Remazeilles, M., Douspis, M., Rubiño-Martín, J. A., Banday, A. J., Chluba, J., de Bernardis, P., De Petris, M., Hernández-Monteagudo, C., Luzzi, G., Macias-Perez, J., Masi, S., Namikawa, T., Salvati, L., Tanimura, H., Aizawa, K., Anand, A., Aumont, J., Baccigalupi, C., Ballardini, M., Barreiro, R. B., Bartolo, N., Basak, S., Bersanelli, M., Blinov, D., Bortolami, M., Brinckmann, T., Calabrese, E., Campeti, P., Carinos, E., Carones, A., Casas, F. J., Cheung, K., Clermont, L., Columbro, F., Coppolecchia, A., Cuttaia, F., de Haan, T., de la Hoz, E., Della Torre, S., Diego-Palazuelos, P., D'Alessandro, G., Eriksen, H. K., Finelli, F., Fuskeland, U., Galloni, G., Galloway, M., Gervasi, M., Génova-Santos, R. T., Ghigna, T., Giardiello, S., Gimeno-Amo, C., Gjerløw, E., González, R. González, Gruppuso, A., Hazumi, M., Henrot-Versillé, S., Hergt, L. T., Herranz, D., Kohri, K., Komatsu, E., Lamagna, L., Lattanzi, M., Leloup, C., Levrier, F., Lonappan, A. I., López-Caniego, M., Maffei, B., Martínez-González, E., Matarrese, S., Matsumura, T., Micheli, S., Migliaccio, M., Monelli, M., Montier, L., Morgante, G., Nagano, Y., Nagata, R., Novelli, A., Omae, R., Pagano, L., Paoletti, D., Pavlidou, V., Piacentini, F., Pinchera, M., Polenta, G., Porcelli, L., Ritacco, A., Ruiz-Granda, M., Sakurai, Y., Scott, D., Shiraishi, M., Stever, S. L., Sullivan, R. M., Takase, Y., Tassis, K., Terenzi, L., Tomasi, M., Tristram, M., Vacher, L., van Tent, B., Vielva, P., Wehus, I. K., Westbrook, B., Weymann-Despres, G., Wollack, E. J., Zannoni, M., and Zhou, Y.
Subjects: Astrophysics - Cosmology and Nongalactic Astrophysics
Abstract: We assess the capabilities of the LiteBIRD mission to map the hot gas distribution in the Universe through the thermal Sunyaev-Zeldovich (SZ) effect. Our analysis relies on comprehensive simulations incorporating various sources of Galactic and extragalactic foreground emission, while accounting for specific instrumental characteristics of LiteBIRD, such as detector sensitivities, frequency-dependent beam convolution, inhomogeneous sky scanning, and $1/f$ noise. We implement a tailored component-separation pipeline to map the thermal SZ Compton $y$-parameter over 98% of the sky. Despite lower angular resolution for galaxy cluster science, LiteBIRD provides full-sky coverage and, compared to the Planck satellite, enhanced sensitivity, as well as more frequency bands to enable the construction of an all-sky $y$-map, with reduced foreground contamination at large and intermediate angular scales. By combining LiteBIRD and Planck channels in the component-separation pipeline, we obtain an optimal $y$-map that leverages the advantages of both experiments, with the higher angular resolution of the Planck channels enabling the recovery of compact clusters beyond the LiteBIRD beam limitations, and the numerous sensitive LiteBIRD channels further mitigating foregrounds. The added value of LiteBIRD is highlighted through the examination of maps, power spectra, and one-point statistics of the various sky components. After component separation, the $1/f$ noise from LiteBIRD is effectively mitigated below the thermal SZ signal at all multipoles. Cosmological constraints on $S_8=\sigma_8\left(\Omega_{\rm m}/0.3\right)^{0.5}$ obtained from the LiteBIRD-Planck combined $y$-map power spectrum exhibits a 15% reduction in uncertainty compared to constraints from Planck alone. This improvement can be attributed to the increased portion of uncontaminated sky available in the LiteBIRD-Planck combined $y$-map., Comment: 38 pages, 13 figures, abstract shortened. Updated to match version accepted by JCAP
Published: 2024

19. LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

Author: LLM-jp, Aizawa, Akiko, Aramaki, Eiji, Chen, Bowen, Cheng, Fei, Deguchi, Hiroyuki, Enomoto, Rintaro, Fujii, Kazuki, Fukumoto, Kensuke, Fukushima, Takuya, Han, Namgi, Harada, Yuto, Hashimoto, Chikara, Hiraoka, Tatsuya, Hisada, Shohei, Hosokawa, Sosuke, Jie, Lu, Kamata, Keisuke, Kanazawa, Teruhito, Kanezashi, Hiroki, Kataoka, Hiroshi, Katsumata, Satoru, Kawahara, Daisuke, Kawano, Seiya, Keyaki, Atsushi, Kiryu, Keisuke, Kiyomaru, Hirokazu, Kodama, Takashi, Kubo, Takahiro, Kuga, Yohei, Kumon, Ryoma, Kurita, Shuhei, Kurohashi, Sadao, Li, Conglong, Maekawa, Taiki, Matsuda, Hiroshi, Miyao, Yusuke, Mizuki, Kentaro, Mizuki, Sakae, Murawaki, Yugo, Nakamura, Ryo, Nakamura, Taishi, Nakayama, Kouta, Nakazato, Tomoka, Niitsuma, Takuro, Nishitoba, Jiro, Oda, Yusuke, Ogawa, Hayato, Okamoto, Takumi, Okazaki, Naoaki, Oseki, Yohei, Ozaki, Shintaro, Ryu, Koki, Rzepka, Rafal, Sakaguchi, Keisuke, Sasaki, Shota, Sekine, Satoshi, Suda, Kohei, Sugawara, Saku, Sugiura, Issa, Sugiyama, Hiroaki, Suzuki, Hisami, Suzuki, Jun, Suzumura, Toyotaro, Tachibana, Kensuke, Takagi, Yu, Takami, Kyosuke, Takeda, Koichi, Takeshita, Masashi, Tanaka, Masahiro, Taura, Kenjiro, Tolmachev, Arseny, Ueda, Nobuhiro, Wan, Zhen, Yada, Shuntaro, Yahata, Sakiko, Yamamoto, Yuya, Yamauchi, Yusuke, Yanaka, Hitomi, Yokota, Rio, and Yoshino, Koichiro
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its activities, and technical reports on the LLMs developed by LLM-jp. For the latest activities, visit https://llm-jp.nii.ac.jp/en/.
Published: 2024

20. Classification of Carotid Plaque with Jellyfish Sign Through Convolutional and Recurrent Neural Networks Utilizing Plaque Surface Edges

Author: Yoshidomi, Takeshi, Kume, Shinji, Aizawa, Hiroaki, and Furui, Akira
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: In carotid arteries, plaque can develop as localized elevated lesions. The Jellyfish sign, marked by fluctuating plaque surfaces with blood flow pulsation, is a dynamic characteristic of these plaques that has recently attracted attention. Detecting this sign is vital, as it is often associated with cerebral infarction. This paper proposes an ultrasound video-based classification method for the Jellyfish sign, using deep neural networks. The proposed method first preprocesses carotid ultrasound videos to separate the movement of the vascular wall from plaque movements. These preprocessed videos are then combined with plaque surface information and fed into a deep learning model comprising convolutional and recurrent neural networks, enabling the efficient classification of the Jellyfish sign. The proposed method was verified using ultrasound video images from 200 patients. Ablation studies demonstrated the effectiveness of each component of the proposed method., Comment: 4 pages, 3 figures, accepted at IEEE EMBC 2024
Published: 2024

21. Revealing asymmetry on midplane of proto-planetary disc through modelling of axisymmetric emission: methodology

Author: Aizawa, Masataka, Muto, Takayuki, and Momose, Munetake
Subjects: Astrophysics - Earth and Planetary Astrophysics, Astrophysics - Solar and Stellar Astrophysics
Abstract: This study proposes an analytical framework for deriving the surface brightness profile and geometry of a geometrically-thin axisymmetric disc from interferometric observation of continuum emission. Such precise modelling facilitates the exploration of faint non-axisymmetric structures, such as spirals and circumplanetary discs. As a demonstration, we simulate interferometric observations of geometrically-thin axisymmetric discs. The proposed method can reasonably recover the injected axisymmetric structures, whereas Gaussian fitting of the same data yielded larger errors in disc orientation estimation. To further test the applicability of the method, it was applied to the mock data for $m=1,2$ spirals and a point source, which are embedded in a bright axisymmetric structure. The injected non-axisymmetric structures were reasonably recovered except for the innermost parts, and the disc geometric parameter estimations were better than Gasussian fitting. The method was then applied to the real data of Elias 20 and AS 209, and it adequately subtracted the axisymmetric component, notably in Elias 20, where substantial residuals remained without our method. We also applied our method to continuum data of PDS 70 to demonstrate the effectiveness of the method. We successfully recovered emission from PDS 70 c consistently with previous studies, and also tentatively discovered new substructures. The current formulation can be applied to any data for disc continuum emission, and aids in the search of spirals and circumplanetary discs, whose detection is still limited., Comment: 31 pages, 23 figures, accepted for publication in MNRAS
Published: 2024

22. Integrable $\mathbb{Z}_2^2$-graded Extensions of the Liouville and Sinh-Gordon Theories

Author: Aizawa, Naruhiko, Ito, Ren, Kuznetsova, Zhanna, Tanaka, Toshiya, and Toppan, Francesco
Subjects: Mathematical Physics, High Energy Physics - Theory, Nonlinear Sciences - Exactly Solvable and Integrable Systems
Abstract: In this paper we present a general framework to construct integrable $\mathbb{Z}_2^2$-graded extensions of classical, two-dimensional Toda and conformal affine Toda theories. The scheme is applied to define the extended Liouville and Sinh-Gordon models; they are based on $\mathbb{Z}_2^2$-graded color Lie algebras and their fields satisfy a parabosonic statististics. The mathematical tools here introduced are the $\mathbb{Z}_2^2$-graded covariant extensions of the Lax pair formalism and of the Polyakov's soldering procedure. The $\mathbb{Z}_2^2$-graded Sinh-Gordon model is derived from an affine $\mathbb{Z}_2^2$-graded color Lie algebra, mimicking a procedure originally introduced by Babelon-Bonora to derive the ordinary Sinh-Gordon model. The color Lie algebras under considerations are: the $6$-generator $\mathbb{Z}_2^2$-graded $sl_2$, the $\mathbb{Z}_2^2$-graded affine ${\widehat{sl_2}}$ algebra with two central extensions, the $\mathbb{Z}_2^2$-graded Virasoro algebra obtained from a Hamiltonian reduction., Comment: 25 pages
Published: 2024

23. MoreHopQA: More Than Multi-hop Reasoning

Author: Schnitzler, Julian, Ho, Xanh, Huang, Jiahao, Boudin, Florian, Sugawara, Saku, and Aizawa, Akiko
Subjects: Computer Science - Computation and Language
Abstract: Most existing multi-hop datasets are extractive answer datasets, where the answers to the questions can be extracted directly from the provided context. This often leads models to use heuristics or shortcuts instead of performing true multi-hop reasoning. In this paper, we propose a new multi-hop dataset, MoreHopQA, which shifts from extractive to generative answers. Our dataset is created by utilizing three existing multi-hop datasets: HotpotQA, 2WikiMultihopQA, and MuSiQue. Instead of relying solely on factual reasoning, we enhance the existing multi-hop questions by adding another layer of questioning that involves one, two, or all three of the following types of reasoning: commonsense, arithmetic, and symbolic. Our dataset is created through a semi-automated process, resulting in a dataset with 1,118 samples that have undergone human verification. We then use our dataset to evaluate five different large language models: Mistral 7B, Gemma 7B, Llama 3 (8B and 70B), and GPT-4. We also design various cases to analyze the reasoning steps in the question-answering process. Our results show that models perform well on initial multi-hop questions but struggle with our extended questions, indicating that our dataset is more challenging than previous ones. Our analysis of question decomposition reveals that although models can correctly answer questions, only a portion - 38.7% for GPT-4 and 33.4% for Llama3-70B - achieve perfect reasoning, where all corresponding sub-questions are answered correctly. Evaluation code and data are available at https://github.com/Alab-NII/morehopqa, Comment: 8 pages, 5 figures. First three authors contributed equally
Published: 2024

24. The LiteBIRD mission to explore cosmic inflation

Author: Ghigna, T., Adler, A., Aizawa, K., Akamatsu, H., Akizawa, R., Allys, E., Anand, A., Aumont, J., Austermann, J., Azzoni, S., Baccigalupi, C., Ballardini, M., Banday, A. J., Barreiro, R. B., Bartolo, N., Basak, S., Basyrov, A., Beckman, S., Bersanelli, M., Bortolami, M., Bouchet, F., Brinckmann, T., Campeti, P., Carinos, E., Carones, A., Casas, F. J., Cheung, K., Chinone, Y., Clermont, L., Columbro, F., Coppolecchia, A., Curtis, D., de Bernardis, P., de Haan, T., de la Hoz, E., De Petris, M., Della Torre, S., Monache, G. Delle, Di Giorgi, E., Dickinson, C., Diego-Palazuelos, P., García, J. J. Díaz, Dobbs, M., Dotani, T., D'Alessandro, G., Eriksen, H. K., Errard, J., Essinger-Hileman, T., Farias, N., Ferreira, E., Franceschet, C., Fuskeland, U., Galloni, G., Galloway, M., Ganga, K., Gerbino, M., Gervasi, M., Génova-Santos, R. T., Giardiello, S., Gimeno-Amo, C., Gjerløw, E., González, R. González, Grandsire, L., Gruppuso, A., Halverson, N. W., Hargrave, P., Harper, S. E., Hazumi, M., Henrot-Versillé, S., Hergt, L. T., Herranz, D., Hivon, E., Hlozek, R. A., Hoang, T. D., Hubmayr, J., Ichiki, K., Ikuma, K., Ishino, H., Jaehnig, G., Jost, B., Kohri, K., Konishi, K., Lamagna, L., Lattanzi, M., Leloup, C., Levrier, F., Lonappan, A. I., Luzzi, G., Macias-Perez, J., Maffei, B., Marchitelli, E., Martínez-González, E., Masi, S., Matarrese, S., Matsumura, T., Micheli, S., Migliaccio, M., Monelli, M., Montier, L., Morgante, G., Mousset, L., Nagano, Y., Nagata, R., Natoli, P., Novelli, A., Noviello, F., Obata, I., Occhiuzzi, A., Odagiri, K., Omae, R., Pagano, L., Paiella, A., Paoletti, D., Pascual-Cisneros, G., Patanchon, G., Pavlidou, V., Piacentini, F., Piat, M., Piccirilli, G., Pinchera, M., Pisano, G., Porcelli, L., Raffuzzi, N., Raum, C., Remazeilles, M., Ritacco, A., Rubino-Martin, J., Ruiz-Granda, M., Sakurai, Y., Savini, G., Scott, D., Sekimoto, Y., Shiraishi, M., Signorelli, G., Stever, S. L., Sullivan, R. M., Suzuki, A., Takaku, R., Takakura, H., Takakura, S., Tartari, Y. Takase. A., Tassis, K., Thompson, K. L., Tomasi, M., Tristram, M., Tucker, C., Vacher, L., van Tent, B., Vielva, P., Watanuki, K., Wehus, I. K., Westbrook, B., Weymann-Despres, G., Winter, B., Wollack, E. J., Zacchei, A., Zannoni, M., Zhou, Y., and Collaboration, the LiteBIRD
Subjects: Astrophysics - Instrumentation and Methods for Astrophysics, Astrophysics - Cosmology and Nongalactic Astrophysics, Physics - Instrumentation and Detectors
Abstract: LiteBIRD, the next-generation cosmic microwave background (CMB) experiment, aims for a launch in Japan's fiscal year 2032, marking a major advancement in the exploration of primordial cosmology and fundamental physics. Orbiting the Sun-Earth Lagrangian point L2, this JAXA-led strategic L-class mission will conduct a comprehensive mapping of the CMB polarization across the entire sky. During its 3-year mission, LiteBIRD will employ three telescopes within 15 unique frequency bands (ranging from 34 through 448 GHz), targeting a sensitivity of 2.2\,$\mu$K-arcmin and a resolution of 0.5$^\circ$ at 100\,GHz. Its primary goal is to measure the tensor-to-scalar ratio $r$ with an uncertainty $\delta r = 0.001$, including systematic errors and margin. If $r \geq 0.01$, LiteBIRD expects to achieve a $>5\sigma$ detection in the $\ell=$2-10 and $\ell=$11-200 ranges separately, providing crucial insight into the early Universe. We describe LiteBIRD's scientific objectives, the application of systems engineering to mission requirements, the anticipated scientific impact, and the operations and scanning strategies vital to minimizing systematic effects. We will also highlight LiteBIRD's synergies with concurrent CMB projects., Comment: 23 pages, 9 figures, 1 table, SPIE Astronomical Telescopes + Instrumentation 2024
Published: 2024

25. Privacy Protection and Video Manipulation in Immersive Media

Author: Wöhler, Leslie, Ikehata, Satoshi, and Aizawa, Kiyoharu
Subjects: Computer Science - Human-Computer Interaction
Abstract: In comparison to traditional footage, 360{\deg} videos can convey engaging, immersive experiences and even be utilized to create interactive virtual environments. Like regular recordings, these videos need to consider the privacy of recorded people and could be targets for video manipulations. However, due to their properties like enhanced presence, the effects on users might differ from traditional, non-immersive content. Therefore, we are interested in how changes of real-world footage like adding privacy protection or applying video manipulations could mitigate or introduce harm in the resulting immersive media., Comment: This is an accepted position statement of CHI 2024 Workshop (Novel Approaches for Understanding and Mitigating Emerging New Harms in Immersive and Embodied Virtual Spaces: A Workshop at CHI 2024)
Published: 2024

26. Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion

Author: Li, Yingxuan, Hinami, Ryota, Aizawa, Kiyoharu, and Matsui, Yusuke
Subjects: Computer Science - Multimedia, Computer Science - Computer Vision and Pattern Recognition
Abstract: Recognizing characters and predicting speakers of dialogue are critical for comic processing tasks, such as voice generation or translation. However, because characters vary by comic title, supervised learning approaches like training character classifiers which require specific annotations for each comic title are infeasible. This motivates us to propose a novel zero-shot approach, allowing machines to identify characters and predict speaker names based solely on unannotated comic images. In spite of their importance in real-world applications, these task have largely remained unexplored due to challenges in story comprehension and multimodal integration. Recent large language models (LLMs) have shown great capability for text understanding and reasoning, while their application to multimodal content analysis is still an open problem. To address this problem, we propose an iterative multimodal framework, the first to employ multimodal information for both character identification and speaker prediction tasks. Our experiments demonstrate the effectiveness of the proposed framework, establishing a robust baseline for these tasks. Furthermore, since our method requires no training data or annotations, it can be used as-is on any comic series., Comment: Accepted to ACM Multimedia 2024. Project page: https://liyingxuan1012.github.io/zeroshot-speaker-prediction ; Github repo: https://github.com/liyingxuan1012/zeroshot-speaker-prediction
Published: 2024

27. Microwave dependent quantum transport characteristics in GaN/AlGaN FETs

Author: Shinozaki, Motoya, Abe, Takaya, Matsumura, Kazuma, Aizawa, Takumi, Kumasaka, Takashi, and Otsuka, Tomohiro
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: Defects in semiconductors, traditionally seen as detrimental to electronic device performance, have emerged as potential assets in quantum technologies due to their unique quantum properties. This study investigates the interaction between defects and quantum electron transport in GaN/AlGaN field-effect transistors, highlighting the observation of Fano resonances at low temperatures. We observe the resonance spectra and their dependence on gate voltage and magnetic fields. To explain the observed behavior, we construct the possible scenario as a Fano interferometer with finite width. Our findings reveal the potential of semiconductor defects to contribute to the development of quantum information processing, providing their role to key components in next-generation quantum devices., Comment: 14 pages, 6 figures
Published: 2024
Full Text: View/download PDF

28. Frequencies of warm debris disks based on point source catalogs of Spitzer, WISE, and Gaia

Author: Mizuki, Toshiyuki, Momose, Munetake, Aizawa, Masataka, and Kobayashi, Hiroshi
Subjects: Astrophysics - Earth and Planetary Astrophysics, Astrophysics - Solar and Stellar Astrophysics
Abstract: More than a thousand warm debris disks have been detected as infrared excess at mid-infrared wavelengths, and their frequencies have been obtained for various spectral types of stars. However, the dependence of the frequencies on spectral type is still debated because the number of stars with significant and detectable infrared excess is limited. Herein, we present the largest systematic search for infrared excess using data from Gaia, WISE, and Spitzer. We identified 373, 485, and 255-reliable infrared excesses in the mid-infrared archival data at wavelengths of 12, 22, and 24 $\mu$m for WISE/$W3$, $W4$, and Spitzer/MIPS ch1, respectively. Although we confirmed that more massive stars tend to show higher frequencies of debris disks, these disk frequencies are relatively flat for both low- and intermediate-mass stars, with a jump at 7000 K for all three wavelengths. Assuming that bright, warm debris disks have lifetimes of a few to several hundred million years, the disk frequency can be understood as the ratio between the timescale and the upper limits of the sample ages. We also found that intermediate-mass stars with infrared excess tend to be bluer and fainter along the evolutionary track than those without, implying that massive stars hosting debris disks are relatively young, with an isochronal age of approximately 500 Myr. These tendencies are reasonably explained by a standard scenario in which debris disks are likely to be produced by collisions of planetesimals in early stages of stellar evolution, such as the Late Heavy Bombardment., Comment: Accepted for publication in AJ. 27 pages, 19 figures, 5 tables
Published: 2024

29. Eigenpruning: an Interpretability-Inspired PEFT Method

Author: Vergara-Browne, Tomás, Soto, Álvaro, and Aizawa, Akiko
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: We introduce eigenpruning, a method that removes singular values from weight matrices in an LLM to improve its performance in a particular task. This method is inspired by interpretability methods designed to automatically find subnetworks of a model which solve a specific task. In our tests, the pruned model outperforms the original model by a large margin, while only requiring minimal computation to prune the weight matrices. In the case of a small synthetic task in integer multiplication, the Phi-2 model can improve its accuracy in the test set from 13.75% to 97.50%. Interestingly, these results seem to indicate the existence of a computation path that can solve the task very effectively, but it was not being used by the original model. Finally, we publicly release our implementation., Comment: Extended abstract accepted to LatinX at NAACL 2024
Published: 2024

30. Destructive spondyloarthropathy of the lumbar spine in patients on long-term haemodialysis: a computed tomography-based study

Author: Yabe, Yutaka, Ishikawa, Keisuke, Kurosawa, Daisuke, Murakami, Eiichi, and Aizawa, Toshimi
Published: 2024
Full Text: View/download PDF

31. Safety and pharmacokinetics of vepdegestrant in Japanese patients with ER+ advanced breast cancer: a phase 1 study

Author: Iwata, Hiroji, Naito, Yoichi, Hattori, Masaya, Yoshimura, Akiyo, Yonemori, Kan, Aizawa, Mana, Mori, Yuko, Yoshimitsu, Junichiro, Umeyama, Yoshiko, and Mukohara, Toru
Published: 2024
Full Text: View/download PDF

32. Nationwide database study of postoperative sequelae and in-hospital mortality in super-elderly hip fracture patients

Author: Mori, Yu, Tarasawa, Kunio, Tanaka, Hidetatsu, Mori, Naoko, Fushimi, Kiyohide, Aizawa, Toshimi, and Fujimori, Kenji
Published: 2024
Full Text: View/download PDF

33. Landscape of homologous recombination deficiency in gastric cancer and clinical implications for first-line chemotherapy

Author: Ichikawa, Hiroshi, Aizawa, Masaki, Kano, Yosuke, Hanyu, Takaaki, Muneoka, Yusuke, Hiroi, Sou, Ueki, Hiroto, Moro, Kazuki, Hirose, Yuki, Miura, Kohei, Shimada, Yoshifumi, Sakata, Jun, Yabusaki, Hiroshi, Nakagawa, Satoru, Kawasaki, Takashi, Okuda, Shujiro, and Wakai, Toshifumi
Published: 2024
Full Text: View/download PDF

34. Characteristics of successful termination of atrial fibrillation by atrial antitachycardia pacing in patients with cardiac implantable electronic devices

Author: Aizawa, Yoshiyasu, Komura, Satoru, Kawakami, Emiko, Watanabe, Shonosuke, Tanaka, Kazuki, Kadowaki, Hiromu, and Takagi, Atsushi
Published: 2024
Full Text: View/download PDF

35. Can LLMs Master Math? Investigating Large Language Models on Math Stack Exchange

Author: Satpute, Ankit, Giessing, Noah, Greiner-Petter, Andre, Schubotz, Moritz, Teschke, Olaf, Aizawa, Akiko, and Gipp, Bela
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Information Retrieval
Abstract: Large Language Models (LLMs) have demonstrated exceptional capabilities in various natural language tasks, often achieving performances that surpass those of humans. Despite these advancements, the domain of mathematics presents a distinctive challenge, primarily due to its specialized structure and the precision it demands. In this study, we adopted a two-step approach for investigating the proficiency of LLMs in answering mathematical questions. First, we employ the most effective LLMs, as identified by their performance on math question-answer benchmarks, to generate answers to 78 questions from the Math Stack Exchange (MSE). Second, a case analysis is conducted on the LLM that showed the highest performance, focusing on the quality and accuracy of its answers through manual evaluation. We found that GPT-4 performs best (nDCG of 0.48 and P@10 of 0.37) amongst existing LLMs fine-tuned for answering mathematics questions and outperforms the current best approach on ArqMATH3 Task1, considering P@10. Our Case analysis indicates that while the GPT-4 can generate relevant responses in certain instances, it does not consistently answer all questions accurately. This paper explores the current limitations of LLMs in navigating complex mathematical problem-solving. Through case analysis, we shed light on the gaps in LLM capabilities within mathematics, thereby setting the stage for future research and advancements in AI-driven mathematical reasoning. We make our code and findings publicly available for research: \url{https://github.com/gipplab/LLM-Investig-MathStackExchange}, Comment: Accepted for publication at the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) July 14--18, 2024, Washington D.C.,USA
Published: 2024

36. Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models

Author: Miyai, Atsuyuki, Yang, Jingkang, Zhang, Jingyang, Ming, Yifei, Yu, Qing, Irie, Go, Li, Yixuan, Li, Hai, Liu, Ziwei, and Aizawa, Kiyoharu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: This paper introduces a novel and significant challenge for Vision Language Models (VLMs), termed Unsolvable Problem Detection (UPD). UPD examines the VLM's ability to withhold answers when faced with unsolvable problems in the context of Visual Question Answering (VQA) tasks. UPD encompasses three distinct settings: Absent Answer Detection (AAD), Incompatible Answer Set Detection (IASD), and Incompatible Visual Question Detection (IVQD). To deeply investigate the UPD problem, extensive experiments indicate that most VLMs, including GPT-4V and LLaVA-Next-34B, struggle with our benchmarks to varying extents, highlighting significant room for the improvements. To address UPD, we explore both training-free and training-based solutions, offering new insights into their effectiveness and limitations. We hope our insights, together with future efforts within the proposed UPD settings, will enhance the broader understanding and development of more practical and reliable VLMs., Comment: Code: https://github.com/AtsuMiyai/UPD
Published: 2024

37. TWOLAR: a TWO-step LLM-Augmented distillation method for passage Reranking

Author: Baldelli, Davide, Jiang, Junfeng, Aizawa, Akiko, and Torroni, Paolo
Subjects: Computer Science - Information Retrieval
Abstract: In this paper, we present TWOLAR: a two-stage pipeline for passage reranking based on the distillation of knowledge from Large Language Models (LLM). TWOLAR introduces a new scoring strategy and a distillation process consisting in the creation of a novel and diverse training dataset. The dataset consists of 20K queries, each associated with a set of documents retrieved via four distinct retrieval methods to ensure diversity, and then reranked by exploiting the zero-shot reranking capabilities of an LLM. Our ablation studies demonstrate the contribution of each new component we introduced. Our experimental results show that TWOLAR significantly enhances the document reranking ability of the underlying model, matching and in some cases even outperforming state-of-the-art models with three orders of magnitude more parameters on the TREC-DL test sets and the zero-shot evaluation benchmark BEIR. To facilitate future work we release our data set, finetuned models, and code.
Published: 2024

38. Entity-NeRF: Detecting and Removing Moving Entities in Urban Scenes

Author: Otonari, Takashi, Ikehata, Satoshi, and Aizawa, Kiyoharu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent advancements in the study of Neural Radiance Fields (NeRF) for dynamic scenes often involve explicit modeling of scene dynamics. However, this approach faces challenges in modeling scene dynamics in urban environments, where moving objects of various categories and scales are present. In such settings, it becomes crucial to effectively eliminate moving objects to accurately reconstruct static backgrounds. Our research introduces an innovative method, termed here as Entity-NeRF, which combines the strengths of knowledge-based and statistical strategies. This approach utilizes entity-wise statistics, leveraging entity segmentation and stationary entity classification through thing/stuff segmentation. To assess our methodology, we created an urban scene dataset masked with moving objects. Our comprehensive experiments demonstrate that Entity-NeRF notably outperforms existing techniques in removing moving objects and reconstructing static urban backgrounds, both quantitatively and qualitatively., Comment: Accepted by IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), Project website: https://otonari726.github.io/entitynerf/
Published: 2024

39. Dynamics of quantum cellular automata electron transition in triple quantum dots

Author: Aizawa, Takumi, Shinozaki, Motoya, Fujiwara, Yoshihiro, Kumasaka, Takeshi, Izumida, Wataru, Ludwig, Arne, Wieck, Andreas D., and Otsuka, Tomohiro
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: The quantum cellular automata (QCA) effect is a transition in which multiple electron move coordinately by Coulomb interactions and observed in multiple quantum dots. This effect will be useful for realizing and improving quantum cellular automata and information transfer using multiple electron transfer. In this paper, we investigate the real-time dynamics of the QCA charge transitions in a triple quantum dot by using fast charge-state readout realized by rf reflectometry. We observe real-time charge transitions and analyze the tunneling rate comparing with the first-order tunneling processes. We also measure the gate voltage dependence of the QCA transition and show that it can be controlled by the voltage., Comment: 11pages, 4 figures
Published: 2024

40. SKT5SciSumm -- Revisiting Extractive-Generative Approach for Multi-Document Scientific Summarization

Author: To, Huy Quoc, Liu, Ming, Huang, Guangyan, Tran, Hung-Nghiep, Greiner-Petter, Andr'e, Beierle, Felix, and Aizawa, Akiko
Subjects: Computer Science - Computation and Language
Abstract: Summarization for scientific text has shown significant benefits both for the research community and human society. Given the fact that the nature of scientific text is distinctive and the input of the multi-document summarization task is substantially long, the task requires sufficient embedding generation and text truncation without losing important information. To tackle these issues, in this paper, we propose SKT5SciSumm - a hybrid framework for multi-document scientific summarization (MDSS). We leverage the Sentence-Transformer version of Scientific Paper Embeddings using Citation-Informed Transformers (SPECTER) to encode and represent textual sentences, allowing for efficient extractive summarization using k-means clustering. We employ the T5 family of models to generate abstractive summaries using extracted sentences. SKT5SciSumm achieves state-of-the-art performance on the Multi-XScience dataset. Through extensive experiments and evaluation, we showcase the benefits of our model by using less complicated models to achieve remarkable results, thereby highlighting its potential in advancing the field of multi-document summarization for scientific text.
Published: 2024

41. MAGPIE: Multi-Task Media-Bias Analysis Generalization for Pre-Trained Identification of Expressions

Author: Horych, Tomáš, Wessel, Martin, Wahle, Jan Philip, Ruas, Terry, Waßmuth, Jerome, Greiner-Petter, André, Aizawa, Akiko, Gipp, Bela, and Spinde, Timo
Subjects: Computer Science - Computers and Society, Computer Science - Computation and Language
Abstract: Media bias detection poses a complex, multifaceted problem traditionally tackled using single-task models and small in-domain datasets, consequently lacking generalizability. To address this, we introduce MAGPIE, the first large-scale multi-task pre-training approach explicitly tailored for media bias detection. To enable pre-training at scale, we present Large Bias Mixture (LBM), a compilation of 59 bias-related tasks. MAGPIE outperforms previous approaches in media bias detection on the Bias Annotation By Experts (BABE) dataset, with a relative improvement of 3.3% F1-score. MAGPIE also performs better than previous models on 5 out of 8 tasks in the Media Bias Identification Benchmark (MBIB). Using a RoBERTa encoder, MAGPIE needs only 15% of finetuning steps compared to single-task approaches. Our evaluation shows, for instance, that tasks like sentiment and emotionality boost all learning, all tasks enhance fake news detection, and scaling tasks leads to the best results. MAGPIE confirms that MTL is a promising approach for addressing media bias detection, enhancing the accuracy and efficiency of existing models. Furthermore, LBM is the first available resource collection focused on media bias MTL.
Published: 2024

42. A Survey of Pre-trained Language Models for Processing Scientific Text

Author: Ho, Xanh, Nguyen, Anh Khoa Duong, Dao, An Tuan, Jiang, Junfeng, Chida, Yuki, Sugimoto, Kaito, To, Huy Quoc, Boudin, Florian, and Aizawa, Akiko
Subjects: Computer Science - Computation and Language
Abstract: The number of Language Models (LMs) dedicated to processing scientific text is on the rise. Keeping pace with the rapid growth of scientific LMs (SciLMs) has become a daunting task for researchers. To date, no comprehensive surveys on SciLMs have been undertaken, leaving this issue unaddressed. Given the constant stream of new SciLMs, appraising the state-of-the-art and how they compare to each other remain largely unknown. This work fills that gap and provides a comprehensive review of SciLMs, including an extensive analysis of their effectiveness across different domains, tasks and datasets, and a discussion on the challenges that lie ahead., Comment: Resources are available at https://github.com/Alab-NII/Awesome-SciLM
Published: 2024

43. Taxonomy of Mathematical Plagiarism

Author: Satpute, Ankit, Greiner-Petter, Andre, Gießing, Noah, Beckenbach, Isabel, Schubotz, Moritz, Teschke, Olaf, Aizawa, Akiko, and Gipp, Bela
Subjects: Computer Science - Information Retrieval
Abstract: Plagiarism is a pressing concern, even more so with the availability of large language models. Existing plagiarism detection systems reliably find copied and moderately reworded text but fail for idea plagiarism, especially in mathematical science, which heavily uses formal mathematical notation. We make two contributions. First, we establish a taxonomy of mathematical content reuse by annotating potentially plagiarised 122 scientific document pairs. Second, we analyze the best-performing approaches to detect plagiarism and mathematical content similarity on the newly established taxonomy. We found that the best-performing methods for plagiarism and math content similarity achieve an overall detection score (PlagDet) of 0.06 and 0.16, respectively. The best-performing methods failed to detect most cases from all seven newly established math similarity types. Outlined contributions will benefit research in plagiarism detection systems, recommender systems, question-answering systems, and search engines. We make our experiment's code and annotated dataset available to the community: https://github.com/gipplab/Taxonomy-of-Mathematical-Plagiarism, Comment: 46th European Conference on Information Retrieval (ECIR)
Published: 2024
Full Text: View/download PDF

44. $\mathcal{N}=2$ Double graded supersymmetric quantum mechanics via dimensional reduction

Author: Aizawa, N., Ito, Ren, and Tanaka, Toshiya
Subjects: Mathematical Physics, High Energy Physics - Theory
Abstract: We present a novel $\mathcal{N} = 2 $ $\mathbb{Z}_2^2$-graded supersymmetric quantum mechanics ($\mathbb{Z}_2^2$-SQM) which has different features from those introduced so far. It is a two-dimensional (two-particle) system and is the first example of the quantum mechanical realization of an eight-dimensional irrep of the $\mathcal{N}=2$ $\mathbb{Z}_2^2$-supersymmetry algebra. The $\mathbb{Z}_2^2$-SQM is obtained by quantizing the one-dimensional classical system derived by dimensional reduction from the two-dimensional $\mathbb{Z}_2^2$-supersymmetric Lagrangian of $\mathcal{N}=1$, which we constructed in our previous work. The ground states of the $\mathbb{Z}_2^2$-SQM are also investigated., Comment: 16 pages, no figure
Published: 2024
Full Text: View/download PDF

45. Incorporating Spatial Locality Into Self-attention for Training Vision Transformer on Small-Scale Datasets

Author: Igaue, Yuki, Kurita, Takio, Aizawa, Hiroaki, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Antonacopoulos, Apostolos, editor, Chaudhuri, Subhasis, editor, Chellappa, Rama, editor, Liu, Cheng-Lin, editor, Bhattacharya, Saumik, editor, and Pal, Umapada, editor
Published: 2025
Full Text: View/download PDF

46. The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization

Author: Mao, Jiafeng, Wang, Xueting, Aizawa, Kiyoharu, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Leonardis, Aleš, editor, Ricci, Elisa, editor, Roth, Stefan, editor, Russakovsky, Olga, editor, Sattler, Torsten, editor, and Varol, Gül, editor
Published: 2025
Full Text: View/download PDF

47. Cut-off values of Geriatric Nutritional Risk Index for cardiovascular events in Japanese patients with acute myocardial infarction

Author: Ito, Satoshi, Inoue, Yasunori, Nagoshi, Tomohisa, Aizawa, Takatoku, Kashiwagi, Yusuke, Morimoto, Satoshi, Ogawa, Kazuo, Minai, Kosuke, Ogawa, Takayuki, and Yoshimura, Michihiro
Published: 2024
Full Text: View/download PDF

48. Efficacy and safety of atrial fibrillation ablation in patients with aged 80 years or older

Author: Yodogawa, Kenji, Iwasaki, Yu-ki, Ito, Nobuaki, Arai, Toshiki, Hachisuka, Masato, Fujimoto, Yuhi, Hagiwara, Kanako, Murata, Hiroshige, Aizawa, Yoshiyasu, Shimizu, Wataru, and Asai, Kuniya
Published: 2024
Full Text: View/download PDF

49. Accumulated melanin in molds provides wavelength-dependent UV tolerance

Author: Onoda, Yushi, Nagahashi, Miharu, Yamashita, Michiyo, Fukushima, Shiho, Aizawa, Toshihiko, Yamauchi, Shigeharu, Fujikawa, Yasuo, Tanaka, Tomotake, Kadomura-Ishikawa, Yasuko, Ishida, Kai, Uebanso, Takashi, Mawatari, Kazuaki, Blatchley, III, Ernest R., and Takahashi, Akira
Published: 2024
Full Text: View/download PDF

50. Surgery on admission and following day reduces hip fracture complications: a Japanese DPC study

Author: Mori, Yu, Tarasawa, Kunio, Tanaka, Hidetatsu, Mori, Naoko, Fushimi, Kiyohide, Fujimori, Kenji, and Aizawa, Toshimi
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

48,033 results on '"Aizawa A"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources