Author: "Tanaka-Ishii, Kumiko" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Tanaka-Ishii, Kumiko"' showing total 380 results

Start Over Author "Tanaka-Ishii, Kumiko"

380 results on '"Tanaka-Ishii, Kumiko"'

1. Evaluating Computational Language Models with Scaling Properties of Natural Language

Author: Takahashi, Shuntaro and Tanaka-Ishii, Kumiko
Subjects: Computational linguistics. Natural language processing, P98-98.5
Abstract: In this article, we evaluate computational models of natural language with respect to the universal statistical behaviors of natural language. Statistical mechanical analyses have revealed that natural language text is characterized by scaling properties, which quantify the global structure in the vocabulary population and the long memory of a text. We study whether five scaling properties (given by Zipf’s law, Heaps’ law, Ebeling’s method, Taylor’s law, and long-range correlation analysis) can serve for evaluation of computational models. Specifically, we test n-gram language models, a probabilistic context-free grammar, language models based on Simon/Pitman-Yor processes, neural language models, and generative adversarial networks for text generation. Our analysis reveals that language models based on recurrent neural networks with a gating mechanism (i.e., long short-term memory; a gated recurrent unit; and quasi-recurrent neural networks) are the only computational models that can reproduce the long memory behavior of natural language. Furthermore, through comparison with recently proposed model-based evaluation methods, we find that the exponent of Taylor’s law is a good indicator of model quality.
Published: 2019
Full Text: View/download PDF

2. Bottleneck-Minimal Indexing for Generative Document Retrieval

Author: Du, Xin, Xiu, Lixin, and Tanaka-Ishii, Kumiko
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: We apply an information-theoretic perspective to reconsider generative document retrieval (GDR), in which a document $x \in X$ is indexed by $t \in T$, and a neural autoregressive model is trained to map queries $Q$ to $T$. GDR can be considered to involve information transmission from documents $X$ to queries $Q$, with the requirement to transmit more bits via the indexes $T$. By applying Shannon's rate-distortion theory, the optimality of indexing can be analyzed in terms of the mutual information, and the design of the indexes $T$ can then be regarded as a {\em bottleneck} in GDR. After reformulating GDR from this perspective, we empirically quantify the bottleneck underlying GDR. Finally, using the NQ320K and MARCO datasets, we evaluate our proposed bottleneck-minimal indexing method in comparison with various previous indexing methods, and we show that it outperforms those methods., Comment: Accepted for ICML 2024
Published: 2024

3. Correlation Dimension of Natural Language in a Statistical Manifold

Author: Du, Xin and Tanaka-Ishii, Kumiko
Subjects: Computer Science - Computation and Language, Condensed Matter - Statistical Mechanics, Computer Science - Artificial Intelligence
Abstract: The correlation dimension of natural language is measured by applying the Grassberger-Procaccia algorithm to high-dimensional sequences produced by a large-scale language model. This method, previously studied only in a Euclidean space, is reformulated in a statistical manifold via the Fisher-Rao distance. Language exhibits a multifractal, with global self-similarity and a universal dimension around 6.5, which is smaller than those of simple discrete random sequences and larger than that of a Barab\'asi-Albert process. Long memory is the key to producing self-similarity. Our method is applicable to any probabilistic model of real-world discrete sequences, and we show an application to music data., Comment: Published at Physical Review Research
Published: 2024
Full Text: View/download PDF

4. Co-Training Realized Volatility Prediction Model with Neural Distributional Transformation

Author: Du, Xin, Moriyama, Kai, and Tanaka-Ishii, Kumiko
Subjects: Computer Science - Computational Engineering, Finance, and Science, Quantitative Finance - Statistical Finance
Abstract: This paper shows a novel machine learning model for realized volatility (RV) prediction using a normalizing flow, an invertible neural network. Since RV is known to be skewed and have a fat tail, previous methods transform RV into values that follow a latent distribution with an explicit shape and then apply a prediction model. However, knowing that shape is non-trivial, and the transformation result influences the prediction model. This paper proposes to jointly train the transformation and the prediction model. The training process follows a maximum-likelihood objective function that is derived from the assumption that the prediction residuals on the transformed RV time series are homogeneously Gaussian. The objective function is further approximated using an expectation-maximum algorithm. On a dataset of 100 stocks, our method significantly outperforms other methods using analytical or naive neural-network transformations., Comment: Accepted at ICAIF'23
Published: 2023
Full Text: View/download PDF

5. Strahler Number of Natural Language Sentences in Comparison with Random Trees

Author: Tanaka-Ishii, Kumiko and Tanaka, Akira
Subjects: Computer Science - Computation and Language, Physics - Data Analysis, Statistics and Probability
Abstract: The Strahler number was originally proposed to characterize the complexity of river bifurcation and has found various applications. This article proposes computation of the Strahler number's upper and lower limits for natural language sentence tree structures. Through empirical measurements across grammatically annotated data, the Strahler number of natural language sentences is shown to be almost 3 or 4, similarly to the case of river bifurcation as reported by Strahler (1957). From the theory behind the number, we show that it is one kind of lower limit on the amount of memory required to process sentences. We consider the Strahler number to provide reasoning that explains reports showing that the number of required memory areas to process sentences is 3 to 4 for parsing (Schuler et al., 2010), and reports indicating a psychological "magical number" of 3 to 5 (Cowan, 2001). An analytical and empirical analysis shows that the Strahler number is not constant but grows logarithmically; therefore, the Strahler number of sentences derives from the range of sentence lengths. Furthermore, the Strahler number is not different for random trees, which could suggest that its origin is not specific to natural language., Comment: 34 pages, 12 figures, 11 tables
Published: 2023

6. A Comparison of Two Fluctuation Analyses for Natural Language Clustering Phenomena: Taylor and Ebeling & Neiman Methods

Author: Tanaka-Ishii, Kumiko and Takahashi, Shuntaro
Subjects: Computer Science - Computation and Language, Statistics - Applications
Abstract: This article considers the fluctuation analysis methods of Taylor and Ebeling & Neiman. While both have been applied to various phenomena in the statistical mechanics domain, their similarities and differences have not been clarified. After considering their analytical aspects, this article presents a large-scale application of these methods to text. It is found that both methods can distinguish real text from independently and identically distributed (i.i.d.) sequences. Furthermore, it is found that the Taylor exponents acquired from words can roughly distinguish text categories; this is also the case for Ebeling and Neiman exponents, but to a lesser extent. Additionally, both methods show some possibility of capturing script kinds.
Published: 2020
Full Text: View/download PDF

7. Extraction of Templates from Phrases Using Sequence Binary Decision Diagrams

Author: Hirano, Daiki, Tanaka-Ishii, Kumiko, and Finch, Andrew
Subjects: Computer Science - Computation and Language
Abstract: The extraction of templates such as ``regard X as Y'' from a set of related phrases requires the identification of their internal structures. This paper presents an unsupervised approach for extracting templates on-the-fly from only tagged text by using a novel relaxed variant of the Sequence Binary Decision Diagram (SeqBDD). A SeqBDD can compress a set of sequences into a graphical structure equivalent to a minimal DFA, but more compact and better suited to the task of template extraction. The main contribution of this paper is a relaxed form of the SeqBDD construction algorithm that enables it to form general representations from a small amount of data. The process of compression of shared structures in the text during Relaxed SeqBDD construction, naturally induces the templates we wish to extract. Experiments show that the method is capable of high-quality extraction on tasks based on verb+preposition templates from corpora and phrasal templates from short messages from social media.
Published: 2020
Full Text: View/download PDF

8. Correction to: Statistical Universals of Language

Author: Tanaka-Ishii, Kumiko, primary
Published: 2023
Full Text: View/download PDF

9. Data

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

10. Mathematical Details

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

11. Language Models

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

12. Glossary and Notations

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

13. Conclusion

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

14. Theories Behind Zipf’s Law

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

15. Mathematical Generative Models

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

16. Word Meaning and Value

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

17. Grammatical Structure and Long Memory

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

18. Size and Frequency

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

19. Articulation of Elements

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

20. Fluctuation

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

21. Long-Range Correlation

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

22. Returns

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

23. Bias in Rank-Frequency Relation

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

24. Related Statistical Universals

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

25. Relation Between Rank and Frequency

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

26. Introduction

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

27. Language as a Complex System

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

28. Universals

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

29. Word Familiarity and Frequency

Author: Tanaka-Ishii, Kumiko and Terada, Hiroshi
Subjects: Computer Science - Computation and Language
Abstract: Word frequency is assumed to correlate with word familiarity, but the strength of this correlation has not been thoroughly investigated. In this paper, we report on our analysis of the correlation between a word familiarity rating list obtained through a psycholinguistic experiment and the log-frequency obtained from various corpora of different kinds and sizes (up to the terabyte scale) for English and Japanese. Major findings are threefold: First, for a given corpus, familiarity is necessary for a word to achieve high frequency, but familiar words are not necessarily frequent. Second, correlation increases with the corpus data size. Third, a corpus of spoken language correlates better than one of written language. These findings suggest that cognitive familiarity ratings are correlated to frequency, but more highly to that of spoken rather than written language., Comment: 17 pages, 8 figures, Published in Studia Linguistica in 2011. Available also from Wiley Online Library
Published: 2018
Full Text: View/download PDF

30. Assessing Language Models with Scaling Properties

Author: Takahashi, Shuntaro and Tanaka-Ishii, Kumiko
Subjects: Computer Science - Computation and Language
Abstract: Language models have primarily been evaluated with perplexity. While perplexity quantifies the most comprehensible prediction performance, it does not provide qualitative information on the success or failure of models. Another approach for evaluating language models is thus proposed, using the scaling properties of natural language. Five such tests are considered, with the first two accounting for the vocabulary population and the other three for the long memory of natural language. The following models were evaluated with these tests: n-grams, probabilistic context-free grammar (PCFG), Simon and Pitman-Yor (PY) processes, hierarchical PY, and neural language models. Only the neural language models exhibit the long memory properties of natural language, but to a limited degree. The effectiveness of every test of these models is also discussed., Comment: 14 pages, 16 figures
Published: 2018

31. Taylor's law for Human Linguistic Sequences

Author: Kobayashi, Tatsuru and Tanaka-Ishii, Kumiko
Subjects: Computer Science - Computation and Language
Abstract: Taylor's law describes the fluctuation characteristics underlying a system in which the variance of an event within a time span grows by a power law with respect to the mean. Although Taylor's law has been applied in many natural and social systems, its application for language has been scarce. This article describes a new quantification of Taylor's law in natural language and reports an analysis of over 1100 texts across 14 languages. The Taylor exponents of written natural language texts were found to exhibit almost the same value. The exponent was also compared for other language-related data, such as the child-directed speech, music, and programming language code. The results show how the Taylor exponent serves to quantify the fundamental structural complexity underlying linguistic time series. The article also shows the applicability of these findings in evaluating language models., Comment: 11 pages, 16 figures, Accepted as ACL 2018 long paper
Published: 2018

32. Stock portfolio selection balancing variance and tail risk via stock vector representation acquired from price data and texts

Author: Du, Xin and Tanaka-Ishii, Kumiko
Published: 2022
Full Text: View/download PDF

33. Long-Range Correlation Underlying Childhood Language and Generative Models

Author: Tanaka-Ishii, Kumiko
Subjects: Computer Science - Computation and Language, Physics - Physics and Society
Abstract: Long-range correlation, a property of time series exhibiting long-term memory, is mainly studied in the statistical physics domain and has been reported to exist in natural language. Using a state-of-the-art method for such analysis, long-range correlation is first shown to occur in long CHILDES data sets. To understand why, Bayesian generative models of language, originally proposed in the cognitive scientific domain, are investigated. Among representative models, the Simon model was found to exhibit surprisingly good long-range correlation, but not the Pitman-Yor model. Since the Simon model is known not to correctly reflect the vocabulary growth of natural language, a simple new model is devised as a conjunct of the Simon and Pitman-Yor models, such that long-range correlation holds with a correct vocabulary growth rate. The investigation overall suggests that uniform sampling is one cause of long-range correlation and could thus have a relation with actual linguistic processes.
Published: 2017

34. Do Neural Nets Learn Statistical Laws behind Natural Language?

Author: Takahashi, Shuntaro and Tanaka-Ishii, Kumiko
Subjects: Computer Science - Computation and Language
Abstract: The performance of deep learning in natural language processing has been spectacular, but the reasons for this success remain unclear because of the inherent complexity of deep learning. This paper provides empirical evidence of its effectiveness and of a limitation of neural networks for language engineering. Precisely, we demonstrate that a neural language model based on long short-term memory (LSTM) effectively reproduces Zipf's law and Heaps' law, two representative statistical properties underlying natural language. We discuss the quality of reproducibility and the emergence of Zipf's law and Heaps' law as training progresses. We also point out that the neural language model has a limitation in reproducing long-range correlation, another statistical property of natural language. This understanding could provide a direction for improving the architectures of neural networks., Comment: 21 pages, 11 figures
Published: 2017
Full Text: View/download PDF

35. Machine Versus Structure of Language via Statistical Universals

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2019
Full Text: View/download PDF

36. Correction to: Statistical Universals of Language

Author: Tanaka-Ishii, Kumiko, primary
Published: 2022
Full Text: View/download PDF

37. Correction to: Glossary and Notations

Author: Tanaka-Ishii, Kumiko, primary
Published: 2022
Full Text: View/download PDF

38. Correction to: Statistical Universals of Language

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

39. Acknowledgments

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

40. Modeling financial time-series with generative adversarial networks

Author: Takahashi, Shuntaro, Chen, Yu, and Tanaka-Ishii, Kumiko
Published: 2019
Full Text: View/download PDF

41. Statistical Universals of Language

Author: Tanaka-Ishii, Kumiko, primary
Published: 2021
Full Text: View/download PDF

42. Correction to: Glossary and Notations

Author: Tanaka-Ishii, Kumiko, Danesi, Marcel, Series Editor, Kauffman, Louis H., Editorial Board Member, Martinovic, Dragana, Editorial Board Member, Neuman, Yair, Editorial Board Member, Núñez, Rafael, Editorial Board Member, Sfard, Anna, Editorial Board Member, Tall, David, Editorial Board Member, Tanaka-Ishii, Kumiko, Editorial Board Member, and Vinner, Shlomo, Editorial Board Member
Published: 2021
Full Text: View/download PDF

43. Co-Training Realized Volatility Prediction Model with Neural Distributional Transformation

Author: Du, Xin, primary, Moriyama, Kai, additional, and Tanaka-Ishii, Kumiko, additional
Published: 2023
Full Text: View/download PDF

44. Modeling Momentum Spillover with Economic Links Discovered from Financial Documents

Author: Chung, Andy, primary and Tanaka-Ishii, Kumiko, additional
Published: 2023
Full Text: View/download PDF

45. Predictability of Post-Earnings Announcement Drift with Textual and Contextual Factors of Earnings Calls

Author: Chung, Andy, primary and Tanaka-Ishii, Kumiko, additional
Published: 2023
Full Text: View/download PDF

46. Machine Versus Structure of Language via Statistical Universals

Author: Tanaka-Ishii, Kumiko, primary
Published: 2019
Full Text: View/download PDF

47. Consonants as Skeleton of Language: Statistical Evidences Through Text Production

Author: Tanaka-Ishii, Kumiko, Ide, Nancy, Series editor, Gala, Núria, editor, Rapp, Reinhard, editor, and Bel-Enguix, Gemma, editor
Published: 2015
Full Text: View/download PDF

48. Semiotics of Computing: Filling the Gap Between Humanity and Mechanical Inhumanity

Author: Tanaka-Ishii, Kumiko and Trifonas, Peter Pericles, editor
Published: 2015
Full Text: View/download PDF

49. Statistical Mechanics of Strahler Number via Random and Natural Language Sentences

Author: Tanaka-Ishii, Kumiko and Tanaka, Akira
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Physics - Data Analysis, Statistics and Probability, FOS: Physical sciences, Computation and Language (cs.CL), Data Analysis, Statistics and Probability (physics.data-an)
Abstract: The Strahler number was originally proposed to characterize the complexity of river bifurcation and has found various applications. This article proposes computation of the Strahler number's upper and lower limits for natural language sentence tree structures, which are available in a large dataset allowing for statistical mechanics analysis. Through empirical measurements across grammatically annotated data, the Strahler number of natural language sentences is shown to be almost always 3 or 4, similar to the case of river bifurcation as reported by Strahler (1957) and Horton (1945). From the theory behind the number, we show that it is the lower limit of the amount of memory required to process sentences under a particular model. A mathematical analysis of random trees provides a further conjecture on the nature of the Strahler number, revealing that it is not a constant but grows logarithmically. This finding uncovers the statistical basics behind the Strahler number as a characteristic of a general tree structure target.
Published: 2023
Full Text: View/download PDF

50. Semiotics of Void and Information Representation

Author: Tanaka-Ishii, Kumiko, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, and Marcus, Aaron, editor
Published: 2013
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

380 results on '"Tanaka-Ishii, Kumiko"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources