Back to Search Start Over

Dimensionally-consistent equation discovery through probabilistic attribute grammars.

Authors :
Brence, Jure
Džeroski, Sašo
Todorovski, Ljupčo
Source :
Information Sciences. Jun2023, Vol. 632, p742-756. 15p.
Publication Year :
2023

Abstract

Equation discovery, also known as symbolic regression, is a machine learning task of inducing closed-form equations from data and background knowledge. The latter takes various forms. Domain-specific knowledge can constrain the space of candidate equations to those that make sense in the scientific or engineering domain of use. Cross-domain knowledge, on the other hand, imposes general rules for model acceptability, such as parsimony, understandability, or consistency of the equations with the dimensional units of the variables. In this paper, we propose using attribute grammars to ensure the induced equations' dimensional consistency. Attribute grammars are flexible enough to combine cross-domain knowledge on dimensional consistency with domain-specific knowledge expressed as a probabilistic context-free grammar. At the same time, we show that attribute grammars can be efficiently transformed into probabilistic context-free grammars for equation discovery with existing algorithms. Finally, we provide empirical evidence that attribute grammars ensuring dimensional consistency of equations can significantly improve the performance of equation discovery on the standard set of a hundred Feynman benchmarks. • Attribute grammars elegantly express dimensionally consistent expressions. • Efficient combining of dimensional analysis with other background knowledge. • Generate equations by transforming into probabilistic context-free grammar. • Discovered 60% more benchmark equations by accounting for dimensional consistency. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00200255
Volume :
632
Database :
Academic Search Index
Journal :
Information Sciences
Publication Type :
Periodical
Accession number :
162758415
Full Text :
https://doi.org/10.1016/j.ins.2023.03.073