Mathematical models allow researchers to understand, analyze, and predict the behavior of systems of physical, biological, and technological interest, and are required for many techniques from dynamical systems and control theory to be used. Unfortunately, it is often impossible to derive mathematical models from first principles, and in such cases system identification is a powerful tool which can be used to deduce models from observed data. Many existing system identification techniques require pre-specification of a dictionary of possible terms in a mathematical model, limiting their ability to give models with the nonlinearities which can arise for biological and other complex systems. We present a methodology which overcomes this limitation by dynamically generating the terms in a model with the necessary complexity and nonlinearity to accurately describe a system’s dynamics. This uses a multilayered, operation-based symbolic regression approach, with the capacity to learn combinations and compositions of operations by training artificial neural networks. Our approach provides a powerful alternative to genetic programming strategies for symbolic regression, and can exploit many of the attractive features of artificial neural networks such as a straightforward learning strategy.
Published on 01/07/24
Accepted on 01/07/24
Submitted on 01/07/24
Volume Data Science, Machine Learning and Artificial Intelligence, 2024
DOI: 10.23967/wccm.2024.129
Licence: CC BY-NC-SA license
Are you one of the authors of this document?