Start Over

The Limit Order Book Recreation Model (LOBRM):An Extended Analysis

Authors :: Shi, Zijian
Cartlidge, John P
Dong, Yuxiao
Kourtellis, Nicolas
Hammer, Barbara
Lozano, Jose A.
Source :: Shi, Z & Cartlidge, J P 2021, The Limit Order Book Recreation Model (LOBRM) : An Extended Analysis . in Y Dong, N Kourtellis, B Hammer & J A Lozano (eds), European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), Applied Data Science Track : Applied Data Science Track-European Conference, ECML PKDD 2021, Proceedings . Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12978 LNAI, Springer, Cham, pp. 204-220, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2021), 13/09/21 . https://doi.org/10.1007/978-3-030-86514-6_13
Publication Year :: 2021
Publisher :: Springer, Cham, 2021.
Abstract: The limit order book (LOB) depicts the fine-grained demand and supply relationship for financial assets and is widely used in market microstructure studies. Nevertheless, the availability and high cost of LOB data restrict its wider application. The LOB recreation model (LOBRM) was recently proposed to bridge this gap by synthesizing the LOB from trades and quotes (TAQ) data. However, in the original LOBRM study, there were two limitations: (1) experiments were conducted on a relatively small dataset containing only one day of LOB data; and (2) the training and testing were performed in a non-chronological fashion, which essentially re-frames the task as interpolation and potentially introduces lookahead bias. In this study, we extend the research on LOBRM and further validate its use in real-world application scenarios. We first advance the workflow of LOBRM by (1) adding a time-weighted z-score standardization for the LOB and (2) substituting the ordinary differential equation kernel with an exponential decay kernel to lower computation complexity. Experiments are conducted on the extended LOBSTER dataset in a chronological fashion, as it would be used in a real-world application. We find that (1) LOBRM with decay kernel is superior to traditional non-linear models, and module ensembling is effective; (2) prediction accuracy is negatively related to the volatility of order volumes resting in the LOB; (3) the proposed sparse encoding method for TAQ exhibits good generalization ability and can facilitate manifold tasks; and (4) the influence of stochastic drift on prediction accuracy can be alleviated by increasing historical samples.

Subjects :: Financial machine learning
Limit Order Book
Time series prediction

Details

Language :: English
Database :: OpenAIRE
Journal :: Shi, Z & Cartlidge, J P 2021, The Limit Order Book Recreation Model (LOBRM) : An Extended Analysis . in Y Dong, N Kourtellis, B Hammer & J A Lozano (eds), European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), Applied Data Science Track : Applied Data Science Track-European Conference, ECML PKDD 2021, Proceedings . Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12978 LNAI, Springer, Cham, pp. 204-220, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2021), 13/09/21 . https://doi.org/10.1007/978-3-030-86514-6_13
Accession number :: edsair.od......2642..5daf9f8507fe8af33be11ed8fa0ed8f7
Full Text :: https://doi.org/10.1007/978-3-030-86514-6_13