Back to Search
Start Over
On Detecting and Removing Superficial Redundancy in Vector Databases.
- Source :
- Mathematical Problems in Engineering; 5/24/2018, p1-14, 14p
- Publication Year :
- 2018
-
Abstract
- A mathematical model is proposed in order to obtain an automatized tool to remove any unnecessary data, to compute the level of the redundancy, and to recover the original and filtered database, at any time of the process, in a vector database. This type of database can be modeled as an oriented directed graph. Thus, the database is characterized by an adjacency matrix. Therefore, a record is no longer a row but a matrix. Then, the problem of cleaning redundancies is addressed from a theoretical point of view. Superficial redundancy is measured and filtered by using the 1-norm of a matrix. Algorithms are presented by Python and MapReduce, and a case study of a real cybersecurity database is performed. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 1024123X
- Database :
- Complementary Index
- Journal :
- Mathematical Problems in Engineering
- Publication Type :
- Academic Journal
- Accession number :
- 129762285
- Full Text :
- https://doi.org/10.1155/2018/3702808