51. Binary Multidimensional Scaling for Hashing
- Author
-
Zhouchen Lin and Yameng Huang
- Subjects
Theoretical computer science ,Computer science ,Nearest neighbor search ,Hash function ,02 engineering and technology ,010501 environmental sciences ,Linear hashing ,Rolling hash ,01 natural sciences ,K-independent hashing ,Locality-sensitive hashing ,Open addressing ,0202 electrical engineering, electronic engineering, information engineering ,Consistent hashing ,Computer Science::Databases ,0105 earth and related environmental sciences ,Universal hashing ,Dynamic perfect hashing ,2-choice hashing ,Computer Graphics and Computer-Aided Design ,Hash table ,Hopscotch hashing ,Cuckoo hashing ,SUHA ,Locality preserving hashing ,020201 artificial intelligence & image processing ,Feature hashing ,Extendible hashing ,Perfect hash function ,Software ,Double hashing - Abstract
Hashing is a useful technique for fast nearest neighbor search due to its low storage cost and fast query speed. Unsupervised hashing aims at learning binary hash codes for the original features so that the pairwise distances can be best preserved. While several works have targeted on this task, the results are not satisfactory mainly due to the over-simplified model. In this paper, we propose a unified and concise unsupervised hashing framework, called binary multidimensional scaling , which is able to learn the hash code for distance preservation in both batch and online mode. In the batch mode, unlike most existing hashing methods, we do not need to simplify the model by predefining the form of hash map. Instead, we learn the binary codes directly based on the pairwise distances among the normalized original features by alternating minimization. This enables a stronger expressive power of the hash map. In the online mode, we consider the holistic distance relationship between current query example and those we have already learned, rather than only focusing on current data chunk. It is useful when the data come in a streaming fashion. Empirical results show that while being efficient for training, our algorithm outperforms state-of-the-art methods by a large margin in terms of distance preservation, which is practical for real-world applications.
- Published
- 2018
- Full Text
- View/download PDF