Descriptor: "Hamming space" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Hamming space"' showing total 553 results

Start Over Descriptor "Hamming space"

553 results on '"Hamming space"'

151. Binary Set Embedding for Cross-Modal Retrieval

Author: Mengyang Yu, Li Liu, and Ling Shao
Subjects: Computer Networks and Communications, Computer science, business.industry, Hash function, Feature extraction, Pattern recognition, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, Computer Science Applications, Image (mathematics), Artificial Intelligence, Feature (computer vision), 0202 electrical engineering, electronic engineering, information engineering, Embedding, 020201 artificial intelligence & image processing, Binary code, Visual Word, Artificial intelligence, Hamming space, business, Orthogonalization, Software, 0105 earth and related environmental sciences, Semantic gap
Abstract: Cross-modal retrieval is such a challenging topic that traditional global representations would fail to bridge the semantic gap between images and texts to a satisfactory level. Using local features from images and words from documents directly can be more robust for the scenario with large intraclass variations and small interclass discrepancies. In this paper, we propose a novel unsupervised binary coding algorithm called binary set embedding (BSE) to obtain meaningful hash codes for local features from the image domain and words from text domain. Understanding image features with the word vectors learned from the human language instead of the provided documents from data sets, BSE can map samples into a common Hamming space effectively and efficiently where each sample is represented by the sets of local feature descriptors from image and text domains. In particular, BSE explores relationship among local features in both feature level and image (text) level, which can balance the sensitivity of each other. Furthermore, a recursive orthogonalization procedure is applied to reduce the redundancy of codes. Extensive experiments demonstrate the superior performance of BSE compared with state-of-the-art cross-modal hashing methods using either image or text queries.
Published: 2017

152. The isometry groups of the hamming spaces of periodic sequences.

Author: Oliynyk, B. and Sushchanskiĭ, V.
Subjects: *ISOMETRICS (Mathematics), *MATHEMATICAL sequences, *GROUP theory, *SUBSPACES (Mathematics), *LIMIT theorems, *GEOMETRICAL constructions, *LATTICE theory
Abstract: We consider the Hamming space of periodic (0, 1)-sequences and a continual family of its subspaces defined as direct limits of finite Hamming spaces. These subspaces form a complete lattice under inclusion which is isomorphic to the lattice of supernatural numbers. We explicitly describe the isometry groups of these spaces. This involves certain constructions similar to the hyperoctahedral groups but accounting for additional structures on the underlying sets. [ABSTRACT FROM AUTHOR]
Published: 2013
Full Text: View/download PDF

153. A Fast Approximate Nearest Neighbor Search Algorithm in the Hamming Space.

Author: Malek Esmaeili, Mani, Ward, Rabab Kreidieh, and Fatourechi, Mehrdad
Subjects: *NEAREST neighbor analysis (Statistics), *SEARCH algorithms, *APPROXIMATION theory, *HASHING, *HUMAN fingerprints, *ACCURACY, *ERROR analysis in mathematics
Abstract: A fast approximate nearest neighbor search algorithm for the (binary) Hamming space is proposed. The proposed Error Weighted Hashing (EWH) algorithm is up to 20 times faster than the popular locality sensitive hashing (LSH) algorithm and works well even for large nearest neighbor distances where LSH fails. EWH significantly reduces the number of candidate nearest neighbors by weighing them based on the difference between their hash vectors. EWH can be used for multimedia retrieval and copy detection systems that are based on binary fingerprinting. On a fingerprint database with more than 1,000 videos, for a specific detection accuracy, we demonstrate that EWH is more than 10 times faster than LSH. For the same retrieval time, we show that EWH has a significantly better detection accuracy with a 15 times lower error rate. [ABSTRACT FROM PUBLISHER]
Published: 2012
Full Text: View/download PDF

154. BINARY NONTILES.

Author: Coppersmith, Don and Miller, Victor S.
Subjects: *CODING theory, *HASHING, *EQUATIONS, *TILING (Mathematics), *TESSELLATIONS (Mathematics), *LINEAR programming
Abstract: A subset Due to image rights restrictions, multiple line equation(s) cannot be graphically displayed. is a tile if Due to image rights restrictions, multiple line equation(s) cannot be graphically displayed. can be covered by disjoint translates of V. In other words, V is a tile if and only if there is a subset Due to image rights restrictions, multiple line equation(s) cannot be graphically displayed. such that Due to image rights restrictions, multiple line equation(s) cannot be graphically displayed. uniquely (i.e., v + a = v' + a' implies that v = v' and a = a', where v, V' and a, a ε A). In some problems in coding theory and hashing we are given a putative tile V and wish to know whether or not it is a tile. In this paper we give two computational criteria for certifying that V is not a tile. The first involves the impossibility of a bin-packing problem, and the second involves the infeasibility of a linear program. We apply both criteria to a list of putative tiles given by Gordon, Miller, and Ostapenko [IEEE Trans. Inform. Theory, 56 (2010), pp. 984-991] in the context of hashing to find close matches, to show that none of them are, in fact, tiles. [ABSTRACT FROM AUTHOR]
Published: 2012
Full Text: View/download PDF

155. Visualization of quasi-median networks

Author: Schwarz, Konrad and Dür, Arne
Subjects: *DATA visualization, *TOPOLOGICAL spaces, *GRAPH theory, *PATHS & cycles in graph theory, *COMBINATORICS, *MATHEMATICAL analysis
Abstract: Abstract: An algorithm for automatic drawing of quasi-median networks is proposed that uses Eulerian cycles in the non-strong-compatibility graph to lay out the network in the plane. [Copyright &y& Elsevier]
Published: 2011
Full Text: View/download PDF

156. On binary linear r-identifying codes.

Author: Ranto, Sanna
Subjects: INVARIANT subspaces, VECTOR algebra, INTERSECTION theory, LINEAR statistical models, ELECTRONIC information resource searching, RADIUS (Geometry), COMBINATORIAL designs & configurations, CIPHERS
Abstract: subspace C of the binary Hamming space F of length n is called a linear r-identifying code if for all vectors of F the intersections of C and closed r-radius neighbourhoods are nonempty and different. In this paper, we give lower bounds for such linear codes. For radius r = 2, we give some general constructions. We give many (optimal) constructions which were found by a computer search. New constructions improve some previously known upper bounds for r-identifying codes in the case where linearity is not assumed. [ABSTRACT FROM AUTHOR]
Published: 2011
Full Text: View/download PDF

157. Bounds for discrepancies in the Hamming space

Author: Maxim Skriganov and Alexander Barg
Subjects: Statistics and Probability, Discrete mathematics, Numerical Analysis, Polynomial, Control and Optimization, Algebra and Number Theory, Applied Mathematics, General Mathematics, 010102 general mathematics, 010103 numerical & computational mathematics, Radius, Space (mathematics), 01 natural sciences, Euclidean geometry, SPHERES, Mathematics::Differential Geometry, Ball (mathematics), 0101 mathematics, Hamming space, Mathematics
Abstract: We derive bounds for the ball L p -discrepancies in the Hamming space for 0 p ∞ and p = ∞ . Sharp estimates of discrepancies have been obtained for many spaces such as the Euclidean spheres and more general compact Riemannian manifolds. In the present paper, we show that the behavior of discrepancies in the Hamming space differs fundamentally because the volume of the ball in this space depends on its radius exponentially while such a dependence for the Riemannian manifolds is polynomial.
Published: 2021

158. ComBI: Compressed Binary Search Tree for Approximate k-NN Searches in Hamming Space

Author: Debarka Sengupta, Prashant Gupta, Aashi Jindal, and Jayadeva
Subjects: Information Systems and Management, Computer science, Hash function, Hash table, Computer Science Applications, Management Information Systems, k-nearest neighbors algorithm, Tree (data structure), Data point, Binary search tree, Search algorithm, Hamming space, Algorithm, Information Systems
Abstract: The space-partitioning based hashing techniques are widely used to represent high-dimensional data points as bit-codes. Although Binary Search Trees (BSTs) can be used for storing bit-codes, their size grows exponentially with code length. In practice, such a tree turns out to be highly sparse, increasing the miss-rate of nearest neighbor searches. We present Compressed BST of Inverted hash tables (ComBI), a geometrically motivated compression technique for BSTs. ComBI enables fast and approximate nearest neighbor searches without a significant memory footprint over BSTs. We show that approximate search in ComBI can perform competitively to an exact search algorithm in retrieving the nearest neighbors in a Hamming space. On a database containing ∼80M samples, ComBI yields an average precision of 0.90, at ∼4X - ∼296X improvements in run-time across different code lengths when compared to MIH, a widely used exact search method. On a database consisting of 1B samples, this value of precision (0.90) is reached at ∼4X - ∼19X improvements in run-time.
Published: 2021

159. Upper bounds for binary identifying codes

Author: Exoo, Geoffrey, Junnila, Ville, Laihonen, Tero, and Ranto, Sanna
Subjects: *BINARY control systems, *CARDINAL numbers, *CODING theory
Abstract: Abstract: A nonempty set of words in a binary Hamming space is called an r-identifying code if for every word the set of codewords within distance r from it is unique and nonempty. The smallest possible cardinality of an r-identifying code is denoted by . In this paper, we consider questions closely related to the open problem whether is true. For example, we show results like , which improve previously known bounds. We also consider codes which identify sets of words of size at most ℓ. The smallest cardinality of such a code is denoted by . We prove that is true for all when and . We also obtain a result where when . This bound is related to the conjecture . Moreover, we give constructions for the best known 1-identifying codes of certain lengths. [Copyright &y& Elsevier]
Published: 2009
Full Text: View/download PDF

160. Quasi-median hulls in Hamming space are Steiner hulls

Author: Bandelt, Hans-Jürgen and Röhl, Arne
Subjects: *MATHEMATICAL sequences, *DISTANCE geometry, *TREE graphs, *MATHEMATICAL analysis, *ITERATIVE methods (Mathematics), *NUMERICAL analysis
Abstract: Abstract: A Hamming space consists of all sequences of length over an alphabet and is endowed with the Hamming distance. In particular, any set of aligned DNA sequences of fixed length constitutes a subspace of a Hamming space with respect to mismatch distance. The quasi-median operation returns for any three sequences the sequence which in each coordinate attains either the majority coordinate from or else (in the case of a tie) the coordinate of the first entry, ; for a subset of the iterative application of this operation stabilizes in its quasi-median hull. We show that for every finite tree interconnecting a given subset of there exists a shortest realization within for which all interior nodes belong to the quasi-median hull of . Hence the quasi-median hull serves as a Steiner hull for the Steiner problem in Hamming space. [Copyright &y& Elsevier]
Published: 2009
Full Text: View/download PDF

161. On Locating-Dominating Codes in Binary Hamming Spaces

Author: Iiro Honkala, Tero Laihonen, and Sanna Ranto
Subjects: locating-dominating codes, hamming space, identifying codes, [info.info-dm] computer science [cs]/discrete mathematics [cs.dm], Mathematics, QA1-939
Abstract: Locating faulty processors in a multiprocessor system gives the motivation for locating-dominating codes. We consider these codes in binary hypercubes and generalize the concept for the situation in which we want to locate more than one malfunctioning processor.
Published: 2004
Full Text: View/download PDF

162. New bounds on binary identifying codes

Author: Exoo, Geoffrey, Laihonen, Tero, and Ranto, Sanna
Subjects: *MULTIPROCESSORS, *ELECTRONIC data processing, *MULTIPROGRAMMING (Electronic computers), *DETECTORS
Abstract: Abstract: The original motivation for identifying codes comes from fault diagnosis in multiprocessor systems. Currently, the subject forms a topic of its own with several possible applications, for example, to sensor networks. In this paper, we concentrate on identification in binary Hamming spaces. We give a new lower bound on the cardinality of -identifying codes when . Moreover, by a computational method, we show that . It is also shown, using a non-constructive approach, that there exist asymptotically good -identifying codes for fixed . In order to construct -identifying codes, we prove that a direct sum of codes that are -identifying is an -identifying code for . [Copyright &y& Elsevier]
Published: 2008
Full Text: View/download PDF

163. The edge-diametric theorem in Hamming spaces

Author: Bey, Christian
Subjects: *INTERSECTION graph theory, *GRAPH theory, *HOMOLOGY theory, *INTERSECTION homology theory
Abstract: Abstract: The maximum number of edges spanned by a subset of given diameter in a Hamming space with alphabet size at least three is determined. The binary case was solved earlier by Ahlswede and Khachatrian [A diametric theorem for edges, J. Combin. Theory Ser. A 92(1) (2000) 1–16]. [Copyright &y& Elsevier]
Published: 2008
Full Text: View/download PDF

164. Scalable Mammogram Retrieval Using Composite Anchor Graph Hashing With Iterative Quantization

Author: Jingjing Liu, Shaoting Zhang, Cheng Deng, Wei Liu, Dimitris N. Metaxas, and Yuanjie Zheng
Subjects: Computer science, Quantization (signal processing), Hash function, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, computer.software_genre, 030218 nuclear medicine & medical imaging, Visualization, 03 medical and health sciences, 0302 clinical medicine, Scalability, 0202 electrical engineering, electronic engineering, information engineering, Media Technology, Graph (abstract data type), 020201 artificial intelligence & image processing, Binary code, Data mining, Electrical and Electronic Engineering, Hamming space, Image retrieval, computer
Abstract: Content-based image retrieval (CBIR) shows great significance in clinical decision-making, which explores the visual content of medical images rather than keywords, tags, or descriptions. It provides doctors an image-guided approach to explore relevant cases that could offer doctors instructive reference. Mammogram screening has been known to be widely used in the early stage diagnosis of breast cancer and could reduce its morbidity and mortality. In this paper, we aim to develop a scalable CBIR method for a large repository of mammogram. To this end, we extend the original Anchor Graph Hashing (AGH) and propose a new unsupervised hashing algorithm, named as composite AGH with iterative quantization (C-AGH-ITQ), which compresses mammographic regions of interest (ROIs) into compact binary codes and enables real-time searching in Hamming space. Multimodal features and different distance metrics are integrated, performing upon a composite Anchor Graph. To improve the effectiveness of the hash code, quantization error is further iteratively minimized by introducing an orthogonal rotation matrix. We evaluate the presented C-AGH-ITQ algorithm on a data set of 11 533 mammographic ROIs obtained from the Digital Database for Screening Mammography. Our method obtains more than 84% retrieval precision and 93% classification accuracy (using $k$ NN prediction), which demonstrates that hash codes produced by C-AGH-ITQ well capture the visual similarities between mammographic images. In addition, since C-AGH-ITQ ensures linear complexity of the training procedure and constant time for query, our system is readily applicable to large-scale mammogram databases and has the potential to provide abundant clinical cases as reference.
Published: 2017

165. Toward Optimal Manifold Hashing via Discrete Locally Linear Embedding

Author: Liujuan Cao, Hong Liu, Di Liu, Rongrong Ji, Yongjian Wu, and Feiyue Huang
Subjects: Discrete mathematics, Hash function, Hamming distance, 02 engineering and technology, 010501 environmental sciences, Linear hashing, 01 natural sciences, Computer Graphics and Computer-Aided Design, Linear code, K-independent hashing, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Feature hashing, Hamming space, Algorithm, Hamming code, Software, 0105 earth and related environmental sciences, Mathematics
Abstract: Binary code learning, also known as hashing, has received increasing attention in large-scale visual search. By transforming high-dimensional features to binary codes, the original Euclidean distance is approximated via Hamming distance. More recently, it is advocated that it is the manifold distance, rather than the Euclidean distance, that should be preserved in the Hamming space. However, it retains as an open problem to directly preserve the manifold structure by hashing. In particular, it first needs to build the local linear embedding in the original feature space, and then quantize such embedding to binary codes. Such a two-step coding is problematic and less optimized. Besides, the off-line learning is extremely time and memory consuming, which needs to calculate the similarity matrix of the original data. In this paper, we propose a novel hashing algorithm, termed discrete locality linear embedding hashing (DLLH), which well addresses the above challenges. The DLLH directly reconstructs the manifold structure in the Hamming space, which learns optimal hash codes to maintain the local linear relationship of data points. To learn discrete locally linear embedding codes, we further propose a discrete optimization algorithm with an iterative parameters updating scheme. Moreover, an anchor-based acceleration scheme, termed Anchor-DLLH , is further introduced, which approximates the large similarity matrix by the product of two low-rank matrices. Experimental results on three widely used benchmark data sets, i.e., CIFAR10, NUS-WIDE, and YouTube Face, have shown superior performance of the proposed DLLH over the state-of-the-art approaches.
Published: 2017

166. Low Distortion Embedding of the Hamming Space into a Sphere with Quadrance Metric and k-means Clustering of Nominal-continuous Data

Author: Aleksander Denisiuk and Michal Grabowski
Subjects: Discrete mathematics, Algebra and Number Theory, Hamming bound, k-means clustering, Hamming distance, 02 engineering and technology, Theoretical Computer Science, Combinatorics, Computational Theory and Mathematics, Hamming graph, Low distortion, 020204 information systems, Metric (mathematics), 0202 electrical engineering, electronic engineering, information engineering, Embedding, 020201 artificial intelligence & image processing, Hamming space, Information Systems, Mathematics
Published: 2017

167. Triple-Bit Quantization with Asymmetric Distance for Image Content Security

Author: Hongtao Xie, Chenggang Yan, and Degang Xu
Subjects: Theoretical computer science, business.industry, Nearest neighbor search, Binary number, Image content, 020207 software engineering, 02 engineering and technology, Computer Science Applications, Hardware and Architecture, 0202 electrical engineering, electronic engineering, information engineering, Embedding, 020201 artificial intelligence & image processing, Binary code, The Internet, Computer Vision and Pattern Recognition, Artificial intelligence, business, Hamming space, Quantization (image processing), Algorithm, Software, Mathematics
Abstract: With the rapid growth of the number of images on the Internet, it has become more necessary to ensure the content security of images. The key problem is retrieving relevant images from the large database. Binary embedding is an effective way to improve the efficiency of calculating similarities for image content security as binary code is storage efficient and fast to compute. It tries to convert real-valued signatures into binary codes while preserving similarity of the original data, and most binary embedding methods quantize each projected dimension to one bit (presented as 0/1). As a consequence, it greatly decreases the discriminability of original signatures. In this paper, we first propose a novel triple-bit quantization strategy to solve the problem by assigning 3-bit to each dimension. Then, asymmetric distance algorithm is applied to re-rank candidates obtained from Hamming space for the final nearest neighbors. For simplicity, we call the framework triple-bit quantization with asymmetric distance (TBAD). The inherence of TBAD is combining the best of binary codes and real-valued signatures to get nearest neighbors quickly and concisely. Moreover, TBAD is applicable to a wide variety of embedding techniques. Experimental comparisons on BIGANN set show that the proposed method can achieve remarkable improvement in query accuracy compared to original binary embedding methods.
Published: 2017

168. SOMH: A self-organizing map based topology preserving hashing method

Author: Xin-Shun Xu, Xiao-Long Liang, Guan-Qun Yang, Yuliang Shi, Shanqing Guo, and Xiao-Lin Wang
Subjects: Self-organizing map, Optimization problem, Computer science, Cognitive Neuroscience, Hash function, 02 engineering and technology, 010501 environmental sciences, Topology, 01 natural sciences, Computer Science Applications, Locality-sensitive hashing, k-nearest neighbors algorithm, Artificial Intelligence, Locality preserving hashing, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Product topology, Hamming space, 0105 earth and related environmental sciences
Abstract: Hashing based approximate nearest neighbor (ANN) search techniques have attracted considerable attention in media search community because of its good potential for low storage cost and fast query speed. Many hashing based ANN search methods have been proposed; but, most of them just consider to keep the similarity relationship of data points during mapping instead of topology of data. It is well known that Self-Organizing Map can keep topology structure while conducting mapping task. Motivated by this, in this paper, we propose a Self-Organizing Map based hashing framework–SOMH, which cannot only keep similarity relationship, but also preserve topology of data. Specifically, in SOMH, Self-Organizing Map is introduced to map data points into hamming space. In addition, in order to make it work well on short and long binary codes, we propose a relaxed version of SOMH and a product space SOMH, respectively. For the optimization problem of the relaxed SOMH, we also present an iterative solution. Moreover, we further propose an extended version of SOMH, which can work well on multimodal data search task, i.e., cross-modal search. To test the performance of these proposed algorithms, we conduct experiments on three data sets–SIFT1M, GIST1M and Wiki (a multimodal dataset). Experimental results show that SOMH can outperform or is comparable to several state-of-the-arts.
Published: 2017

169. Partial Hash Update via Hamming Subspace Learning

Author: Furong Peng, Ivor W. Tsang, Chuancai Liu, and Chao Ma
Subjects: Theoretical computer science, Computer science, Hamming bound, Hash function, Hamming distance, 02 engineering and technology, Computer Graphics and Computer-Aided Design, Linear code, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Artificial Intelligence & Image Processing, 020201 artificial intelligence & image processing, Binary code, Hamming(7,4), Forward error correction, Hamming weight, Hamming space, Hamming code, Software
Abstract: © 2017 IEEE. Hashing technique has become an effective method for information retrieval due to the fast calculation of the Hamming distance. However, with the continuous growth of data coming from the Internet, the online update of hashing on the massive social data becomes very time-consuming. To alleviate this issue, in this paper, we propose a novel updating technique for hashing methods, namely Hamming Subspace Learning (HSL). The motivation of HSL is to generate a low-dimensional Hamming subspace from a high-dimensional Hamming space by selecting representative hash functions. Through HSL, we aim to improve the speed of updating binary codes for all samples. We present a method for Hamming subspace learning based on greedy selection strategy and the Distribution Preserving Hamming Subspace learning (DHSL) algorithm by designing a novel loss function. The experimental results demonstrate that the HSL is effective to improve the speed of online updating and the performance of hashing algorithm.
Published: 2017

170. Kernelized product quantization

Author: Zhou Jianshe, Yongdong Zhang, Yichao Zhang, Shi Jinsheng, and Liu Jie
Subjects: Theoretical computer science, Similarity (geometry), Computer science, Cognitive Neuroscience, Quantization (signal processing), Nearest neighbor search, Feature vector, Hash function, Hamming distance, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, Computer Science Applications, Artificial Intelligence, Kernel (statistics), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Binary code, Hamming space, Quantization (image processing), Algorithm, 0105 earth and related environmental sciences
Abstract: There has been increasing interest in learning compact binary codes for large-scale image data representation and retrieval. In most existing hashing-based methods, high-dimensional vectors are hashed into Hamming space, and the similarity between two vectors is approximated by the Hamming distance between their binary codes. Although hashing-based binary codes generation methods were widely used, Product Quantization (PQ) has been shown to be more accurate than various hashing-based methods, largely due to its lower quantization distortions and more precise distance computation. However, it is still a challenging problem to generalize PQ to accommodate arbitrary kernels. In this paper, we demonstrate how to employ arbitrary kernel functions in a PQ scheme. First, we propose a Kernelized PQ (KPQ) method based on composite kernels, which serves as a basic framework by making the decomposition of implicit feature space possible. Furthermore, we propose a Kernelized Optimized PQ (KOPQ) method to generalize Optimized Product Quantization (OPQ) to an arbitrary implicit feature space. Finally, we propose a Supervised KPQ (SKPQ) to improve the performance of semantic neighbor search. Both methods are variations of KPQ with the incorporation of their corresponding core techniques, KPCA and KCCA respectively, to the basic KPQ framework. Experiments involving three notable datasets show that KPQ, KOPQ and SKPQ can outperform the state-of-the-art methods for a similarity search in feature space or semantic search.
Published: 2017

171. Large-scale image retrieval with supervised sparse hashing

Author: Xiao Tan, Yan Xu, Yuan Wang, Lianli Gao, Fumin Shen, and Xing Xu
Subjects: Clustering high-dimensional data, Universal hashing, Computer science, Cognitive Neuroscience, Dynamic perfect hashing, Hash function, 02 engineering and technology, 010501 environmental sciences, Overfitting, computer.software_genre, 01 natural sciences, Projection (linear algebra), Computer Science Applications, Artificial Intelligence, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Binary code, Data mining, Hamming space, Image retrieval, Algorithm, computer, 0105 earth and related environmental sciences
Abstract: In recent years, learning based hashing becomes an attractive technique in large-scale image retrieval due to its low storage and computation cost. Hashing methods map each high-dimensional vector onto a low-dimensional hamming space by projection operators. However, when processing high dimensional data retrieval, many existing methods including hashing cost a majority of time on projection operators. In this paper, we solve this problem by implementing a sparsity regularizer. On one hand, due to the sparse property of the projection matrix, our method effectively lower both the storage and computation cost. On the other hand, we reduce the effective number of parameters involved in the learned projection matrix according to sparsity regularizer, which helps avoid overfitting problem. Without relaxing binary constraints, an iterative scheme jointly optimizing the objective function directly was given, which helps to obtain effective and efficient binary codes. We evaluate our method on three databases and compare it with some state-of-the-art hashing methods. Experimental results demonstrate that our method outperforms the comparison approaches.
Published: 2017

172. Hashing With Pairwise Correlation Learning and Reconstruction

Author: Ning Li, Xiao-Jiao Mao, and Yu-Bin Yang
Subjects: Theoretical computer science, Universal hashing, Computer science, Dynamic perfect hashing, Hash function, 02 engineering and technology, 010501 environmental sciences, 2-choice hashing, 01 natural sciences, Hash table, Computer Science Applications, Locality-sensitive hashing, K-independent hashing, Signal Processing, Locality preserving hashing, 0202 electrical engineering, electronic engineering, information engineering, Media Technology, 020201 artificial intelligence & image processing, Feature hashing, Electrical and Electronic Engineering, Hamming space, Double hashing, 0105 earth and related environmental sciences
Abstract: Existing hashing methods normally define certain specific forms of hash functions, after which an objective function can be formulated to optimize the loss on training set to learn the parameters. However, in this way, the hash function will be tightly coupled with the generated objective in most cases. Moreover, since the objectives are generally formulated with binary quantization, most of them are nonconvex, which makes the optimization difficult and consequently decreases the similarity preserving performance of hashing. To solve this problem, we propose a novel pairwise correlation preserving framework to learn compact binary codes for hashing. First, we project each data into a metric space and represent it as a vector encoding the underlying local and global structure by pairwise correlation learning. Afterwards, pairwise correlation reconstruction (PCR), is further proposed to preserve the correlations of data between the metric space and the hamming space to learn binary codes. The PCR model is convex. Moreover, no specific hash functions are needed to be predefined and the steps of correlation learning and reconstruction are independent. The above characteristics make the optimization of PCR easily and efficiently, and thus leads to better preservation of data similarity in hamming space.
Published: 2017

173. Interactive Exploration for Continuously Expanding Neuron Databases

Author: Zhongyu Li, Dimitris N. Metaxas, Aidong Lu, and Shaoting Zhang
Subjects: Similarity (geometry), Databases, Factual, Computer science, Information Storage and Retrieval, Relevance feedback, 02 engineering and technology, Similarity measure, computer.software_genre, General Biochemistry, Genetics and Molecular Biology, Pattern Recognition, Automated, Domain (software engineering), 03 medical and health sciences, 0302 clinical medicine, Data retrieval, Artificial Intelligence, Image Processing, Computer-Assisted, 0202 electrical engineering, electronic engineering, information engineering, medicine, Humans, Hamming space, Molecular Biology, Neurons, Database, medicine.anatomical_structure, 020201 artificial intelligence & image processing, Binary code, Neuron, computer, 030217 neurology & neurosurgery
Abstract: This paper proposes a novel framework to help biologists explore and analyze neurons based on retrieval of data from neuron morphological databases. In recent years, the continuously expanding neuron databases provide a rich source of information to associate neuronal morphologies with their functional properties. We design a coarse-to-fine framework for efficient and effective data retrieval from large-scale neuron databases. In the coarse-level, for efficiency in large-scale, we employ a binary coding method to compress morphological features into binary codes of tens of bits. Short binary codes allow for real-time similarity searching in Hamming space. Because the neuron databases are continuously expanding, it is inefficient to re-train the binary coding model from scratch when adding new neurons. To solve this problem, we extend binary coding with online updating schemes, which only considers the newly added neurons and update the model on-the-fly, without accessing the whole neuron databases. In the fine-grained level, we introduce domain experts/users in the framework, which can give relevance feedback for the binary coding based retrieval results. This interactive strategy can improve the retrieval performance through re-ranking the above coarse results, where we design a new similarity measure and take the feedback into account. Our framework is validated on more than 17,000 neuron cells, showing promising retrieval accuracy and efficiency. Moreover, we demonstrate its use case in assisting biologists to identify and explore unknown neurons.
Published: 2017

174. Nonexistence of a few binary orthogonal arrays

Author: Peter Boyvalenkov, Maya Stoyanova, and Tanya Marinova
Subjects: Discrete mathematics, Applied Mathematics, Binary number, 020206 networking & telecommunications, 02 engineering and technology, Kravchuk polynomials, 01 natural sciences, Reduction (complexity), 010104 statistics & probability, Cardinality, 0202 electrical engineering, electronic engineering, information engineering, Discrete Mathematics and Combinatorics, Point (geometry), Uniqueness, 0101 mathematics, Orthogonal array, Hamming space, Mathematics
Abstract: We develop and apply combinatorial algorithms for investigation of the feasible distance distributions of binary orthogonal arrays with respect to a point of the ambient binary Hamming space utilizing constraints imposed from the relations between the distance distributions of connected arrays. This turns out to be strong enough and we prove the nonexistence of binary orthogonal arrays of parameters (length, cardinality, strength) = ( 9 , 96 , 4 ) , ( 10 , 192 , 5 ) , ( 10 , 112 , 4 ) , ( 11 , 224 , 5 ) , ( 11 , 112 , 4 ) and ( 12 , 224 , 5 ) , resolving the first cases where the existence was undecided so far. For the existing arrays our approach allows substantial reduction of the number of feasible distance distributions which could be helpful for classification results (uniqueness, for example).
Published: 2017

175. A Novel DBSCAN Based on Binary Local Sensitive Hashing and Binary-KNN Representation

Author: Qing He, Qin Wei, Hai Xia Gu, and Xu Wang
Subjects: DBSCAN, Article Subject, General Computer Science, Computer science, business.industry, Hash function, Binary number, Pattern recognition, 02 engineering and technology, lcsh:QA75.5-76.95, Robustness (computer science), 020204 information systems, Binary data, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, lcsh:Electronic computers. Computer science, Artificial intelligence, Hamming space, business, Cluster analysis, Cluster expansion
Abstract: We revisit the classic DBSCAN algorithm by proposing a series of strategies to improve its robustness to various densities and its efficiency. Unlike the original DBSCAN, we first use the binary local sensitive hashing (LSH) which enables faster region query for the k neighbors of a data point. The binary data representation method based on k neighborhood is then proposed to map the dataset into the Hamming space for faster cluster expansion. We define a core point based on binary influence space to enhance the robustness to various densities. Also, we propose a seed point selection method, which is based on influence space and k neighborhood similarity, to select some seed points instead of all the neighborhood during cluster expansion. Consequently, the number of region queries can be decreased. The experimental results show that the improved algorithm can greatly improve the clustering speed under the premise of ensuring better algorithm clustering accuracy, especially for large-scale datasets.
Published: 2017

176. On the variance of average distance of subsets in the Hamming space

Author: Fu, Fang-Wei, Ling, San, and Xing, Chaoping
Subjects: *CODING theory, *DIGITAL electronics, *MACHINE theory, *COMPUTER programming
Abstract: Abstract: Let V be a finite set with q distinct elements. For a subset C of , denote the variance of the average Hamming distance of C. Let and denote the minimum and maximum variance of the average Hamming distance of subsets of with cardinality M, respectively. In this paper, we study and for general q. Using methods from coding theory, we derive upper and lower bounds on , which generalize and unify the bounds for the case . These bounds enable us to determine the exact value for and in several cases. [Copyright &y& Elsevier]
Published: 2005
Full Text: View/download PDF

177. On Locating-Dominating Codes in Binary Hamming Spaces.

Author: Honkala, Iiro, Laihonen, Tero, and Ranto, Sanna
Subjects: *MULTIPROCESSORS, *COMPUTERS, *MICROPROCESSORS, *CIPHERS, *HYPERCUBES
Abstract: Locating faulty processors in a multiprocessor system gives the motivation for locating-dominating codes. We consider these codes in binary hypercubes and generalize the concept for the situation in which we want to locate more than one malfunctioning processor. [ABSTRACT FROM AUTHOR]
Published: 2004

178. Bounds on the number of hidden neurons in three-layer binary neural networks

Author: Zhang, Zhaozhi, Ma, Xiaomin, and Yang, Yixian
Subjects: *ARTIFICIAL neural networks, *INTEGER programming, *BOOLEAN algebra, *ARTIFICIAL intelligence
Abstract: This paper investigates an important problem concerning the complexity of three-layer binary neural networks (BNNs) with one hidden layer. The neuron in the studied BNNs employs a hard limiter activation function with only integer weights and an integer threshold. The studies are focused on implementations of arbitrary Boolean functions which map from {0,1}n into {0,1}. A deterministic algorithm called set covering algorithm (SCA) is proposed for the construction of a three-layer BNN to implement an arbitrary Boolean function. The SCA is based on a unit sphere covering (USC) of the Hamming space (HS) which is chosen in advance. It is proved that for the implementation of an arbitrary Boolean function of n-variables (n≥3) by using SCA, ⌊3L/2⌋ hidden neurons are necessary and sufficient, where L is the number of unit spheres contained in the chosen USC of the n-dimensional HS. It is shown that by using SCA, the number of hidden neurons required is much less than that by using a two-parallel hyperplane method. In order to indicate the potential ability of three-layer BNNs, a lower bound on the required number of hidden neurons which is derived by using the method of estimating the Vapnik-Chervonenkis (VC) dimension is also given. [Copyright &y& Elsevier]
Published: 2003
Full Text: View/download PDF

179. Families of optimal codes for strong identification

Author: Laihonen, Tero and Ranto, Sanna
Subjects: *BINARY number system, *MULTIPROCESSORS
Abstract: Codes for strong identification are considered. The motivation for these codes comes from locating faulty processors in a multiprocessor system. Constructions and lower bounds on these codes are given. In particular, we provide two infinite families of optimal strongly identifying codes, which can locate up to two malfunctioning processors in a binary hypercube. [Copyright &y& Elsevier]
Published: 2002
Full Text: View/download PDF

180. On Identifying Codes in Binary Hamming Spaces

Author: Honkala, Iiro and Lobstein, Antoine
Subjects: *CODING theory, *BINARY number system, *LINEAR algebra
Abstract: A binary code C⊆{0,1}n is called r-identifying, if the sets Br(x)∩C, where Br(x) is the set of all vectors within the Hamming distance r from x, are all nonempty and no two are the same. Denote by Mr(n) the minimum possible cardinality of a binary r-identifying code in {0,1}n. We prove that if ρ∈[0,1) is a constant, then limn→∞n−1 log2M⌊ρn⌋(n)=1−H(ρ), where H(x)=−x log2x−(1−x) log2(1−x). We also prove that the problem whether or not a given binary linear code is r-identifying is Π2-complete. [Copyright &y& Elsevier]
Published: 2002
Full Text: View/download PDF

181. Locally linear spatial pyramid hash for large-scale image search

Author: Hangzai Luo, Jianping Fan, Jinye Peng, and Wanqing Zhao
Subjects: Computer Networks and Communications, Computer science, Nearest neighbor search, Hash function, 02 engineering and technology, Rolling hash, K-independent hashing, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Media Technology, Pyramid (image processing), Hamming space, Universal hashing, business.industry, Dynamic perfect hashing, Pattern recognition, Hash table, Hardware and Architecture, Computer Science::Computer Vision and Pattern Recognition, 020201 artificial intelligence & image processing, Feature hashing, Artificial intelligence, business, Perfect hash function, Software, Double hashing
Abstract: Hash-based methods can achieve a fast similarity search by representing high-dimensional data with compact binary codes. However, the spatial structure in row images was always lost in most previous methods. In this paper, a novel Locally Linear Spatial Pyramid Hash(LLSPH) algorithm is developed for the task of fast image retrieval. Unlike the conventional approach, the spatial extent of image features is exploited in our method. The spatial pyramid structure is used both to construct binary hash codes and to increase the discriminability of the description. To generate interpretable binary codes, the proposed LLSPH method captures the spatial characteristics of the original SPM and generates a low-dimensional sparse representation using multi-dictionaries Locality-constrained Linear Coding(MD_LLC). LLSPH then converts the low-dimensional data into Hamming space by the TF-IDF binarization rule. Our experimental results show that our LLSPH method can outperform several state-of-the-art hashing algorithms on the Caltech256 and ImageNet-500 datasets.
Published: 2016

182. On Codes Identifying Sets of Vertices in Hamming Spaces.

Author: Honkala, Iiro, Laihonen, Tero, and Ranto, Sanna
Abstract: A code $$C \subseteq F_2^n $$ is called ( t, ≤2)-identifying if for all the words x, y( x ≠ y) and $$z$$ the sets ( B t( x) ⋃ B t( y)) ⋂ C and $$B_t (z)\; \cap \;C$$ are nonempty and different. Constructions of such codes and a lower bound on the cardinality of these codes are given. The lower bound is shown to be sharp in some cases. We also discuss a more general notion of $$(t,\mathcal{F})$$ -identifying codes and introduce weakly identifying codes. [ABSTRACT FROM AUTHOR]
Published: 2001
Full Text: View/download PDF

183. Learning Angular Reconstruction based Binary Descriptor for Face Recognition

Author: Yunxiao Zu and Jing Chen
Subjects: business.industry, Computer science, Cosine similarity, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Pattern recognition, Facial recognition system, Euclidean distance, ComputingMethodologies_PATTERNRECOGNITION, Discriminative model, Face (geometry), Binary code, Artificial intelligence, Hamming space, business, Feature learning
Abstract: In this work, a simple angular reconstruction based binary descriptor (ARBD) strategy is proposed for face recognition. Compared with previous learning-based hashing methods such as compact binary face descriptor (CBFD) and sparse projection matrix binary descriptor (SPMBD) which measure data similarity with the Euclidean distance, our ARBD focuses on using the cosine similarity to generate binary codes. Specifically, the angular reconstruction term is added to the objective function to minimize the reconstruction error. Furthermore, an efficient algorithm based on the augmented Lagrange method (ALM) is designed to explicitly address the discrete constraint in the Hamming space. We also perform pooling on the learned binary codes with the unsupervised clustering manner to improve their discriminative ability. The results on two face datasets (i.e., CAS-PEAL-R1 and PaSC) demonstrate the superior performance of our ARBD over other existing face recognition methods.
Published: 2019

184. Learning to Hash for Efficient Search Over Incomplete Knowledge Graphs

Author: Meng Wang, Sen Wang, Yang Chen, Guilin Qi, Haomin Shen, Yinlin Jiang, and Lina Yao
Subjects: Euclidean distance, Theoretical computer science, Computer science, 020204 information systems, Hash function, 0202 electrical engineering, electronic engineering, information engineering, Embedding, 020201 artificial intelligence & image processing, Hamming distance, Binary code, 02 engineering and technology, Hamming space
Abstract: Knowledge graph (KG) embedding techniques represent entities and relations as low-dimensional, continuous vectors, and thus enables machine learning models to be easily adapted to KG completion and querying tasks. However, learned dense vectors are inefficient for large-scale similarity computations. Learning-to-hash is to learn compact binary codes from high-dimensional input data and provides a promising way to accelerate efficiency by measuring Hamming distance instead of Euclidean distance or dot-product. Unfortunately, most of learning-to-hash methods cannot be directly applied to KG structure encoding. In this paper, we introduce a novel framework for encoding incomplete KGs and graph queries in Hamming space. To preserve KG structure information from embeddings to hash codes and address the ill-posed gradient issue in optimization, we utilize a continuation method with convergence guarantees to jointly encode queries and KG entities with geometric operations. The hashed embedding of a query can be utilized to discover target answers from incomplete KGs whilst the efficiency has been greatly improved.We compared our model with state-of-the-art methods on real-world KGs. Experimental results show that our framework not only significantly speeds up the searching process, but also provides good results for unanswerable queries caused by incomplete information.
Published: 2019

185. Fast Semantic Preserving Hashing for Large-Scale Cross-Modal Retrieval

Author: Nannan Wang, Xin Liu, Yiu-ming Cheung, Zhikai Hu, Xingzhi Wang, and Shu-Juan Peng
Subjects: Class (computer programming), Theoretical computer science, Scale (ratio), Computer science, Hash function, 02 engineering and technology, Semantic data model, 01 natural sciences, Modal, 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Orthonormal basis, 010306 general physics, Hamming space
Abstract: Most Cross-modal hashing methods do not sufficiently exploit the discrimination power of semantic information when learning hash codes, while often involving time-consuming training procedures for large-scale dataset. To tackle these issues, we first formulate the learning of similarity-preserving hash codes in terms of orthogonally rotating the semantic data to hamming space, and then propose a novel Fast Semantic Preserving Hashing (FSePH) approach to large-scale cross-modal retrieval. Specifically, FSePH introduces an orthonormal basis to regress the targeted hash codes of training examples to their corresponding reasonably relaxed class labels, featuring significantly reducing the quantization error. Meanwhile, an effective optimization algorithm is derived for modality-specific projection function learning and an efficient closed-form solution for hash code learning, which are computationally tractable. Extensive experiments have shown that the proposed FSePH approach runs sufficiently fast, and also significantly improves the retrieval performances over the state-of-the-arts.
Published: 2019

186. Separated Variational Hashing Networks for Cross-Modal Retrieval

Author: Peng Hu, Xu Wang, Liangli Zhen, and Dezhong Peng
Subjects: Modality (human–computer interaction), Theoretical computer science, Computer science, Nearest neighbor search, Hash function, 0202 electrical engineering, electronic engineering, information engineering, Code (cryptography), Binary number, 020206 networking & telecommunications, 020201 artificial intelligence & image processing, Binary code, 02 engineering and technology, Hamming space
Abstract: Cross-modal hashing, due to its low storage cost and high query speed, has been successfully used for similarity search in multimedia retrieval applications. It projects high-dimensional data into a shared isomorphic Hamming space with similar binary codes for semantically-similar data. In some applications, all modalities may not be obtained or trained simultaneously for some reasons, such as privacy, secret, storage limitation, and computational resource limitation. However, most existing cross-modal hashing methods need all modalities to jointly learn the common Hamming space, thus hindering them from handling these problems. In this paper, we propose a novel approach called Separated Variational Hashing Networks (SVHNs) to overcome the above challenge. Firstly, it adopts a label network (LabNet) to exploit available and nonspecific label annotations to learn a latent common Hamming space by projecting each semantic label into a common binary representation. Then, each modality-specific network can separately map the samples of the corresponding modality into their binary semantic codes learned by LabNet. We achieve it by conducting variational inference to match the aggregated posterior of the hashing code of LabNet with an arbitrary prior distribution. The effectiveness and efficiency of our SVHNs are verified by extensive experiments carried out on four widely-used multimedia databases, in comparison with 11 state-of-the-art approaches.
Published: 2019

187. Accelerate Learning of Deep Hashing With Gradient Attention

Author: Long-Kai Huang, Sinno Pan, and Jianda Chen
Subjects: Theoretical computer science, Artificial neural network, Computer science, business.industry, Deep learning, Hash function, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, 0202 electrical engineering, electronic engineering, information engineering, Benchmark (computing), 020201 artificial intelligence & image processing, Binary code, Artificial intelligence, Hamming space, business, Image retrieval, 0105 earth and related environmental sciences
Abstract: Recent years have witnessed the success of learning to hash in fast large-scale image retrieval. As deep learning has shown its superior performance on many computer vision applications, recent designs of learning-based hashing models have been moving from shallow ones to deep architectures. However, based on our analysis, we find that gradient descent based algorithms used in deep hashing models would potentially cause hash codes of a pair of training instances to be updated towards the directions of each other simultaneously during optimization. In the worst case, the paired hash codes switch their directions after update, and consequently, their corresponding distance in the Hamming space remain unchanged. This makes the overall learning process highly inefficient. To address this issue, we propose a new deep hashing model integrated with a novel gradient attention mechanism. Extensive experimental results on three benchmark datasets show that our proposed algorithm is able to accelerate the learning process and obtain competitive retrieval performance compared with state-of-the-art deep hashing models.
Published: 2019

188. Local Feature Hashing with Graph Regularized Binary Auto-encoder for Face Recognition

Author: Jing Chen and Yunxiao Zu
Subjects: 021110 strategic, defence & security studies, Computer science, business.industry, Quantization (signal processing), Feature extraction, Hash function, 0211 other engineering and technologies, Pattern recognition, 02 engineering and technology, Facial recognition system, Autoencoder, Graph, 0202 electrical engineering, electronic engineering, information engineering, Graph (abstract data type), 020201 artificial intelligence & image processing, Binary code, Feature hashing, Artificial intelligence, Laplacian matrix, Hamming space, business
Abstract: In recent years, learning-based hashing has gained increasing research interest in face recognition due to its encouraging storage and computational efficiency. However, learning high-quality binary codes that yield appealing recognition performance is still a challenge. In addition, since the neighborhood structure is important for discrimination, it should be taken into account to capture useful nearest neighbors. In this paper, we propose a novel and efficient graph-based hashing model, called local feature hashing with graph regularized binary auto-encoder (LFH-GRBA), to learn feature representations for face recognition. It considers the semantic neighborhood of the face data and seeks to reconstruct a face image from the learned binary codes. Specifically, the graph Laplacian is incorporated into the binary auto-encoder as a regularizer to exploit the semantical neighborhood information of the face data. To make such a model computational efficiency, a tractable alternating optimization approach is proposed to solve the objective function, yielding high-quality binary codes to well capture the neighborhood structure and provide high discrimination. Moreover, the discrete cyclic coordinate descent (DCC) method is adopted to directly learn binary codes in the Hamming space to eliminate the accumulated quantization errors. Extensive experimental results on three widely used datasets including FERET, LFW and PaSC datasets demonstrate that the proposed LFH-GRBA model outperforms most state-of-the-art face representation methods.
Published: 2019

189. Maximum-Margin Hamming Hashing

Author: Yue Cao, Kang Rong, Jianmin Wang, Philip S. Yu, and Mingsheng Long
Subjects: business.industry, Computer science, Computation, Hash function, Binary number, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, TheoryofComputation_ANALYSISOFALGORITHMSANDPROBLEMCOMPLEXITY, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Binary code, Artificial intelligence, Hamming space, business, Algorithm, Hamming code, 0105 earth and related environmental sciences
Abstract: Deep hashing enables computation and memory efficient image search through end-to-end learning of feature representations and binary codes. While linear scan over binary hash codes is more efficient than over the high-dimensional representations, its linear-time complexity is still unacceptable for very large databases. Hamming space retrieval enables constant-time search through hash lookups, where for each query, there is a Hamming ball centered at the query and the data points within the ball are returned as relevant. Since inside the Hamming ball implies retrievable while outside irretrievable, it is crucial to explicitly characterize the Hamming ball. The main idea of this work is to directly embody the Hamming radius into the loss functions, leading to Maximum-Margin Hamming Hashing (MMHH), a new model specifically optimized for Hamming space retrieval. We introduce a max-margin t-distribution loss, where the t-distribution concentrates more similar data points to be within the Hamming ball, and the margin characterizes the Hamming radius such that less penalization is applied to similar data points within the Hamming ball. The loss function also introduces robustness to data noise, where the similarity supervision may be inaccurate in practical problems. The model is trained end-to-end using a new semi-batch optimization algorithm tailored to extremely imbalanced data. Our method yields state-of-the-art results on four datasets and shows superior performance on noisy data.
Published: 2019

190. Обратимые вычисления: обзор проблемы и новые результаты (отказоустойчивость и криптография)

Author: Gurov, S.I., Zhukov, A.E., Zakablukov, D.V., and Kormakov, G.V.
Subjects: hamming space, information protection, fault-tolerant circuits, обратимая логика, защита информации, обратимые схемы с «уборкой мусора», reversible logic elements, хэммингово пространство, reversible circuits with "garbage collection", reversible logic, обратимые логические элементы, отказоустойчивые схемы
Abstract: В работе рассмотрены основные положения обратимости как новой парадигмы развития вычислительной техники. Первые разделы носят обзорный характер. Показана неизбежность т. н. теплового проклятия при сохранении традиционной парадигмы создания средств ВТ. Изложены основы обратимой логики, рассмотрены основные обратимые логические элементы и модели обратимых вычислений, в т. ч. обратимые клеточные автоматы. Кратко рассмотрены обратимые языки программирования. Во второй части затронуты основные вопросы логического синтеза схем из обратимых элементов и физическая реализация обратимой схемотехники. Кратко описана проблематика синтеза отказоустойчивых схем в парадигме обратимой схемотехники. Предлагается техника синтеза сбоеустойчивых обратимых элементов в хэмминговом пространстве и описываются некоторые такие схемы. Далее рассматривается проблематика применения схем из обратимых логических элементов в криптографии. Описывается предлагаемая общая схема создания обратимых схем с уборкой мусора , предназначенных для криптографических применений., The paper considers the main provisions of reversibility as a new paradigm for the development of computer technology. The first sections are of an overview nature. The inevitability of the so-called heat curse while maintaining the traditional paradigm of creating means of computer engineering. The fundamentals of reversible logic are presented, the main reversible logic elements and models of reversible computations, including reversible cellular automata, are considered. Reversible programming languages are briefly reviewed. The second part addresses the basic issues of the logical synthesis of circuits from reversible elements and the physical implementation of reversible circuitry. The synthesis of fault-tolerant circuits in the paradigm of reversible circuitry is briefly described. A technique for synthesizing fault-tolerant reversible elements in a hamming space is proposed and some such schemes are described. Next, the problems of using circuits of reversible logic elements in cryptography are considered. The proposed general scheme for creating reversible schemes with garbage collection intended for cryptographic applications is described., №3 (2020)
Published: 2019
Full Text: View/download PDF

191. Hash Code Indexing in Cross-Modal Retrieval

Author: Chih-Yi Chiu and Sarawut Markchit
Subjects: Computer science, Search engine indexing, Hash function, Hamming distance, 02 engineering and technology, Inverted index, 01 natural sciences, 0103 physical sciences, Data_FILES, 0202 electrical engineering, electronic engineering, information engineering, Code (cryptography), Benchmark (computing), 020201 artificial intelligence & image processing, Binary code, 010306 general physics, Hamming space, Algorithm
Abstract: Cross-modal hashing, which searches nearest neighbors across different modalities in the Hamming space, has become a popular technique to overcome the storage and computation barrier in multimedia retrieval recently. Although dozens of cross-modal hashing algorithms are proposed to yield compact binary code representation, applying exhaustive search in a large-scale dataset is impractical for the real-time purpose, and the Hamming distance computation suffers inaccurate results. In this paper, we propose a novel index scheme over binary hash codes in cross-modal retrieval. The proposed indexing scheme exploits a few binary bits of the hash code as the index code. Based on the index code representation, we construct an inverted index structure to accelerate the retrieval efficiency and train a neural network to improve the indexing accuracy. Experiments are performed on two benchmark datasets for retrieval across image and text modalities, where hash codes are generated by three cross-modal hashing methods. Results show the proposed method effectively boosts the performance over the benchmark datasets and hash methods.
Published: 2019

192. Supervised Generative Adversarial Cross-Modal Hashing by Transferring Pairwise Similarities for Venue Discovery

Author: Himanshu Aggarwal, Rajiv Ratn Shah, Feida Zhu, and Suhua Tang
Subjects: Modalities, business.industry, Computer science, Feature extraction, Hash function, 02 engineering and technology, Construct (python library), 010501 environmental sciences, Machine learning, computer.software_genre, Semantics, 01 natural sciences, Data modeling, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Pairwise comparison, Artificial intelligence, Hamming space, business, computer, 0105 earth and related environmental sciences
Abstract: Venue discovery using real-world multimedia data has not been investigated thoroughly. We are referring to business and travel locations as venues in this study and aim to improve the efficiency of venue discovery by hashing. Most existing supervised cross-modal hashing methods map data in different modalities to Hamming space, where the semantic information is exploited to supervise data of different modalities during the training stage. However, previous works neglect pairwise similarity between data in different modalities, which lead to degraded performance of hashing function learning. To address this issue, we propose a supervised Generative Adversarial Cross-modal Hashing method by Transferring Pairwise Similarities (SGACH-TPS). This work has three significant contributions: i) we propose a model for making efficient venue discovery, ii) the supervised generative adversarial network can construct a hash function to map multimodal data to a common hamming space. iii) a simple transfer training strategy for the adversarial network is suggested to supervise data in different modalities where the pairwise similarity is transferred to the fine-tuning stage of training. Evaluation on the new WikiVenue dataset confirms the superiority of the proposed method.
Published: 2019

193. Hardware-Accelerated Similarity Search with Multi-Index Hashing

Author: Felipe M. G. França, Victor C. Ferreira, Alexandre S. Nery, Brunno F. Goldstein, Leandro Santiago de Araújo, Leandro A. J. Marzulo, and Sandip Kundu
Subjects: 010302 applied physics, Speedup, Computer science, Nearest neighbor search, Hash function, Hamming distance, 02 engineering and technology, Parallel computing, 01 natural sciences, Hash table, 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Binary code, Hamming space, Field-programmable gate array
Abstract: Similarity search plays a major role in a large variety of database applications where high-dimensional data, usually derived from images and videos, are mapped onto indexed structures. Its essence relies on a k-nearest-neighbors (kNN) algorithm to find similar contents given a specific input data. An efficient way to store and speedup such searches is to apply kNN in Hamming space by encoding the data as binary and splitting it into multiple hash tables. This paper proposes an efficient Hardware-Accelerated Similarity Search co-processor architecture using Multi-Index Hashing as storage structure. The accelerator is specified in Verilog hardware description language and implemented in a Xilinx low-cost Zynq FPGA. Performance, circuit-area and power consumption results are presented, with the accelerator being up to 13x and 18x faster than the corresponding C/C++ code on the ARM host processor when running the SIFT and GIST datasets, respectively, while also requiring less power.
Published: 2019

194. Central Similarity Quantization for Efficient Image and Video Retrieval

Author: Jiashi Feng, Li Yuan, Xiaopeng Zhang, Tao Wang, Zequn Jie, Wei Liu, and Francis Eh Tay
Subjects: FOS: Computer and information sciences, Computer science, business.industry, Computer Vision and Pattern Recognition (cs.CV), Quantization (signal processing), Hash function, Computer Science - Computer Vision and Pattern Recognition, Cryptography, Hamming distance, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, Hamming space, business, Algorithm, Image retrieval, Hadamard matrix, 0105 earth and related environmental sciences
Abstract: Existing data-dependent hashing methods usually learn hash functions from pairwise or triplet data relationships, which only capture the data similarity locally, and often suffer from low learning efficiency and low collision rate. In this work, we propose a new \emph{global} similarity metric, termed as \emph{central similarity}, with which the hash codes of similar data pairs are encouraged to approach a common center and those for dissimilar pairs to converge to different centers, to improve hash learning efficiency and retrieval accuracy. We principally formulate the computation of the proposed central similarity metric by introducing a new concept, i.e., \emph{hash center} that refers to a set of data points scattered in the Hamming space with a sufficient mutual distance between each other. We then provide an efficient method to construct well separated hash centers by leveraging the Hadamard matrix and Bernoulli distributions. Finally, we propose the Central Similarity Quantization (CSQ) that optimizes the central similarity between data points w.r.t.\ their hash centers instead of optimizing the local similarity. CSQ is generic and applicable to both image and video hashing scenarios. Extensive experiments on large-scale image and video retrieval tasks demonstrate that CSQ can generate cohesive hash codes for similar data pairs and dispersed hash codes for dissimilar pairs, achieving a noticeable boost in retrieval performance, i.e. 3\%-20\% in mAP over the previous state-of-the-arts. The code is at: \url{https://github.com/yuanli2333/Hadamard-Matrix-for-hashing}, CVPR2020, Codes: https://github.com/yuanli2333/Hadamard-Matrix-for-hashing
Published: 2019

195. An Effective Network Intrusion Detection Framework Based on Learning to Hash

Author: Heng Qi, Yuan Cao, Junxiao Wang, and Wenrui Zhou
Subjects: Computer science, Hash function, Detector, Anomaly detection, Data mining, Network intrusion detection, computer.software_genre, Hamming space, computer, Classifier (UML), k-nearest neighbors algorithm
Abstract: Nowadays, the network intrusion detection has been an important issue in IoT. Although machine learning based methods seem to be promising in traditional network intrusion detection, these methods can hardly meet some demands of IoT. For example, unknown classes of flows are produced frequently in IoT, leading to classifiers training repeatly. To address this issue, we proposed a network intrusion detection framework based on learning to hash in this paper, which can reduce computation overhead significantly while avoiding frequent training of classifiers. The proposed framework consists of a hashing encoding module and an anomaly detection module with optimized k-NN classifier based on data distribution ratio. Moreover, the multi-index hashing is applied for fast and accurate search in Hamming space. Experimental results show that the proposed framework can detect various attacks and outperform the traditional intrusion detector.
Published: 2019

196. Adversary Guided Asymmetric Hashing for Cross-Modal Retrieval

Author: Wen Gu, Bo Li, Xiaoyan Gu, Jingzi Gu, Zhi Xiong, and Weiping Wang
Subjects: Theoretical computer science, Modal, Empirical research, Discriminative model, Computer science, Quantization (signal processing), Hash function, Binary code, Hamming space, Feature learning
Abstract: Cross-modal hashing has attracted considerable attention for large-scale multimodal retrieval task. A majority of hashing methods have been proposed for cross-modal retrieval. However, these methods inadequately focus on feature learning process and cannot fully preserve higher-ranking correlation of various item pairs as well as the multi-label semantics of each item, so that the quality of binary codes may be downgraded. To tackle these problems, in this paper, we propose a novel deep cross-modal hashing method, called Adversary Guided Asymmetric Hashing (AGAH). Specifically, it employs an adversarial learning guided multi-label attention module to enhance the feature learning part which can learn discriminative feature representations and keep the cross-modal invariability. Furthermore, in order to generate hash codes which can fully preserve the multi-label semantics of all items, we propose an asymmetric hashing method which utilizes a multi-label binary code map that can equip the hash codes with multi-label semantic information. In addition, to ensure higher-ranking correlation of all similar item pairs than those of dissimilar ones, we adopt a new triplet-margin constraint and a cosine quantization technique for Hamming space similarity preservation. Extensive empirical studies show that AGAH outperforms several state-of-the-art methods for cross-modal retrieval.
Published: 2019

197. Collective Reconstructive Embeddings for Cross-Modal Hashing

Author: Richang Hong, Mengqiu Hu, Heng Tao Shen, Fumin Shen, Yang Yang, and Ning Xie
Subjects: Similarity (geometry), Theoretical computer science, Computer science, Nearest neighbor search, Hash function, Cosine similarity, 02 engineering and technology, Computer Graphics and Computer-Aided Design, Euclidean distance, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial Intelligence & Image Processing, Hamming space, Software
Abstract: © 1992-2012 IEEE. In this paper, we study the problem of cross-modal retrieval by hashing-based approximate nearest neighbor search techniques. Most existing cross-modal hashing works mainly address the issue of multi-modal integration complexity using the same mapping and similarity calculation for data from different media types. Nonetheless, this may cause information loss during the mapping process due to overlooking the specifics of each individual modality. In this paper, we propose a simple yet effective cross-modal hashing approach, termed collective reconstructive embeddings (CRE), which can simultaneously solve the heterogeneity and integration complexity of multi-modal data. To address the heterogeneity challenge, we propose to process heterogeneous types of data using different modality-specific models. Specifically, we model textual data with cosine similarity-based reconstructive embedding to alleviate the data sparsity to the greatest extent, while for image data, we utilize the Euclidean distance to characterize the relationships of the projected hash codes. Meanwhile, we unify the projections of text and image to the Hamming space into a common reconstructive embedding through rigid mathematical reformulation, which not only reduces the optimization complexity significantly but also facilitates the inter-modal similarity preservation among different modalities. We further incorporate the code balance and uncorrelation criteria into the problem and devise an efficient iterative algorithm for optimization. Comprehensive experiments on four widely used multimodal benchmarks show that the proposed CRE can achieve a superior performance compared with the state of the art on several challenging cross-modal tasks.
Published: 2019

198. K-Nearest Neighbors Hashing

Author: Jian Cheng, Peisong Wang, and Xiangyu He
Subjects: Clustering high-dimensional data, business.industry, Computer science, Nearest neighbor search, Hash function, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, Linear subspace, k-nearest neighbors algorithm, ComputingMethodologies_PATTERNRECOGNITION, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Binary code, Artificial intelligence, Space partitioning, Hamming space, business, Algorithm, 0105 earth and related environmental sciences, Sign (mathematics)
Abstract: Hashing based approximate nearest neighbor search embeds high dimensional data to compact binary codes, which enables efficient similarity search and storage. However, the non-isometry sign() function makes it hard to project the nearest neighbors in continuous data space into the closest codewords in discrete Hamming space. In this work, we revisit the sign() function from the perspective of space partitioning. In specific, we bridge the gap between k-nearest neighbors and binary hashing codes with Shannon entropy. We further propose a novel K-Nearest Neighbors Hashing (KNNH) method to learn binary representations from KNN within the subspaces generated by sign(). Theoretical and experimental results show that the KNN relation is of central importance to neighbor preserving embeddings, and the proposed method outperforms the state-of-the-arts on benchmark datasets.
Published: 2019

199. Discrete Semantic Alignment Hashing for Cross-Media Retrieval

Author: Xiangwei Kong, Tao Yao, Qi Tian, and Haiyan Fu
Subjects: Modality (human–computer interaction), Information retrieval, Computer science, Hash function, 02 engineering and technology, 010501 environmental sciences, Semantics, 01 natural sciences, Computer Science Applications, Human-Computer Interaction, Feature (linguistics), Control and Systems Engineering, 0202 electrical engineering, electronic engineering, information engineering, Collaborative filtering, 020201 artificial intelligence & image processing, Electrical and Electronic Engineering, Hamming space, Software, 0105 earth and related environmental sciences, Information Systems, Semantic gap
Abstract: Cross-media hashing, which maps data from different modalities to a low-dimensional sharing Hamming space, has attracted considerable attention due to the rapid increase of multimodal data, for example, images and texts. Recent cross-media hashing works mainly aim at learning compact hash codes to preserve the class label-based or feature-based similarities among samples. However, these methods ignore the unbalanced semantic gaps between different modalities and high-level semantic concepts, which generally results in less effective hash functions and unsatisfying retrieval performance. Specifically, the key words of texts contain semantic meanings, while the low-level features of images lack of semantic meanings. That means the semantic gap in image modality is larger than that in text modality. In this paper, we propose a simple yet effective hashing method for cross-media retrieval to address this problem, dubbed discrete semantic alignment hashing (DSAH). First, DSAH formulates to exploit collaborative filtering to mine the relations between class labels and hash codes, which can reduce memory consumption and computational cost compared to pairwise similarity. Then, the attribute of image modality is employed to align the semantic information with text modality. Finally, to further improve the quality of hash codes, we propose a discrete optimization algorithm to learn discrete hash codes directly, and each bit has a closed-form solution. Extensive experiments on multiple public databases show that our model can seamlessly incorporate attributes and achieve promising performance.
Published: 2019

200. Supervised Online Hashing via Hadamard Codebook Learning

Author: Hong Liu, Rongrong Ji, Yongjian Wu, and Mingbao Lin
Subjects: FOS: Computer and information sciences, Theoretical computer science, Computer science, Computer Vision and Pattern Recognition (cs.CV), Supervised learning, Hash function, Codebook, Computer Science - Computer Vision and Pattern Recognition, 02 engineering and technology, Multimedia (cs.MM), Semantic similarity, Robustness (computer science), Hadamard transform, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Binary code, Hamming space, Computer Science::Databases, Computer Science - Multimedia
Abstract: In recent years, binary code learning, a.k.a hashing, has received extensive attention in large-scale multimedia retrieval. It aims to encode high-dimensional data points to binary codes, hence the original high-dimensional metric space can be efficiently approximated via Hamming space. However, most existing hashing methods adopted offline batch learning, which is not suitable to handle incremental datasets with streaming data or new instances. In contrast, the robustness of the existing online hashing remains as an open problem, while the embedding of supervised/semantic information hardly boosts the performance of the online hashing, mainly due to the defect of unknown category numbers in supervised learning. In this paper, we proposed an online hashing scheme, termed Hadamard Codebook based Online Hashing (HCOH), which aims to solve the above problems towards robust and supervised online hashing. In particular, we first assign an appropriate high-dimensional binary codes to each class label, which is generated randomly by Hadamard codes to each class label, which is generated randomly by Hadamard codes. Subsequently, LSH is adopted to reduce the length of such Hadamard codes in accordance with the hash bits, which can adapt the predefined binary codes online, and theoretically guarantee the semantic similarity. Finally, we consider the setting of stochastic data acquisition, which facilitates our method to efficiently learn the corresponding hashing functions via stochastic gradient descend (SGD) online. Notably, the proposed HCOH can be embedded with supervised labels and it not limited to a predefined category number. Extensive experiments on three widely-used benchmarks demonstrate the merits of the proposed scheme over the state-of-the-art methods. The code is available at https://github.com/lmbxmu/mycode/tree/master/2018ACMMM_HCOH.
Published: 2019

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

553 results on '"Hamming space"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources