Cardoso, &. 1, 2 , Sacramento, J. 1, 2 & Wichert, A. . 1, 2
1 Universidade Técnica de Lisboa
2 INESC-ID Lisboa
In this work we investigate the sparsification method for efficient information retrieval using the associative memory.
A unary representation of a number h would require binary vector of length d=log2 h+1. However if we represent the number h unary we are required h position. A binary number of length d is represented by a unary number of 2^d positions which is exponential in the size of input.
A binary vector x of dimension t is split into f distinct sub vectors of dimension p=dim(t/f). The binary sub vectors of dimension p=dim(t/f) are represented as unary vectors of dimension 2^p.
The resulting binary vector is composed out of the unary vectors and has the dimension f * 2^p.
Such a binary sparse vector could correspond to set of c sensors at different positions. At a position one and only one sensor is activated. For f positions and c sensors a state would be represented by binary vector of the dimension f*c with f ones.
An example of such unary coding is the MTC model of the visual system [1], it is a less complex description of the pattern recognition capabilities of the Neocognitron. Each layer represents a set of features. A binary vector in which the positions represent different features at different positions on the image describes it. The input image is tiled with a squared mask M of size j * j in which a corresponding category of a feature is determined. Each feature is determined through the use of the elements in each squared mask. The categories can be learned by a simple clustering algorithm such as K-Means [1] that represents the distribution of the data.
[1] Cardoso, A, Wichert A.: Neocognitron and the Map Transformation Cascade, Neural Networks, 23 (1): 74-88, 2010