Abstract | In this paper we present a new model of semantic features that, unlike previously presented methods, does not rely on the presence of a labeled training data base, as the creation of the feature extraction function is done in an unsupervised manner. We test these features on an unsupervised classification (clustering) task, and show that they outperform primitive (low-level) features, and that have performance comparable to that of supervised semantic features, which are much more expensive to determine relying on the presence of a labeled training set to train the feature extraction function.
|