Selection

Original idea of selection was to sacrifice accuracy for speed. This means, if we start with the SPARQL query, given a huge dataset (RDF graph), it might take a while to get the answer back. The role of subsetting would be to, for example, partition the RDF graph into smaller subsets, and then, at query time, the process would be faster as we will execute queries against seleted subsets - not the whole huge graph. We can look at this as an indexing process where offline you create an index and then you use that index to perform various searches later.

The other example would be doing some kind of 'ranking' of all statements (RDF triples) which are included in the huge RDF graph. Then if we set the threshold, we can still call this subsetting. So, if I can simplify the selection/subsetting it would be like this:

(this happens at runtime, offline we can do all various calculations which take a lot of time such as building vector space, index...etc.) input: SPARQL query output: sub-graph which is a correct result (set of RDF triples) (output should be the same with or without using selection/subsetting method)

Using ML for selection

As we are working with RDF, we need to create these 'artificial' documents, which in our case we call RDF Molecules. These are lexicalisations of RDF, which we try to process using some method and see if we can get something useful. For example, could do some kind of clustering, where each cluster would be a set of RDF molecules (or maybe a set of triples relevant for the given query). At runtime (when someone sends the SPARQL query, we could look at only most relevant cluster, not the whole RDF graph). And 'the most relevant cluster' is the one with the highest score.

Active learning

Experimental design

Evaluation

The simplest way would be comparing results with and without selection method. The results should be the same i.e. selection method must take care of not loosing relevant data.

Mutual collaborations between WP2 and WP3 from WP3’s Viewpoint

A. What WP2 can contribute to WP3

B. What WP3 can contribute to WP2

LarkcProject/WP2/integration/ml (last edited 2010-01-14 15:29:02 by ?VolkerTresp)