wiki:VIQUEN

Version 43 (modified by nixdell, 9 years ago) (diff)

--

VIQUEN: A Visual Query Engine for RDF

VIQUEN is a graphical tool for semantic query construction, execution and visualization that is based on the IML data flow graph transformation language for manipulating RDF data. VIQUEN enables the formulation of queries using a set of graphical query components and GUI-based editing actions. The formulated queries are automatically compiled into the IML query language, before being executed over local or online RDF data sets. The RDF data set resulting from the IML query is then visualized as a graph.

For background information on IML, RDF, SPARQL and vSPARQL, please refer to the IML documentation, which may be found here.

Implementation

VIQUEN has been implemented as a platform independent Java application. The GUI components of the system have been built using the Java Swing toolkit, and both the query builder environment and the visualization environment utilize the JGraph visualization library. After the queries have been compiled into IML, they are executed over online RDF data sets using the Java AMF connection protocol, which connects to the Query Manager server. The RDF data sets returned after executing queries are parsed using the Jena Framework for building semantic web applications.

Query-building Environment

No image "screenshot_qb_1.png" attached to VIQUEN

The query-building environment is used to graphically formulate semantic queries. The user interface, shown above may be divided into four main parts:

  1. The toolbar and system menus
  2. The operation library palettes
  3. The main query-building workspace
  4. The query-building workspace outline

1. The toolbar and system menus

The toolbar and system menus have been designed to provide easy, single-click options for managing the workspace, including saving and loading queries, copying, pasting or deleting query operations, compiling queries into IML or changing the look-and-feel of the application. Several layout options have been provided which automatically structure the flow of the query operations in a space efficient manner. These may be accessed through the Diagram menu (Diagram -> Layout).

Several toolbar buttons have specialized query-building functionality. These include:
The data sources button: used to add, remove or edit the data sources and namespaces specified in the query.
The compile query button: used to automatically compile the query into IML and open the query execution environment.
The visualization button: opens the visualization environment in a separate window to enable the visualization of local RDF files.

2. The operation library palettes

The operation library palettes contain icons which represent query operations that may be added to the workspace. The query operations have been divided into five different palettes, with similar operations being grouped together. Operations are added to the main query-building workspace by dragging and dropping the appropriate icon from the relevant palette. Each palette additionally contains an Edge icon for adding directed edges to the data flow workspace.

The Extract palette contains shortcuts for the 5 Extract operations: Extract Edges, Extract Tree, Extract Reachable, Extract Path and Extract Recursive.
The Delete palette contains shortcuts for the 4 Delete operations: Delete Edges. Delete Node, Delete Property and Delete Tree.
The Replace palette contains shortcuts for the 7 Replace operations: Replace Edge Subject, Replace Edge Object, Replace Edge Property, Replace Edge Literal, Replace Node, Replace Property and Replace Literal.
The Where palette contains shortcuts for the 4 Where operations: Match Statements, Union Statements, Filter Statements and Optional Statements.
The Basic palette contains shortcuts for the remainder of the query operations: Start, Input, Output, Add Edges and Union Graphs.

3. The main query-building workspace

The main query-building workspace has been designed to take advantage of the data flow graph transformation style of IML. Each high-level query operation is represented in its own visual node. Nodes of the same type are the same color for easy identification. The visual nodes are then chained together, using directed edges, to compose the entire query. Each node contains maximize (+) and minimize (-) buttons that are used to expand and collapse the node. The bottom right corner of the node may be clicked and dragged to resize the node.

A query must begin with a Start node, which indicates the point from which the system will start to compile the query. By positioning the Start node appropriately, different chunks of the query may be executed individually before combining them into a larger query. After the Start node, the query is defined by adding one or more subquery blocks to the workspace.

Each subquery block begins with an Input node, which defines the data sources to be used as input to the query. Clicking on the "Manage data sources" button will bring up a list of available data sources which may be selected for inclusion in the query.

A subquery block must end with an Output node, which specifies the output graph for the block. This output graph may easily be added to the list of potential input data sources by clicking on the "Add to data sources" button and specifying a name for the output graph.

Between the Input node and the Output node, a number of different query operations may be added. These are described in detail below.

4. The query-building workspace outline

An outline of the main query-building workspace is provided at the bottom left-hand side of the screen, with a dark blue rectangle indicating the fraction of the workspace currently being viewed. The query workspace may be navigated by clicking and dragging on this rectangle.

Execution Environment

No image "screenshot_ex_1.png" attached to VIQUEN

The query execution environment, shown above, consists of three components: 1. The query component, 2. The results component and 3. A simple menu bar. The query component displays the generated query in IML. Clicking on the "execute query" button will cause the query to be executed, and the results of the query are displayed in raw RDF/XML format in the result component. The system will also provide an alert indicating the number of RDF triples that have been returned by the query.

Both the generated query and the resulting RDF/XML may be saved to a local file using the "save query" and "save results" buttons. Clicking on the "visualize results" button will open VIQUEN’s visualization environment.

Visualization Environment

No image "screenshot_rv_1.2.png" attached to VIQUEN

The visualization environment, shown above, facilitates exploration and manipulation of an RDF graph. The user interface may be divided into 5 main components:

  1. The toolbar and system menus
  2. The tree and list views
  3. The main visualization workspace
  4. The visualization workspace pop-up menu
  5. The visualization workspace outline

1. The toolbar and system menus

The visualization environment has been designed in a fashion consistent with the query-building environment, and utilizes similar layouts, menus and toolbars. As in the query-building environment, several automatic graph layout options are available from the menu (Diagram -> Layout). Additionally, VIQUEN visualizations can be loaded from and saved to disk using the same file format as that for saving visual queries.

The load RDF button allows locally saved RDF files to be loaded and visualized.

2. The tree and list views

The tree and list views of the RDF data set are located in the upper left hand side of the workspace. The tree view depicts the RDF using a tree structure showing the graph of nodes. The list view provides an alphabetized list of the nodes. Clicking on a node in either the tree view or the alphabetized list view will make the node available for viewing and manipulation in the main visualization workspace in the following way: if the node is currently visible, the system will select it and scroll to it. Alternatively, if the selected node is not currently visible, the system will make the node visible, along with its children and parent nodes.

3. The main visualization workspace

The main visualization workspace depicts the RDF visually as a graph consisting of nodes connected by edges. The nodes represent the subject and object of the RDF triple, while the edges represent the properties. Since queries may potentially return a large number of RDF triples, VIQUEN does not attempt to display the entire results graph on the screen at one time. This would make the resulting graph difficult to understand and navigate. Instead, one or more likely root nodes from which to start the visualization are found. The most appropriate of these root nodes may then be chosen using the tree or list view of the graph.

Properties in the visualization workspace are displayed as directed edges, starting at the subject of the RDF triple and going to the object, with the edge label consisting of the property name. Nodes are displayed in blue colored rectangles labeled with the name of the node. A node may be selected and moved by clicking and dragging it in the workspace. Positioning the mouse pointer over a node’s information icon will display the total number of incoming and outgoing edges for the node and the full name of the node. Clicking on the show children button will make all of the node's child nodes visible. Note that this button is only displayed in nodes that have children.

4. The visualization workspace pop-up menu

Additional functionality for further visualization and exploration of the RDF is made available to the user in a pop-up menu which is accessed by right clicking in the main visualization workspace. As well as the basic cut, copy, paste, delete and undo actions, three submenus group actions into select actions, group actions and show/hide actions. The select submenu has options to select all of the nodes, none of the nodes, the children of a particular node or the entire subtree rooted at a particular node. The group submenu has options which allow for a number of nodes to be grouped together and then collapsed into a single representative group node. The group may then be expanded and collapsed as a single unit, or opened in a separate visualization workspace for more detailed manipulation. The show/hide submenu provides a variety of choices for manipulating currently visible nodes: show or hide the child nodes, parent nodes or the subtree rooted at that node. There are also options to show or hide the entire graph, or just the selected portion of the graph.

5. The visualization workspace outline

An outline of the main visualization workspace is provided at the bottom left-hand side of the screen, with a dark blue rectangle indicating the fraction of the workspace currently being viewed. The visualization workspace may be navigated by clicking and dragging on this rectangle.

Attachments (37)

Download all attachments as: .zip