Installation and Configuration of the Virtuoso Faceted Browser

Prerequisites

Requires Virtuoso 6.0 TP1 or higher, either Open Source or Commercial Edition.

Pre Installation

Note: This step is not required for Virtuoso Release 6.1 and later.

If you have an existing Virtuoso 6.0 installation, and your Quad Store has more than 10,000 triples, please run these commands using the Virtuoso isql program before installing the Faceted Browser VAD.

Note: This step may take some time depending on how many triples are already in your Quad Store.

VAD Package Installation

  1. Install the Virtuoso Faceted Browser VAD package called fct from the available packages listed in the Install Packages section of the System Admin -> Packages tab of the Virtuoso Conductor. As an alternative you can download the VAD from our public download store.



  2. The HTML interface of the Faceted Browser Engine is exposed at: http://<cname>/fct, where "cname" is the <hostname>:<port-number> your Virtuoso instance is running on.



  3. The Faceted Browser Engine exposes a REST API at the endpoint: http://<cname>/fct/service.

Post Installation

  1. Build Full Text Indexes by running the following commands using the Virtuoso isql program:

    RDF_OBJ_FT_RULE_ADD (null, null, 'All'); VT_INC_INDEX_DB_DBA_RDF_OBJ ();

  2. Run the following procedure using the Virtuoso isql program to periodically populate label lookup tables and activate the Label text box of the Entity Label Lookup tab:

    urilbl_ac_init_db()

  3. Run the following procedure using the Virtuoso isql program to calculate the IRI ranks. Note: This should be re-run periodically as the data grows to re-rank the IRIs.

    s_rank()

  4. Fetch some Network Resources data to load some RDF triples in the quad store. This can easily be done by first installing the Virtuoso Sponger Middleware VAD and then loading the following URL in a Web browser, which will use the built-in Virtuoso description.vsp page to provides a hypertext description of RDF Linked Data. You may substitute any live Web page URL for "http://news.cnet.com/" but it is best if you select one known to have plenty of data.

    http://<cname>/about/html/http://news.cnet.com/







  5. Use the Faceted Browser Search and Find User Interface to search for information on "CNET":



  6. Results of the following form should be returned for the Network resource being fetched:



  7. Click Types link shown at the right vertical Navigation:
  8. Results of the classes/properties should be returned:



  9. To get Type description, click "Describe" link for a given type, for ex. "Person".
  10. A list of attributes and values should be presented for the given resource. Note that automatically is generated QRCode image for the described entity.



  11. Return back to the Attributes list from above by going to the "Facets" tab.
  12. To exclude a type, un-tick the checkbox associated with the type:



  13. Click the Type URI link:
  14. Results of excluding the Type(s) should be shown:



  15. The Faceted Browser Web service endpoint can also be queried to obtain the same results.
  16. Click "New search" from the Entity Relations Navigation and go to "Entity Label Lookup" tab:



  17. In the Label auto-complete text box of the Entity Label Lookup tab, enter the name of an rdfs label to be Described:



  18. Select a URI from the list of available Labels to obtain a description of the URI:







  19. Click "Facets" and go to "Entity URI Lookup" tab:



  20. In the URI auto-complete text box of the Entity URI Lookup tab enter the name URI to be Described:



  21. Select a URI from the list of available Labels to obtain a description of the URI:






  22. If data is loaded into the quad store via DML functions (TTLP, RDF_LOAD_RDFXML, etc.), the following procedure needs to be run from isql to rebuild the free text indexes each time:

    VT_INC_INDEX_DB_DBA_RDF_OBJ ()

URI Labels

  1. Go to http://<cname> fct
  2. Enter a free text search pattern (for example, "Camcorder" as consumer product), and click Search:



  3. Your initial query results page will display a list of literal value snippets where for each URL will be displayed a label:



  4. Click for ex. on the URL link of the first row result.
  5. The product description page should be shown and a list of Attributes and Values will be presented:





Usage Statistics

  1. Use the Faceted Browser Search and Find User Interface to search for information on "Michael Jackson":



  2. Results of the following form should be returned for the Network Resource being fetched:



  3. Click the "Types" link under "Entity Relations Navigation".
  4. Results about "Michael Jackson" as Type/Label/Count list should be displayed:



  5. You can navigate amongst the search results pages by using the "<" and ">" buttons. Click for ex. ">" in order to view the next page results:



  6. Click a type link, for ex.:

    yago:EpicRecordsArtist

  7. Displaying Ranked Entity Names and Text summaries should be shown:



  8. Click the link:

    dbpedia:Michael_Jackson

  9. Results about "Michael Jackson" as Attribute/Value list should be presented:







  10. Go to "Metadata" tab.
  11. Results of usage statistics for "Michael Jackson" grouped in 4 tabs should be shown:
    1. Source Documents: shows how many times the URI is found as object in the relevant graph(s):

      SPARQL SELECT ?g COUNT (*) WHERE { GRAPH ?g { ?s ?p <URI> } } GROUP BY ?g ORDER BY DESC 2 LIMIT 20 ;





    2. Referenced by Documents: shows how many times the URI is found as subject in the relevant graph(s):

      SPARQL SELECT ?g COUNT (*) WHERE { GRAPH ?g { <URI> ?p ?o } } GROUP BY ?g ORDER BY DESC 2 LIMIT 20 ;





    3. Explicit Coreferences: shows results as subject and calculated rank, based on running transitive closure over owl:sameAs of the URI in subject or object:

      SPARQL SELECT ?syn ( sql:rnk_scale (<LONG::IRI_RANK> (?syn))) WHERE { { SELECT ?s ?syn WHERE { { ?syn owl:sameAs ?s } UNION { ?s owl:sameAs ?syn } } } OPTION (transitive, t_distinct, t_min (0), T_in (?s), t_out (?syn)) . FILTER (!isliteral (?syn) && ?s = <URI> ) } ORDER BY DESC 2 LIMIT 20 ;





    4. Implicit Coreferences: shows expanded results for objects concur with the URI by IFP:

      SPARQL SELECT DISTINCT ?syn ?p ?o (sql:rnk_scale (<LONG::IRI_RANK> (?syn))) WHERE { <URI> ?p ?o . FILTER (0 != (<LONG::bif:rdf_is_sub> ("b3sifp", ?p, lod:ifp_like, 3))) . ?syn ?p ?o . } ORDER BY DESC 4 LIMIT 20 ;





Tutorials

Related