This HTML5 document contains 51 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

Namespace Prefixes

PrefixIRI
dctermshttp://purl.org/dc/terms/
atomhttp://atomowl.org/ontologies/atomrdf#
foafhttp://xmlns.com/foaf/0.1/
n6http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideSitemaps/
n20http://vos.openlinksw.com/dataspace/services/wiki/
n12http://vos.openlinksw.com/dataspace/owiki/wiki/VOS/VirtSetCrawlerJobsGuideSitemaps/
oplhttp://www.openlinksw.com/schema/attribution#
n2http://vos.openlinksw.com/dataspace/owiki/wiki/VOS/
dchttp://purl.org/dc/elements/1.1/
n18http://vos.openlinksw.com/dataspace/dav#
rdfshttp://www.w3.org/2000/01/rdf-schema#
n21http://rdfs.org/sioc/services#
siocthttp://rdfs.org/sioc/types#
n9http://vos.openlinksw.com/dataspace/person/dav#
n8http://vos.openlinksw.com/dataspace/owiki/wiki/
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
n15http://vos.openlinksw.com/dataspace/owiki#
n14http://docs.openlinksw.com/virtuoso/rdfinsertmethods.html#
xsdhhttp://www.w3.org/2001/XMLSchema#
n17http://cname:port/
n13http://vos.openlinksw.com/dataspace/%28NULL%29/wiki/VOS/
n16http://vos.openlinksw.com/dataspace/person/owiki#
siochttp://rdfs.org/sioc/ns#

Statements

Subject Item
n9:this
foaf:made
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n18:this
sioc:creator_of
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n20:item
n21:services_of
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n15:this
sioc:creator_of
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n8:VOS
sioc:container_of
n2:VirtSetCrawlerJobsGuideSitemaps
atom:entry
n2:VirtSetCrawlerJobsGuideSitemaps
atom:contains
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n2:VirtSetCrawlerJobsGuideSemanticSitemapsFuncExample
sioc:links_to
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n2:VirtSetCrawlerJobsGuideSemanticSitemaps
sioc:links_to
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n2:VirtSetCrawlerJobsGuideSitemaps
rdf:type
sioct:Comment atom:Entry
dcterms:created
2017-06-13T05:48:33.027482
dcterms:modified
2017-06-13T05:48:33.027482
rdfs:label
VirtSetCrawlerJobsGuideSitemaps
foaf:maker
n9:this n16:this
dc:title
VirtSetCrawlerJobsGuideSitemaps
opl:isDescribedUsing
n12:sioc.rdf
sioc:has_creator
n15:this n18:this
sioc:attachment
n6:cr11.png n6:cr11a.png n6:cr1.png n6:cr12.png n6:cr12a.png n6:cr11ab.png n6:cr11b.png n6:cr15.png n6:cr2.png n6:cr13.png n6:cr14.png n6:cr3.png
sioc:content
%META:TOPICPARENT{name="VirtSetCrawlerJobsGuide"}% ---+Setting up a Content Crawler Job to retrieve Sitemaps The following guide describes how to set up a crawler job for getting content of a basic Sitemap where the source includes RDFa. 1 From the Virtuoso Conductor User Interface i.e. http://cname:port/conductor, login as the "dba" user. 1 Go to "Web Application Server" tab. %BR%%BR%<a href="%ATTACHURLPATH%/cr1.png" target="_blank"><img src="%ATTACHURLPATH%/cr1.png" width="600px" /></a>%BR%%BR% 1 Go to the "Content Imports" tab. %BR%%BR%<a href="%ATTACHURLPATH%/cr2.png" target="_blank"><img src="%ATTACHURLPATH%/cr2.png" width="600px" /></a>%BR%%BR% 1 Click on the "New Target" button. %BR%%BR%<a href="%ATTACHURLPATH%/cr3.png" target="_blank"><img src="%ATTACHURLPATH%/cr3.png" width="600px" /></a>%BR%%BR% 1 In the form displayed: * Enter a name of choice in the "Crawl Job Name" text-box: <verbatim> Basic Sitemap Crawling Example </verbatim> * Enter the URL of the site to be crawled in the "Data Source Address (URL)" text-box: <verbatim> http://psclife.pscdog.com/catalog/seo_sitemap/product/&nbsp </verbatim> * Enter the location in the Virtuoso WebDAV repository the crawled should stored in the "Local WebDAV Identifier" text-box, for example, if user demo is available, then: <verbatim> /DAV/home/demo/basic_sitemap/ </verbatim> * Choose the "Local resources owner" for the collection from the list-box available, for ex: user demo. * Select the "Accept RDF" check-box. %BR%%BR%<a href="%ATTACHURLPATH%/cr11.png" target="_blank"><img src="%ATTACHURLPATH%/cr11.png" width="600px" /></a>%BR%<a href="%ATTACHURLPATH%/cr11a.png" target="_blank"><img src="%ATTACHURLPATH%/cr11a.png" width="600px" /></a>%BR%%BR% 1 Click the "Create" button to create the import: %BR%%BR%<a href="%ATTACHURLPATH%/cr12.png" target="_blank"><img src="%ATTACHURLPATH%/cr12.png" width="600px" /></a>%BR%%BR% 1 Click the "Import Queues" button. 1 For the "Robot targets" with label "Basic Sitemap Crawling Example " click the "Run" button. 1 This will result in the Target site being crawled and the retrieved pages stored locally in DAV and any sponged triples in the RDF Quad store. %BR%%BR%<a href="%ATTACHURLPATH%/cr13.png" target="_blank"><img src="%ATTACHURLPATH%/cr13.png" width="600px" /></a>%BR%%BR% 1 Go to the "Web Application Server" -> "Content Management" tab. %BR%%BR%<a href="%ATTACHURLPATH%/cr14.png" target="_blank"><img src="%ATTACHURLPATH%/cr14.png" width="600px" /></a>%BR%%BR% 1 Navigate to the location of newly created DAV collection: <verbatim> /DAV/home/demo/basic_sitemap/ </verbatim> 1 The retrieved content will be available in this location. %BR%%BR%<a href="%ATTACHURLPATH%/cr15.png" target="_blank"><img src="%ATTACHURLPATH%/cr15.png" width="600px" /></a>%BR%%BR% ---++Related * [[VirtSetCrawlerJobsGuide][Setting up Crawler Jobs Guide using Conductor]] * [[http://docs.openlinksw.com/virtuoso/rdfinsertmethods.html#rdfinsertmethodvirtuosocrawler][Setting up a Content Crawler Job to Add RDF Data to the Quad Store]] * [[VirtSetCrawlerJobsGuideSemanticSitemaps][Setting up a Content Crawler Job to Retrieve Semantic Sitemaps (a variation of the standard sitemap)]] * [[VirtSetCrawlerJobsGuideDirectories][Setting up a Content Crawler Job to Retrieve Content from Specific Directories]] * [[VirtCrawlerSPARQLEndpoints][Setting up a Content Crawler Job to Retrieve Content from SPARQL endpoint]]
sioc:id
8e3b28cf81a7848dc1ce50585dfeebf2
sioc:link
n2:VirtSetCrawlerJobsGuideSitemaps
sioc:has_container
n8:VOS
n21:has_services
n20:item
atom:title
VirtSetCrawlerJobsGuideSitemaps
sioc:links_to
n2:VirtSetCrawlerJobsGuideDirectories n2:VirtSetCrawlerJobsGuide n13:VirtCrawlerSPARQLEndpoints n14:rdfinsertmethodvirtuosocrawler n17:conductor, n2:WebDAV
atom:source
n8:VOS
atom:author
n9:this
atom:published
2017-06-13T05:48:33Z
atom:updated
2017-06-13T05:48:33Z
sioc:topic
n8:VOS
Subject Item
n2:VirtCrawlerSPARQLEndpoints
sioc:links_to
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n2:VirtCrawlerGuideAtom
sioc:links_to
n2:VirtSetCrawlerJobsGuideSitemaps