This HTML5 document contains 44 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

PrefixNamespace IRI
dctermshttp://purl.org/dc/terms/
atomhttp://atomowl.org/ontologies/atomrdf#
foafhttp://xmlns.com/foaf/0.1/
n26http://vos.openlinksw.com/dataspace/owiki/wiki/VOS/VirtSetCrawlerJobsGuideDirctories/sioc.
n5http://vos.openlinksw.com/dataspace/services/wiki/
oplhttp://www.openlinksw.com/schema/attribution#
n2http://vos.openlinksw.com/dataspace/owiki/wiki/VOS/
n27http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideDirctories/cr2.
n24http://localhost:8890/
dchttp://purl.org/dc/elements/1.1/
n8http://vos.openlinksw.com/dataspace/dav#
rdfshttp://www.w3.org/2000/01/rdf-schema#
n6http://rdfs.org/sioc/services#
n23http://vos.openlinksw.com/dataspace/person/dav#
siocthttp://rdfs.org/sioc/types#
n4http://vos.openlinksw.com/dataspace/owiki/wiki/
n11http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideDirctories/d5.
n13http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideDirctories/d6.
n14http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideDirctories/d3.
n15http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideDirctories/d4.
n16http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideDirctories/d1.
n17http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideDirctories/d2.
n10http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideDirctories/d7.
n29http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideDirctories/cr1.
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
n28http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideDirctories/cr3.
n7http://vos.openlinksw.com/dataspace/owiki#
n22http://docs.openlinksw.com/virtuoso/rdfinsertmethods.html#
xsdhhttp://www.w3.org/2001/XMLSchema#
n12http://vos.openlinksw.com/dataspace/%28NULL%29/wiki/VOS/
n21http://vos.openlinksw.com/dataspace/person/owiki#
siochttp://rdfs.org/sioc/ns#
Subject Item
n23:this
foaf:made
n2:VirtSetCrawlerJobsGuideDirctories
Subject Item
n8:this
sioc:creator_of
n2:VirtSetCrawlerJobsGuideDirctories
Subject Item
n5:item
n6:services_of
n2:VirtSetCrawlerJobsGuideDirctories
Subject Item
n7:this
sioc:creator_of
n2:VirtSetCrawlerJobsGuideDirctories
Subject Item
n4:VOS
sioc:container_of
n2:VirtSetCrawlerJobsGuideDirctories
atom:entry
n2:VirtSetCrawlerJobsGuideDirctories
atom:contains
n2:VirtSetCrawlerJobsGuideDirctories
Subject Item
n2:VirtSetCrawlerJobsGuideDirctories
rdf:type
sioct:Comment atom:Entry
dcterms:created
2017-06-13T05:38:19.214673
dcterms:modified
2017-06-13T05:38:19.214673
rdfs:label
VirtSetCrawlerJobsGuideDirctories
foaf:maker
n21:this n23:this
dc:title
VirtSetCrawlerJobsGuideDirctories
opl:isDescribedUsing
n26:rdf
sioc:has_creator
n7:this n8:this
sioc:attachment
n10:png n11:png n13:png n14:png n15:png n16:png n17:png n27:png n28:png n29:png
sioc:content
---+Guide for Setting up Crawler Jobs for Directories The following guide describes how to set up crawler job for getting directories using Conductor. 1 Go to Conductor UI. For ex. at http://localhost:8890/conductor . 1 Enter dba credentials. 1 Go to "Web Application Server". %BR%%BR%<img src="%ATTACHURLPATH%/cr1.png" style="wikiautogen"/>%BR%%BR% 1 Go to "Content Imports". %BR%%BR%<img src="%ATTACHURLPATH%/cr2.png" style="wikiautogen"/>%BR%%BR% 1 Click "New Target". %BR%%BR%<img src="%ATTACHURLPATH%/cr3.png" style="wikiautogen"/>%BR%%BR% 1 In the shown form: * Enter for "Target description": <verbatim> Gov.UK data </verbatim> * Enter for "Target URL": <verbatim> http://source.data.gov.uk/data/ </verbatim> * Enter for "Copy to local DAV collection" for available user, for ex. demo: <verbatim> /DAV/home/demo/gov.uk/ </verbatim> * Choose from the available list "Local resources owner" an user, for ex. demo ; %BR%%BR%<img src="%ATTACHURLPATH%/d1.png" style="wikiautogen"/>%BR%%BR% * Click the button "Create". 1 As result the Robot target will be created: %BR%%BR%<img src="%ATTACHURLPATH%/d2.png" style="wikiautogen"/>%BR%%BR% 1 Click "Import Queues". %BR%%BR%<img src="%ATTACHURLPATH%/d3.png" style="wikiautogen"/>%BR%%BR% 1 For "Robot target" with label "Gov.UK data " click "Run". 1 As result will be shown the status of the pages: retrieved, pending or respectively waiting. %BR%%BR%<img src="%ATTACHURLPATH%/d4.png" style="wikiautogen"/>%BR%%BR% 1 Click "Retrieved Sites" 1 As result should be shown the number of the total pages retrieved. %BR%%BR%<img src="%ATTACHURLPATH%/d5.png" style="wikiautogen"/>%BR%%BR% 1 Go to "Web Application Server" -> "Content Management" . 1 Enter path: <verbatim> DAV/home/demo/gov.uk </verbatim> %BR%%BR%<img src="%ATTACHURLPATH%/d6.png" style="wikiautogen"/>%BR%%BR% 1 Go to path: <verbatim> DAV/home/demo/gov.uk/data </verbatim> 1 As result the retrieved content will be shown. %BR%%BR%<img src="%ATTACHURLPATH%/d7.png" style="wikiautogen"/>%BR%%BR% ---++Related * [[VirtSetCrawlerJobsGuide][Setting up Crawler Jobs Guide using Conductor]] * [[http://docs.openlinksw.com/virtuoso/rdfinsertmethods.html#rdfinsertmethodvirtuosocrawler][Setting up Crawler Job for inserting RDF data]] * [[VirtSetCrawlerJobsGuideSitemaps][Setting up Crawler Job for retrieving Sitemaps (basic where the source has RDFa)]] * [[VirtSetCrawlerJobsGuideSemanticSitemaps][Setting up Crawler Job for retrieving Semantic Sitemaps -- a variation of standard sitemap]]
sioc:id
e04c88cf189ae48746a4a2959589cac3
sioc:link
n2:VirtSetCrawlerJobsGuideDirctories
sioc:has_container
n4:VOS
n6:has_services
n5:item
atom:title
VirtSetCrawlerJobsGuideDirctories
sioc:links_to
n12:VirtSetCrawlerJobsGuide n22:rdfinsertmethodvirtuosocrawler n12:VirtSetCrawlerJobsGuideSitemaps n12:VirtSetCrawlerJobsGuideSemanticSitemaps n24:conductor
atom:source
n4:VOS
atom:author
n23:this
atom:published
2017-06-13T05:38:19Z
atom:updated
2017-06-13T05:38:19Z
sioc:topic
n4:VOS