Show simple item record

Depositordc.contributorPareti, Paolo
Funderdc.contributor.otherSICSA - Scottish Informatics and Computer Science Allianceen_UK
Time Perioddc.coverage.temporalstart=2014-06-16; end=2014-07-16; scheme=W3C-DTFen
Data Creatordc.creatorPareti, Paolo
Data Creatordc.creatorKlein, Ewan H.
Date Accessioneddc.date.accessioned2016-04-29T14:33:47Z
Date Availabledc.date.available2016-04-29T14:33:47Z
Citationdc.identifier.citationPareti, Paolo; Klein, Ewan H.. (2016). The Human Know-How Dataset, 2014 [dataset]. https://doi.org/10.7488/ds/1394.en
Persistent Identifierdc.identifier.urihttps://hdl.handle.net/10283/1985
Persistent Identifierdc.identifier.urihttps://doi.org/10.7488/ds/1394
Dataset Description (abstract)dc.description.abstractThe Human Know-How Dataset describes 211,696 human activities from many different domains. These activities are decomposed into 2,609,236 entities (each with an English textual label). These entities represent over two million actions and half a million pre-requisites. Actions are interconnected both according to their dependencies (temporal/logical orders between actions) and decompositions (decomposition of complex actions into simpler ones). This dataset has been integrated with DBpedia (259,568 links). For more information see: - The project website: http://homepages.inf.ed.ac.uk/s1054760/prohow/index.htm - The data is also available on datahub: https://datahub.io/dataset/human-activities-and-instructions ---------------------------------------------------------------- * Quickstart: if you want to experiment with the most high-quality data before downloading all the datasets, download the file "9of11_knowhow_wikihow", and optionally files "Process - Inputs", "Process - Outputs", "Process - Step Links" and "wikiHow categories hierarchy". * Data representation based on the PROHOW vocabulary: http://w3id.org/prohow# Data extracted from existing web resources is linked to the original resources using the Open Annotation specification * Data Model: an example of how the data is represented within the datasets is available in the attached Data Model PDF file. The attached example represents a simple set of instructions, but instructions in the dataset can have more complex structures. For example, instructions could have multiple methods, steps could have further sub-steps, and complex requirements could be decomposed into sub-requirements. ---------------------------------------------------------------- Statistics: * 211,696: number of instructions. From wikiHow: 167,232 (datasets 1of11_knowhow_wikihow to 9of11_knowhow_wikihow). From Snapguide: 44,464 (datasets 10of11_knowhow_snapguide to 11of11_knowhow_snapguide). * 2,609,236: number of RDF nodes within the instructions From wikiHow: 1,871,468 (datasets 1of11_knowhow_wikihow to 9of11_knowhow_wikihow). From Snapguide: 737,768 (datasets 10of11_knowhow_snapguide to 11of11_knowhow_snapguide). * 255,101: number of process inputs linked to 8,453 distinct DBpedia concepts (dataset Process - Inputs) * 4,467: number of process outputs linked to 3,439 distinct DBpedia concepts (dataset Process - Outputs) * 376,795: number of step links between 114,166 different sets of instructions (dataset Process - Step Links)en_UK
Dataset Description (TOC)dc.description.tableofcontentsInstruction datasets: * Datasets 1of11_knowhow_wikihow to 9of11_knowhow_wikihow contain instructions from wikiHow. Instructions are allocated in the datasets in order of popularity. This means that the most popular and high-quality instructions are found in 9of11_knowhow_wikihow, while the least popular ones are in dataset 1of11_knowhow_wikihow. These instructions are also classified according to the hierarchy found in wikiHow categories hierarchy. * Datasets 10of11_knowhow_snapguide to 11of11_knowhow_snapguide contain instructions from Snapguide. Instructions coming from Snapguide are not sorted by their popularity. Links datasets: * The Process - Inputs datasets contain detailed information about the inputs of the sets of instructions, including links to DBpedia resources * The Process - Outputs datasets contains detailed information about the outputs of the sets of instructions, including links to DBpedia resources * The Process - Step Links datasets contains links between different sets of instructions Other datasets: *The wikiHow categories hierarchy dataset contains information on how the various wikiHow categories are hierarchically structureden_UK
Languagedc.language.isoengen_UK
Relation (Is Version Of)dc.relation.isversionofhttps://datahub.io/dataset/human-activities-and-instructionsen_UK
Relation (Is Referenced By)dc.relation.isreferencedbyhttps://doi.org/10.1007/978-3-319-13704-9_30en_UK
Relation (Is Referenced By)dc.relation.isreferencedbyPareti P, Testu B, Ryutaro I, Klein E, Barker A "Integrating Know-How into the Linked Data Cloud", chapter in "Knowledge Engineering and Knowledge Management", Volume 8876 of the series Lecture Notes in Computer Science pp 385-396 http://link.springer.com/chapter/10.1007%2F978-3-319-13704-9_30en
Rightsdc.rightsDataset released under the Creative Commons Attribution-NonCommercial 4.0 International licence: http://creativecommons.org/licenses/by-nc/4.0/ Attribution to this dataset should be given by citing the following publication (https://doi.org/10.1007/978-3-319-13704-9_30): Paolo Pareti, Benoit Testu, Ryutaro Ichise, Ewan Klein and Adam Barker. Integrating Know-How into the Linked Data Cloud. Knowledge Engineering and Knowledge Management, volume 8876 of Lecture Notes in Computer Science, pages 385-396. Springer International Publishing (2014) N.B. the reason for the 'non-commercial use only' restriction is that part of the data comes from wikiHow and Snapguide, which do not allow the reuse of their data for commercial purposes.en
Sourcedc.sourcehttp://www.wikihow.com/en_UK
Sourcedc.sourcehttps://snapguide.com/en_UK
Subjectdc.subjectLinked Dataen_UK
Subjectdc.subjectCommon Sense Reasoningen_UK
Subjectdc.subjectKnow-Howen_UK
Subjectdc.subjectHuman Activitiesen_UK
Subjectdc.subjectInstructionsen_UK
Subjectdc.subjectProceduresen_UK
Subjectdc.subjectProcessesen_UK
Subjectdc.subjectWorkflowsen_UK
Subjectdc.subjectSemantic Weben_UK
Subject Classificationdc.subject.classificationMathematical and Computer Sciencesen_UK
Titledc.titleThe Human Know-How Dataseten_UK
Alternative Titledc.title.alternativeThe Web of Know-How: Human Activities and Instructionsen_UK
Typedc.typedataseten_UK

Download All
zip file MD5 Checksum: 573ca467eb0b74c62c02294af914fdad

Files in this item

Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record