Person dataset

Description:

This dataset includes 502,529 persons in Wikipedia.

The person graph is extracted from DBpedia 2014, and represented as RDF triples, including their categories. Another person JSON document is crawled from Wikipedia by its API, containing the people's image url.

Complete Data Statistics:

RDF graph
Person
Source nodes 502,529
Labeled edges 48
Target nodes 417,315
Triples 4,471,823
JSON document
Person
Rows 153,134
Attributes 10

Sample Data:

person.graph

Tom_Hanks birthDate "1956-07-09"^^<http://www.w3.org/2001/XMLSchema#date&gt;

Tom_Hanks birthYear "1956"^^<http://www.w3.org/2001/XMLSchema#gYear&gt;

Tom_Hanks activeYearsStartYear "1978"^^<http://www.w3.org/2001/XMLSchema#gYear&gt;

Tom_Hanks networth "3.9E8"^^<http://dbpedia.org/datatype/usDollar&gt;

Tom_Hanks birthPlace Concord,_California Tom_Hanks occupation

Tom_Hanks__1 Tom_Hanks residence Los_Angeles Tom_Hanks residence California

Tom_Hanks education Chabot_College

Tom_Hanks education California_State_University,_Sacramento

Tom_Hanks religion Eastern_Orthodox_Church Tom_Hanks spouse Rita_Wilson

Tom_Hanks child Colin_Hanks

Tom_Hanks relation Jim_Hanks

person.json

{

"batchcomplete":"",

"query":{

"normalized":[{

"from":"Tom_Hanks",

"to":"Tom Hanks"}],

"pages":{

"43568":{

"pageid":43568,

"ns":0,

"title":"Tom Hanks",

"thumbnail":{

"original":"https://upload.wikimedia.org/wikipedia/commons/f/f5/TomHanks1989.png"

}

}

}

}

}

Example Queries:

Query Description
q1 Find top-k people who looks like Barack Obama and lives in USA.

Download Files: Person.zip