Patent dataset

Description:

This dataset spans 37 years (January 1, 1963 to December 30, 1999), and includes all the utility patents granted during that period, totaling 3,923,922 patents.

The citation graph includes all citations made by patents granted between 1975 and 1999, totaling 16,522,438 citations. The data was originally released by NBER (http://www.nber.org/patents/), and then processed as five tables and a graph, including patent table, inventor table, assignee table, class table, category table, citation graph.

Relational Table

Patent

Inventor Assignee Class Category
Rows 2,923,922 4,301,229 175,115 426 36
Attributes 23 11 7 4 5
Directed Graph
Citation
Nodes 3,774,768
Edges 16,518,948

Sample Data:

Patent.table

"PATENT","GYEAR","GDATE","APPYEAR","COUNTRY","POSTATE","ASSIGNEE","ASSCODE","CLAIMS","NCLASS",...

3070801,1963,1096,,"BE","",,1,,269,...

3070802,1963,1096,,"US","TX",,1,,2,...

3070803,1963,1096,,"US","IL",,1,,2,...

3070804,1963,1096,,"US","OH",,1,,2,...

3070805,1963,1096,,"US","CA",,1,,2,...

where

  • "PATENT" is "Patent number"
  • "GYEAR" is "Grant year"
  • "GDATE" is "Grant date"
  • "APPYEAR" is "Application year"
  • "COUNTRY" is "Country of first inventor"
  • "POSTATE" is "State of first inventor (if US)"
  • "ASSIGNEE" is "Assignee identifier (missing 1963-1967)"
  • "ASSCODE" is "Assignee Type",
  • "CLAIMS" is "Number of claims",
  • "NCLASS" is "Main patent class (3 digit) "

inventor.table

"PATENT","LASTNAM","FIRSTNAM","MIDNAM","MODIFNAM","STREET","CITY","POSTATE","COUNTRY","ZIP","INVSEQ"

3858241,"Durand","Philip","E.","","","Hudson","MA","US","",1

3858241,"Norris","Lonnie","H.","","","Milford","MA","US","",2

3858242,"Gooding","Elwyn","R.","","120 Darwin Rd.","Pinckney","MI","US","48169",1

3858243,"Pierron","Claude","Raymond","","","Epinal","","FR","",1

3858243,"Jenny","Jean","Paul","","","Decines","","FR","",2

where

  • "PATENT" is "Patent number"
  • "LASTNAM" is "Last name of inventor"
  • "FIRSTNAM" is "First name of inventor"
  • "MIDNAM" is "Middle name of inventor"
  • "MODIFNAM" is "Surname modifier"
  • "STREET" is "Street address"
  • "CITY" is "City"
  • "POSTATE" is "State code"
  • "COUNTRY" is "Country code"
  • "ZIP" is "Zip code"
  • "INVSEQ" is "Inventor sequence number"

assignee.table

Lorem ips"ASSIGNEE","ASSNAME","CNAME","CUSIP","OWN","PNAME","SNAME"

5,"AAA ASSOCIATES, INC.",,,,,

10,"AAA OFFICE COFFEE SERVICE, INC.",,,,,

15,"AAA PIPE CLEANING CORPORATION",,,,,

20,"AAA PRODUCTS INTERNATIONAL INC.",,,,,

25,"AAA SALES & ENGINEERING, INC.",,,,,um

where

  • "ASSIGNEE" is "Assignee identifier"
  • "ASSNAME" is "Patent assignee name"
  • "CNAME" is "Compustat name"
  • "CUSIP" is "Compustat CUSIP"
  • "OWN" is "Percent of ownership"
  • "PNAME" is "Parent name"
  • "SNAME" is "Subsidiary name"

category.table

"CAT","SUBCAT","SUBCATNAME","CATNAMESHORT","CATENAMELONG"

1,11,Agriculture & Food & Textiles,Chemical,Chemical

1,12,Coating,Chemical,Chemical

1,13,Gas,Chemical,Chemical

1,14,Organic Compounds,Chemical,Chemical

1,15,Resins,Chemical,Chemical

where

  • "CAT" is "Technological category"
  • "SUBCAT" is "Technological sub-category"
  • "SUBCATNAME" is "Sub-category name"
  • "CATNAMESHORT" is "Category short name"
  • "CATENAMELONG" is "Category long name"

class.table

"CLASS","CNAME","SUBCAT","CAT"

1,** Classification Undetermined **,69,6

2,Apparel,63,6

4,Baths, Closets, Sinks, and Spittoons,65,6

5,Beds,65,6 7,Compound Tools,59,5

where

  • "CLASS" is "Class identifier"
  • "CNAME" is "Class name"
  • "SUBCAT" is "Technological sub-category"
  • "CAT" is "Technological category"

citation.graph:

"CITING","CITED"

3858241,956203

3858241,1324234

3858241,3398406

3858242,1515701

3858242,3319261

Example Queries:

query description
q1 Find all patents which has more than 5 times citation.
q2 Find all patents which has more than 5 times citation and belongs to chemical category.
q3 Find all patents which is invented by influencing inventors (whose citation is more than 10 times).
q4 Find all assignees which assigns the patents cited more than 5 times and belongs to CUSIP.
q5 Find all assignees which assigns the patents cited by chemical category patents.
q6 Find the inventor who has most citation in chemical category.
q7 Count average citation of each inventor.
q8 Find all inventors who has been cited more than average citation.
q9 Find all inventors who has invented more than 3 categories patents with at least 3 times citation.
q10 Find all inventors who cooperates with the influencing inventors, but not belong to influencing inventors.

Download Files: Patent.zip