U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Method and system for reconstructing original distributions from randomized numeric data

Patent 6687691 Issued on February 3, 2004. Estimated Expiration Date: Icon_subject January 19, 2020. Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.

Patent References

Data mining method and system for generating a decision tree classifier for data records based on a minimum description length (MDL) and presorting of records
Patent #: 5787274
Issued on: 07/28/1998
Inventor: Agrawal, et al.

Method and system for generating a decision-tree classifier independent of system memory size
Patent #: 5799311
Issued on: 08/25/1998
Inventor: Agrawal, et al.

Method and system for generating a decision-tree classifier in parallel in a multi-processor system
Patent #: 5870735
Issued on: 02/09/1999
Inventor: Agrawal, et al.

Method for performing targeted marketing over a large computer network
Patent #: 6055510
Issued on: 04/25/2000
Inventor: Henrick, et al.

Method for refining the initial conditions for clustering with applications to small and large database clustering
Patent #: 6115708
Issued on: 09/05/2000
Inventor: Fayyad, et al.

Method and system for generating a decision-tree classifier in parallel in a multi-processor system
Patent #: 6138115
Issued on: 10/24/2000
Inventor: Agrawal, et al.

Parallel classification for data mining in a shared-memory multiprocessor system
Patent #: 6230151
Issued on: 05/08/2001
Inventor: Agrawal, et al.

Decision tree classifier with integrated building and pruning phases
Patent #: 6247016
Issued on: 06/12/2001
Inventor: Rastogi ,   et al.

Generating a model for raw variables from a model for cooked variables
Patent #: 6405200
Issued on: 06/11/2002
Inventor: Heckerman

Method and system for building a decision-tree classifier from privacy-preserving data Patent #: 6546389
Issued on: 04/08/2003
Inventor: Agrawal, et al.

Inventors

Application

No. 487642 filed on 01/19/2000

US Classes:

707/6, Pattern matching access707/10, Distributed or remote access707/101Manipulating data structure (e.g., compression, compaction, compilation)

Examiners

Primary: Corrielus, Jean B.

Attorney, Agent or Firm

International Class

G06F 017/30

Abstract

A system and method for mining data while preserving a user's privacy includes perturbing user-related information at the user's computer and sending the perturbed data to a Web site. At the Web site, perturbed data from many users is aggregated, and from the distribution of the perturbed data, the distribution of the original data is reconstructed, although individual records cannot be reconstructed. Based on the reconstructed distribution, a decision tree classification model or a Naive Bayes classification model is developed, with the model then being provided back to the users, who can use the model on their individual data to generate classifications that are then sent back to the Web site such that the Web site can display a page appropriately configured for the user's classification. Or, the classification model need not be provided to users, but the Web site can use the model to, e.g., send search results and a ranking model to a user, with the ranking model being used at the user computer to rank the search results based on the user's individual classification data.

Other References

  • Adam et al, "Security-control methods for statistical databases: A comparative study", ACM Computing Surveys, vol.21, No.4, pp. 516-556, Dec. 1989.
  • Tendick et al., "A modified random perturbation method for database security", ACM, vol.19, No.1, PP. 48-63, Mar. 1994.
  • Muralidhar et al., Security of Random data perturbation methods, ACM, vol.24, No.4, pp. 487-493, Dec. 1999.
  • Traub et al., "The statistical security of a Statistical database", ACM, vol.9, No.4, pp. 672-679, Dec. 1984.
  • Palley et al., "The use of regression methodology for the compromise of confidential information in statistical databases", ACM, vol.12, No.4, pp.593-608, Dec. 198
PatentsPlus Images
Enhanced PDF formats
loading...
PatentsPlus: add to cart
PatentsPlus: add to cartSearch-enhanced full patent PDF image
$9.95more info
PatentsPlus: add to cart
PatentsPlus: add to cartIntelligent turbocharged patent PDFs with marked up images
$16.95more info
 
Sign InRegister
Username  
Password   
forgot password?