Skip to main content
Log in

Ethical issues in web data mining

  • Published:
Ethics and Information Technology Aims and scope Submit manuscript

Abstract

Web mining refers to the whole of data miningand related techniques that are used toautomatically discover and extract informationfrom web documents and services. When used in abusiness context and applied to some type ofpersonal data, it helps companies to builddetailed customer profiles, and gain marketingintelligence. Web mining does, however, pose athreat to some important ethical values likeprivacy and individuality. Web mining makes itdifficult for an individual to autonomouslycontrol the unveiling and dissemination of dataabout his/her private life. To study thesethreats, we distinguish between `content andstructure mining' and `usage mining.' Webcontent and structure mining is a cause forconcern when data published on the web in acertain context is mined and combined withother data for use in a totally differentcontext. Web usage mining raises privacyconcerns when web users are traced, and theiractions are analysed without their knowledge.Furthermore, both types of web mining are oftenused to create customer files with a strongtendency of judging and treating people on thebasis of group characteristics instead of ontheir own individual characteristics and merits(referred to as de-individualisation). Althoughthere are a variety of solutions toprivacy-problems, none of these solutionsoffers sufficient protection. Only a combinedsolution package consisting of solutions at anindividual as well as a collective level cancontribute to release some of the tensionbetween the advantages and the disadvantages ofweb mining. The values of privacy andindividuality should be respected and protectedto make sure that people are judged and treatedfairly. People should be aware of theseriousness of the dangers and continuouslydiscuss these ethical issues. This should be ajoint responsibility shared by web miners (bothadopters and developers), web users, andgovernments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • M.J.A. Berry and G.S. Linoff. Mining the Web: Transforming Customer Data. John Wiley & Sons, New York, 2002.

    Google Scholar 

  • R. Clarke. 'Profiling’ and Its Privacy Implications. Privacy Law & Policy Reporter, 1: 128, 1994.

    Google Scholar 

  • R. Clarke. Platform for Privacy Preferences: A Critique. Privacy Law & Policy Reporter, 5 (3): 46–48, 1998.

    Google Scholar 

  • B. Custers. Data Mining and Group Profiling on the Internet. In A. Vedder, editor, Ethics and the Internet, pages 87–104. Intersentia, Antwerpen Groningen Oxford, 2001.

    Google Scholar 

  • O. Etzioni. The World Wide Web: Quagmire or Gold Mine? Communications of the ACM, 39 (11): 65–68, 1996.

    Google Scholar 

  • D.R. Fordham, D.A. Riordan and M. Riordan. Business Intelligence. Management Accounting, 83 (11): 24–29, 2002.

    Google Scholar 

  • J.S. Fulda. Data Mining and Privacy. In R. Spinello and H. Tavani, editors, Readings in CyberEthics, pages 413–417. Jones and Bartlett, Sudbury MA, 2001.

    Google Scholar 

  • D.G. Johnson. Computer Ethics, 3rd. edition. Prentice-Hall, Upper Saddle River New Jersey, 2001.

    Google Scholar 

  • J.F. Johnson. Immunity from the Illegitimate Focused Attention of Others: An Explanation of our Thinking and Talking about Privacy. In A. Vedder, editor, Ethics and the Internet, pages 49–70. Intersentia, Antwerpen Groningen Oxford, 2001.

    Google Scholar 

  • R. Kosala, H. Blockeel and F. Neven. An Overview of Web Mining. In J. Meij, editor, Dealing with the Data Flood: Mining Data, Text and Multimedia, pages 480–497. STT, Rotterdam, 2002.

    Google Scholar 

  • S.K. Madria, S.S. Bhowmick, W.-K. Ng and E.P. Lim. Research Issues in Web-data Mining. Lecture Notes in Computer Science, 1676: 303–312, 1999.

    Google Scholar 

  • B. Mobasher, R. Cooley and J. Srivastava. Automatic Personalization Based on Web Usage Mining. Communications of the ACM, 43 (8): 142–151, 2000.

    Google Scholar 

  • M.D. Mulvenna, S.S. Anand and A.G. Büchner. Personalization on the Net Using Web Mining. Communications of the ACM, 43 (8): 123–125, 2000.

    Google Scholar 

  • H. Nissenbaum. Toward an Approach to Privacy in Public: Challenges of Information Technology. Ethics & Behavior, 7 (3): 207–220, 1997.

    Google Scholar 

  • E. Philips, D. Vriens and G. Kienhorst, editors. Business Intelligence. Kluwer, Deventer, 1999.

    Google Scholar 

  • R. Pierik. Group Profiles, Equality, and the Power of Numbers. In A. Vedder, editor, Ethics and the Internet, pages 105–123. Intersentia, Antwerpen Groningen Oxford, 2001.

    Google Scholar 

  • K. Scribbins. Privacy@net, An International Comparative Study of Consumer Privacy on the Internet. Consumers International (http://www.consumersinternational.org/news/press releases/fprivreport.pdf), 2001.

  • J. Srivastava, R. Cooley, M. Deshpande and P.N. Tan. Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data. ACM SIGKDD, 1(2): 12–23, 2000.

    Google Scholar 

  • H.T. Tavani. Informational Privacy, Data Mining, and the Internet.’ Ethics and Information Technology, 1: 137–145, 1999a.

    Google Scholar 

  • H.T. Tavani. KDD, Data Mining, and the Challenge to Normative Privacy. Ethics and Information Technology, 1: 265–273, 1999a.

    Google Scholar 

  • H.T. Tavani. Privacy Enhancing Technologies as a Panacea for Online Privacy Concerns: Some Ethical Considerations. Ethics and Information Technology, 9: 26–36, 2000.

    Google Scholar 

  • H.T. Tavani and J. Moor. Privacy Protection, Control of Information, and Privacy-Enhancing Technologies. Computers and Society, 31: 6–11, 2001.

    Google Scholar 

  • A. Vedder. KDD: The Challenge to Individualism. Ethics and Information Technology, 1: 275–281, 1999.

    Google Scholar 

  • A. Vedder. Privacy and Confidentiality. Medical Data, New Information Technologies, and the Need for Normative Principles Other than Privacy Rules. Law and Medicine, 3: 441–459, 2000.

    Google Scholar 

  • A. Vedder. KDD, Privacy, Individuality, and Fairness. In R. Spinello and H. Tavani, editors, Readings in CyberEthics, pages 404–412. Jones and Bartlett, Sudbury MA, 2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

van Wel, L., Royakkers, L. Ethical issues in web data mining. Ethics and Information Technology 6, 129–140 (2004). https://doi.org/10.1023/B:ETIN.0000047476.05912.3d

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/B:ETIN.0000047476.05912.3d

Navigation