Not Available for Download

Academic Data Collection in Electronic Environments: Defining Acceptable Use of Internet Resources

MIS Quarterly, Vol. 30, No. 3, pp. 599-610, 2006

Minnesota Legal Studies Research Paper No. 06-48

Posted: 12 Sep 2006  

Gove N. Allen

Tulane University - A.B. Freeman School of Business

Dan L. Burk

University of California, Irvine School of Law

Gordon B. Davis

University of Minnesota - Twin Cities - Carlson School of Management

Abstract

Academic researchers access commercial websites to collect research data. This research practice is likely to increase. Is this appropriate? Is this legal? Such commercial websites are maintained to achieve business objectives; research access uses site resources for other purposes. Website administrators may, therefore, deem academic data collection inappropriate. Is there a process to make research access more open and acceptable to website owners and administrators? These are significant issues. This article clarifies the problems and suggests possible approaches to handle the issues with sensitivity and openness.

Research access to commercial websites may be manual (using a standard web browser) or automated (using automated data collection agents). These approaches have different effects on websites. Researchers using manual access tend to make a limited number of page requests because manual access is costly to perform. Researchers using automated access methods can request large numbers of pages at a low cost. Therefore, website administrators tend to view manual access and automated access very differently.

Because of the number of accesses and nonbusiness purpose, automated research requests for data are sometimes blocked by site administration using a variety of means (both technological and legal). This paper details the pertinent legal issues including trespass, copyright violation, and breach of contract. It also explains the nature of express and implied consent by site administration for research access.

Based on the issues presented, guidelines for researchers are proposed to reduce objections to research activities, to facilitate communication with website administration, and to achieve express or implied consent. These include notification to website administration of intended automated research activity, description of the research project posted as a web page, and clear identification of automated requests for web pages. In order to encourage good research practices with respect to automated data collection, suggestions are made with respect to disclosing methods used in research papers and for self regulation by academic associations.

Keywords: Internet, research, automated data collection, trespass, ethics

JEL Classification: C81, C87, C88, K11, K42

Suggested Citation

Allen, Gove N. and Burk, Dan L. and Davis, Gordon B., Academic Data Collection in Electronic Environments: Defining Acceptable Use of Internet Resources. MIS Quarterly, Vol. 30, No. 3, pp. 599-610, 2006; Minnesota Legal Studies Research Paper No. 06-48. Available at SSRN: https://ssrn.com/abstract=929482

Gove N. Allen

Tulane University - A.B. Freeman School of Business ( email )

7 McAlister Drive
New Orleans, LA 70118
United States

HOME PAGE: http://gove.net/

Dan L. Burk (Contact Author)

University of California, Irvine School of Law ( email )

4500 Berkeley Place
Irvine, CA 92697-1000
United States
949-824-9325 (Phone)

Gordon B. Davis

University of Minnesota - Twin Cities - Carlson School of Management ( email )

19th Avenue South
Minneapolis, MN 55455
United States

Paper statistics

Abstract Views
1,655