jeremykidwell.info/static/files/bookdown/data_ethics-law_course/exploring-the-world-of-user...

175 lines
14 KiB
HTML
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

<!DOCTYPE html>
<html >
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<title>what can I do with stuff I find online?</title>
<meta name="description" content="A short course on the ethics and legality of working with data">
<meta name="generator" content="bookdown 0.7 and GitBook 2.6.7">
<meta property="og:title" content="what can I do with stuff I find online?" />
<meta property="og:type" content="book" />
<meta property="og:url" content="http://jeremykidwell.info/files/bookdown/data_ethics-law_course/" />
<meta property="og:description" content="A short course on the ethics and legality of working with data" />
<meta name="github-repo" content="kidwellj/data_ethics-law_course" />
<meta name="twitter:card" content="summary" />
<meta name="twitter:title" content="what can I do with stuff I find online?" />
<meta name="twitter:description" content="A short course on the ethics and legality of working with data" />
<meta name="author" content="Alex Fenlon and Jeremy H. Kidwell">
<meta name="date" content="2016-12-02">
<meta name="viewport" content="width=device-width, initial-scale=1">
<meta name="apple-mobile-web-app-capable" content="yes">
<meta name="apple-mobile-web-app-status-bar-style" content="black">
<link rel="prev" href="can-you-use-stuff-online-for-research.html">
<link rel="next" href="exercise-1.html">
<script src="libs/jquery-2.2.3/jquery.min.js"></script>
<link href="libs/gitbook-2.6.7/css/style.css" rel="stylesheet" />
<link href="libs/gitbook-2.6.7/css/plugin-bookdown.css" rel="stylesheet" />
<link href="libs/gitbook-2.6.7/css/plugin-highlight.css" rel="stylesheet" />
<link href="libs/gitbook-2.6.7/css/plugin-search.css" rel="stylesheet" />
<link href="libs/gitbook-2.6.7/css/plugin-fontsettings.css" rel="stylesheet" />
<link rel="stylesheet" href="style.css" type="text/css" />
</head>
<body>
<div class="book without-animation with-summary font-size-2 font-family-1" data-basepath=".">
<div class="book-summary">
<nav role="navigation">
<ul class="summary">
<li><a href="./">An OER on the ethics and legality of working with digital data in research</a></li>
<li class="divider"></li>
<li class="chapter" data-level="1" data-path="index.html"><a href="index.html"><i class="fa fa-check"></i><b>1</b> Introduction to the module</a></li>
<li class="chapter" data-level="2" data-path="can-you-use-stuff-online-for-research.html"><a href="can-you-use-stuff-online-for-research.html"><i class="fa fa-check"></i><b>2</b> Can you use stuff online for research?</a><ul>
<li class="chapter" data-level="2.1" data-path="can-you-use-stuff-online-for-research.html"><a href="can-you-use-stuff-online-for-research.html#iframe-video-here"><i class="fa fa-check"></i><b>2.1</b> iframe / video here</a></li>
<li class="chapter" data-level="2.2" data-path="can-you-use-stuff-online-for-research.html"><a href="can-you-use-stuff-online-for-research.html#video-transcript"><i class="fa fa-check"></i><b>2.2</b> Video Transcript:</a></li>
</ul></li>
<li class="chapter" data-level="3" data-path="exploring-the-world-of-user-generated-data.html"><a href="exploring-the-world-of-user-generated-data.html"><i class="fa fa-check"></i><b>3</b> Exploring the world of user-generated data</a><ul>
<li class="chapter" data-level="3.1" data-path="exploring-the-world-of-user-generated-data.html"><a href="exploring-the-world-of-user-generated-data.html#video"><i class="fa fa-check"></i><b>3.1</b> Video</a></li>
<li class="chapter" data-level="3.2" data-path="exploring-the-world-of-user-generated-data.html"><a href="exploring-the-world-of-user-generated-data.html#transcript-jeremy-kidwell-speaking"><i class="fa fa-check"></i><b>3.2</b> Transcript (Jeremy Kidwell speaking)</a></li>
</ul></li>
<li class="chapter" data-level="4" data-path="exercise-1.html"><a href="exercise-1.html"><i class="fa fa-check"></i><b>4</b> Exercise 1</a></li>
<li class="chapter" data-level="5" data-path="exercise-2-documentary-analysis.html"><a href="exercise-2-documentary-analysis.html"><i class="fa fa-check"></i><b>5</b> Exercise 2 - documentary analysis</a></li>
<li class="chapter" data-level="6" data-path="exercise-3-reading.html"><a href="exercise-3-reading.html"><i class="fa fa-check"></i><b>6</b> Exercise 3 - reading!</a></li>
<li class="chapter" data-level="7" data-path="reading.html"><a href="reading.html"><i class="fa fa-check"></i><b>7</b> Reading:</a></li>
<li class="chapter" data-level="8" data-path="other-media.html"><a href="other-media.html"><i class="fa fa-check"></i><b>8</b> Other media:</a></li>
<li class="chapter" data-level="9" data-path="copyright-licenses-and-data-as-property.html"><a href="copyright-licenses-and-data-as-property.html"><i class="fa fa-check"></i><b>9</b> Copyright, licenses, and data as property?</a></li>
<li class="chapter" data-level="10" data-path="confidentiality-anonymity-consent-and-privacy.html"><a href="confidentiality-anonymity-consent-and-privacy.html"><i class="fa fa-check"></i><b>10</b> Confidentiality, anonymity, consent, and privacy</a></li>
<li class="chapter" data-level="11" data-path="transcript-of-the-video.html"><a href="transcript-of-the-video.html"><i class="fa fa-check"></i><b>11</b> Transcript of the video:</a></li>
<li class="chapter" data-level="12" data-path="resources.html"><a href="resources.html"><i class="fa fa-check"></i><b>12</b> Resources:</a></li>
<li class="chapter" data-level="13" data-path="final-steps-how-do-we-decide-what-to-do.html"><a href="final-steps-how-do-we-decide-what-to-do.html"><i class="fa fa-check"></i><b>13</b> Final Steps - How do we decide what to do?</a></li>
<li class="divider"></li>
<li><a href="https://github.com/rstudio/bookdown" target="blank">Published with bookdown</a></li>
</ul>
</nav>
</div>
<div class="book-body">
<div class="body-inner">
<div class="book-header" role="navigation">
<h1>
<i class="fa fa-circle-o-notch fa-spin"></i><a href="./">what can I do with stuff I find online?</a>
</h1>
</div>
<div class="page-wrapper" tabindex="-1" role="main">
<div class="page-inner">
<section class="normal" id="section-">
<div id="exploring-the-world-of-user-generated-data" class="section level1">
<h1><span class="header-section-number">Chapter 3</span> Exploring the world of user-generated data</h1>
<div id="video" class="section level2">
<h2><span class="header-section-number">3.1</span> Video</h2>
</div>
<div id="transcript-jeremy-kidwell-speaking" class="section level2">
<h2><span class="header-section-number">3.2</span> Transcript (Jeremy Kidwell speaking)</h2>
<p>Youve probably heard by now of the company “Cambridge Analytica” <a href="https://www.theregister.co.uk/2018/05/02/cambridge_analytica_shutdown/">recently renamed to Emerdata</a>. As several media outlets reported in 2017, a little known firm called Cambridge analytica surprised many by claiming that their “evolutionary approach to data-driven communication has played such an integral part in President-elect Trumps extraordinary win.” As details emerged, it became clear that this was not mere bluster, but that this firm had managed to amass a trove of personal data about individuals, as the Washington Post suggested, up to 5000 pieces of data on each American citizen and then sought to nudge or manipulate voting behaviours by creating highly-targetted content, including ads on major social media platforms and so-called “fake news” stories.</p>
<p>Data ethics is always easier in hindsight, but Id like to nonetheless look into the structure of this data collection to raise some issues about how data gets “out there” in the first place.</p>
<p>Facebook is a central character in this story about data and this isnt surprising given their dominance of internet communication in recent years. In some cases, more persons answering surveys claim to be using facebook than the internet. While this is logically impossible - facebook is merely a service which sits on top of the internet, at least for now it gets towards the ubiquity of facebook use. Given this centrality, it is sensible to begin our look here to see how things are in terms of data. The story of privacy and data protection on facebook is, to be generous, an evolving one. Much of the data that users put on facebook was completely public until 2012, including the complete catalogue of your “likes”. For a company like Cambridge Analytica, this information was pure gold - enabling them to build up what psychologists call a “psychometric” profile using this data. If this information was on the internet in plain sight, could any user have assumed that their activity on facebook was private? Should they have? Since likes were made private, facebook has had a number of “gaffes” in which new features or <a href="https://www.cnbc.com/2018/06/07/facebook-bug-made-private-posts-of-up-to-14-million-users-public.html">bugs</a> have forced this data back out into the public. Much of the reporting of the cambridge analytica scandal have referred to their access of data as a “breach” implying that Facebook had been trying to keep data that users generated private in good faith and that this company had found improper or possibly even illegal ways to harvest it, but this is actually quite misleading. Companies like CA and it is worth noting that there are probably hundreds of <a href="https://www.zdnet.com/article/data-firm-leaks-48-million-user-profiles-it-scraped-from-facebook-linkedin-others/">other similar operations</a> which have been harvesting similarly massive datasets - can put together millions of tiny pieces of tiny information scattered across the internet - the number of contacts you have on a social network platform, or the number of profile pictures youve cycled through, hint at personality traits.</p>
<p>The controversial part that some persons are (in my opinion inaccurately) calling a breach relates to another approach that CA took on, shortly after facebook began to make its data privacy policies a bit less free-wheeling. They used <a href="https://www.fastcompany.com/40548348/how-amazon-helped-cambridge-analytica-harvest-americans-facebook-data">Amazons mechanical turk platform</a>, where companies can hire consultants to do tiny tasks for small sums of money, sometimes just a single pound (or dollar in this case) to answer a personality survey. Over 200k persons answered this survey, which had a hidden gem at the end - users were asked to share their facebook profiles, with their (now private) likes and friends. Thousands compiled unwittingly. Some people who took the survey complained to Amazon that this violated Amazons terms of service, but Amazon didnt discontinue the surveys until more than a year later.</p>
<p>Is this kind of data collection ethical? Well, Ill get into these kinds of questions from the perspective of a researcher a bit later, after we hear about the monkey selfie. For now, I want us to start thinking about ourselves as generators of data. This is a good ethical exercise, to place ourselves in a situation and see how we feel - so that we turn this dynamic around and begin to think of ourselves as collectors (and not producers) of data, we have some sensitivity to how things might be a bit complex.</p>
<p>For this session, wed like to have you try a few exercises which will get you acquainted with the idea of “terms and conditions”. Youve likely seen dozens of T&amp;Cs as theyre called by now, but because theyre all in legalese and often dozens of pages long, we hardly ever read them. In fact, the Guardian reported in 2011 that <a href="https://www.theguardian.com/money/2011/may/11/terms-conditions-small-print-big-problems">less than 7% of Britons</a> ever read T&amp;Cs and that 1/10 would rather read the whole phone book. Another <a href="https://www.theguardian.com/technology/2017/mar/03/terms-of-service-online-contracts-fine-print">more recent study</a> found that only 1 in 4 students take the time to read terms and conditions. Jonathan Obar at York University did a study which found that it would take the average user 40 minutes a day to actually read through privacy and T&amp;C documents in which theyre implicated. Yep, thats 40 minutes out of every single day.</p>
<p>Whether this situation is deliberate as some scholars have suggested, or merely an unforunate accident, theres a problem here relating to user literacy of data policies. So were going to ask you to actually read through one of these documents and then to debrief how this knowledge changes your perspective on putting your data on social network platforms. Were also going to ask you to do an informal study of a digital chat.</p>
<p>We hope youll find this exercise illuminating, and will look forward to telling you about that monkey selfie in our next session.</p>
</div>
</div>
</section>
</div>
</div>
</div>
<a href="can-you-use-stuff-online-for-research.html" class="navigation navigation-prev " aria-label="Previous page"><i class="fa fa-angle-left"></i></a>
<a href="exercise-1.html" class="navigation navigation-next " aria-label="Next page"><i class="fa fa-angle-right"></i></a>
</div>
</div>
<script src="libs/gitbook-2.6.7/js/app.min.js"></script>
<script src="libs/gitbook-2.6.7/js/lunr.js"></script>
<script src="libs/gitbook-2.6.7/js/plugin-search.js"></script>
<script src="libs/gitbook-2.6.7/js/plugin-sharing.js"></script>
<script src="libs/gitbook-2.6.7/js/plugin-fontsettings.js"></script>
<script src="libs/gitbook-2.6.7/js/plugin-bookdown.js"></script>
<script src="libs/gitbook-2.6.7/js/jquery.highlight.js"></script>
<script>
gitbook.require(["gitbook"], function(gitbook) {
gitbook.start({
"sharing": {
"github": false,
"facebook": true,
"twitter": true,
"google": false,
"linkedin": false,
"weibo": false,
"instapper": false,
"vk": false,
"all": ["facebook", "google", "twitter", "linkedin", "weibo", "instapaper"]
},
"fontsettings": {
"theme": "white",
"family": "sans",
"size": 2
},
"edit": {
"link": null,
"text": null
},
"download": null,
"toc": {
"collapse": "subsection"
}
});
});
</script>
</body>
</html>