Recording Industry vs The People: Study finds that record company methods for detecting infringement are inconclusive

Thursday, June 05, 2008

Study finds that record company methods for detecting infringement are inconclusive

As reported by the New York Times, an academic study out of the University of Washington has found the record industry's methods of detecting infringement among BitTorrent users to be "inconclusive". I would appreciate input from the technical community, in our "comments" section, on the extent to which these findings would be applicable to MediaSentry's supposed "detection" of infringement among FastTrack users, as opposed to BitTorrent users, since every single lawsuit of which I am aware involves the FastTrack or Gnutella protocols, rather than BitTorrent. Thanks to my many friends who alerted me to this article and study. -R.B.

The Inexact Science Behind DMCA Takedown Notices
By Brad Stone
June 5, 2008
New York Times Technology Section

A new study from the University of Washington suggests that media industry trade groups are using flawed tactics in their investigations of users who violate copyrights on peer-to-peer file sharing networks.

Those trade groups, including the Motion Picture Association of America (M.P.A.A.) Entertainment Software Association (E.S.A.) and Recording Industry Association of America (R.I.A.A.), send universities and other network operators an increasing number of takedown notices each year, alleging that their intellectual property rights have been violated under the Digital Millennium Copyright Act.

Many universities pass those letters directly on to students without questioning the veracity of the allegations. The R.I.A.A. in particular follows up some of those notices by threatening legal action and forcing alleged file-sharers into a financial settlement.

But the study, released today by Tadayoshi Kohno, an assistant professor, Michael Piatek a graduate student, and Arvind Krishnamurthy, a research assistant professor, all at the University of Washington, argues that perhaps those takedown notices should be viewed more skeptically.

Complete article

The underlying study: "Challenges and Directions for Monitoring P2P File Sharing Networks – or – Why My Printer Received a DMCA Takedown Notice" By Michael Piatek, Tadayoshi Kohno, and Arvind Krishnamurthy (PDF)

Commentary & discussion:

Electronic Frontier Foundation
Linha Defensiva (Portugese)

Keywords: digital copyright law online internet law legal download upload peer to peer p2p file sharing filesharing music movies indie independent label freeculture creative commons pop/rock artists riaa independent mp3 cd favorite songs intellectual property

10 comments:

Anonymous said...: On first glance, it appears that the monitoring methods for BitTorrent are very different from the monitoring methods used in these cases.

It appears that DMCA notices over suspected BitTorrent users are generated without downloading anything from the user. They just look at the list of IP addresses the tracker says is participating. My guess is that the process is mostly automated, so it's no surprise the process generates so many garbage complaints.

In the cases discussed on this blog, involving Gnutella, eMule, FastTrack, etc., the RIAA/MPAA or their agents claim to have downloaded and verified at least some of the content. Whether the verification was performed by a human is unclear, since MediaSentry is uncooperative with discovery requests, but it probably is. That would distinguish it from the DMCA spam mill that is the subject of this study.; June 5, 2008 at 5:49:00 PM EDT
Nohwhere Man said...: Having given the paper a quick read, IMHO it was fairly well written. Nothing jumped out at me as bad science, the descriptions and technical conclusions all make sense, and are quite likely defensible.

It would be interesting to set up a 'honey-pot' node (using maybe a printer or a network monitoring box), wait for a takedown notice, and say "see you in court". It would be even more interesting to see the discovery request for the hard disk of a printer.

z!
(30 yrs of messing about with computers); June 5, 2008 at 6:23:00 PM EDT
Justin Olbrantz (Quantam) said...: I was considering submitting this story to you. I posted it on my blog a couple hours ago, with the following commentary:

This was actually a study I've been wanting to see done for some time. The other study that I think is very important but has not yet been done is to determine empirically how, on a system like eDonkey, where users search all peers for a certain file, the number of requests a single computer gets for a single file varies with the popularity of the file. The basis of this investigation is the claim by RIAA and others that users could be sharing thousands or millions of copies of each copyrighted work, therefore constitutional limitations on civil damage awards do not apply.

Clearly files that are popular (e.g. the latest hit song) will be downloaded more (in total) than files which are unpopular. But does this mean any single computer will upload popular files significantly more often than unpopular files? I believe the answer is no, for the reason that because the files are more popular, not only are they downloaded more, but they are also available from more computers. In theory, the increase in demand is accompanied by a proportionate increase in supply, keeping the ratio invariant regardless of demand. According to this belief, I have argued on forums (one example here) that most of the people the RIAA has sued have, according to simple probability, not uploaded more than a single copy of each file, on average (so about $0.70 of damage per file, if you assume 1 download = 1 lost sale, which itself is highly suspect).; June 5, 2008 at 6:27:00 PM EDT
Anonymous said...: Disclaimer:
I'm not even close to an expert on the FastTrack network's protocol. It's a proprietary protocol, so I wasn't able to find much on it; you'll want to try to have the inner workings of the protocol clarified somehow. What's posted below is based on what I was able to find on the 'net about the protocol.
---

Using
FastTrack reverse engineering doc

It seems that when performing a search on the FastTrack network your local KaZaA client will send its request to a supernode that aggregates the available files of all of its clients. The search is then performed on that supernode, and passed on to other supernodes that it knows about; at no point during a search operation does your client actually connect to the peers that are reported to have files available.

Furthermore, it looks like the only time that your client connects to other peers is when its trying to download a file from said peer.

Of particular interest is that it looks like packet types 0x20 and 0x21 are used by your client to request a list of files from a peer. The important thing to note about this packet pair is that it's between you and the supernode; at no point does it contact the peer you want the file list of.

Based on this the cached information about what a client is sharing is discarded as soon as the client disconnects from the supernode. However, without knowing the frequency of keep-alive pings, it's impossible to know how long it takes a supernode to identify that a client is no longer connected. If keep-alive pings are only sent every X minutes, then the stale data about a client will stick around on a supernode for at most X minutes after an unclean disconnect (killing off the KaZaA program without giving it a chance to properly close its TCP connections -- such as by a power outage, turning off your computer, your router/computer crashing, etc) from the supernode.

So, with that all in mind, the "mistimed reports" scenario of section 4.2 of the UWashington study is certainly plausable; especially in a University residence environment where DHCP leasing of IP addresses could potentially quickly reassign an IP when someone disconnects. How plausable depends very much on the amount of time between keep-alive pings from supernodes, though. The plausibility of this scenario is further increased if the RIAA investigators don't actually try to download the offending file from the peer; if they're just testing whether the IP address is still around via a ping that tells them nothing about whether the person's connected to the FastTrack network.

This all makes knowing more about FastTrack very important. To wit:
- How long does it take for a supernode to identify that a client has disconnected?
- Is it definitely the case that you only connect to another peer when trying to download from them?

It's also important to know the RIAA investigator's methodology. At the least:
- Given that search results aren't returned directly from each peer, and that a peer's file list isn't sent directly from said peer (unless the peer is a supernode and they connected directly to it), how to they verify that an IP address is sharing files? Ping/traceroute? Connecting through KaZaA and downloading?
- Do they record whether an offending peer is a supernode or an ordinary client node? If it's a supernode, and they got the file list directly from it, then the UWashington scenario is pretty much impossible. But, if it's an ordinary node then it's possible.

Enjoy,
Dan; June 5, 2008 at 7:55:00 PM EDT
Rick Boatright said...: Saddly Ray, this article has nothing at all to do with the various Gnutella varient (emule, fasttrack, limewire etc) p2p programs.

In the bittorrent p2p programs any one computer may well be part of a "swarm" of computers and may, or may _not_ actually participate in the act of being downloaded from.

On the other hand, the gnutella varient programs estable a one-to-one relationship between the downloader and the uploader.

It's totally different technology.; June 5, 2008 at 8:45:00 PM EDT
Alter_Fritz said...: well, it actually does not need expensive and dryly serious studies to come to such an conclusion.

Some oversimplified picture
found via http://thepiratebay.org/blog/111 let one reach similar conclusions too, I guess. ;-); June 5, 2008 at 8:50:00 PM EDT
Justin Olbrantz (Quantam) said...: "It would be interesting to set up a 'honey-pot' node (using maybe a printer or a network monitoring box), wait for a takedown notice, and say "see you in court". It would be even more interesting to see the discovery request for the hard disk of a printer."

LOL @ setting up a honeypot and then suing for filing false DMCA notices. That would be beyond godly. And I bet it would very quickly bring an end to mass-mailing of DMCA notices.; June 5, 2008 at 10:57:00 PM EDT
Anonymous said...: Those (upcomming) lawyers among you readers of this blog lhat defend the real innocent dolphins like Mrs. Andersen, Mrs. Santangelo, Mrs. Lindoralready these days against MAFIAA while they are still reaping in their "extortion"money from the p2p system of choice in use in 2004-2007 should be aware of:

Something that might be extremely noteworthy for you lawyers and could become important information in litigation in 3 or 4 years from now for alleged wrongdoings in 2008 by your "then clients":

Oh, and what also should be noted with respect to false positives of printers stealing Indiana jones;

The tracker software (open tracker) that is used by one of the largest Trackers in the world, is known that it can be setup to report bogus IPs to peers that are not actually in the swarm and doing ANYthing even remotely resambling to copyrightinfringement.

The german programmer of that software calls that feature “Perfect Deniability”

http://opentracker.blog.h3q.com/?p=22

Everybody recieving a takedown notice should be made aware of that “defense”

Maybe thepiratebay.org has switched that feature to “on” too?
— Posted by kdsde; June 6, 2008 at 5:16:00 AM EDT
Anonymous said...: The study itself shows through proof that in the real world that IP addresses can easily be spoofed. That you can appear to be some other device on the network with little effort. This concept is applicable to more than just the BitTorrent discussed in the article. By extension, although not covered in the article, all you have to do to make your own computer appear to be that of your despised dorm rival down the hall is to reset your MAC address (easily done for most network adapters, and doable through having a simple router if you can't figure out how to do it on your own hardware) to match his MAC address. To your university, your computer now appears to be his computer when IP addresses are assigned and logs written. Spoofing of IP addresses is no longer some theoretical concept that: Yes it can happen but how likely is it really, you Honor?

Justin Olbranz, you make the excellent point that, in fact, the record companies may have suffered no damages at all by downloading because there is no indication at all that any download has ever equated to any lost sale. Other factors can explain completely the downturn in record sales (the rise of DVD games, the drop in quality of the music being sold, the insanely stubborn high prices of music CDs, the downturn in the economy, all competing for a limited pool of spending) without blaming all, or any, of it on filesharing.

XxX; June 6, 2008 at 11:28:00 AM EDT
StephenH said...: I beleive that the RIAA and MPAA need to learn that IP address logs do not identify people the same way that DNA does, and the idea of having automated bots indentify canidates for DMCA notices is bad, because they can easily reach innocent users.

This paper clearly shows the errors RIAA and others made when sending takedown notices, and how they can easily reach innocent victims. I personally beleive that one should have at least some recourse rights if a DMCA notice reached an innocent victim. Personally if it were me, I would abolish the DMCA altogether.

The DMCA has done nothing positive for technological innovation.; June 8, 2008 at 11:55:00 AM EDT

Post a Comment

JUDICIAL QUOTATIONS

"[T]he Court is concerned about the lack of facts establishing that Defendant was using that IP address at that particular time. Indeed, the [complaint] does not explain what link, if any, there is between Defendant and the IP address. It is possible that Plaintiff sued Defendant because he is the subscriber to IP address .... As recognized by many courts, just because an IP address is registered to an individual does not mean that he or she is guilty of infringement when that IP address is used to commit infringing activity." -Hon. Barry Ted Moskowitz, Chief Judge, S.D. California. January 29, 2013, AF Holdings v. Rogers
"The complaints assert that the defendants – identified only by IP address – were the individuals who downloaded the subject “work” and participated in the BitTorrent swarm. However, the assumption that the person who pays for Internet access at a given location is the same individual who allegedly downloaded a single sexually explicit film is tenuous, and one that has grown more so over time." - Hon. Gary R. Brown, Magistrate Judge, E.D.N.Y. May 1, 2012, K-Beech v. Does 1-37
"The concern of this Court is that in these lawsuits, potentially meritorious legal and factual defenses are not being litigated, and instead, the federal judiciary is being used as a hammer by a small group of plaintiffs to pound settlements out of unrepresented defendants."
-Hon. S. James Otero, Dist. Judge, Central Dist. California, March 2, 2007, Elektra v. O'Brien, 2007 ILRWeb (P&F) 1555
"The University has adequately demonstrated that it is not able to identify the alleged infringers with a reasonable degree of technical certainty...[C]ompliance with the subpoena as to the IP addresses represented by these Defendants would expose innocent parties to intrusive discovery....[T]he Court declines to authorize discovery and quashes the subpoena as to Does # 8, 9, and 14" -Hon. Nancy Gertner, Dist. Judge, Dist. Massachusetts, November 24, 2008, London-Sire Records v. Does 1-4
"[C]ounsel representing the record companies have an ethical obligation to fully understand that they are fighting people without lawyers... that the formalities of this are basically bankrupting people, and it's terribly critical that you stop it...." -Hon. Nancy Gertner, Dist. Judge, Dist. Massachusetts, June 17, 2008, London-Sire v. Does 1-4
"Rule 11(b)(3) requires that a representation in a pleading have evidentiary support and one wonders if the Plaintiffs are intentionally flouting that requirement in order to make their discovery efforts more convenient or to avoid paying the proper filing fees. In my view, the Court would be well within its power to direct the Plaintiffs to show cause why they have not violated Rule 11(b) with their allegations respecting joinder. [I]t is difficult to ignore the kind of gamesmanship that is going on here.....These plaintiffs have devised a clever scheme... to obtain court-authorized discovery prior to the service of complaints, but it troubles me that they do so with impunity and at the expense of the requirements of Rule 11(b)(3) because they have no good faith evidentiary basis to believe the cases should be joined." -Hon. Margaret J. Kravchuk, Magistrate Judge, District of Maine, January 25, 2008, Arista v. Does 1-27, 2008 WL 222283, modified Oct. 29, 2008
"[N]either the parties' submissions nor the Court's own research has revealed any case holding the mere owner of an internet account contributorily or vicariously liable for the infringing activities of third persons.....In addition to the weakness of the secondary copyright infringement claims against Ms. Foster, there is a question of the plaintiffs' motivations in pursuing them..... [T]here is an appearance that the plaintiffs initiated the secondary infringement claims to press Ms. Foster into settlement after they had ceased to believe she was a direct or "primary" infringer." -Hon. Lee R. West, District Judge, Western District of Oklahoma, February 6, 2007, Capitol v. Foster, 2007 WL 1028532
"[A]n overwhelming majority of cases brought by recording companies against individuals are resolved without so much as an appearance by the defendant, usually through default judgment or stipulated dismissal.....The Defendant Does cannot question the propriety of joinder if they do not set foot in the courthouse." -Hon. S. James Otero, Central District of California, August 29, 2007, SONY BMG v. Does 1-5, 2007 ILRWeb (P&F) 2535
"Plaintiffs are ordered to file any future cases of this nature against one defendant at a time, and may not join defendants for their convenience."
-Hon. Sam Sparks and Hon. Lee Yeakel, District Judges, Western District of Texas, November 17, 2004, Fonovisa v. Does 1-41, 2004 ILRWeb (P&F) 3053
"The Court is unaware of any other authority that authorizes the ex parte subpoena requested by plaintiffs."
-Hon. Walter D. Kelley, Jr., District Judge, Eastern District of Virginia, July 12, 2007, Interscope v. Does 1-7, 494 F. Supp. 2d 388, vacated on reconsideration 6/20/08
"Plaintiffs contend that unless the Court allows ex parte immediate discovery, they will be irreparably harmed. While the Court does not dispute that infringement of a copyright results in harm, it requires a Coleridgian "suspension of disbelief" to accept that the harm is irreparable, especially when monetary damages can cure any alleged violation. On the other hand, the harm related to disclosure of confidential information in a student or faculty member's Internet files can be equally harmful.....Moreover, ex parte proceedings should be the exception, not the rule."
-Hon. Lorenzo F. Garcia, Magistrate Judge, District of New Mexico, May 24, 2007, Capitol v. Does 1-16, 2007 WL 1893603
"'Statutory damages must still bear some relation to actual damages." Hon. Michael J. Davis, Dist. Judge, U.S.District Court, Dist. Minnesota, January 22, 2010, Capitol Records v. Thomas-Rasset
"[T]his court finds that defendants' use of the same ISP and P2P networks to allegedly commit copyright infringement is, without more, insufficient for permissive joinder under Rule 20. This court will sever not only the moving defendants from this action, but all other Doe defendants except Doe 2."
-Hon. W. Earl Britt, District Judge, Eastern District of North Carolina, February 27, 2008, LaFace v. Does 1-38, 2008 WL 544992
"[L]arge awards of statutory damages can raise due process concerns. Extending the reasoning of Gore and its progeny, a number of courts have recognized that an award of statutory damages may violate due process if the amount of the award is "out of all reasonable proportion" to the actual harm caused by a defendant's conduct.[T]hese cases are doubtlessly correct to note that a punitive and grossly excessive statutory damages award violates the Due Process Clause....."Hon. Marilyn Hall Patel, Dist. Judge, N.D. California, June 1, 2005, In re Napster, 2005 US DIST Lexis 11498, 2005 WL 1287611
"[P]laintiffs can cite to no case foreclosing the applicability of the due process clause to the aggregation of minimum statutory damages proscribed under the Copyright Act. On the other hand, Lindor cites to case law and to law review articles suggesting that, in a proper case, a court may extend its current due process jurisprudence prohibiting grossly excessive punitive jury awards to prohibit the award of statutory damages mandated under the Copyright Act if they are grossly in excess of the actual damages suffered....."-Hon. David G. Trager, Senior District Judge, Eastern Dist. New York, November 9, 2006, UMG v. Lindor, 2006 U.S. Dist. LEXIS 83486, 2006 WL 3335048
"'[S]tatutory damages should bear some relation to actual damages suffered'....(citations omitted) and 'cannot be divorced entirely from economic reality'". -Hon. Shira A. Scheindlin, Dist. Judge, Southern Dist. New York, August 19, 2008, Yurman v. Castaneda
"The Court would be remiss if it did not take this opportunity to implore Congress to amend the Copyright Act to address liability and damages in peer to peer network cases.... The defendant is an individual, a consumer. She is not a business. She sought no profit from her acts..... [T]he damages awarded in this case are wholly disproportionate to the damages suffered by Plaintiffs." -Hon. Michael J. Davis, District Judge, Dist. Minnesota, September 24, 2008, Capitol v. Thomas
"If there is an asymmetry in copyright, it is one that actually favors defendants. The successful assertion of a copyright confirms the plaintiff's possession of an exclusive, and sometimes very valuable, right, and thus gives it an incentive to spend heavily on litigation. In contrast, a successful defense against a copyright claim, when it throws the copyrighted work into the public domain, benefits all users of the public domain, not just the defendant; he obtains no exclusive right and so his incentive to spend on defense is reduced and he may be forced into an unfavorable settlement." US Court of Appeals, 7th Cir., July 9, 2008, Eagle Services Corp. v. H20 Industrial Services, Inc., 532 F.3d 620
"Customers who download music and movies for free would not necessarily spend money to acquire the same product.....RIAA’s request problematically assumes that every illegal download resulted in a lost sale."
-Hon. James P. Jones, Dist. Judge, Western Dist. Virginia, November 7, 2008, USA v. Dove