If this quote from "the mother of The Matrix" is accurate, and I think it is, the issue of expression poses some not-inconsiderable challenges to a project like PM/FEM. In short, my comment on the Good communication discussion page suggesting that there is no law covering "plagiarism" would appear to have some exceptions. Yes, ideas can not be copyrighted, but the expression of some collection of ideas can be. So, taking something someone said or wrote (e.g. "The Third Eye") and reworking it, even very considerably, into something different (e.g. "The Terminator" or "The Matrix") can still be copyright infringement.
I'm not completely certain that Sophia Stewart's case hinges on a claim of copyright infringement rather than "something bigger" - anyway, the case seems very interesting - maybe someone still has time to cover it for the Free Culture and the DL conference, as an example of non-free culture. It doesn't have a whole lot to do with digital libraries… except - listen to me now - I'm trying to point out to everyone involved in the production of PM/FEM that simply rewording a definition you find in a book does not suffice as a "dodge" against the requirements of copyright law. Just ask the Wachowskis how the Stewart case is going (well, OK, that might be hard to do – and it might be going swimmingly for them, I have no idea; it would certainly be interesting to see some hard facts on the case).
I'll note that the only reason that the HDM might be expected to work (legally), is that its software is supposed to be able to "extract" ideas from expressions, and rework them into new expressions. This is not intended to be a matter of simply rewording things, but how exactly it would work is still up in the air. Insofar as math is a subject of expressions, above and beyond ideas, HDM will have met its fatal limit. If math is found to truely be a subject of ideas whose expressions are in the end inconsequential, then the HDM has a hope.
A subject like math is palpably full of ideas, and one may sometimes lose sight of the forest for the trees. It is my strong caution to all writers for the PM project and editors for the FEM project that they pay careful attention to the difference between ideas and expressions. Don't be caught in the rewording trap. Authors should be prepared to document the process they use to come up with the expression given to ideas in an entry. If the process is "I don't know anymore how I learned this definition, it is basically common knowledge," that is probably a good thing. If the process is "I looked it up in this book and that's basically what they said," there are some serious questions: has the author of the entry identified the main ideas of the source document? Has the author found a way to express those ideas independently of each other, and independently of the expressions (and cumulative expression) given in the original source? If these questions can be answered in the affirmative, then it seems likely that the article is going to be OK. But without documentation of this process, a reader or editor ought to be skeptical. Citing sources is not enough, the author should also describe her or his relationship to those sources.
From the purely legal point of view, what allows such copying from a book is the doctrine of fair use, which is codified in section 107 of copyright law. However, as the U. S. Copyright office points out, there are no hard and fast guidelines as to what constitutes fair use. Rather, it is decided on a case-by-case basis in accordance with the four general guidelines laid out in the law.
Since I know next to nothing about copyright law, don't look to me for any advice on this subject but, in light of what of was mentioned, it certainly seems that caution is in order when dealing with commercial works (as opposed to works in the public domain or released under some form of free license).
In particular, I noticed that one of the provisions of the law is that the effect on the marketability of a work is to be considered in determining fair use. In connection with this guideline, let me remind everyone of what happened with Math World, especially since that indirectly led to the formation of Planet Math.
Again, I have no idea of how relevant it is legally, but from the standpoint of common sense, the suggestions on the process used to write an entry seem reasonable. Also, I like Aaron's suggestion that, when basing an entry on an existing work, one should consider improving the work and adding value and that that too could serve as some sort of defense against plagarism.
Since ignorance of the law is no excuse, it might not be a bad idea for all of us involved with this business of Free Math to learn more about copyright law and how it has been interpreted. A factor which complicates this is that contributors to Planet Math come from different countries. Since the copyrights to the different entries belong to their respective authors, this might be a potential source for added complexity.
Moving from law to language, I would say that, in mathematical writing, there is an important distinction to be made between formal and expository writing. For the present purpose, let me define "formal mathematical writing" to mean something written in natural language which is equivalent to a purely symbolic expression. For instance, consider the following excerpt:
The notion of continuity plays an important role in analysis. A real function is said to be continuous if and only if, for every open set, the preimage of that open set is an open set.
In this excerpt, I would call the first sentence expository and the second sentence formal. The second sentence means the same thing as the following symbolic formula:
Moreover, in formal, as opposed to expository, mathematical writing, literary considerations and devices play little or no role. Given a particular mathematical formula, an author does not have much leeway in choosing how to express it in words. Rather, formal mathematical prose is standardized and abounds with idioms like "there exists", "if and only if", "the following are equivalent". The only creativity on the part of the author in formal mathematical writing is the creativity that goes into coming up with the definitions and proofs and has nothing to do with creative writing.
I would argue that, in assessing an author's reliance upon sources, one needs to keep this fact in mind. Formal mathematical sentences expressing the same equation are bound to be identical or nearly identical in different sources whether or not they were copied from each other because of the nature of formal mathematical writing. Therefore, it would seem to me, the concerns about expression of ideas would only apply to the expository parts of mathematical writing and to the manner in which the formal parts are presented. --rspuzio 9 March 2005
The point about formal writing that Ray made above is particularly interesting and important. This point, as far as I understand it, really ought to take the form of a question - does is the fact that something is written formally nullify its copyrightability? I would say that the answer is "no". Computer programs are written formally – more formally than most mathematics, because they typically are designed to be run on computers. Yet software is copyrightable, and copyright law is the "domain of enforcement" for the free software movement.
However, that does not mean that formally written pieces of mathematics, once written down, can never be written again by anyone new. (Words, once said, can never be un-said, however they can be said again, and again, and again… and they often are.) Many formal sentences in mathematics assert facts, and facts are not copyrightable. Moreover, these sentences are written in the language of mathematics, and I don't think that anyone can bar anyone else from using this language. (Children and childish adults can say "don't say what I said", but I don't know if there are any cases in which someone said "you can't use the language I use." Such a case would certainly be interesting, but I don't think it exists; anyway, languages are tools, and therefore are not subject to copyright - though they could be subject to patent.) You might think that made them "basically public domain" – but actually, I think issues of "expression" may still come into play.
If the author lays down certain meta-rules for self-expression (whether tacitly or explicitly) then I think it should be easy for that author to show on a case by case basis how her or his writing conforms exactly to this personal writing spec. This is my plan for HDM, which will have a style as horrifyingly clear and precisely defined as what you'd see if Russell, Bourbaki, Rudin, and Knuth were working together to write a monograph.
And even still (!) there may be some difficulties. The very collections of ideas or facts you find embodied in a work have some "expression" associated with them, simply by dint of being found together in the same place. Also, one could argue that definitions are not statements of fact, that rather, that they are "expressive" labelings of certain mathematical objects.
The recourse I envision for the HDM lies in the view that mathematics, collections of mathematical facts, and mathematical definitions, are, in the end, essentially all tools (just like language and words) and that, as stated previously, tools are not subject to copyright. In order to wield a mathematical tool, one must be able to build it. If someone comes around complaining that the HDM has robbed them of their "expression", I hope to be able to say that, no, it only built a copy of the tool they exhibited plans for.
Is this different from stealing "The Third Eye" and claiming that manuscripts are also simply tools? Perhaps not in a certain view - but I think that there may be some philosophers wise (or clever) enough to be able to make the distinction. --jcorneli Wed Mar 09 22:57:05 2005 UTC
In response to "Quick point no. 3", I would say that it is much more than hope. Unless an author specifically says so, the copyright remains with the author. I, for one, have not yet signed any documents transferring copyright of my entries to Planet Math. While I have given Planet Math permission to distribute these entries by posting them to the website, that is quite a different matter from giving away my copyright.
Perhaps I wasn't sufficiently clear on the point, but my argument not based on a principle that "the fact that something is written formally nullify its copyrightability". I was basing my argument of the more specific premise that mathematical equations cannot be protected by law. As you point out, there are types of formal expresssion, such as computer programs, which can be protected by law. However, I would also maintain that it is a generally accepted principle that the author of a mathematical equation cannot claim legal protection for that equation. It might be a good idea to double-check about the extent to which it is impossible to obtain legal protection for a mathematical formula; I am quite sure that this is correct, but I only know this second-hand and indirectly, so it wouldn't be a good idea to double-check before making anything important ride upon this principle. If so, then my argument still holds.
I am even more certain of the principle that mathematical ideas and objects cannot be legally protected by their discoverer or inventor. I would agrue that, as far as the law goes (as opposed to philosophical principles involved), this principle would be the primary distinction between the matrix article on Planet Math and the Matrix film. In the latter case, using the same plot and characters even if one changed the names constituted infringement whilst in the former case using the same definitions, theorems and proofs with the same names is legal. --rspuzio 9 March 2005
Weird stuff, huh? --jcorneli Thu Mar 10 03:23:52 2005 UTC
As a rather different example, consider the following:
http://www.clwbar.org/resipsa/december-january02/article460.html
Comparing annd contrasting these two examples rather dramatically points up the difference between "derivative works" vs. "transformative use". I think that we need to be clear on this point since it may prove important in understanding what one may or may not do when writing math entries. --rspuzio 14 March 2005