Open Knowledge Registry/AsWiki

This page is now "retired" as it has been superseded by the Comprehensive Knowledge Archive Network (CKAN) site.

TableOfContents

= General =


 * 1) wikipedia: DONE
 * 2) * dc.title: wikipedia encyclopedia
 * 3) * url: http://download.wikimedia.org/wikipedia/
 * 4) ** English: http://download.wikimedia.org/wikipedia/en/
 * 5) releases: N/A. Continuously updated so no fixed release dates.
 * 6) * dct.created: 2001
 * 7) www.archive.org (WONTDO -- not a single entity)
 * 8) project gutenberg DONE
 * 9) Christian Classics Ethereal Library DONE
 * 10) * url: http://www.ccel.org/
 * 11) * doesn't state a license but they do provide source xml, html, rtf etc!
 * 12) world wide molecular matrix - at Cambridge University DSpace (DONE - crystaleye)
 * 13) Website Attica: http://www.chass.utoronto.ca/attica/
 * 14) MIT opencoursware (not open really but should probably do ...)
 * 15) Connexions DONE
 * 16) * Knowledge should be free, open, and shared. Connexions is a rapidly growing collection of free scholarly materials and a powerful set of free software tools to help
 * 17) * http://cnx.rice.edu/
 * 18) http://www.opencontent.org/ (WONTDO - no specific open material)
 * 19) * OpenContent.org is reinventing itself as a portal into high quality, open access educational materials and educational discussions. Using the box above you can search the OpenCourseWare collection and its official Español and Portugues translations, the Connexions collection, and the Open Learning Support support forums
 * 20) Mathematics (WONTDO - not sure of size or status)
 * 21) * http://www.archim.org.uk/notes/ - Archimedeans cambridge. use is open.
 * 22) http://www.mathforge.net ?? not so sure
 * 23) http://www.opentextbook.org + http://www.opengeodata.org. both starter projects. WONTDO: not sources of data)
 * 24) https://www.bioforge.net  Community for Biological Innovation (WONTDO - inactive)
 * 25) http://www.keithbriggs.net -- online departure board information (WONTDO - no data it seems)

= Property Data =
 * Land Registry Uk (WONTDO - commercial)
 * country: gb
 * url: http://www.landregistry.gov.uk/property_info/
 * comments: have to pay for most of the detailed data

= Software =
 * okftext
 * text, pdf, latex, html, xml
 * lilypond
 * tags: music

= Geodata =


 * http://mappinghacks.com/data/

= Music =

Musicbrainz
DONE


 * Recorded Music CD Datatabase
 * license: cc-by-nc-sa
 * format: rdf
 * url: http://musicbrainz.org/

Texts

 * Mutopia Project DONE
 * url: http://www.mutopiaproject.org/
 * data: 579 works as of 2005-11-11
 * format: All music is available as Postscript (.ps) and PDF (.pdf) files, for both A4 and Letter paper sizes, as well as Lilypond's own file format (.ly)
 * The Choral Public Domain Library: DONE
 * url: http://www.cpdl.org/
 * Project Gutenberg's music section DONE
 * url: http://gutenberg.org/music
 * The Werner Icking Music Archive DONE
 * url: http://icking-music-archive.org/
 * Lilypond
 * type: data format
 * url: http://lilypond.org/

= Economics =


 * 1) title: innovation in specific industries DONE
 * 2) * url: http://www.hss.cmu.edu/departments/sds/faculty/klepper/archive.html
 * 3) * description: The first data set contains all the data used in the analyses reported in the paper by Steven Klepper and Kenneth L. Simons entitled, "The Making of an Oligopoly: Firm Survival and Technological Change in the Evolution of the U.S. Tire Industry," Journal of Political Economy, 2000, vol. 108, no. 4, pp. 728-760. For an explanation of how this data set is organized, click here. If you want to download the data set, click here. The other two data sets contain all the data used in the analyses reported in the paper by Steven Klepper and Kenneth L. Simons entitled, "Dominance by Birthright: Entry of Prior Radio Producers and Competitive Ramifications in the U.S. Television Receiver Industry," Strategic Management Journal, vol. 21, pp. 997-1016. For an explanation of how these two data sets are organized, click here. If you want to download the first of these data sets pertaining to the firms that produced radios, click here. If you want to download the second of these data sets pertaining to firms that produced televisions, click here.
 * 4) title: repec bibliography DONE
 * 5) * url: http://repec.org/ http://www.ecommunics.com/
 * 6) * description: processing the repec database. Making biblio generally available.

= History =


 * 1) history event markup language DONE
 * 2) * url: http://www.heml.org/
 * 3) * status: inactive (no change since 2004)
 * 4) * type: tool and data
 * 5) * description: tools for producing timelines and geographic charts. Data is there for demo tool rather than to be comprehensive

= Closed Data =

Bibliomaina.com
Bibliomania.com (went bankrupt) but from site we have:

What is the copyright status of texts on Bibliomania.com?

Most texts on our site are in the public domain. However Bibliomania.com Ltd has copyright in the HTML versions we have created for our web site. You are free to download these texts for personal use, but they may not be used for any commercial purpose, or republished in any form (including on the internet) without our prior email permission. Bibliomania.com Ltd has and will take legal action worldwide to protect its rights.

Please use the comments board to email us for copyright permission.

How do I cite a Bibliomania work?

We do not have full bibliographic data for the texts on Bibliomania, and they were typed from scratch, repaginated and reformatted hence these works are an original edition and should be cited as copyright Bibliomania.com Ltd 2000.

genuki.org.uk/big/eng/YKS/
Genuki is historical and geneaological information including information on Yorkshire. Claims aren't in relation to copyright (although what copyright in 100 year old photographs could you have) but in the assertion of database rights in information much of which comes from 1892.

http://www.genuki.org.uk/big/eng/YKS/Misc/conditions.html

((( All the material which is to be found on the Genuki Yorkshire site (any page which has a URL starting with "www.genuki.org.uk/big/eng/YKS/") is held in a database by me, and software to which I own the copyright, is used to extract the relevant data and generate the pages which you see on the Genuki Yorkshire site. A United Kingdom Act of 1997 specifically covers the compiling and use of database material. The notice below is required to be displayed in order to give me protection under this Act: Database Right, all databases used for this website are covered by the 1997 Database Regulations. Colin Hinson (and others as stated on the relevant pages) are the makers of the database used for this website and the owner of the database rights. First published in 1997. )))

Music Databases
Music databases (including open ones such as mutopia) claim copyright in the typesetting of their musical scores. While there is a compilation type copyright (for presentation) in most jurisdictions, I don't really see how this would cover the representation of the score in a musical notation such as lilypond or **kern when the original music is out of copyright.

One of the more outrageous (and stupid) examples of using this copyright to close access is on http://www.musedata.org/ (what makes it particularly bad is that this is academic project):

The research license: http://www.musedata.org/legal/licen.html -- MuseData files are provided free of charge to academic and non-commercial users but they remain the intellectual property of the Center for Computer Assisted Research in the Humanities, Braun #129, Stanford University, Stanford, CA, 94305-3067, USA.

Before downloading any materials from this site, please indicate your acceptance of the terms of this license agreement

All other prospective users must contact the Center for Computer Assisted Research in the Humanities, Braun #129, Stanford University, Stanford, CA 94305-3076, USA, before downloading, copying, or redistributing any data, in whole or in part, as found here or in derivative versions, in any format, electronic or otherwise, found at this site. --

The same site also runs themefinder which starts off its about page with:

Both the notated images and underlying data representations used by Themefinder are protected by international copyright laws. Visitors to this site are free to use Themefinder to search for musical themes for personal, teaching and non-commercial research purposes. However, any attempt to download the database, in whole or in part, will be considered a breach of copyright, and may lead to denial of access or legal action.

The notated images are copyright © 1999-2000 by the Center for Computer Assisted Research in the Humanities. The encoded thematic material is copyright © 1999-2000 by David Huron and CCARH. The encoded European folksongs are copyright © 1999 by the estate of Helmut Schaffrath and used by permission. The Latin Motet thematic material is copyright © 1993 by Harry B. Lincoln and used by permission.

Shakespeare
Given Shakespeare's public domain status is incredible how much stuff claims copyright in his works or in related info. e.g.


 * http://shakespeare.palomar.edu/timeline/summarychart.htm
 * ©1998 Terry A. Gray - Do not duplicate or use without permission - Last Modified 09/16/00
 * http://www.shakespeare-online.com/keydates/playchron.html
 * http://www.shakespeare-online.com/siteinfo/copyright.html: 'All information provided by Shakespeare Online is owned by Shakespeare Online (excluding outside links) and any user is permitted to store, manipulate, analyze, print, and display the information on Shakespeare Online only for such user's personal use. In no event shall any user publish or redistribute or otherwise reproduce any Shakespeare Online graphics or written content in any format to anyone, and no user shall use any Shakespeare Online information in connection with any business enterprise.'
 * http://www.shaksper.net/archives/files/chronology.html
 * Copyright © 2005, Hardy M. Cook

Open Source Shakespeare
Despite its name bears the statement at the bottom of each page:

Program code and database © 2003-2006 Bernini Communications LLC. If copyrighted, texts are the property of their respective owners. About the texts used in OSS • Privacy policy


 * http://www.opensourceshakespeare.org/

Does contain interesting info on the way most shakespeare texts end up being copyrighted again:
 * http://www.opensourceshakespeare.org/info/moby_shakespeare.php

National archives

 * free to access and transcribe
 * 25GBP per copy

Not yet investigated properly

 * landregistry
 * companieshouse
 * ons (office national statistics)
 * met office: can't even find out what is available and what it costs

= Sites Contacted without Response =


 * History of Economic Thought Website
 * http://cepa.newschool.edu/het/
 * Contacted: 2004-10